This weekend in Generative Media
Studios adopt genAI slowly; AI finds more Nazca lines; Report from AI film screening
News and Opinion
- Hundreds More Nazca Lines Emerge in Peru’s Desert (New York Times) With drones and A.I., researchers managed to double the number of mysterious geoglyphs in a matter of months. 
- One big room full of hopes and schemes: An anecdotal report of a major AI film screening and the reception that followed. (Mike Gioia on Substack) 
- Labelers training AI say they're overworked, underpaid and exploited by big American tech companies (CBS News) 
- Moonvalley wants to build more ethical video models (TechCrunch) 
- Want to speak Italian? Microsoft AI can make it sound like you do. (Washington Post) 
- ChatGPT’s Poetry is Incompetent and Banal (Ernest Davis, NYU) [PDF] 
Software
- Introducing Frames: An image generation model offering unprecedented stylistic control (RunwayML) 
- aiOla unveils open source AI audio transcription model that obscures sensitive info in realtime (VentureBeat) 
- ComfyUI-LTXTricks A set of nodes that provide additional controls for the LTX Video model (GitHub) 
- ComfyUI_AdvancedReduxControl (GitHub) 
Research
- 🎞️ MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views (NeurIPS 2024 project page) 
- ARM: Appearance Reconstruction Model for Relightable 3D Generation (project page) 
- Direct and Explicit 3D Generation from a Single Image (project page) 
- Find Any Part in 3D (project page) 
- Stylecodes: Encoding Stylistic Information For Image Generation (project page) 
- Oscillation inversion: Understanding the structure of large flow models through the lens of inversion methods (project page) 
- StableV2V: Stablizing Shape Consistency in Video-to-Video Editing (project page) 
- 3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes (project page) 
- Beyond Gaussians: Fast and High-Fidelity 3D Splatting with Linear Kernels (arXiv) 
- OminiControl: Minimal and Universal Control for Diffusion Transformer (arXiv) 

