This weekend in Generative Media
Studios adopt genAI slowly; AI finds more Nazca lines; Report from AI film screening
News and Opinion
Hundreds More Nazca Lines Emerge in Peru’s Desert (New York Times) With drones and A.I., researchers managed to double the number of mysterious geoglyphs in a matter of months.
One big room full of hopes and schemes: An anecdotal report of a major AI film screening and the reception that followed. (Mike Gioia on Substack)
Labelers training AI say they're overworked, underpaid and exploited by big American tech companies (CBS News)
Moonvalley wants to build more ethical video models (TechCrunch)
Want to speak Italian? Microsoft AI can make it sound like you do. (Washington Post)
ChatGPT’s Poetry is Incompetent and Banal (Ernest Davis, NYU) [PDF]
Software
Introducing Frames: An image generation model offering unprecedented stylistic control (RunwayML)
aiOla unveils open source AI audio transcription model that obscures sensitive info in realtime (VentureBeat)
ComfyUI-LTXTricks A set of nodes that provide additional controls for the LTX Video model (GitHub)
ComfyUI_AdvancedReduxControl (GitHub)
Research
🎞️ MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views (NeurIPS 2024 project page)
ARM: Appearance Reconstruction Model for Relightable 3D Generation (project page)
Direct and Explicit 3D Generation from a Single Image (project page)
Find Any Part in 3D (project page)
Stylecodes: Encoding Stylistic Information For Image Generation (project page)
Oscillation inversion: Understanding the structure of large flow models through the lens of inversion methods (project page)
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing (project page)
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes (project page)
Beyond Gaussians: Fast and High-Fidelity 3D Splatting with Linear Kernels (arXiv)
OminiControl: Minimal and Universal Control for Diffusion Transformer (arXiv)