Today in Generative Media
Lionsgate & RunwayML; SocialAI: Twitter without humans; YouTube gets genAI videos
News
Runway Partners with Lionsgate (RunwayML)
SocialAI: we tried the Twitter clone where no other humans are allowed (The Verge) I Stared Into the AI Void With the SocialAI App (Wired)
YouTube will use AI to generate ideas, titles, and even full videos (The Verge)
How we’re increasing transparency for gen AI content with the C2PA (Google Keyword blog)
A bottle of water per email: the hidden environmental costs of using AI chatbots (Washington Post)
Google just stepped up its AI fight with Apple (Business Insider)
AI Adds Human Cries and Creepy Laughs to People’s Songs Without Warning (Vice)
OpenAI threatening to ban users for asking Strawberry about its reasoning. (Futurism)
Software
Real Time Face Swap (FaceCam)
Pixtral 12B - the first-ever multimodal Mistral model. Apache 2.0 (Mistral)
UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks (HuggingFace)
CogVideoX-5B Huggingface Space🤗 (HuggingFace)
Research
SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction (ECCV 2024, project page)
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion (project page)
Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think (project page)
NVLM: Open Frontier-Class Multimodal LLMs (project page)
Phidias : A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion (project page)
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer (project page)
Learning Source Disentanglement in Neural Audio Codec (project page)
OSV: One Step is Enough for High-Quality Image to Video Generation (arXiv)