Today in Generative Media

Luma's AI video model; Real photo wins AI contest; Google's AI alphabet generator

Jun 13, 2024

News

‘We don’t need Sora anymore’: Luma’s new AI video generator Dream Machine slammed with traffic after debut (VentureBeat)
Luma launches Dream Machine (Luma Labs) Some fun examples:
Photographer Disqualified From AI Image Contest After Winning With Real Photo (PetaPixel)
I tried Google's new AI alphabet generator, and it's way more fun than it sounds (ZDNet)
Adobe employees slam the company over AI controversy: 'Let's avoid becoming like IBM' (Business Insider)
Forget DALL-E: Apple's new AI image generator runs on-device and works like magic (ZDNet)
AI speech-to-text can hallucinate violent language (TechXplore)

Software

Research

Image Neural Field Diffusion Models (CVPR 2024, project page)
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language (CVPR 2024, project page)
Neural Gaffer: Relighting Any Object via Diffusion (project page)
ID-to-3D: Expressive ID-guided 3D Heads via Score Distillation Sampling (project page)
An Image is Worth 32 Tokens for Reconstruction and Generation (project page)
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model (project page)
MultiEdits: Simultaneous Multi-Aspect Editing with Text-to-Image Diffusion Models (project page)
MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model (project page)
4Real Towards Photorealistic 4D Scene Generation via Video Diffusion Models (project page)
AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising (project page)
MimicBrush: Zero-shot Image Editing with Reference Imitation (project page)
Human 3Diffusion: Realistic Avatar Creation via Explicit 3D Consistent Diffusion Models (project page)
What If We Recaption Billions of Web Images with LLaMA-3 ? (project page)
Cinematic Gaussians: Real-Time HDR Radiance Fields with Depth of Field (arXiv)

Discussion about this post

No posts

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts