Today in Generative Media
3-Mile Island for Microsoft AI; Creative process for AI film; UAE invests $1T in US AI
News
Microsoft deal would reopen Three Mile Island nuclear plant to power AI (Washington Post)
AI filmmaking is "making the creative process so much smoother," says the director of the new Jordan Rudess music video (Creative Bloq)
UAE hoping to expand $1 trillion partnership with U.S. through AI, Investment (cnbc.com)
Software
Upscale with Universal Upscaler (Leonardo AI)
Moshi: a speech-text foundation model for real time dialogue (GitHub) fully voice to voice ai running locally on my mac (X)
This is a custom node for ComfyUI that allows you to use the Luma AI API directly in ComfyUI (GitHub)
KoolCogVideoX is fine-tuned on CogVideoX specifically for interior design scenarios (HuggingFace)
Research
LVCD: Reference-based Lineart Video Colorization with Diffusion Models (SIGGRAPH Asia 2024, project page)
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt (project page)
FlexiTex: Enhancing Texture Generation via Visual Guidance (project page)
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities (project page)
SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation (arXiv)
StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation (arXiv)