Today in Generative Media
Promise, a genAI studio; Voice clones in Microsoft Teams; AI publisher Spines
News
Microsoft will soon let you clone your voice for Teams meetings (TechCrunch)
Itching to write a book? AI publisher Spines wants to make a deal (TechCrunch)
The US Patent and Trademark Office Banned Staff From Using Generative AI (Wired)
Coca-Cola causes controversy with AI-made ad (NBC News)
Software and Data
AnyChat brings together ChatGPT, Google Gemini, and more for ultimate AI flexibility (VentureBeat)
Perplexity’s AI search engine can now buy products for you (The Verge)
Jan is an open source ChatGPT-alternative that runs 100% offline. (jan.ai)
Kolors Virtual Try-On in the Wild (HuggingFace)
Text to Math Animations powered by manim engine + gpt-4o (VisualMath AI)
MeshGPT: Generating Triangle Meshes with Decoder-Only Transformers (GitHub)
Research
GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views (project page)
LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models (project page)
StdGEN: Semantic-Decomposed 3D Character Generation from Single Images (project page)
SmoothCache: A Universal Inference Acceleration Technique for Diffusion Transformers (arXiv)
Misc
Generative AI in media and entertainment (FX Guide) In this new Field Guide to Generative AI, fxguide’s Mike Seymour was commissioned by NVIDIA to unpack the impact of generative AI on the media and entertainment industries, offering practical applications, ethical considerations, and a roadmap for the future.
Justin Tranter x Music AI | Google Lab Sessions | Full Film (YouTube)