Today in Generative Media
OpenAI's o1 improves logic; NotebookLM adds audio; AI leaders at the White House
News and Software
NotebookLM now lets you listen to a conversation about your sources (Google Keyword Blog)
Tech, US officials discuss AI development, power needs at White House (Reuters)
US sets reporting requirements for AI models, infrastructure operators (The Register)
Why comparing AI image editing to Photoshop downplays the risks (The Verge)
Gemini Live is rolling out to all Android users - for free. Here's how to access it (ZDNet)
Introducing Covers (Suno) Covers let you reimagine your songs by keeping the melody and adapting the track to a different style.
Introducing HeyGen Avatar 3.0 (X) 🧠 Dynamic Script Understanding: Our avatars now grasp the nuances of your words. 😀 Spot-On Facial Expressions: Emotions that match your message, beat for beat. 🗣️ Precise Voice Inflections: Every word delivered with the perfect tone. 🎵 Singing Capabilities: From soulful ballads to spittin' rhymes, our avatars do it all!
Announcing the AI Character Shape Generator by @yellow_3d_ (X) This handy Daz Studio plug-in allows you to generate character mesh shapes from simple text prompts, transforming the character creation process.
The new open-source Text to Speech model: Fish Speech 1.4 is brilliant! (X)
-Instant voice cloning 🗣️- Ultra-low latency ⚡
- Compact model (~1GB weights) 🏋️♂️