🎉 Google Veo2 VideoGen Stuns, ChatGPT Feature Launches, Multi-Agent Story Generation, Pika Labs 2.0, AI Customer Support for All
Google Launches Their VideoGen Tool Veo2 Which Blows Away Competition, ChatGPT Continues to Expand, New AI Agents Take on Storytelling, Pika Labs Launches Updated VideoGen 2.0 Tool
Welcome to this week’s edition of AImpulse, a four point summary of the most significant advancements in the world of Artificial Intelligence followed by a cool new AI tool I’m trying out this week.
Here’s the pulse on this week’s top stories:
What’s Happening: Google just announced the release of Veo 2, a state-of-the-art video generation model that creates high-resolution outputs with stunning realism and detail — along with Imagen 3, an upgraded image model also offering state-of-the-art quality.
Veo 2:
Veo 2 can generate 8-second clips at 4K resolution (720p at launch), and it has received significant upgrades in cinematic control quality.
The model also shows massive improvements in physics simulation and reduced hallucinations, leading to more realistic movement and detail.
Veo 2 outperformed all competitors in head-to-head human evaluations and prompt adherence, including OpenAI’s recently released Sora.
The model is rolling out gradually through the VideoFX waitlist, with YouTube Shorts integration planned for 2025.
Imagen 3:
The upgraded model delivers enhanced color vibrancy and composition across artistic styles, with better handling of fine details, textures, and text rendering.
New capabilities include more accurate prompt interpretation and better rendering of complex scenes that match user intentions.
Imagen 3 outperformed all models, including Midjourney, Flux, and Ideogram, in human evaluations for preference, visual quality, and prompt adherence.
The model is now available through Google Labs’ ImageFX and is rolling out to over 100 countries.
Why it matters: Google is having an absolutely massive end to 2024 — first with Gemini 2.0 and now Veo 2 and Imagen 3. These models appear to up the bar in both categories, giving Google state-of-the-art type performance across nearly every area of AI. OpenAI may have the hype this holiday season, but Google is showing the results.
What’s Happening OpenAI just announced a major expansion of its ChatGPT Search feature on Day 8 of the company’s livestream event, making it freely available to all users alongside added voice search capabilities and improved mobile features.
The details:
The previously premium search feature now extends to all logged-in users, with faster responses, and is now available through a globe icon on the platform.
Search has also been added to Advanced Voice Mode for premium users, allowing them to conduct searches through natural spoken prompts.
The Search mobile experience has been revamped, with enhanced visual layouts for local businesses and native integration with Google and Apple Maps.
Users can also set ChatGPT Search as a default search engine, with results displaying relevant links before ChatGPT text responses for faster access.
OpenAI also teased a ‘mini Dev Day’ for tomorrow.
Why it matters: ChatGPT’s ability to access the web and up-to-date information is an important step towards an agentic future, particularly within Advanced Voice Mode — turning the tool into a much more intelligent and capable version of Siri (and maybe powering it eventually). Search is about to change in a big way in the AI era.
What’s Happening: AI startup Higgsfield just introduced ReelMagic, a multi-agent platform that transforms story concepts into complete 10-minute videos, claiming to streamline the entire production process into a single workflow.
The details:
The tool uses specialized AI agents for production roles like scriptwriting and editing, creating cohesive long-form outputs in under 10 minutes.
ReelMagic starts with a short synopsis, and then AI agents handle script refinement, virtual actor casting, filming, sound/music, and editing.
ReelMagic's smart reasoning engine automatically selects optimal AI models for each shot, and it has partnerships with Kling, Minimax, ElevenLabs, and more.
The platform is already being tested by leading Hollywood studios, and Higgsfield is also planning to launch Hera, an AI video streaming platform.
Access is available to Project Odyssey participants via a waitlist, with no info on a broader release.
Why it matters: There has been a disconnect between AI video generators and the ability to craft cohesive, longer-form content—with heavy manual editing needed. While not available publicly yet, ReelMagic looks to be a workflow that combines AI’s limitless creative power to unlock broader storytelling capabilities.
What’s Happening: Pika Labs just released version 2.0 of its AI video generator, introducing a new ‘Ingredients’ tool that lets users incorporate their own images into AI-generated videos — alongside improved motion, prompting, and animation features.
The details:
A new 'Scene Ingredients' system allows users to upload and mix characters, objects, and backgrounds that the AI automatically recognizes and animates.
Pika’s updated model shows impressive realism, smooth movement, and prompt/image adherence, giving users more control over outputs.
The new video generator also features a significant update to text alignment, showcasing the ability to craft realistic branded scenes and advertising content.
Pika has already attracted over 11M users and secured $80M in funding, and the new version follows its viral ‘effects’ launch in October.
Why it matters: Pika’s new upgrades are wild, continuing to move video outputs out of the ‘slot machine’ luck phase into a more customizable, personalized experience. While we patiently waited for Sora, the AI video scene leveled up in a major way — with Pika, Luma, Runway, Kling, Hailuo, and others dulling the impact of OpenAI’s latest release.
Cool New Tool: ElevenLabs’ new Conversational AI Agents let you incorporate an AI-powered voice agent that can interact naturally with website visitors. If you don’t already have an e-comm side hustle, maybe it’s time you jumped in.
Step-by-step:
Create an ElevenLabs account and navigate to the Agents section.
Configure your AI agent's personality and initial message.
Choose or create a custom voice for natural interactions.
Customize the widget's appearance and embed it on your site.
Pro tip: Use the "Test AI agent" button to test your agent thoroughly before deploying. This helps ensure responses align with your expectations and brand voice.