Back to a16z Podcast

What You Missed in AI This Week (Google, Apple, ChatGPT)

a16z Podcast

Full Title

What You Missed in AI This Week (Google, Apple, ChatGPT)

Summary

This episode features A16Z partners Justine and Olivia Moore discussing the rapid advancements in consumer AI. They cover breakthroughs in AI video and voice models, analyze how AI is revolutionizing startup revenue models, and demonstrate AI's power in creating a complete brand from ideation to product imagery.

Key Points

  • Google's VO3 model marks a "ChatGPT moment" for AI video by generating video with native audio from text prompts, opening new possibilities for AI storytelling despite current limitations in clip length and character consistency.
  • OpenAI's ChatGPT recently upgraded its advanced voice mode to sound more human-like, featuring natural conversational elements, after a period of lagging behind competitors, possibly due to caution around human-like AI companions.
  • Apple's AI strategy, branded "Apple Intelligence," has been perceived as underwhelming, largely relying on outsourced integrations with ChatGPT and focusing on features like Genmoji and call translation, suggesting a cautious approach to fully integrated AI personal assistants.
  • Eleven Labs' 11B3 model enhances AI voice generation by enabling natural emotional inflections, accents, and sound effects through text tagging, significantly improving realistic multi-character conversational storytelling.
  • Consumer AI startups are experiencing unprecedented revenue growth, with median annualized revenue run rates (ARR) of $4.2 million in their first year, exceeding traditional B2B benchmarks due to direct consumer subscriptions and users' willingness to pay higher prices for powerful AI-native products.
  • AI tools are democratizing brand creation, allowing users to generate comprehensive brand assets from logos and product photos to potential storefronts and marketing materials with consistent visual branding, significantly reducing the technical skills required for entrepreneurship.

Conclusion

The rapid pace of innovation in AI video and voice generation is opening up vast new avenues for creative content and narrative storytelling, requiring creators to constantly adapt.

AI is fundamentally shifting consumer startup economics, enabling faster monetization through direct subscriptions and higher average revenue per user, even allowing for enterprise-like revenue expansion.

The accessibility of advanced AI tools empowers individuals to conceptualize and execute full-fledged brands and businesses with unprecedented ease, lowering barriers to entry for entrepreneurs.

Discussion Topics

  • How do you envision the future of AI-generated content on social media, especially with advancements in realistic AI video and voice?
  • What are the ethical considerations and potential societal impacts of increasingly human-like AI companions and AI-generated identities?
  • Given the rapid revenue growth of consumer AI startups, what new business models or industries do you predict will emerge or be disrupted next?

Key Terms

VO3
Google DeepMind's latest video model that generates video and audio natively from text prompts.
API
Application Programming Interface; a set of definitions and protocols for building and integrating application software.
ARR
Annualized Revenue Run Rate; a projection of a company's yearly revenue based on its current monthly or quarterly recurring revenue.
Inference Cost
The computational cost associated with running an AI model to generate an output (e.g., text, image, audio) based on an input.
Genmoji
AI-generated emojis.
Text-to-speech
Technology that converts written text into spoken audio.
Tags (Eleven Labs)
Specific textual commands used to prompt AI voice models to generate speech with particular emotions, inflections, or sound effects.
Ideogram
An AI image generation and editing platform particularly strong in generating logos and typography.
CREA (Flux Context)
An AI image editing model that allows users to modify images using natural language prompts while maintaining object and character consistency.
Full-stack AI brands
Brands where all aspects, from logo and product design to marketing, advertising, and even social media influencers, are generated or assisted by AI.

Timeline

00:01:08

Google's VO3 model marks a "ChatGPT moment" for AI video by generating video with native audio from text prompts, opening new possibilities for AI storytelling despite current limitations in clip length and character consistency.

00:03:33

OpenAI's ChatGPT recently upgraded its advanced voice mode to sound more human-like, featuring natural conversational elements, after a period of lagging behind competitors, possibly due to caution around human-like AI companions.

00:05:23

Apple's AI strategy, branded "Apple Intelligence," has been perceived as underwhelming, largely relying on outsourced integrations with ChatGPT and focusing on features like Genmoji and call translation, suggesting a cautious approach to fully integrated AI personal assistants.

00:06:14

Eleven Labs' 11B3 model enhances AI voice generation by enabling natural emotional inflections, accents, and sound effects through text tagging, significantly improving realistic multi-character conversational storytelling.

00:07:40

Consumer AI startups are experiencing unprecedented revenue growth, with median annualized revenue run rates (ARR) of $4.2 million in their first year, exceeding traditional B2B benchmarks due to direct consumer subscriptions and users' willingness to pay higher prices for powerful AI-native products.

00:11:27

AI tools are democratizing brand creation, allowing users to generate comprehensive brand assets from logos and product photos to potential storefronts and marketing materials with consistent visual branding, significantly reducing the technical skills required for entrepreneurship.

Episode Details

Podcast
a16z Podcast
Episode
What You Missed in AI This Week (Google, Apple, ChatGPT)
Published
June 13, 2025