Back to The Twenty Minute VC (20VC)

20VC: Open Models vs Frontier Models: Who Actually Wins? | The...

The Twenty Minute VC (20VC)

Full Title

20VC: Open Models vs Frontier Models: Who Actually Wins? | The $100,000 Token Budget Every Engineer Will Need | Why Forward-Deployed Engineers Are the Future of Enterprise AI with Clay Bavor, Co-Founder of Sierra

Summary

This episode features Clay Bavor, co-founder of Sierra, discussing the future of AI models, the economics of AI, and the operational strategies of a rapidly growing enterprise AI company.

Key topics include the interplay between open-source and frontier AI models, the cost of AI compute, the evolution of enterprise AI adoption, and the importance of company culture and leadership.

Key Points

  • Clay Bavor left Google after 18 years to co-found Sierra, driven by the opportunity in language models and agents during a period of technological upheaval.
  • Sierra deliberately chose not to train its own foundational models, opting instead to fine-tune existing open-weight models to manage capital expenditure while still building proprietary AI architectures.
  • The demand for frontier-level intelligence is considered unbounded, particularly in complex domains like science and law, and will likely continue to drive the development of frontier models despite the increasing capability of open-weight models.
  • Token economics are complex, influenced by hardware advancements, the migration of workloads to open-weight models, and the fundamental constraint of GPU availability, which acts as a price floor.
  • The future of enterprise AI will involve a mix of open and frontier models, with companies adopting a strategy of using the most cost-effective and capable model for specific tasks.
  • Locally running AI models on personal devices is seen as beneficial for some consumer applications but unlikely to alleviate the significant compute demands of large-scale enterprise AI training and inference.
  • The US open-weights AI ecosystem may be lagging behind due to a perceived reluctance of major US labs to compete with their own frontier models by releasing highly capable open-weights alternatives, unlike Chinese companies which appear more focused on distilling and offering advanced models.
  • Sierra's success in serving large enterprises is attributed to a "forward-deployed engineer" (FDE) model, inspired by Palantir, where engineers work closely with clients to ensure successful AI implementation and adoption, though it's not strictly binary for sales.
  • The future of teams will likely be leaner and more high-leverage due to AI, but serving large, complex enterprises like those Sierra works with still requires significant human expertise in understanding customer needs, integrating systems, and building trust.
  • Sierra utilizes internal AI agents, like "Pinecone," to significantly boost employee productivity across various functions, from engineering to hiring, streamlining operations and enhancing decision-making.
  • The company culture at Sierra is built on three core values: craftsmanship, intensity, and family, emphasizing excellence in execution, a high-impact work ethic, and a balance that respects personal life.
  • The hiring process has been significantly adapted to embrace AI, with interviews now often including tasks where candidates use AI tools to build applications, reflecting an AI-native approach to talent acquisition.
  • The increasing use of AI in code generation is seen as a potential boon for cybersecurity, as AI can also be leveraged to identify and mitigate vulnerabilities, though it also presents new challenges.
  • Founders Clay Bavor and Brett Adcock have a "major/minor" division of responsibilities, with Bavor focusing on company operations and Adcock on sales and engineering, both deeply trusting each other's expertise.
  • The company's board meetings are held every six weeks rather than quarterly to adapt to the rapid pace of AI development, and they utilize detailed board memos instead of decks to foster deeper engagement.
  • The perceived "unbounded demand" for AI solutions is a unique market condition, driving rapid adoption and making this a crucial time for companies in the AI space.
  • The high token spend for top engineers ($100,000+ annually) is seen as an indicator of AI adoption and productivity gains, with potential for this to become a normalized part of engineering budgets.
  • A key piece of advice for young people entering the workforce is to master AI tools, as this provides an "unfair advantage" and high value in the current job market.
  • The importance of in-person work for building culture, camaraderie, and facilitating apprenticeship and mentorship in a young company is strongly emphasized.
  • The founders' values of "craftsmanship, intensity, and family" are seen as critical for building an enduring and excellent company.

Conclusion

The rapid advancement of AI is creating unbounded demand for intelligence, influencing both open and frontier models and fundamentally changing how companies operate and build products.

Companies that embrace AI, invest deeply in their technology, and foster strong cultures of craftsmanship, intensity, and family are best positioned to navigate and succeed in this transformative era.

The future of work will be heavily shaped by AI, necessitating a focus on adaptability, continuous learning, and leveraging AI tools to enhance human capabilities rather than replace them.

Discussion Topics

  • How do companies balance the cost of developing or utilizing frontier AI models versus leveraging more accessible open-weight models for specific business needs?
  • What are the ethical considerations and potential long-term impacts of highly capable AI agents performing complex tasks traditionally handled by humans?
  • As AI continues to automate and augment human capabilities, what skills and mindsets will be most critical for individuals to cultivate to thrive in the future workforce?

Key Terms

Frontier Models
Highly advanced, state-of-the-art AI models that represent the cutting edge of performance and capability, often developed by large research labs.
Open-Weight Models
AI models whose parameters (weights) are publicly released, allowing for broader access, fine-tuning, and use by the community.
Scaled Distillation
A process where a smaller, more efficient model is trained to mimic the behavior of a larger, more complex "frontier" model, making advanced AI capabilities more accessible.
Token Economics
The economic principles governing the use and cost of "tokens" (pieces of text processed by AI models) in AI language models, influencing pricing and usage patterns.
Compute
The processing power required to train and run AI models, typically involving significant hardware resources like GPUs.
FDE (Forward-Deployed Engineer)
A model where engineers are embedded within client organizations to directly implement and support technology solutions.
ARR (Annual Recurring Revenue)
A metric used to track the predictable revenue a company expects to receive from its customers over a year.
MCP Gateway
A system that aggregates main company systems and services, allowing agents to access them.
Agent Architecture
The underlying design and framework for building and operating AI agents that can perform tasks and interact with systems.
OMOTE NASHI
A Japanese cultural concept of selfless hospitality, implying deep care and attention to customer experience.
AI-Pilled
A term used to describe individuals who have become highly enthusiastic and knowledgeable about AI.

Timeline

00:00:03

Unbounded demand for frontier intelligence is not yet appreciated.

00:06:07

Bavor explains his 18-year tenure at Google and the reasons for finally leaving to co-found Sierra.

00:08:03

Bavor details key takeaways from Google, including a willingness to invest deeply in technology stacks.

00:09:11

Discussion on the decision not to train foundational models from scratch due to capital expense.

00:10:37

The future of open versus fine-tuned models and whether it diminishes the need for frontier models.

00:13:32

The evolution of token economics with the shift from chat to agent-based AI.

00:15:50

The possibility and impact of running models locally on devices.

00:17:30

Concerns about the US open-weights AI ecosystem compared to China's scaled distillation approach.

00:18:27

The structure of Sierra's team and how it differs for enterprise clients compared to leaner startups.

00:19:17

The general trend towards smaller, higher-leverage teams due to AI productivity gains.

00:20:48

Details on Sierra's internal agent, Pinecone, and its impact on company operations.

00:23:03

Explanation of "Sierra Brain," an internal system for strategic reasoning and decision-making.

00:24:14

Approach to managing and budgeting token spend for engineers.

00:25:35

The shift in developer bottlenecks from writing code to reviewing code, and future implications.

00:26:01

Discussion on the percentage of developer salaries likely to be spent on tokens in the future.

00:27:20

Maintaining product focus and customer closeness while serving large enterprises.

00:29:26

Market maturation and competitive landscape of enterprise AI, comparing it to Uber/Lyft versus AWS/GCP/Azure models.

00:30:38

The necessity of a forward-deployed engineer (FDE) motion for enterprise AI sales and implementation.

00:33:37

The unique market condition of high buyer pull for AI solutions.

00:34:33

Sierra's evolution towards sales and conversion platforms, exemplified by customer use cases.

00:35:40

Comparison of Sierra to Palantir and the strategy for scaling beyond a niche customer base.

00:36:44

The potential for building customer-specific product features versus a unified platform approach.

00:38:12

Strategies for running effective board meetings with a six-week cadence and board memos.

00:40:09

Insights into writing about company weaknesses and memorable lessons learned.

00:41:07

The process of setting company valuation and funding rounds.

00:42:05

Explanation of Sierra's core values: craftsmanship, intensity, and family.

00:45:24

Methods for maintaining intensity and drive within a growing company.

00:46:52

The role of ambitious goals in shaping company trajectory.

00:50:07

The strong opinion on the importance of in-person work for company culture.

00:51:35

Advice for young people on learning and compounding skills.

00:52:40

Guidance for recent graduates entering the AI-driven job market.

00:54:01

Changes to Sierra's hiring process to accommodate an AI-native workforce.

00:54:47

The impact of AI on cybersecurity and the potential for AI-generated code vulnerabilities.

00:55:35

The dynamic between Bavor and co-founder Brett Adcock, and their disagreement resolution process.

00:56:43

The "major/minor" division of responsibilities between Bavor and Adcock.

00:57:56

Lessons learned from working with Sundar Pichai and observing his dynamic range.

00:58:44

Underestimated aspects of Google's culture and its ability to solve complex problems.

00:59:38

Recommended reading, "The Wright Brothers" by David McCullough.

01:00:34

Advice for parenting and raising children.

01:01:47

The importance of shared interests and enabling partners in a marriage.

01:03:31

The kindest thing done by parents for Bavor.

01:04:54

Reflections on Bavor's personal qualities and the impact of the conversation.

Episode Details

Podcast
The Twenty Minute VC (20VC)
Episode
20VC: Open Models vs Frontier Models: Who Actually Wins? | The $100,000 Token Budget Every Engineer Will Need | Why Forward-Deployed Engineers Are the Future of Enterprise AI with Clay Bavor, Co-Founder of Sierra
Published
July 4, 2026