Back to a16z Podcast

Steven Sinofsky on Apple at 50, Microsoft, and the Future of...

a16z Podcast

Full Title

Steven Sinofsky on Apple at 50, Microsoft, and the Future of Computing

Summary

The podcast discusses the recent NVIDIA RTX Spark Super Chip announcement at Computex, focusing on its potential to shift AI compute to local devices and away from token-based cloud services. Steven Sinofsky shares insights on the evolution of PC hardware and software, drawing parallels to historical shifts in computing and offering predictions for the future.

Key Points

  • NVIDIA's RTX Spark Super Chip, an ARM CPU integrated with NVIDIA's parallel processing, signifies a move towards AI-native PCs by enabling local compute and reducing reliance on costly cloud tokens, mirroring past shifts where resource constraints moved to personal devices.
  • The shift in AI compute burden from CPUs to GPUs and TPUs is a continuation of trends seen with graphics rendering, making local AI processing on devices the next significant evolution in personal computing.
  • The cost of cloud-based AI, specifically the "dollars per token" model, is unsustainable for widespread personal AI use, driving the need for local, free-to-use compute on devices.
  • Past component shortages have historically resolved themselves, suggesting that current memory or chip shortages will similarly correct over time.
  • The PC ecosystem faces a challenge in offering a consistent, high-quality user experience, unlike Macs and phones which have become more integrated and reliable, leading to a potential shift towards more unified hardware-software solutions.
  • Microsoft's embrace of ARM and NVIDIA's CUDA stack on future PCs, while offering backward compatibility, risks perpetuating old PC issues like fan noise, registry editing, and instability, contrary to the user experience on Macs and phones.
  • The true value of future AI-native PCs will lie in their ability to run agents and not solely in backward compatibility, highlighting a need for forward-looking design in the PC industry.
  • Surface, originally conceived as a platform discontinuity towards ARM and mobile factors, pivoted to Intel-based "objection handlers" but AI presents a new opportunity for Microsoft to redefine the PC for a forward-looking, mobile-centric era.

Conclusion

The future of personal computing is moving towards AI-native devices where compute is localized, driven by the high cost of cloud-based AI.

Companies like Microsoft and NVIDIA are poised to redefine the PC by embracing ARM and integrated AI capabilities, but must avoid repeating past mistakes of prioritizing backward compatibility over a modern, forward-looking user experience.

The industry needs to focus on creating devices that are not only powerful but also reliable, efficient, and user-friendly, mirroring the strengths of existing mobile ecosystems.

Discussion Topics

  • How will the shift to local AI processing impact the current cloud computing landscape and the business models of major tech companies?
  • Given the historical challenges in the PC market for consistent hardware and user experience, what strategies will truly differentiate future AI-native PCs from current offerings?
  • Beyond backward compatibility, what core user needs and desires should drive the design of next-generation personal computing devices to ensure their adoption and success?

Key Terms

ARM
A family of reduced instruction set computing (RISC) architectures for computer processors. Often used in mobile devices due to its power efficiency.
GPU
Graphics Processing Unit. A specialized electronic circuit designed to rapidly manipulate and alter memory to accelerate the creation of images intended for output to a display device.
TPU
Tensor Processing Unit. A specialized hardware accelerator for machine learning developed by Google.
CUDA
A parallel computing platform and programming model created by NVIDIA. It allows software developers to use a CUDA-enabled graphics processing unit (GPU) for general purpose processing.
Win32
An API for Microsoft Windows. It is the core programmatic interface for the Windows operating system.
DirectX
A collection of application programming interfaces (APIs) for handling tasks related to multimedia, especially game programming and video, on Microsoft platforms.
OpenGL
A cross-language, cross-platform application programming interface (API) for rendering 2D and 3D vector graphics.

Timeline

00:00:13

NVIDIA's RTX Spark Super Chip represents a shift towards AI-native PCs, moving compute to local devices and away from cloud token costs, reminiscent of past trends where resource constraints led to on-device solutions.

00:06:42

The compute burden for AI is shifting from CPUs to GPUs and TPUs, an evolution of the earlier GPU shift for rendering, making local AI processing on devices a key future development.

00:07:40

The current model of paying "dollars per token" for cloud AI services is a significant limitation, creating a strong incentive for AI compute to move to local devices for cost-free usage.

00:15:30

Historical component shortages, such as DRAM or hard drives, have consistently resolved themselves over time, suggesting that current memory or chip issues will also be overcome.

00:17:42

The PC ecosystem struggles with hardware consistency compared to Macs, which offer a more predictable and integrated experience, suggesting a need for a more unified approach to future PC development.

00:25:33

Microsoft's support for NVIDIA chips on ARM PCs, while offering backward compatibility with existing Windows programs, risks carrying over old PC issues like fans and registry editing, potentially undermining the user experience.

(00:480) AI presents a new opportunity for Microsoft to reorient the PC industry towards forward-looking technology and mobile-centric usage, moving away from the backward-looking compatibility focus.

(00:453) Steven Sinofsky reflects on the original vision for Surface as a platform shift to ARM and mobile form factors, noting that AI now offers a similar opportunity to redefine the PC for a modern era.

Episode Details

Podcast
a16z Podcast
Episode
Steven Sinofsky on Apple at 50, Microsoft, and the Future of Computing
Published
June 2, 2026