TWiT 1065: AI Action Park - DeepSeek's mHC Model Training Breakthrough! - This Week in Tech (Audio) | Ryan Randels | Mobile & SaaS Strategy | Agentic AI

Full Title

TWiT 1065: AI Action Park - DeepSeek's mHC Model Training Breakthrough!

Summary

The episode discusses the rapid advancements and evolving landscape of AI in 2025 and anticipation for 2026, highlighting the productization of AI by companies like Anthropic and Google, the ongoing debate about AI's impact on programming professions, and the cybersecurity implications of AI.

The hosts also touch upon regulatory challenges and consumer-facing AI developments, as well as the historical parallels of technological disruption.

Key Points

The AI industry is seeing a significant shift towards productization, with companies like Anthropic and Google focusing on business applications, while consumer AI products from Google and OpenAI are showing rapid improvement and adoption.
There is a growing sentiment among AI practitioners, like Andrej Karpathy, that the pace of AI development is overwhelming, leading to a feeling of being left behind, which mirrors past technological shifts like the advent of the internet.
Concerns about the environmental impact of AI, specifically water consumption, are being re-evaluated, with evidence suggesting these concerns may be overblown compared to other industries like golf courses.
The practice of "hackquisitions," where companies acquire talent by buying other firms without necessarily acquiring the entire company, is a growing trend, exemplified by NVIDIA's deal with Grok.
OpenAI is actively seeking a Head of Preparedness to manage the risks associated with advanced AI, indicating a focus on safety and mitigation alongside development.
The increasing capability and accessibility of AI tools are transforming workflows and productivity, drawing parallels to the impact of early software like Photoshop.
Regulatory efforts are underway to address AI-related issues, but also pose potential threats to internet freedoms, such as age verification requirements and attempts to undermine Section 230.
The cybersecurity risks associated with AI are significant, ranging from AI-powered phishing attacks to data leakage through AI applications, necessitating robust security solutions like zero trust architectures.
The semiconductor industry faces supply chain challenges and geopolitical tensions impacting AI development, particularly concerning China's access to advanced chip-making technology.
Waymo's autonomous vehicles have encountered operational issues during power outages, highlighting the dependence of AI on infrastructure and the need for robust fallback mechanisms.
The evolution of payment systems for public transportation, like New York's MetroCard phasing out in favor of tap-to-pay, reflects broader technological shifts towards digital and contactless solutions.
The passing of Stuart Sheffey, a pioneer in computer television shows, is remembered, underscoring the historical roots of tech communication and the evolution of the industry.

Conclusion

The AI landscape is rapidly evolving, with significant progress in productization and integration, but also facing challenges in accessibility, ethics, and societal impact.

Historical parallels of technological disruption provide context for understanding the current AI revolution, suggesting that while overwhelming, these changes can also lead to unprecedented opportunities.

The importance of balancing technological advancement with user safety, ethical considerations, and robust cybersecurity remains paramount as AI becomes more pervasive.

Discussion Topics

How are AI advancements shaping the future of work for programmers and creative professionals, and what strategies are essential for staying relevant?
What are the most significant ethical and societal challenges posed by the rapid development of AI, and how can they be addressed effectively?
With the increasing complexity and interconnectedness of AI, what are the most critical cybersecurity measures individuals and organizations need to adopt to protect themselves from emerging threats?

Key Terms

TPU: Tensor Processing Unit, a custom ASIC developed by Google for neural network machine intelligence, specifically for machine learning.
Vibe Coding: A term coined by Andrej Karpathy referring to writing code in a more intuitive and fluid way, often facilitated by AI coding assistants.
Generative AI: A type of artificial intelligence capable of generating new content, such as text, images, music, or code, often in response to prompts.
LLM: Large Language Model, a type of AI model trained on massive amounts of text data, capable of understanding and generating human-like text.
Hallucination (AI): When an AI model generates false or misleading information presented as factual.
Hackquisition: A term suggesting the acquisition of a company primarily to hire its talent and intellectual property, rather than the company as a whole.
Manifold Constrained Hyperconnections (mHC): A new AI training technique proposed by DeepSeek aimed at improving training stability, scale, and efficiency.
Burkhoff Polytope: A mathematical concept related to matrices and stochastic processes, used in the description of DeepSeek's mHC technique.
Stochastic: Involving random variables or processes; having a probability distribution.
OLAP: Online Analytical Processing, a category of software technology that enables analysts, managers and executives to access, more rapidly, from complex summarization and comparison of business data.
Flipper Zero: A portable, tamper-resistant multi-tool for geeks and hackers, designed for penetration testing and digital/physical interaction with various systems.
Raspberry Pi: A series of small single-board computers based on ARM architecture, popular for hobbyist and educational projects.
CES: Consumer Electronics Show, an annual trade show organized by the Consumer Technology Association, showcasing new technology and products.
QD-OLED: A type of OLED display technology that uses quantum dots to enhance color and brightness.
DRAM: Dynamic Random-Access Memory, a type of semiconductor memory that uses capacitors to store each bit of data.
VRAM: Video Random-Access Memory, a specialized type of RAM used by graphics cards for storing image data.
RSU: Restricted Stock Unit, a company-issued stock grant that is subject to vesting conditions.
TSMC: Taiwan Semiconductor Manufacturing Company, the world's largest contract chip manufacturer.
EUV: Extreme Ultraviolet Lithography, a technique used in semiconductor manufacturing to etch patterns onto silicon wafers.
MTA: Metropolitan Transportation Authority, the public authority responsible for most public transportation in the New York metropolitan area.
MetroCard: A contactless fare card used for public transportation in the New York metropolitan area.
Omni (One Metro New York): A new contactless fare payment system for the MTA, replacing the MetroCard.
Computer Chronicles: A long-running television show that documented the personal computer industry from 1981 to 2002.
CPM: Control Program for Microcomputers, an operating system developed by Digital Research.
Digital Research: A software company founded by Gary Kildall.
HYPERCARD: A software construction kit, combining a database with an object-oriented programming language and a graphical user interface.
SUPERCard: An extension of HyperCard.
DIRECTOR: A multimedia authoring tool developed by Macromedia.
HYPERCARD: A software construction kit, combining a database with an object-oriented programming language and a graphical user interface.

Timeline

00:01:19:760

Introduction of Joey Davila, AI developer advocate, and Dan Patterson, Senior Director of Content at Blackbird.ai.

00:03:13:400

Discussion on the rapid pace of AI development and its potential to change everything.

00:04:23:480

Analysis of AI's economic impact and the productization trend of AI companies.

00:06:10:840

Debunking myths about AI's water consumption and discussing the broader environmental context.

00:07:37:936

Conversation on the progress made in AI, referencing companies like Anthropic and Google.

00:08:55:296

Discussion on the strategic direction of AI companies, particularly their focus on consumer products.

00:10:09:456

Comparison of AI's impact on journalism and productivity to the early days of Photoshop.

00:11:35:776

Discussion of Andrej Karpathy's sentiment about falling behind in the AI programming field.

00:13:17:552

Reflection on the overwhelming feeling of technological change, drawing parallels to the early internet.

00:15:59:312

Anecdote about using Mosaic browser in the early days of the internet.

00:16:27:352

Reflection on the shift from utopian visions of the internet to current cynicism and consequences.

00:17:19:791

Analogy of YouTube's content explosion to the potential for AI to generate vast amounts of useful or terrible output.

00:20:03:888

Discussion on the history of domain name ownership and the early internet.

00:21:15:088

Reflection on the nature of technology podcasting and its slightly OCD audience.

00:21:59:929

Observation about business acquisitions and moves happening during holiday periods in the AI industry.

00:27:39:089

News about NVIDIA's acquisition of Grok's technology and talent, termed a "hackquisition."

00:30:31:168

Discussion of Mark Zuckerberg's acquisition of the Singapore startup Manus.

00:32:25:824

OpenAI's search for a Head of Preparedness and the associated salary and responsibilities.

00:34:14:103

Joey Davila's use of Claude Code for job searching and evaluating his fit for roles.

00:37:14:224

Introduction of DeepSeek's mHC (Manifold Constrained Hyperconnections) model training breakthrough.

00:39:49:160

Discussion of DeepSeek's analogy for mHC, comparing it to a "crazy water park with perfect safety controls."

00:44:42:175

Debate between two schools of thought on LLMs: one viewing them as a dead end, the other advocating for more compute.

00:45:58:895

Reflection on the rapid advancement of AI since ChatGPT's release three years ago.

00:50:39:694

Discussion of Cory Doctorow's use of Joey's quote and the concept of "shitification."

00:52:31:743

Personal anecdotes about the impact of SARS and COVID-19 on public health.

00:54:09:694

Overview of proposed federal "bad internet bills" like the SCREEN Act and the COOPER Davis Act.

00:59:55:276

Discussion of Things Canary, a security device that acts as a honeypot.

01:00:22:756

Mention of the FCC banning foreign-made drones and its implications.

01:03:16:296

Concerns about the potential for new legislation to further attack Section 230.

01:05:49:816

New York state law requiring social media platforms to display warning labels on content.

01:06:39:176

A federal judge blocks Texas's App Store age verification law, citing First Amendment concerns.

01:09:00:336

Discussion of Apple's potential role in age verification for app stores and user privacy.

01:11:05:397

Recalling early peer-to-peer VPN projects like PeekABoot from Cult of the Dead Cow.

01:13:13:077

Sharing of past interactions and the history of tech media, including G4 Tech TV.

01:15:30:397

The hosts look forward to CES coverage next week and mention special content for Club Twit members.

01:21:38:557

Reflections on the history of tech journalism and the early days of TWiT.

01:22:45:121

Discussion about US government policies on Chinese chips and the semiconductor industry.

01:27:27:641

The US FCC bans all foreign-made drones, not just DJI, impacting various industries.

01:32:54:166

The FCC kills the Cybertrust Mark Program, intended to improve home security device cybersecurity.

01:34:23:046

Waymo vehicles stopped operating in San Francisco during a power outage, causing traffic disruptions.

01:37:50:571

Discussion on the impact of ride-sharing services like Uber on New York City's traffic and taxi industry.

01:39:39:143

San Francisco's new mayor bans Flipper Zeros and Raspberry Pis at the inauguration.

01:40:47:891

Discussion about the portability and potential misuse of devices like Flipper Zero.

01:41:01:411

The MetroCard is being phased out in New York City in favor of tap-to-pay systems like Omni.

01:43:47:902

Apple's supply chain and potential impact of DRAM prices on future MacBook Pro models.

01:52:27:944

The "dumbest things that happened in tech" in 2025, including a lawyer suing Mark Zuckerberg over his name.

01:58:22:249

The story of Suhail Doshi exposing an engineer working multiple jobs simultaneously.

02:00:21:668

Mark Zuckerberg's methods for recruiting talent for OpenAI, including delivering soup.

02:01:31:947

Elon Musk's Grok AI and its issues with generating inappropriate content, including child nudity.

02:04:09:227

Discussion on how technology, historically driven by adult content, is evolving.

02:06:45:788

A humorous tangent about New York pizza and sandwich shops.

02:09:08:827

Concern that children are losing the ability to read analog clocks due to reliance on digital devices.

02:11:43:543

Nostalgic discussion about radio show "hot clocks" and the use of cart machines.

02:13:47:052

Remembering the KFRC mobile studio and the history of radio technology.

02:15:17:543

The significance of the cart machine in the show "Stranger Things" and its 1980s setting.

02:20:00:358

Joey Davila's accordion playing and the historical context of accordion music.

02:21:30:238

Introduction of Redis Cloud as a real-time data platform for AI applications.

02:24:35:358

Preview of CES 2026, including Sony's Afila car and Samsung's new QD-OLED TVs.

02:27:59:743

Discussion of the surprising price point and form factor of a new BlackBerry-style Android phone.

02:31:40:024

The skyrocketing price of DRAM and VRAM, impacting the cost of consumer electronics and AI hardware.

02:35:47:583

The MTA's transition from MetroCards to tap-to-pay systems, referred to as Omni.

02:38:00:143

A tribute to Stuart Sheffey, creator of "Computer Chronicles," and the history of early computer television shows.

02:43:47:902

Joey Davila's ongoing projects, including accordion music and AI development.

02:45:27:743

Thanks to guests and a look ahead to next week's show covering CES.

02:46:47:543

A reflection on TWiT's 20-year history and a call to action for listeners to spread the word.

TWiT 1065: AI Action Park - DeepSeek's mHC Model Training Breakthrough!...