Autonomous VehiclesIndustry InsightsElectric Vehicles

Rethinking FSD: Insights from Waymo's John Krafcik on Tesla's Strategy

JJordan Avery

2026-02-04

14 min read

A deep analysis of Krafcik’s critique of Tesla FSD — comparing approaches, safety tradeoffs, and practical advice for buyers and fleets.

Rethinking FSD: Insights from Waymo's John Krafcik on Tesla's Strategy

By taking John Krafcik’s public critique as a starting point, this deep-dive unpacks what Tesla’s Full Self-Driving (FSD) methodology means for safety, business models, and the future of autonomous vehicles. We compare technical approaches, validation philosophies, deployment strategies and give practical guidance for drivers, fleet operators and buyers who must decide whether to trust FSD today or wait for a different path.

Introduction: Why Krafcik’s View Matters

John Krafcik — former CEO of Waymo and a central voice in AV policy and engineering circles — has been critical of single-pronged AV strategies that prioritize rapid consumer rollouts over validated, redundant safety engineering. His critique is less about any single company and more about the tradeoffs the industry faces between fast iteration and rigorous validation. If you're researching autonomous vehicles, understanding that tradeoff is critical for assessing product risk, safety claims and long-term value.

To ground this analysis, we draw parallels to software and AI deployment practices. For example, engineers building quick prototypes follow guides like how to build a 48-hour ‘micro’ app with ChatGPT and Claude and then stress-test those concepts with secured development patterns from pieces such as building secure LLM-powered desktop agents. Those techniques translate into AVs as rapid simulation, controlled pilots and rigorous post-mortem processes.

Throughout this guide we link to practical resources and analogies from other technical domains — because the industry is converging: AV stacks increasingly resemble cloud-native, AI-driven platforms that require secure pipelines, validation playbooks and clear governance. See our section on incident handling later for playbook analogies like the cloud post-mortem guidance at post-mortem playbook: responding to Cloudflare and AWS outages.

1. Two Philosophies of Autonomy: Waymo vs Tesla

Waymo’s Conservative, Systems-First Approach

Waymo has historically chosen redundancy, extensive simulation and constrained operational design domains (ODDs) as the path to safety. Krafcik’s perspective reflects an engineering culture that insists on layered sensing, conservative decision-making and acceptably low incident rates before broad consumer deployments. This philosophy treats autonomy as a systems-engineering problem as much as a machine-learning one, and the company invests accordingly in validation infrastructure, mapping and dedicated compute per vehicle.

Tesla’s Iterative, Vision-First Strategy

Tesla’s FSD leans heavily on vision-based neural nets, vast fleet data and over-the-air updates to iterate quickly. The idea: scale the model with massive, real-world data and let continuous updates address edge cases. That gives faster feature rollout to customers but raises questions about how edge cases are validated before software reaches the consumer fleet — a central point of Krafcik’s critique.

Implications for Safety and Certification

The two approaches impact certification and regulatory oversight differently. Waymo’s cautious route produces smaller incremental deployments, easier-to-defend validation records and clearer metrics for regulators. Tesla’s approach asks regulators and consumers to accept a more dynamic safety envelope where behavior evolves via updates — a model that complicates both oversight and consistent safety assurances.

2. Sensors, Perception & The Argument for Redundancy

Sensor Suites: Cost vs Coverage

Sensor choice — lidar, radar, cameras, ultrasonic — is both a technical and business decision. Lidar adds precise depth at a cost; cameras are cheap and high-bandwidth for semantic understanding. Krafcik argues that redundancy (multiple modalities) reduces single-point failure modes. Tesla’s camera-first architecture bets that advanced perception nets can close the gap economically. The debate comes down to how you value safety margin versus deployment cost.

Perception Networks and Training Data

Quality of training data matters as much as quantity. Networks trained on well-labeled, diverse datasets generalize better to unusual scenarios. Teams that follow practices from the machine-learning and micro-app world — such as iterative validation and controlled releases described in guides like how to build ‘micro’ apps with LLMs — are better positioned to manage skew between training and reality.

Edge Processing & Safety in Depth

Redundancy should extend to compute and decision layers, not only sensors. Hardened agent workflows and safe failover logic, similar to processes recommended in security articles like how to harden desktop AI agents, are crucial in an AV context. That means predicting sensor failure, gracefully handing control back to drivers, and extensively testing fallback states.

3. Data Collection: Quantity, Quality, and Labeling

Fleet Scale vs Labeling Depth

Tesla’s advantage is raw scale: millions of vehicle-miles producing diverse, real-world events. But raw recordings are only part of the story — labeling depth and context are essential. Waymo’s approach prioritizes richly annotated datasets, HD maps and edge-case rehearsal; they invest heavily in curated data to compensate for smaller fleet size.

Automating Annotation and Human-in-the-Loop

Automation can accelerate labeling, but human-in-the-loop remains necessary for rare edge cases. Teams well-versed in building internal microservice platforms — for example from playbooks like how to build internal micro-apps with LLMs — can create workflows that intelligently escalate ambiguous frames to human labelers, improving training signal without exploding cost.

Edge AI Tools for Real-World Collection

Edge devices and prototyping hardware — think of practical kits like the Raspberry Pi AI HAT guides at getting started with the Raspberry Pi 5 AI HAT+ 2 — inform how companies instrument sensors for field diagnostics and localized testing before full integration. Smaller teams can use these approaches to reproduce rare sensor behaviors and validate fixes in lab conditions before fleet rollout.

4. Validation & Simulation: The Hidden Backbone

Why Simulation Is Not Enough — But Still Essential

High-fidelity simulation scales scenario coverage and safely exercises edge cases that would be rare or dangerous in the real world. However, simulation fidelity matters: missing dynamics or incorrect sensor noise models can give false confidence. Robust teams run mixed-in-the-loop validation cycles that marry simulated stress tests with staged field trials.

Incident Response and Post-Mortems

AV companies must adopt incident response playbooks similar to those in IT operations. The post-mortem methodologies in responding to Cloudflare and AWS outages and multi-provider outage playbooks at responding to a multi-provider outage are useful analogies. A disciplined post-mortem culture — documenting root causes, fixes, and preventative measures — accelerates safety improvements and supports regulatory transparency.

Metrics that Matter: Beyond Miles Driven

Miles driven alone are a blunt instrument. Better metrics include validated edge-case resolution rates, safe disengagements per relevant scenario, and closed-loop validation coverage. Transparent reporting of these metrics would help regulators and the public evaluate claims. Companies that adopt structured metric-reporting will find it easier to build trust.

5. Security, Privacy & Governance

Securing the AI Stack

Autonomous stacks combine vehicle firmware, perception models, mapping data, and cloud services — an attack surface with many vectors. Applying hardening practices from secure agent and desktop AI deployments, such as those outlined in from Claude to Cowork: building secure desktop agent workflows and how to harden desktop AI agents, reduces the risk of model-tampering, data exfiltration or malicious OTA updates.

Privacy, Data Sovereignty, and Local Regulations

Where fleet data is stored and how it’s processed are regulatory issues that differ by jurisdiction. Discussions around cloud alternatives like is Alibaba Cloud a viable alternative to AWS illustrate the importance of sovereignty when datasets contain personally identifiable information. AV companies should design data architectures that support regional compliance and enable auditability.

Governance: Who Signs Off?

Clear governance — who approves a release, what tests must pass, and what rollback criteria look like — matters. Borrowing governance patterns from micro-app platforms and internal dev playbooks such as building a micro-app platform for non-developers ensures that non-engineering stakeholders (safety, legal, ops) can participate in release gating and audit trails.

6. Business Model & Go-To-Market Tradeoffs

Mass Market Rolls vs Controlled Fleet Deployments

Tesla’s consumer-first rollout attempts to rapidly expose the model to many real-world scenarios and monetize features through subscriptions. Waymo favors controlled deployments (robotaxis and closed geographies) to limit ODD scope. Each model has revenue and liability tradeoffs: mass-market rollouts accelerate user feedback and adoption but increase regulatory exposure and liability risk.

Customer Expectations & Product Positioning

Consumers expect clear labeling about capability: is the system driver-assist or self-driving? Clear positioning reduces risky user behavior — people misuse features that seem more capable than they are. Teams that coordinate product messaging and timing are more likely to manage expectations, a practice similar to timing ad campaigns around events detailed in how to time your listing ads around big live TV events, where messaging cadence affects audience behavior.

Monetization Paths and Long-Term Value

Subscription models, fleet-as-a-service and robotaxi revenue are all viable. The key to sustainable monetization is demonstrable safety and predictable operating costs. Companies that can show reproducible safety improvements and transparent metrics will find insurers and regulators more willing to underwrite services.

7. What Buyers and Drivers Need to Know

Practical Checklist Before Enabling Anything

If you own or plan to buy a car with FSD-style features, treat each update like software: review release notes, check local regulations and verify that driver monitoring exists. Always test features in low-risk environments before relying on them in complex traffic. For general vehicle selection guidance, our practical reviews like the best dog-friendly cars in the UK show how feature sets and real-world usability matter — the same diligence applies to autonomy.

Insurance and Liability Considerations

Insurance markets are still grappling with AV liability. If a system is partially autonomous but requires driver supervision, most carriers will hold the driver responsible in many jurisdictions. Documenting your usage and keeping logs of incidents can be valuable for claims. Expect insurance products to evolve as regulators and manufacturers clarify responsibility.

How to Evaluate Claims and Marketing Language

Look for transparent metrics: sets of tests completed, simulated hours, and independent audits. Beware of broad language like “full self-driving” without accompanying validation. Companies that adopt public reporting of safety metrics and clear ODD boundaries make it easier for consumers to compare offerings objectively.

8. Roadmap Recommendations: A Middle Path

Hybrid Solutions: Best of Both Worlds

A pragmatic strategy adopts a hybrid: use vision rapidly for feature development but layer additional modalities and conservative decision-making for safety-critical behaviors. That hybrid path can reduce single-point failure risks while preserving agility for continuous improvement.

Incremental, Measurable Releases

Use feature flags, staged rollouts and canary tests — classic techniques from software engineering and micro-app rollouts — to manage risk. Resources like build a 7-day micro-app to automate invoice approvals and build a 7-day microapp to validate preorders illustrate rapid validation cycles that reduce exposure before broader release.

Platformization & Ecosystem Play

Companies should invest in modular platforms for perception, validation and safety monitoring. Architecting for non-developer operators and safety reviewers — inspired by building a micro-app platform for non-developers — lets cross-functional teams participate in safety workflows and accelerates trustworthy scale.

9. Tech & Operations: Lessons from Other AI-Driven Systems

Leverage Secure Agent Patterns

AV stacks will benefit from secure agent patterns: explicit least-privilege, signed updates, anomaly detection and immutable audit logs. The design patterns in secure desktop agent workflows and building secure LLM-powered desktop agents are directly applicable to OTA update pipelines and onboard model serving.

Organize for Rapid Iteration, Not Reckless Changes

Iteration is valuable when coupled with gating: tests that must pass before deployment, telemetry that must be reported, and rollback criteria that are automatic. Teams that model their release pipelines on internal micro-app practices such as practical guides for micro-apps with LLMs will see fewer downstream surprises.

Comprehensive Monitoring & Feedback Loops

Monitoring must go beyond crash logs to capture nuanced behavior signals: near-miss detections, hesitation events, and sensor degradation. Establishing feedback loops from these signals into labeling and retraining cycles accelerates safe improvements and reduces repeated incidents.

10. Final Assessment: What Krafcik’s Critique Means for the Industry

A Call for Balanced Engineering

Krafcik’s critique is a call to balance innovation speed with engineered safety. The industry cannot succeed on hype alone; it needs measurable, auditable safety progress that supports regulatory and insurer confidence. This doesn’t mean stifling innovation; it means structuring innovation so it’s accountable and observable.

Buyers’ Bottom Line

For shoppers and fleet buyers: ask for data. Request disclosure of validation metrics, rollback plans and human oversight safeguards. Treat autonomy as you would any complex software purchase: demand transparency, measurable performance and robust support for edge cases.

Where to Watch Next

Watch for independent third-party audits, standardized safety reporting, and cross-industry data sharing initiatives. Companies that invest in these practices will likely be the ones that scale safely and profitably in the years ahead.

Comparison: Tesla FSD vs Waymo and Competitors

The table below summarizes the major differences in approach across key dimensions that matter to buyers, regulators and engineers. Use it as a starting point to evaluate claims and product positioning.

Dimension	Tesla FSD (Vision-First)	Waymo (Redundancy-First)	Cruise / Robotaxi	Aurora / Tier-1 (Hybrid)
Primary sensors	Cameras, radar (de-emphasized lidar)	Lidar + radar + cameras	Lidar + cameras (urban-focused)	Mix: lidar in constrained ops, cameras for perception
Engineering philosophy	Scale data + continuous updates	Systems engineering + redundancy	Conservative geofenced rollout	Platform + partner integration (OEM-focus)
Deployment	Consumer fleet, subscription model	Robotaxi & controlled pilots	Urban taxi trials	Logistics and commercial pilots
Validation emphasis	Real-world miles, OTA iterates	Simulation + curated datasets	Extensive simulation + controlled field tests	Integration tests + OEM validation
Regulatory & insurance risk	Higher near-term ambiguity	Lower (easier to audit)	Moderate (fleet ops focused)	Variable (B2B contracts ease risk)

Pro Tip: When comparing vendor claims, demand three things: a clear description of the operational design domain (ODD), independent validation metrics, and an audit trail for software releases. These three provide a defensible baseline for safety comparisons.

Actionable Checklist for Fleet Operators & Consumers

Below are concrete steps you can take today to reduce risk and make better-informed purchases or deployment decisions.

For Fleet Operators

1) Insist on staged rollouts with safety gates; 2) require signed firmware and model releases with rollback capability; 3) mandate independent audits and post-mortem reports modeled after incident playbooks like those at post-mortem playbook.

For Consumers & Buyers

1) Verify what the feature actually does in your jurisdiction; 2) test features in low-risk settings; 3) demand clear documentation on monitoring and driver responsibilities, and check trustworthy review sources.

For Policymakers

1) Standardize metrics beyond miles; 2) require transparency for OTA updates; 3) incentivize shared data frameworks for rare-event learning and third-party auditing to accelerate trust-building.

FAQ

1. Is Tesla FSD unsafe?

"Unsafe" is too binary. Tesla FSD has enabled advanced driver-assist features for many users, but the safety envelope depends on drivers adhering to supervision requirements and the software being used within its tested ODD. Krafcik’s critique highlights that rapid rollout without layered redundancy increases risk of unexpected edge-case failures.

2. Why does Waymo favor lidar?

Waymo emphasizes lidar for depth accuracy and redundancy. Lidar helps in low-light and complex sensor-fusion scenarios by offering metric scene understanding that complements cameras and radar, supporting conservative decision-making.

3. Can simulation replace real-world miles?

Simulation is necessary but not sufficient. High-fidelity sim expands coverage quickly and safely, but real-world validation uncovers unmodeled dynamics. A mixed validation approach is essential.

4. What should consumers ask manufacturers?

Ask for the devices’ ODD, independent safety metrics (edge-case resolution, validated disengagement rates), release gating processes, and telemetry policies for privacy. These answers reveal how mature a system’s safety practices are.

5. How will insurance and regulation change?

Insurers will demand clearer accountability and proven safety metrics before providing broad coverage. Regulators will likely require more auditable validation and rollback procedures for OTA updates. Expect incremental policy changes and pilot-focused approvals before generalized consumer acceptance.

Jordan Avery

Senior Editor & Automotive Technology Strategist

Senior editor and content strategist. Writing about technology, design, and the future of digital media. Follow along for deep dives into the industry's moving parts.