Field Notes

We'd Better Build Some Damn Good Brakes

Waymo is objectively better than an average human driver.

Data shows it reduces injury-causing crashes by over 80%. Yet, if an autonomous vehicle makes a single mistake, it's front-page news. We demand near-perfection from machines while accepting massive error rates from humans. And rightfully so.

We should apply that exact same zero-tolerance policy to "AI Twins" in the professional world.

Lately, I've been building a "Data Scientist twin" to handle chat requests on routine experimentation analysis in my domain. In testing, it's brilliant, it's fast, it never complains about an ad-hoc request.

And yet, I haven't deployed it widely.

Why? Because when I make a mistake, I understand the organizational fallout. I know when a quick estimate is "good enough," and when the stakes are high, I double-check the code, get it peer-reviewed and am on my toes till a decision is made and the consequences are known.

This is exactly why my DS twin, despite working well in tests, stays in its sandbox. I'm not convinced you can prompt a 'spidey sense' into the skill files for spotting a bad result. And even if you could inject enough context to get close, an AI isn't on the line when a decision goes sideways. Maybe it's the Google in me, but I will always lean toward "be right and useful" over "move fast and hallucinate." Being an AI-First Data Scientist means recognizing that the higher an AI's capability, the more rigorous our safety guardrails must be.

I'd love an AI Gilfoyle in my life. But until we can code true accountability into our AI doubles, I'm keeping mine in a heavily audited sandbox.

If we want AI to drive our workflows, we'd better build some damn good brakes.

Originally posted on LinkedIn.

Stepping Out of the Deep Work Chamber

When my architect asked for the vision behind the house I'm building in India, I sketched a lighthouse. Five floors. One central column. The bottom floor was a gallery — open, social, collaborative. The top floor was a sealed chamber for uninterrupted deep work. I was low-key proud of

My Poster at the Google Data Science Summit

Grateful to have had my poster featured at the 2026 Google Data Science Summit in Sunnyvale today. Autorater Context Enrichment tackles something I've been thinking about for a while: static prompts don't age well. Autoraters built on fixed grounding miss nuance as products evolve, and the

No Plan Survives Contact with the Data: Why AI Blueprints Need a Commander's Intent

A few months ago, during a coffee chat, a very smart AI mentor at Google told me something: "The best way to think about these agents," she said,"is like a very smart intern. Give them enough context, define the workflow, and they will execute the code

I Gave My AI Amnesia on Purpose

In 2010, Tom Preston-Werner, co-founder of GitHub, wrote a short essay arguing that engineers should write the README before writing a single line of code. His reason was simple: A perfect implementation of the wrong specification is worthless. A few months ago, I was having an amazing brainstorming session with

Read more

Stepping Out of the Deep Work Chamber

My Poster at the Google Data Science Summit

No Plan Survives Contact with the Data: Why AI Blueprints Need a Commander's Intent

I Gave My AI Amnesia on Purpose