Lenny Distilled

AI Engineering 101 with Chip Huyen

October 23, 2025

Featuring: Chip Huyen (Core developer at Nvidia, AI researcher, Stanford instructor, Author)

8 quotes · 8 insights

Watch Full Episode

Metrics without action are entertainment

"Eval" refers to evaluation frameworks used to systematically test and measure AI model performance across different metrics and user segments.

The goal of eval is to guide the product development. So you see eval, because I think I'm a big fan of eval, is that it helps you uncover opportunities where the progress are doing well.
Chip HuyenCore developer at Nvidia, AI researcher, Stanford instructor, Author 00:27:54

Your data preparation matters more than your tech stack

RAG (Retrieval-Augmented Generation) is an AI approach where systems first retrieve relevant information from databases before generating responses to user queries.

In a lot of the companies that I have seen, that's the biggest performance, in their RAG solutions coming from better data preparations, not agonizing over what vector databases to use.
Chip HuyenCore developer at Nvidia, AI researcher, Stanford instructor, Author 00:34:02

Functions must integrate, not coordinate

"Eval" refers to evaluation systems used to test and measure AI model performance in machine learning applications.

I think there's a lot of value plays in... So before we have a lot of disjointed teams. We have very clear engineering team, product team, but then there's a question of who should write eval? Who should own the metrics? And it turns out, eval, it's not a separate problem. It's a system problem because you need to look into different components, how they interact with each other.
Chip HuyenCore developer at Nvidia, AI researcher, Stanford instructor, Author 58:44

AI is reshaping everything - adapt urgently or become obsolete

We are in an ideal crisis. Now, we have all this really cool tools to do everything from scratch and have new design. It can have you write code. You can have new website. So in theory, we should see a lot more, but at the same time, people are somehow stuck. They don't know what to build.
Chip HuyenCore developer at Nvidia, AI researcher, Stanford instructor, Author 00:00:19

AI breaks traditional productivity metrics

It's really hard to measure productivity. So you actually think about what actually drive productivity metrics for you.
Chip HuyenCore developer at Nvidia, AI researcher, Stanford instructor, Author 00:45:37

AI adoption happens bottom-up while executives remain blind

I do ask people to ask their managers, 'Would you rather give everyone on the team very expensive coding agent subscriptions or you get an extra head count?' Almost every one, the managers will say head count. But if you ask VP level or someone who manage a lot of teams, they would say, 'Want AI assistant.' Because as managers, you are still growing, so for you having one HR head count is big. Whereas for executives, maybe you have more business metrics that you care about.
Chip HuyenCore developer at Nvidia, AI researcher, Stanford instructor, Author 00:38:00

MVP is dead, MLP (Minimum Lovable Product) is the future

You don't have to be absolutely perfect, I think, to win. You just need to be good enough and being consistent about it.
Chip HuyenCore developer at Nvidia, AI researcher, Stanford instructor, Author 00:24:39

Build what frustrates you daily

One tip is go look from the last week. For a week, just pay attention to what you do and what frustrates you. And when something frustrates you, think about, is there anything we can do? Can it be done a different way?
Chip HuyenCore developer at Nvidia, AI researcher, Stanford instructor, Author 01:09:58

The Missing Stamp

Every episode of Lenny's Podcast, distilled into the insights that matter and the quotes that make them stick.

LENNY WAS HERE__STAMP_DATE__

Lenny, if you're reading this, the stamp's ready when you are. 🧡🔥