Lenny Distilled

Count first, complicate later

Craft → Analytical Sense

Supporting

"Before" refers to traditional data science approaches, and "the confusion" refers to the debate over whether AI products need specialized evaluation methods versus standard data analysis.

This is the same data science as before, and I think that's what's causing the confusion is, 'Hey, we need data science thinking,' and AI product is helpful to have that thinking in AI products like it is in any product is my take on that.
Nuanced

"This agreement" refers to measuring how often an AI judge system agrees with human evaluators when assessing AI model outputs.

A lot of people go straight to this agreement. They say, 'Okay, my judge agrees with the human some percentage of the time.' That sounds appealing, but it's a very dangerous metric to use, because a lot of times, errors, they only happen on the long tail and they don't happen as frequently.

The Missing Stamp

Every episode of Lenny's Podcast, distilled into the insights that matter and the quotes that make them stick.

LENNY WAS HERE__STAMP_DATE__

Lenny, if you're reading this, the stamp's ready when you are. 🧡🔥