The fastest agent in the race has the best evals

14 Nov 2025 • 32 min • EN
32 min
00:00
32:33
No file found

Ryan welcomes Benjamin Klieger, lead engineer at Groq, to explore the infrastructure behind AI agents, how you can turn a one-minute agent into a ten-second agent, and how they used fast inference and effective evals to build their efficient and reliable Compound agent.  Episode notes:  Groq delivers fast, low-cost inference using their custom-designed LPU, the first chip built for inference. Check out their agent, Compound, which can search the web and run code. Connect with Benjamin on LinkedIn and X.  Congrats to user Bart Kiers for winning a Stellar Answer badge on their response to Regular expression to match a line that doesn't contain a word.  See Privacy Policy at https://art19.com/privacy and California Privacy Notice at https://art19.com/privacy#do-not-sell-my-info.

From "The Stack Overflow Podcast"

Listen on your iPhone

Download our iOS app and listen to interviews anywhere. Enjoy all of the listener functions in one slick package. Why not give it a try?

App Store Logo
application screenshot

Popular categories