QuestionsLeaderboardAppendixBlogPracticeProfile
Back to Repository
AI System DesignMedium

How do you evaluate an agent when the "correct path" might involve 5 different tool calls in any order?

Practice Your Response

Similar Questions in AI System Design

Medium

How do you implement "Long-Term Memory" for an agent so it remembers a user's preference from a conversation that happened three weeks ago?

View
Medium

What is the fundamental difference between a "Chain" (hardcoded steps) and an "Agent" (model-decided steps)? When is a Chain actually better than an Agent?

View
Hard

If a tool returns a massive 50MB JSON file, you can't feed that to the LLM. How do you "summarize" or "filter" tool observations for the agent?

View

Built for the AI Engineering community.

BlogPrivacyTermsContact