How do you calculate the ROI of a prompt change? If a new prompt is 5% more accurate but 50% more expensive in tokens, how do you decide if it’s worth it?

Question

Accepted Answer

You track the "cost of a correct answer." If switching to a larger model makes the system 2% more accurate but 10x more expensive, the "cost per successful outcome" might be too high for a sustainable business model.

How do you calculate the ROI of a prompt change? If a new prompt is 5% more accurate but 50% more expensive in tokens, how do you decide if it’s worth it?

Practice Your Response

Similar Questions in Reliability & Evaluation