Define Exact Match (EM) vs. F1 Score in the context of an extraction task (e.g., extracting dates from a PDF). When should you use EM?

Question

Accepted Answer

Exact Match (EM) requires every character to be identical. F1 Score measures the overlap of words between the predicted and ground truth. EM doesn't work well for summaries or chat, but is necessary for structured data like SKU numbers or status codes.

Define Exact Match (EM) vs. F1 Score in the context of an extraction task (e.g., extracting dates from a PDF). When should you use EM?

Practice Your Response

Similar Questions in Reliability & Evaluation