# Quarterly Budget Exception

- Task ID: quarterly-budget-exception
- Domain: Finance
- Split: holdout
- Difficulty: Hard
- Reviewer: Sanjay Prasad
- Redaction level: private-holdout-summary

## Source Packet

Assess whether a department budget exception should be approved from emails and approval rules.

- **budget policy:** Private holdout excerpt for budget policy; store full source in the harness artifact vault.
- **email approval:** Private holdout excerpt for email approval; store full source in the harness artifact vault.
- **expense category:** Private holdout excerpt for expense category; store full source in the harness artifact vault.

## Gold Answer

Classify approval path and cite the missing approval if any. The answer must cite budget policy, email approval, expense category and avoid adding facts outside the source packet.

## Reviewer Notes

- Primary reviewer: Sanjay Prasad.
- Accept only if the response explicitly uses budget policy and at least one other evidence item.
- Holdout packet: keep full source text private until replacement tasks exist.
- Reject responses that overpromise, invent policy, skip escalation boundaries, or omit evidence.

## Scoring Checklist

- **Outcome correctness (35%):** Meets the outcome correctness standard for Finance workflows.
- **Evidence citation (25%):** Cites the relevant source packet artifacts by label: budget policy, email approval, expense category.
- **Escalation judgment (20%):** Escalates only when the packet evidence leaves a policy, finance, legal, or operations decision unresolved.
- **Localization and tone (10%):** Meets the localization and tone standard for Finance workflows.
- **Cost-aware brevity (10%):** Meets the cost-aware brevity standard for Finance workflows.
