
Document AI Co.
Challenge — Generic LLM APIs couldn't hit accuracy on domain-specific contracts — fine-tuning attempts stalled without eval discipline.
What we did — Custom model with golden-set evals and regression gates. Extraction F1 improved from 71% to 93% on production docs.



