Resources · Case Studies

Annotation engineering in the field

Real projects. Real data challenges. Technical deep dives into how Concave AI has solved annotation quality problems across computer vision, RLHF, agentic AI, fintech document intelligence, and precision agriculture — with the numbers to back it up.

Case Study 01
Beyond the Box: Precision Annotation for Heavily Occluded Objects in Urban Traffic Scenes
Standard bounding-box pipelines break down when objects are 60–90% occluded. This case study covers the multi-pass annotation protocol, inter-annotator agreement thresholds, and quality gates that produced production-grade training data for an autonomous vehicle perception model operating in dense urban environments.
Read the full case study →
Case Study 03
From Chaos to Code: Transforming Low-Resolution Financial Documents into Structured JSON with 99.9% Field Accuracy
OCR field accuracy drops to 41% on barely legible financial documents. This case study covers the human-in-the-loop correction protocol, CA-qualified annotator selection, and three-tier QA architecture that raised accuracy to 99.1% — reducing false loan rejections from 8.2% to 0.9%.
Read the full case study →
Case Study 04
High-Granularity Labeling: Distinguishing Crop Phenotypes from Invasive Weed Species in Variable Lighting
Standard annotators achieve 54% accuracy on the hardest crop-weed confusion pair — barely above random chance. Expert agronomist annotators reach 89% on the same pair. This case study documents the methodology, calibration protocol, and 1,140:1 ROI on expert annotation investment.
Read the full case study →
Case Study 02
RLHF for Agentic Workflows: Multi-Step Logic Verification in Autonomous Web Browsing Agents
Preference annotation for agentic AI requires evaluating entire reasoning chains — not just individual responses. This case study details the annotation rubric, logical coherence scoring protocol, and inter-annotator calibration approach used to build RLHF preference data for a multi-step autonomous web browsing agent.
Read the full case study →
Coming soon
Legal AI
Claim-level hallucination detection in contract review — 40,000 clause annotations
How we built a hallucination-detection annotation pipeline for legal AI, with claim-level verification and citation tracing across 12 contract types.
Medical AI
Radiology report annotation for rare disease detection — 12,000 chest X-rays with radiologist review
Building ground-truth annotation data for rare pulmonary conditions where AI models consistently underperform — expert radiologist calibration, multi-label uncertainty scoring, and downstream model impact.
Legal AI
Multi-label crop disease annotation for Indian varietals absent from public datasets
PlantVillage contains zero Indian crop varietals. How we built ground-truth annotation data for Pusa Ruby, IR-64, and MCU-5 from scratch with agronomist review at every tier.

Want a case study on your annotation challenge?

We run a free data audit — 50 outputs evaluated, quality findings in 5 working days. No cost, no commitment.

Request Free Audit →