Resources — Concave AI

12 min read

RLHF Data Quality Alignment

Why 38% of RLHF preference data trains your model to lie — and how to detect it before training

Sycophancy enters RLHF pipelines when annotators reward confident agreement over factual accuracy. Here is the mechanism, the measurement protocol, and the annotation-level fix.

Read the full analysis →

18 min read

AI Training Fine-Tuning Data Quality

How to Train an AI Model: The Complete 2026 Guide to Workflow, Data, and Getting It Right

Pre-training, fine-tuning, or RLHF — choosing the wrong approach costs six weeks and hundreds of thousands of rupees. This guide covers the full training workflow, modality-specific data requirements, real cost breakdowns, and the six annotation mistakes that silently cap your model's ceiling.

Read the full guide →

10 min read

Quality Metrics Annotation Best Practices

Cohen's kappa explained for ML engineers — the annotation quality metric your pipeline probably is not measuring

Inter-annotator agreement is the single most important quality metric in data annotation. Here is what it measures, how to interpret it, and why "98% accuracy" without kappa is meaningless.

Read the full guide →

Deep dives into data quality for AI

Want to see these principles applied to your data?