Curated training packs for model work: cleaned records, sources, and schema files. Use them for fine-tuning, embeddings, and evals.