Agent Post-training Data Flywheel
Turned production Agent badcases into evaluable, labelable, trainable preference data loops.
DossierKit / AI Expert
Building intelligent systems and delightful experiences with AI.
Each role page is generated from roles.yaml and reassembles work, lab notes, and proof for a target role.
A fit for teams moving Agents from demos to production, covering runtime, tool-use, evaluation, permissions, and enterprise integration.
A fit for teams that need to turn production Agent feedback into preference data, evaluation sets, and training loops.
A fit for teams that need business process fluency, field integration, and LLM product engineering in one role.
Deep case studies covering architecture, collaboration, evaluation, and lessons.
Lab notes showing evaluation, data loops, and applied algorithm depth.
DossierKit favors proof through work, experiments, metrics, and lessons rather than keyword density.
Can decompose Agent platforms into runtime, tool-use, evaluation, and permission audit modules.
Experienced in turning badcases into preference data and post-training experiments.
A fit for roles requiring architecture, implementation, and business collaboration.
Contact
Best for conversations around Agent platforms, LLM application architecture, post-training data loops, and AI product delivery.