We Got Claude to Fine-Tune an Open Source LLM
read at source ↗ huggingface.co
We Got Claude to Fine-Tune an Open Source LLM
Source: HuggingFace Date: 2025-12-04 URL: https://huggingface.co/blog/hf-skills-training
Summary
Integration tutorial demonstrating HF Skills Training: Claude Code (or Codex/Gemini CLI) can handle the entire fine-tuning pipeline — dataset validation, hardware selection, script generation, job submission, monitoring, and Hub push — with a single natural language instruction. Supports SFT, DPO, GRPO, and GGUF conversion. 0.6B model training costs ~$0.30-0.75/hour. Requires HF Pro/Team/Enterprise. This is the predecessor post to the more detailed Codex-focused version.
Implications
Thread: HF as open-source ML hub / agentic patterns. Same strategic signal as the Codex variant: HF turning fine-tuning into a tool callable by AI coding agents. The Claude Code + HF Jobs combination is the most interesting development here — it means fine-tuning domain models is now literally a prompt to Claude. The Trackio real-time metrics integration and GGUF conversion for local deployment complete the loop from “natural language goal” to “locally runnable model.” The HF Pro requirement is the friction gate — watch whether HF lowers this barrier over time.