Argilla 2.4: Easily Build Fine-Tuning and Evaluation Datasets on the Hub — No Code Required
read at source ↗ huggingface.co
Argilla 2.4: Easily Build Fine-Tuning and Evaluation Datasets on the Hub — No Code Required
Source: HuggingFace Date: 2024-11-04 URL: https://huggingface.co/blog/argilla-ui-hub
Summary
Library update: Argilla 2.4 adds no-code dataset creation directly on the HF Hub — import any of 230k+ public Hub datasets, define annotation questions, collect human feedback, and deploy via HF Spaces with OAuth for community contributions. Auto-configuration suggests question types from dataset features. Currently public datasets only; private dataset support requested.
Implications
Thread: HF as open-source ML hub. Argilla’s no-code integration lowers the barrier to dataset annotation to near zero for domain experts who can’t write Python. The HF Spaces + OAuth combination makes community annotation workflows trivial to set up. The dependency on public Hub datasets is a meaningful constraint for enterprise use, but the pattern — annotate Hub data, push fine-tuning datasets back to Hub — tightens the data flywheel. Watch whether Argilla’s tight HF integration makes it the default annotation layer for community model fine-tuning campaigns.