Setup & Installation
What This Skill Does
Adds and manages evaluation results in Hugging Face model cards using the model-index metadata format. Supports extracting benchmark tables from README files, importing scores from the Artificial Analysis API, and running evaluations with vLLM or lighteval on local GPUs or HF Jobs infrastructure. Instead of manually converting markdown tables to model-index YAML and resolving merge conflicts, this skill handles extraction, formatting, deduplication, and PR creation in a single CLI workflow.
When to use it
- Working with hugging face evaluation functionality
- Implementing hugging face evaluation features
- Debugging hugging face evaluation related issues
