Back to skills

hugging-face-evaluation

ai-tools

Adds and manages evaluation results in Hugging Face model cards using the model-index metadata format. Supports extracting benchmark tables from README files, importing scores from the Artificial Anal

Setup & Installation

npx skills add https://github.com/huggingface/hugging-face-evaluation --skill hugging-face-evaluation
or paste the link and ask your coding assistant to install it
https://github.com/huggingface/hugging-face-evaluation
View on GitHub

What This Skill Does

Adds and manages evaluation results in Hugging Face model cards using the model-index metadata format. Supports extracting benchmark tables from README files, importing scores from the Artificial Analysis API, and running evaluations with vLLM or lighteval on local GPUs or HF Jobs infrastructure. Instead of manually converting markdown tables to model-index YAML and resolving merge conflicts, this skill handles extraction, formatting, deduplication, and PR creation in a single CLI workflow.

When to use it

  • Working with hugging face evaluation functionality
  • Implementing hugging face evaluation features
  • Debugging hugging face evaluation related issues