Top Agent Skills

Hugging Face Evaluation Manager

Development Tools

The Hugging Face Evaluation Manager skill provides comprehensive tools for managing evaluation results in Hugging Face model cards. It supports extracting evaluation tables from README content, importing benchmark scores from Artificial Analysis API, and running custom model evaluations with vLLM or lighteval. Works with model-index metadata format for leaderboard integration and supports both inference providers and local GPU evaluation.

Hugging FaceModel EvaluationBenchmarksMLAI
4.9rating
3reviews
1.4kdownloads
Trending
Quick Info
AuthorCommunity
Version1.3.0
Last Updated2024-11-01
Key Features
Extract eval tables from README
Import from Artificial Analysis
Run custom model evaluations
vLLM and lighteval integration
Model-index metadata management
GPU inference support
Pull request automation
Batch evaluation operations

Ready to Get Started?

Explore the documentation and start integrating this skill into your projects today.