Hugging Face Evaluation Manager

Development Tools

The Hugging Face Evaluation Manager skill provides comprehensive tools for managing evaluation results in Hugging Face model cards. It supports extracting evaluation tables from README content, importing benchmark scores from Artificial Analysis API, and running custom model evaluations with vLLM or lighteval. Works with model-index metadata format for leaderboard integration and supports both inference providers and local GPU evaluation.

Hugging FaceModel EvaluationBenchmarksMLAI

4.9rating

3reviews

1.4kdownloads

Trending

Quick Info

AuthorCommunity

Version1.3.0

Last Updated2024-11-01

View Documentation GitHub Repository

Key Features

Extract eval tables from README

Import from Artificial Analysis

Run custom model evaluations

vLLM and lighteval integration

Model-index metadata management

GPU inference support

Pull request automation

Batch evaluation operations

Ready to Get Started?

Explore the documentation and start integrating this skill into your projects today.

Read Documentation Browse More Skills