Model-Benchmark-Suite

A user-friendly streamlit UI for running various lm_eval supported benchmarks on large language models and to compare them with one another.

Supported Benchmarks:

Quick Start

Clone into the repo:

git clone https://github.com/TeichAI/Model-Benchmark-Suite.git
cd Model-Benchmark-Suite

Install deps and start the app:

pip install -r requirements.txt
streamlit run app.py

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
benchmarks		benchmarks
saved_results		saved_results
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
main.py		main.py
requirements.txt		requirements.txt