-
Notifications
You must be signed in to change notification settings - Fork 1
feat: add lighteval support #23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
timurcarstensen
merged 41 commits into
main
from
codex/add-oellm-multilingual-task-group
Nov 10, 2025
Merged
Changes from all commits
Commits
Show all changes
41 commits
Select commit
Hold shift + click to select a range
3a3d6b7
Use lm-eval harness for INCLUDE and global MMLU
timurcarstensen a8104fc
Remove mypy pre-commit hook
timurcarstensen 3e9a6b6
chore: remove tests
timurcarstensen 1a0b16a
fix: lighteval integration
timurcarstensen f9c5bce
fix: lumi paths
timurcarstensen 64287d4
fix: faster compression
timurcarstensen 2674439
fix: faster compression
timurcarstensen 10d4217
chore: remove unnecessary files
timurcarstensen e2c866a
fix: ruff formatting target version
timurcarstensen 20f04e9
chore: restructure task-groups into groups and super-groups
timurcarstensen 73e2377
feat: task-cache prototype
timurcarstensen f831fbc
feat: task super groups
timurcarstensen 5fe62ee
task cache fix
timurcarstensen e816bfd
fix: task cache; moving data files to oellm/resources
timurcarstensen a97d92d
Update README.md
timurcarstensen c9db766
misc
timurcarstensen 34d7723
Merge branch 'codex/add-oellm-multilingual-task-group' of https://git…
timurcarstensen 10b26ff
temporarily adding AGENTS>md for development
timurcarstensen e8e3b38
fix: task caching for lighteval
timurcarstensen d8c8ed5
fix
timurcarstensen d37b532
fix: compression algorithm
timurcarstensen 79ace47
fix: updated apptainer definitions to include correct uv install
timurcarstensen 13e985c
fix: lighteval cli args
timurcarstensen c9160d5
feat: wrapper to suppress tqdm output
timurcarstensen ccf4c5a
misc
timurcarstensen 97b3d69
fix: lighteval tool python version
timurcarstensen 541d387
nltk setup
timurcarstensen 006ab8d
nltk setup
timurcarstensen 15bea15
fix: downloading nltk data for lighteval during container setup
timurcarstensen 9c97d25
suppressing all tqdm progress bars
timurcarstensen f11d4a4
lighteval fixes
timurcarstensen 096cbc0
misc
timurcarstensen 6e888d7
feat: aya-expanse tasks
timurcarstensen 9d87217
chore: schedule-eval logic cleanup
timurcarstensen 4f9f8a8
feat: adding spinners
timurcarstensen fe067fa
chore: making pre-commit happy
timurcarstensen f552c96
misc
timurcarstensen 9bbf5c1
fix: restrict model parallel
timurcarstensen 1b81460
fix: result collection
timurcarstensen c3e0b41
fix: leonardo directory
timurcarstensen d510921
Merge branch 'main' into codex/add-oellm-multilingual-task-group
timurcarstensen File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -13,3 +13,4 @@ | |
| **/*.egg-info | ||
| **/*.csv | ||
| **/uv.lock | ||
| **/task_map_cache.json | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,5 @@ | ||
| Rules: | ||
| - no try...Except unless absolutely necessary | ||
| - no unnecessary comments | ||
| - don't worry about tests | ||
| - if you need to run stuff, assume there is a .venv at the root of the project. you can also just use uv |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -2,24 +2,37 @@ Bootstrap: docker | |
| From: nvcr.io/nvidia/pytorch:25.06-py3 | ||
|
|
||
| %labels | ||
| Author multi-cluster-eval | ||
| Description Apptainer image for JURECA cluster (converted from dockerfile) | ||
| Author oellm-cli | ||
| Description Apptainer image for JURECA JSC cluster | ||
|
|
||
| %post | ||
| # 1. Install uv package manager | ||
| curl -LsSf https://astral.sh/uv/install.sh | sh | ||
| echo 'export PATH=$HOME/.local/bin:$PATH' >> /etc/profile | ||
| # Install uv into a global bin | ||
| curl -LsSf https://astral.sh/uv/install.sh | env UV_INSTALL_DIR=/usr/local/bin sh | ||
|
|
||
| # Make uv visible for subsequent commands during build | ||
| export PATH=/root/.local/bin:$PATH | ||
| # Put uv-installed tool shims in a global bin too | ||
| export UV_TOOL_BIN_DIR=/usr/local/bin | ||
| uv --version | ||
|
|
||
| # 2. Install Python dependencies | ||
| uv pip install --system --break-system-packages lm-eval \ | ||
| "transformers<=4.53.0" "datasets<4.0.0" wandb sentencepiece tiktoken accelerate | ||
|
|
||
| # Optional: keep tool envs under /opt to avoid $HOME | ||
| export UV_TOOL_DIR=/opt/uv-tools | ||
| uv tool install --python 3.12 "lighteval[multilingual] @ git+https://github.com/huggingface/lighteval.git@63424f4e795ecc577b90646381b374af3a627978" | ||
|
Collaborator
Author
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. all this is to make lighteval happy |
||
| uv pip install --system --break-system-packages nltk | ||
| mkdir -p /opt/nltk_data | ||
| python - <<'PY' | ||
| import nltk | ||
| nltk.download('punkt', download_dir='/opt/nltk_data') | ||
| nltk.download('punkt_tab', download_dir='/opt/nltk_data') | ||
| PY | ||
|
|
||
| %environment | ||
| # Ensure uv is present inside the container runtime as well | ||
| export PATH=/root/.local/bin:$PATH | ||
| export PATH=/usr/local/bin:$PATH | ||
| export UV_TOOL_BIN_DIR=/usr/local/bin | ||
| export UV_TOOL_DIR=/opt/uv-tools | ||
| export NLTK_DATA=/opt/nltk_data | ||
|
|
||
|
|
||
| %runscript | ||
| exec bash "$@" | ||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
explanation: trades off space vs aws ec2 time