feat: Add code_x_glue code_to_code task #317
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Pull Request: Add CodeXGLUE Code-to-Code Translation Task with CodeBLEU Metric
Summary
This PR adds support for the CodeXGLUE Code-to-Code Translation benchmark, enabling evaluation of models on translating code between Java and C# using the CodeBLEU metric.
Motivation
CodeBLEU is the recommended metric for code translation tasks as it goes beyond standard BLEU by considering:
This makes it particularly suited for evaluating code generation and translation quality.
Changes
New Files
bigcode_eval/tasks/codexglue_code_to_code_trans.py- Task implementationbigcode_eval/tasks/few_shot_examples/codexglue_code_to_code_trans_few_shot_prompts.json- Few-shot examplesrequirements-codebleu.txt- CodeBLEU dependencyModified Files
bigcode_eval/tasks/__init__.py- Register the new taskrequirements.txt- Add tree-sitter dependenciesREADME.md- Document the new task and installation instructionsNew Tasks
codexglue_code_to_code_trans-java_cscodexglue_code_to_code_trans-cs_javaInstallation
Due to a dependency conflict between
codebleuand newertree-sitterversions, install with:This installs
codebleuwithout its dependencies first (bypassing thetree-sitter>=0.22.0,<0.23.0constraint), then installs the compatibletree-sitter==0.25.2packages required by the language parsers.Usage
Dataset
References
Checklist