[TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"
-
Updated
Dec 5, 2025 - Python
[TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"
Implementation of TSDS: Data Selection for Task-Specific Model Finetuning. An optimal-transport framework for selecting domain-specific and task-specific training data to improve LLM finetuning and instruction tuning.
Codes and data for a published work "A method to evaluate task-specific importance of spatio-temporal units based on explainable artificial intelligence"
Add a description, image, and links to the task-specific topic page so that developers can more easily learn about it.
To associate your repository with the task-specific topic, visit your repo's landing page and select "manage topics."