LLaDA-8B Fine-Tuning

This repository contains code and sample data for supervised fine-tuning of the LLaDA-8B model. LLaDA (Large Language Diffusion with mAsking) is a diffusion-based language model that offers an alternative to traditional autoregressive models.

Repository Structure

sft_data/conversations.json: Sample conversation data for fine-tuning
preprocess_sft_data.py: Script to preprocess the conversation data
finetune_llada.py: Main script for fine-tuning LLaDA
inference_example.py: Script to test the fine-tuned model
run_fine_tuning.sh: Shell script to run the entire fine-tuning pipeline

How to Use

Prepare Your Data:
- Place your conversation data in the sft_data/conversations.json file
- The data should follow the format in the sample file

Run the Fine-Tuning Pipeline:

chmod +x run_fine_tuning.sh
./run_fine_tuning.sh

Customize Fine-Tuning Parameters:
- Edit run_fine_tuning.sh to adjust parameters like model name, batch size, learning rate, etc.

Fine-Tuning Process

The fine-tuning process follows the guidelines from the LLaDA paper:

Data Preprocessing:
- Format data as prompt-response pairs
- Handle multi-turn dialogues
- Pad with EOS tokens for equal lengths
Forward Process:
- Apply noise only to the response part
- Keep the prompt unchanged
Loss Calculation:
- Calculate loss only on masked tokens in the response
- Normalize by answer length
Sampling Strategies:
- Semi-autoregressive sampling with low-confidence remasking
- Divide generation into blocks for better control

Requirements

PyTorch
Transformers (version 4.38.2 or later)
CUDA-capable GPU (recommended)

Reference

For more details on LLaDA, refer to the original paper and the official repository.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
TraditionalDecodingSFT		TraditionalDecodingSFT
imgs		imgs
limo_data		limo_data
sft_data		sft_data
.gitignore		.gitignore
ALIGNMENT_README.md		ALIGNMENT_README.md
AWQ.py		AWQ.py
GUIDELINES.md		GUIDELINES.md
LICENSE		LICENSE
README.md		README.md
alignment_training_log.log		alignment_training_log.log
chat.py		chat.py
finetune_llada_alignment.py		finetune_llada_alignment.py
generate.py		generate.py
get_log_likelihood.py		get_log_likelihood.py
inference_example.py		inference_example.py
preprocess_alignment_data.py		preprocess_alignment_data.py
requirements.txt		requirements.txt
run_alignment_fine_tuning.sh		run_alignment_fine_tuning.sh
temp_config.json		temp_config.json
training_log.log		training_log.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLaDA-8B Fine-Tuning

Repository Structure

How to Use

Fine-Tuning Process

Requirements

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

SpotDylan/llada-8b-fine-tune

Folders and files

Latest commit

History

Repository files navigation

LLaDA-8B Fine-Tuning

Repository Structure

How to Use

Fine-Tuning Process

Requirements

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages