SecondYearProject

The following project is created in connection with our Second Year Project "Optimizing for cross-lingual learning for multilingual language models on unseen languages of similar structures". The goal of our project is to answer the following research question: How can model adaptation be adjusted to better utilizethe cross-domain knowledge obtained by multilingual LLM’s during pre-training and does these changes impact transferability to unseen languages?.

Recreating model training

To recreate the model training, follow these steps:

Clone the repository
Create a virtual environment and install the requirements using eg. using pip:
```
pip install -r requirements.txt
```
To train the baseline model run:
```
python3 train.py
```
To train the other variations use the following arguments to augment the training script:
- To add discriminate learning rates for different layers:
```
--discriminative_lr True
```
Example:
```
python3 train.py --discriminative_lr True
```

Other hyperparameters can be used by using the following arguments:

batch_size: --batch_size
learning_rate: --lr
epochs: --epochs
seed: --seed

Recreating significance testing of models

All results will be saved to the eval_lists folder

To get the results for the significance testing run
```
python3 significance_testing.py
```
This will print the results of the significance testing to the terminal as well as save the results to a latex table in the folder results

Recreating model evaluation

To recreate the model evaluation, follow these steps:

To evaluate the baseline model run:
```
python3 eval.py
```
To evaluate the other variations use the following arguments to augment the evaluation script:
- To evaluate the model with discriminate learning rates for different layers:
```
--discriminative_lr True
```
Example:
```
python3 train.py --discriminative_lr True --cosine_schedule True
```

Other hyperparameters can be used by using the following arguments:

batch_size: --batch_size
to_csv: --to_csv (default=True)
save_name: --save_name (This will override --discriminative_lr and --cosine_schedule)

Seed from our models

Baseline

94664
16538
36677
39377
85712
99578
78252
97696
77020
79002

Discriminate learning rate

36916
32320
22986
66448
68125
3837
9756
3168
70121
57808

Name		Name	Last commit message	Last commit date
Latest commit History 244 Commits
confusion_matrixes		confusion_matrixes
data		data
eval_lists		eval_lists
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
TokenClassificationTrainer.py		TokenClassificationTrainer.py
confmatrix.ipynb		confmatrix.ipynb
data_analysis.py		data_analysis.py
disc_lr_preds.conll		disc_lr_preds.conll
eval.py		eval.py
full_run_script.py		full_run_script.py
preds.conll		preds.conll
requirements.txt		requirements.txt
significance_testing.py		significance_testing.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SecondYearProject

Recreating model training

Recreating significance testing of models

Recreating model evaluation

Seed from our models

Baseline

Discriminate learning rate

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

borchand/SecondYearProject

Folders and files

Latest commit

History

Repository files navigation

SecondYearProject

Recreating model training

Recreating significance testing of models

Recreating model evaluation

Seed from our models

Baseline

Discriminate learning rate

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages