
thanks for your great work. i have some question about training dataset.
- Are these datasets (as shown in the summary image) simply concatenated into a single file (viscot_mixed_2m.json) and then directly used for the second stage of training ?
- The Visual CoT dataset contains 438k samples,which utilized in any time?

thanks for your great work. i have some question about training dataset.