Train dataset

![Image](https://github.com/user-attachments/assets/c50990be-faed-4c1d-9ac4-4a44bb058af8)
thanks for your great work. i have some question about training dataset.
1. Are these datasets (as shown in the summary image) simply concatenated into a single file (viscot_mixed_2m.json) and then directly used for the second stage of training ?
2. The Visual CoT dataset contains 438k samples，which utilized in any time?

![Image](https://github.com/user-attachments/assets/e13eae11-86c8-4ca1-a115-c77b64fa80a7)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train dataset #28

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Train dataset #28

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions