-
Notifications
You must be signed in to change notification settings - Fork 112
Open
Description
Data preparation involves downloading reddit comment and submission data form https://files.pushshift.io/reddit/ and it is written that total data is around 700GB. However, the actual size of the data is around ~2TB, for training GODEL unitl which YYYY-MM reddit data you've used?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels