-
Notifications
You must be signed in to change notification settings - Fork 8
Open
Description
Hi Phil, I tested it in my private project 2 days ago, and it seems to speed up learning quite significantly, not sure that final val/train losses are better, more like very similar to original but it got there much faster. Also i did not do different tasks/architects to compare, but my project contains few different nets one including tiny transformer, another using RNN cells and last one simple shallow convolutions
lucidrains
Metadata
Metadata
Assignees
Labels
No labels