AdaScale SGD: A User-Friendly Algorithm for Distributed Training

## 一言でいうと

分散学習時の学習率チューニングを不要にするようなSGDの拡張

### 論文リンク
https://arxiv.org/abs/2007.05105

### 著者/所属機関
Tyler B. Johnson, Pulkit Agrawal, Haijie Gu, Carlos Guestrin (Apple)

### 投稿日付(yyyy/MM/dd)
2020/07/09

## 概要

<img width="987" alt="Screen Shot 2021-01-04 at 14 00 54" src="https://user-images.githubusercontent.com/10952293/103502698-69f9bc00-4e95-11eb-9d58-d438ac6bd5bd.png">


## 新規性・差分

既存のスケジューリングルールであるIdentity scaling ruleとlinear scaling ruleを適応的にした．
## 手法

<img width="992" alt="Screen Shot 2021-01-04 at 14 01 01" src="https://user-images.githubusercontent.com/10952293/103502704-6ebe7000-4e95-11eb-9003-8f10434b708b.png">


## 結果

<img width="838" alt="Screen Shot 2021-01-04 at 14 01 34" src="https://user-images.githubusercontent.com/10952293/103502706-7251f700-4e95-11eb-895e-8690b675f3d7.png">

<img width="839" alt="Screen Shot 2021-01-04 at 14 01 44" src="https://user-images.githubusercontent.com/10952293/103502712-75e57e00-4e95-11eb-8109-d16a11cbecb2.png">


## コメント


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AdaScale SGD: A User-Friendly Algorithm for Distributed Training #13

一言でいうと

論文リンク

著者/所属機関

投稿日付(yyyy/MM/dd)

概要

新規性・差分

手法

結果

コメント

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

AdaScale SGD: A User-Friendly Algorithm for Distributed Training #13

Description

一言でいうと

論文リンク

著者/所属機関

投稿日付(yyyy/MM/dd)

概要

新規性・差分

手法

結果

コメント

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions