Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

optimizer 1d -- EMA in place (fbgemm part) #3402

Open
wants to merge 2 commits into
base: main
Choose a base branch
from

Conversation

zhihao-cen
Copy link

Summary:
Implement the ema_within_table_rowwise_adagrad

        Emb_table:
         -------------------------------------------------
         -                        --                     -
         -        Fast part       --      Slow part      -
         -    (RL) main part      --      target part    -
         -                        --                     -
         -------------------------------------------------

         In every "step_ema" step, we perform
            slow_part += coef_ema * (fast_part - slow_part)

It mainly serves the target network purpose in the reinforcement learning framework.
Design doc https://fburl.com/gdoc/qyfv7tyi

Differential Revision: D66015331

@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66015331

Copy link

netlify bot commented Nov 21, 2024

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 0ed0365
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/67412cc7d8c30000081acf77
😎 Deploy Preview https://deploy-preview-3402--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

zhihao-cen pushed a commit to zhihao-cen/FBGEMM that referenced this pull request Nov 21, 2024
Summary:
X-link: facebookresearch/FBGEMM#490


Implement the ema_within_table_rowwise_adagrad
```
        Emb_table:
         -------------------------------------------------
         -                        --                     -
         -        Fast part       --      Slow part      -
         -    (RL) main part      --      target part    -
         -                        --                     -
         -------------------------------------------------

         In every "step_ema" step, we perform
            slow_part += coef_ema * (fast_part - slow_part)
```

It mainly serves the target network purpose in the reinforcement learning framework.
Design doc https://fburl.com/gdoc/qyfv7tyi

Differential Revision: D66015331
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66015331

zhihao-cen pushed a commit to zhihao-cen/FBGEMM that referenced this pull request Nov 21, 2024
Summary:
X-link: facebookresearch/FBGEMM#490


Implement the ema_within_table_rowwise_adagrad
```
        Emb_table:
         -------------------------------------------------
         -                        --                     -
         -        Fast part       --      Slow part      -
         -    (RL) main part      --      target part    -
         -                        --                     -
         -------------------------------------------------

         In every "step_ema" step, we perform
            slow_part += coef_ema * (fast_part - slow_part)
```

It mainly serves the target network purpose in the reinforcement learning framework.
Design doc https://fburl.com/gdoc/qyfv7tyi

Differential Revision: D66015331
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66015331

zhihao-cen pushed a commit to zhihao-cen/FBGEMM that referenced this pull request Nov 22, 2024
Summary:
X-link: facebookresearch/FBGEMM#490


Implement the ema_within_table_rowwise_adagrad
```
        Emb_table:
         -------------------------------------------------
         -                        --                     -
         -        Fast part       --      Slow part      -
         -    (RL) main part      --      target part    -
         -                        --                     -
         -------------------------------------------------

         In every "step_ema" step, we perform
            slow_part += coef_ema * (fast_part - slow_part)
```

It mainly serves the target network purpose in the reinforcement learning framework.
Design doc https://fburl.com/gdoc/qyfv7tyi

Differential Revision: D66015331
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66015331

Supadchaya Puangpontip and others added 2 commits November 22, 2024 12:34
Differential Revision: D66346179
Summary:
X-link: facebookresearch/FBGEMM#490

Pull Request resolved: pytorch#3402

Implement the ema_within_table_rowwise_adagrad
```
        Emb_table:
         -------------------------------------------------
         -                        --                     -
         -        Fast part       --      Slow part      -
         -    (RL) main part      --      target part    -
         -                        --                     -
         -------------------------------------------------

         In every "step_ema" step, we perform
            slow_part += coef_ema * (fast_part - slow_part)
```

It mainly serves the target network purpose in the reinforcement learning framework.
Design doc https://fburl.com/gdoc/qyfv7tyi

Differential Revision: D66015331
@facebook-github-bot
Copy link
Contributor

This pull request was exported from Phabricator. Differential Revision: D66015331

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants