Skip to content

Actions: microsoft/DeepSpeed

Build and publish DeepSpeed release

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
31 workflow runs
31 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Enabled Qwen2-MoE Tensor Parallelism (TP) inference (#6551)
Build and publish DeepSpeed release #31: Commit 474a328 pushed by jomayeri
October 9, 2024 17:46 2m 4s v0.15.2
October 9, 2024 17:46 2m 4s
Handle an edge case where CUDA_HOME is not defined on ROCm systems …
Build and publish DeepSpeed release #30: Commit 10ba3dd pushed by loadams
September 5, 2024 01:30 3m 17s v0.15.1
September 5, 2024 01:30 3m 17s
Fix torch check (#6402)
Build and publish DeepSpeed release #29: Commit 55b4cae pushed by loadams
August 22, 2024 22:46 1m 24s v0.15.0
August 22, 2024 22:46 1m 24s
Pydantic v2 migration (#5167)
Build and publish DeepSpeed release #28: Commit 0a4457c pushed by loadams
August 22, 2024 22:39 1m 1s v0.15.0
August 22, 2024 22:39 1m 1s
Allow accelerator to instantiate the device (#5255)
Build and publish DeepSpeed release #27: Commit eb07d41 pushed by loadams
August 15, 2024 18:04 1m 51s v0.14.5
August 15, 2024 18:04 1m 51s
[XPU] support op builder from intel_extension_for_pytorch kernel path…
Build and publish DeepSpeed release #26: Commit d254d75 pushed by loadams
June 21, 2024 17:33 1m 41s v0.14.4
June 21, 2024 17:33 1m 41s
Monitor was always enabled causing performance degradation (#5633)
Build and publish DeepSpeed release #25: Commit 54f98fd pushed by mrwyattii
June 12, 2024 18:14 1m 49s v0.14.3
June 12, 2024 18:14 1m 49s
Update PyTest torch version to match PyTorch latest official (2.3.0) …
Build and publish DeepSpeed release #24: Commit 5f631ab pushed by loadams
April 23, 2024 23:25 1m 35s v0.14.2
April 23, 2024 23:25 1m 35s
Fix the FP6 kernels compilation problem on non-Ampere GPUs. (#5333)
Build and publish DeepSpeed release #23: Commit e3d873a pushed by loadams
April 15, 2024 19:51 1m 21s v0.14.1
April 15, 2024 19:51 1m 21s
Update version.txt
Build and publish DeepSpeed release #22: Commit ce78a63 pushed by loadams
March 8, 2024 01:10 1m 16s v0.14.0
March 8, 2024 01:10 1m 16s
FP6 blog (#5235)
Build and publish DeepSpeed release #21: Commit 0a979f8 pushed by loadams
March 8, 2024 01:02 1m 17s v0.14.0
March 8, 2024 01:02 1m 17s
fix fused_qkv model accuracy issue (#5217)
Build and publish DeepSpeed release #20: Commit bc0d246 pushed by mrwyattii
March 6, 2024 01:16 1m 5s v0.13.5
March 6, 2024 01:16 1m 5s
Add script to check for --extra-index-url (#5184)
Build and publish DeepSpeed release #19: Commit 5115df3 pushed by lekurile
February 26, 2024 23:52 1m 36s v0.13.4
February 26, 2024 23:52 1m 36s
Switch cpu-inference workflow from --extra-index-url to --index-url (…
Build and publish DeepSpeed release #18: Commit afdf028 pushed by mrwyattii
February 23, 2024 22:58 3m 47s v0.13.3
February 23, 2024 22:58 3m 47s
Remove optimizer step on initialization (#5104)
Build and publish DeepSpeed release #17: Commit 1817980 pushed by mrwyattii
February 12, 2024 17:19 1m 12s v0.13.2
February 12, 2024 17:19 1m 12s
Refactor the Qwen positional emebdding config code (#4955)
Build and publish DeepSpeed release #16: Commit 1d35db7 pushed by mrwyattii
January 23, 2024 23:01 1m 17s v0.13.1
January 23, 2024 23:01 1m 17s
Update release.yml
Build and publish DeepSpeed release #15: Commit 1c8b8f3 pushed by mrwyattii
January 19, 2024 23:25 2m 34s v0.13.0
January 19, 2024 23:25 2m 34s
Update index.md
Build and publish DeepSpeed release #14: Commit 9144b17 pushed by mrwyattii
January 19, 2024 23:16 1m 23s v0.13.0
January 19, 2024 23:16 1m 23s
Update README.md
Build and publish DeepSpeed release #13: Commit 1ac843a pushed by mrwyattii
January 19, 2024 23:04 1m 26s v0.13.0
January 19, 2024 23:04 1m 26s
Mixtral FastGen Support (#4828)
Build and publish DeepSpeed release #12: Commit c00388a pushed by mrwyattii
December 21, 2023 00:46 1m 22s v0.12.6
December 21, 2023 00:46 1m 22s
Fix 4649 (#4650)
Build and publish DeepSpeed release #11: Commit 65b7727 pushed by mrwyattii
December 16, 2023 01:00 1m 22s v0.12.5
December 16, 2023 01:00 1m 22s
Add safetensors support (#4659)
Build and publish DeepSpeed release #10: Commit 7122362 pushed by mrwyattii
December 1, 2023 19:32 1m 33s v0.12.4
December 1, 2023 19:32 1m 33s
fix num_kv_heads sharding in autoTP for the new in-repo Falcon-40B (#…
Build and publish DeepSpeed release #9: Commit 6ea44d0 pushed by lekurile
November 13, 2023 17:51 2m 37s v0.12.3
November 13, 2023 17:51 2m 37s
allow cuda mismatch exceptions to be triggered
Build and publish DeepSpeed release #8: Commit 4f7dd72 pushed by jeffra
November 4, 2023 04:08 1m 20s v0.12.2
November 4, 2023 04:08 1m 20s
Update minor CUDA version compatibility. (#4613)
Build and publish DeepSpeed release #7: Commit 3437a5b pushed by jeffra
November 4, 2023 03:24 1m 32s v0.12.1
November 4, 2023 03:24 1m 32s