Skip to content

Actions: microsoft/DeepSpeed

nv-torch-latest-v100

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
5,022 workflow runs
5,022 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

nv-torch-latest-v100
nv-torch-latest-v100 #12743: Scheduled
December 26, 2024 00:20 1h 24m 46s master
December 26, 2024 00:20 1h 24m 46s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
nv-torch-latest-v100 #12739: Pull request #6909 synchronize by hj-wei
December 25, 2024 02:18 Action required hj-wei:dev_hjwei
December 25, 2024 02:18 Action required
Add the missing view operations from sequence parallel(async).
nv-torch-latest-v100 #12738: Pull request #6750 synchronize by inkcherry
December 25, 2024 01:50 Action required inkcherry:ds_overlap_fix
December 25, 2024 01:50 Action required
nv-torch-latest-v100
nv-torch-latest-v100 #12737: Scheduled
December 25, 2024 00:20 1h 34m 19s master
December 25, 2024 00:20 1h 34m 19s
[BUG FIX]:fix get torch.version.cuda error when cuda is None in rocm
nv-torch-latest-v100 #12736: Pull request #6909 opened by hj-wei
December 24, 2024 07:38 Action required hj-wei:dev_hjwei
December 24, 2024 07:38 Action required
[inf] Add config var to enable keeping module on host
nv-torch-latest-v100 #12735: Pull request #6846 synchronize by oelayan7
December 24, 2024 06:49 6h 0m 24s oelayan7:keep_module_on_host
December 24, 2024 06:49 6h 0m 24s
nv-torch-latest-v100
nv-torch-latest-v100 #12734: Scheduled
December 24, 2024 00:20 1h 29m 22s master
December 24, 2024 00:20 1h 29m 22s
Tecorigin sdaa accelerator
nv-torch-latest-v100 #12733: Pull request #6903 synchronize by tjruwase
December 23, 2024 23:13 Action required siqi654321:Tecorigin-SDAA-accelerator
December 23, 2024 23:13 Action required
Tecorigin sdaa accelerator
nv-torch-latest-v100 #12730: Pull request #6903 opened by siqi654321
December 23, 2024 02:21 1h 31m 52s siqi654321:Tecorigin-SDAA-accelerator
December 23, 2024 02:21 1h 31m 52s
nv-torch-latest-v100
nv-torch-latest-v100 #12729: Scheduled
December 23, 2024 00:21 1h 32m 30s master
December 23, 2024 00:21 1h 32m 30s
nv-torch-latest-v100
nv-torch-latest-v100 #12728: Scheduled
December 22, 2024 00:22 1h 31m 0s master
December 22, 2024 00:22 1h 31m 0s
nv-torch-latest-v100
nv-torch-latest-v100 #12727: Scheduled
December 21, 2024 00:20 1h 37m 34s master
December 21, 2024 00:20 1h 37m 34s
Adds ignore_index to sequence parallel cross entropy
nv-torch-latest-v100 #12726: Pull request #6882 synchronize by hwchen2017
December 20, 2024 20:06 1h 34m 11s ronald-d-rogers:add-ignore-index-sp-loss
December 20, 2024 20:06 1h 34m 11s
Stage3: Use new torch grad accumulation hooks API
nv-torch-latest-v100 #12725: Pull request #6773 synchronize by tohtana
December 20, 2024 18:16 6h 0m 25s deepcharm:stage3-use-new-grad-acc-api
December 20, 2024 18:16 6h 0m 25s
Fix error caused by all_reduce call in domino
nv-torch-latest-v100 #12723: Pull request #6880 synchronize by tjruwase
December 20, 2024 02:22 1h 31m 15s hongwei/fix_domino_allreduce
December 20, 2024 02:22 1h 31m 15s
Change compile for pipeline module torch.compile
nv-torch-latest-v100 #12722: Pull request #6478 synchronize by loadams
December 20, 2024 00:56 3h 12m 47s NirSonnenschein:torch_compile_micro_offset_fix
December 20, 2024 00:56 3h 12m 47s
Fix checkpointable_layers Logic
nv-torch-latest-v100 #12721: Pull request #6881 synchronize by loadams
December 20, 2024 00:55 6h 30m 58s Quentin-Anthony:qanthony/fix-act-recomp
December 20, 2024 00:55 6h 30m 58s
Stage3: Use new torch grad accumulation hooks API
nv-torch-latest-v100 #12720: Pull request #6773 synchronize by loadams
December 20, 2024 00:55 6h 12m 2s deepcharm:stage3-use-new-grad-acc-api
December 20, 2024 00:55 6h 12m 2s
nv-torch-latest-v100
nv-torch-latest-v100 #12719: Scheduled
December 20, 2024 00:20 1h 49m 31s master
December 20, 2024 00:20 1h 49m 31s
Fix error caused by all_reduce call in domino
nv-torch-latest-v100 #12718: Pull request #6880 synchronize by loadams
December 19, 2024 23:23 1h 29m 51s hongwei/fix_domino_allreduce
December 19, 2024 23:23 1h 29m 51s
Fix checkpointable_layers Logic
nv-torch-latest-v100 #12717: Pull request #6881 synchronize by loadams
December 19, 2024 20:32 4h 21m 9s Quentin-Anthony:qanthony/fix-act-recomp
December 19, 2024 20:32 4h 21m 9s