v0.17.2 Patch Release
What's Changed
- Update version after 0.17.1 release by @loadams in #7345
- s/UlyssesPlus/Arctic Long Sequence Training (ALST)/ by @stas00 in #7348
- Don't break set_start_method by @tjruwase in #7349
- Fix error of <glog/logging.h> by @Freed-Wu in #7351
- Improve padding util for compile by @tohtana in #7355
- Fix 404s by @tjruwase in #7363
- Fix tutorial title by @stas00 in #7365
- Restore real inputs for recompilation by @tohtana in #7356
- Fix(scheduler): WarmupLR inherits optimizer lr when not specified by @Flink-ddd in #7360
- sequence parallel default dtype by @stas00 in #7364
- Enable torch.autocast with ZeRO by @tohtana in #6993
- add Arctic Long Sequence Training paper reference by @stas00 in #7372
- Flops profiler support for F.interpolate by @sfc-gh-truwase in #7353
- Relax tolerances for FP8 unit test only for ROCm + FP16 by @rraminen in #7373
- Update latest news with DeepNVMe by @loadams in #7375
- Fix release of IPG buffer by @tohtana in #7376
- fix wandb.log() call by removing
sync
kwarg by @ned2 in #7383 - Fix dtype mismatch in
TestParamPartitioningSkipInit
by @tohtana in #7377 - Add support for ws=1 scenario by @NirSonnenschein in #7379
- fix(inference): Add missing dtype attribute to ParameterBase setter by @Flink-ddd in #7378
- add blog link by @stas00 in #7385
- fix broken url by @stas00 in #7390
- add support for CUDAtk12.9 by @loscrossos in #7394
- Fix unbound local error for
return_val
by @HollowMan6 in #7395 - Fix ZeRO stage 1 and add stage 2 support with DeepCompile by @tohtana in #7366
- Improve coverage of DeepCompile by @tohtana in #7386
- Added device detection to communication logging by @alexk101 in #7398
- fix: Add
csrc/compile
to include paths for DeepCompile builder by @HollowMan6 in #7401 - fix: DeepCompile for torch 2.8 by @HollowMan6 in #7402
- fix(comm): Expose GradBucket in deepspeed.comm API by @Flink-ddd in #7400
- fix: fix FileNotFoundError for build_win.bat by @gjj2828 in #7399
- fix: engine initializes optimizer attributes at the beginning by @HollowMan6 in #7410
New Contributors
- @Freed-Wu made their first contribution in #7351
- @Flink-ddd made their first contribution in #7360
- @ned2 made their first contribution in #7383
- @alexk101 made their first contribution in #7398
- @gjj2828 made their first contribution in #7399
Full Changelog: v0.17.1...v0.17.2