-
-
Notifications
You must be signed in to change notification settings - Fork 1.1k
Pull requests: axolotl-ai-cloud/axolotl
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: upgrade cce commit to include smollm3, granite, granitemoe
#2993
opened Jul 31, 2025 by
NanoCode012
Loading…
feat(doc): add links to new features on README
hold
don't merge this yet
#2980
opened Jul 25, 2025 by
NanoCode012
Loading…
set torchao quant config on config.json of saved model
#2942
opened Jul 17, 2025 by
winglian
Loading…
fix: pass model to plugin trainer_cls for rl trainer builder
#2883
opened Jul 8, 2025 by
NanoCode012
•
Draft
Add venv to shell prompt in dockerfiles
hold
don't merge this yet
#2857
opened Jul 2, 2025 by
SalmanMohammadi
Loading…
fix: remove unnecessary movement of eval logits to cpu
#2824
opened Jun 23, 2025 by
NanoCode012
Loading…
Enable Memory Efficient Loading when using Deepspeed 3 for Mistral
#2804
opened Jun 18, 2025 by
benHeid
Loading…
[Draft] Token-weighted datasets: Control up/down-sampling of multiple datasets
#2794
opened Jun 16, 2025 by
casper-hansen
•
Draft
feat(mm_chat): enhance multimodal chat collator for audio/text suppor…
hold
don't merge this yet
#2765
opened Jun 5, 2025 by
voidful
Loading…
6 of 9 tasks
Add StableMax integration to enable grokking and prevent Softmax Collapse
#2761
opened Jun 5, 2025 by
ehartford
Loading…
Make De-duplication Multi-threaded and Happen Only During Pre-processing
#2747
opened Jun 1, 2025 by
xzuyn
Loading…
Create base docker images for CUDA 12.8 with custom FlashAttention 3 installed
#2685
opened May 16, 2025 by
winglian
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-07-28.