-
Notifications
You must be signed in to change notification settings - Fork 237
Pull requests: unslothai/unsloth-zoo
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add Gemma-4 float16 UNSLOTH_FORCE_FLOAT32 patches for GRPO stability
#600
opened Apr 17, 2026 by
danielhanchen
Contributor
Loading…
Add per-model logit_matmul_upcast for Gemma-4 fp16 RL
#599
opened Apr 16, 2026 by
danielhanchen
Contributor
Loading…
Fix CSM depth decoder generate: preserve forward signature on wrapper
#590
opened Apr 10, 2026 by
danielhanchen
Contributor
Loading…
5 tasks done
[Qwen 3.5][gemma4] Qwen35 and Gemma 4 fast inference
#588
opened Apr 10, 2026 by
Datta0
Collaborator
Loading…
Add GraniteMoeHybridForCausalLM compiler support
#562
opened Mar 24, 2026 by
Maxusmusti
Loading…
2 tasks done
Fix FP8 MoE scale patching for compressed-tensors models
#551
opened Mar 16, 2026 by
danielhanchen
Contributor
Loading…
Fix dead-code VLM layer count branches and missing state dict exclusion
#550
opened Mar 16, 2026 by
danielhanchen
Contributor
Loading…
6 tasks done
[MoE] FP8 support for MoE, specifically GLM 4.7 flash
#548
opened Mar 16, 2026 by
Datta0
Collaborator
Loading…
Add Idefics3 fast_inference support
#540
opened Mar 12, 2026 by
danielhanchen
Contributor
Loading…
6 tasks done
Double-buffer GPU activations for overlapping H2D copy with backward compute
#534
opened Mar 6, 2026 by
ruixiang63
Contributor
Loading…
Fix _get_vllm_state_dict for LFM2 models
#531
opened Mar 3, 2026 by
danielhanchen
Contributor
Loading…
4 tasks done
Add Bnb4bit support for MoE models on transformers v5 - #4032
#527
opened Mar 2, 2026 by
sensai99
Loading…
Guard GPT-OSS allocator warmup on low-memory 4-bit loads
#521
opened Feb 26, 2026 by
danielhanchen
Contributor
Loading…
Fix vLLM vision GRPO compatibility for issue #4081
#520
opened Feb 26, 2026 by
danielhanchen
Contributor
Loading…
Fix missing ParameterModule export in GPT-OSS compiler path
#519
opened Feb 25, 2026 by
danielhanchen
Contributor
Loading…
Enable ROCm GPU acceleration for llama.cpp GGUF export
#512
opened Feb 24, 2026 by
GoldenGrapeGentleman
Contributor
Loading…
Fix transformers 5.x compat: GRPO token_type_ids, gpt_oss BlockMask, compiler decorators
#511
opened Feb 24, 2026 by
danielhanchen
Contributor
Loading…
5 tasks done
fix: skip non-attention layers in _get_vllm_state_dict (fixes unslothai/unsloth#4073)
#510
opened Feb 23, 2026 by
stakeswky
Loading…
fix: handle LFM2/Mamba hybrid layers in _get_vllm_state_dict for fast_inference
#504
opened Feb 18, 2026 by
devchilll
Loading…
Fix MoE target_parameters module_count alignment (#3405, #3701)
#499
opened Feb 14, 2026 by
GoldenGrapeGentleman
Contributor
Loading…
Handle missing CSM depth decoder loss during loss aggregation
#496
opened Feb 11, 2026 by
danielhanchen
Contributor
Loading…
ROCm: disable cache in generate and fix GPT-OSS dtype
#494
opened Feb 10, 2026 by
danielhanchen
Contributor
Loading…
Fix Gemma3 Vision + Gemma3N audio inference on transformers 5.x
#492
opened Feb 10, 2026 by
danielhanchen
Contributor
Loading…
5 tasks done
Previous Next
ProTip!
Adding no:label will show everything without a label.