unslothai / unsloth-zoo Public

Notifications You must be signed in to change notification settings
Fork 237
Star 235

Code
Issues 32
Pull requests 71
Actions
Security and quality
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security and quality
Insights

Pull requests: unslothai/unsloth-zoo

Labels 10 Milestones 0

New pull request New

71 Open 465 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add Gemma-4 float16 UNSLOTH_FORCE_FLOAT32 patches for GRPO stability

#600 opened Apr 17, 2026 by danielhanchen Contributor

Loading…

Add per-model logit_matmul_upcast for Gemma-4 fp16 RL

#599 opened Apr 16, 2026 by danielhanchen Contributor

Loading…

Fix CSM depth decoder generate: preserve forward signature on wrapper

#590 opened Apr 10, 2026 by danielhanchen Contributor

Loading…

5 tasks done

[Qwen 3.5][gemma4] Qwen35 and Gemma 4 fast inference

#588 opened Apr 10, 2026 by Datta0 Collaborator

Loading…

Add GraniteMoeHybridForCausalLM compiler support

#562 opened Mar 24, 2026 by Maxusmusti

Loading…

2 tasks done

Fix bugs in FP8 MoE support

#554 opened Mar 17, 2026 by danielhanchen Contributor

Loading…

5 tasks

Fix FP8 MoE scale patching for compressed-tensors models

#551 opened Mar 16, 2026 by danielhanchen Contributor

Loading…

Fix dead-code VLM layer count branches and missing state dict exclusion

#550 opened Mar 16, 2026 by danielhanchen Contributor

Loading…

6 tasks done

[MoE] FP8 support for MoE, specifically GLM 4.7 flash

#548 opened Mar 16, 2026 by Datta0 Collaborator

Loading…

Add Idefics3 fast_inference support

#540 opened Mar 12, 2026 by danielhanchen Contributor

Loading…

6 tasks done

Double-buffer GPU activations for overlapping H2D copy with backward compute

#534 opened Mar 6, 2026 by ruixiang63 Contributor

Loading…

Fix _get_vllm_state_dict for LFM2 models

#531 opened Mar 3, 2026 by danielhanchen Contributor

Loading…

4 tasks done

Moe kernels refactor

#529 opened Mar 3, 2026 by Datta0 Collaborator

Loading…

Add Bnb4bit support for MoE models on transformers v5 - #4032

#527 opened Mar 2, 2026 by sensai99

Loading…

Guard GPT-OSS allocator warmup on low-memory 4-bit loads

#521 opened Feb 26, 2026 by danielhanchen Contributor

Loading…

Fix vLLM vision GRPO compatibility for issue #4081

#520 opened Feb 26, 2026 by danielhanchen Contributor

Loading…

Fix missing ParameterModule export in GPT-OSS compiler path

#519 opened Feb 25, 2026 by danielhanchen Contributor

Loading…

Enable ROCm GPU acceleration for llama.cpp GGUF export

#512 opened Feb 24, 2026 by GoldenGrapeGentleman Contributor

Loading…

Fix transformers 5.x compat: GRPO token_type_ids, gpt_oss BlockMask, compiler decorators

#511 opened Feb 24, 2026 by danielhanchen Contributor

Loading…

5 tasks done

fix: skip non-attention layers in _get_vllm_state_dict (fixes unslothai/unsloth#4073)

#510 opened Feb 23, 2026 by stakeswky

Loading…

fix: handle LFM2/Mamba hybrid layers in _get_vllm_state_dict for fast_inference

#504 opened Feb 18, 2026 by devchilll

Loading…

Fix MoE target_parameters module_count alignment (#3405, #3701)

#499 opened Feb 14, 2026 by GoldenGrapeGentleman Contributor

Loading…

Handle missing CSM depth decoder loss during loss aggregation

#496 opened Feb 11, 2026 by danielhanchen Contributor

Loading…

ROCm: disable cache in generate and fix GPT-OSS dtype

#494 opened Feb 10, 2026 by danielhanchen Contributor

Loading…

Fix Gemma3 Vision + Gemma3N audio inference on transformers 5.x

#492 opened Feb 10, 2026 by danielhanchen Contributor

Loading…

5 tasks done

Previous 1 2 3 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!