Hybrid Linear Attention Done Right

Code and models for the paper: Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts by Yingfa Chen, Zhen Leng Thai, Zihan Zhou, Zhu Zhang, Xingyu Shen, Shuo Wang, Chaojun Xiao, Xu Han, and Zhiyuan Liu.

Links:

Models: https://huggingface.co/collections/chen-yingfa/hypenet
Paper: https://arxiv.org/pdf/2601.22156

The main contributions of the paper is:

🔥 Superior performance-efficiency tradeoff: Compared to homogenous Transformer models such as Qwen3.
🚀 Training-efficient distillation: Converting pre-trained Transformer models into hybrid linear attention, with just 2.3B.
🚄 Extremely Long Contexts: A novel position encoding method designed specifically for hybrid models (+ various architectural improvements) for exceptional length generalization abilities.

HALO: An Efficient Hybrid Distillation Procedure

The distillation procedure, HALO, is as follows.

HypeNet: An Effective Hybrid Architecture

The model architecture of HypeNet is as follows.

How to Cite?

The following is the BibTex for citing us.

@misc{hybrid-linear-attention-done-right,
      title={Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts}, 
      author={Yingfa Chen and Zhen Leng Thai and Zihan Zhou and Zhu Zhang and Xingyu Shen and Shuo Wang and Chaojun Xiao and Xu Han and Zhiyuan Liu},
      year={2026},
      eprint={2601.22156},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2601.22156}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
attn-layer-selection		attn-layer-selection
figures		figures
halo		halo
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hybrid Linear Attention Done Right

HALO: An Efficient Hybrid Distillation Procedure

HypeNet: An Effective Hybrid Architecture

How to Cite?

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Hybrid Linear Attention Done Right

HALO: An Efficient Hybrid Distillation Procedure

HypeNet: An Effective Hybrid Architecture

How to Cite?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages