KaihuaTang/Qwen-Tokenizer-Pruner

KaihuaTang

Fetched on 2026/06/23 05:44

Due to the huge vocaburary size (151,936) of Qwen models, the Embedding and LM Head weights are excessively heavy. Therefore, this project provides a Tokenizer vocabulary shearing solution for Qwen and Qwen-VL. - View it on GitHub

Star

Rank

613560

KaihuaTang

KaihuaTang / Qwen-Tokenizer-Pruner