This repository is the official implementation of the ACL 2024 paper: Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations. - View it on GitHub
Star
2
Rank
3675031