Code from the collaboration work between Intel and UKP. Implements a dynamic layer skipping based on Gumbel Softmax (for llama models). - View it on GitHub
Star
0
Rank
12090359