Training small GPT-2 style models using Kolmogorov-Arnold networks. - View it on GitHub
Star
1
Rank
5919397