An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library. - View it on GitHub
Star
0
Rank
11265897