DistilBERT model pre-trained on 131 GB of Japanese web text. The teacher model is BERT-base that built in-house at LINE. - View it on GitHub
Star
44
Rank
462631