microsoft/deep-language-networks

microsoft

Fetched on 2026/07/23 05:42

We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN - View it on GitHub

Star

Rank

321299

microsoft

microsoft / deep-language-networks