# Language modeling

tags
NLP

## LM with RNNs

Different models have been studied, starting from the initial recurrent neural network based language model (Mikolov et al. 2011). Recurrent neural networks

LSTM were then used with more success than previous models (Zaremba et al. 2015).

Recently, transformers seem to have dominated language modeling. However it is not clear if this is due to their real superiority over RNNs or their practical scalability (Merity 2019).

Existing models:

## Text generation

Language models can be used to generate text from a prompt or starting sentence. This is the kind of examples that made models like GPT-2 and GPT-3 famous, because of their ability to generate long sequences of apparently coherent text (Radford et al. 2019; Brown et al. 2020).

## Bibliography

1. . . "Recurrent Neural Network Based Language Model". In , 4.
2. . . "Recurrent Neural Network Regularization". Arxiv:1409.2329 [cs]. http://arxiv.org/abs/1409.2329.