Jurassic-1

tags: Transformers, GPT, NLP
blog post: AI21Labs blog

Architecture

This model is similar to GPT-3 with an improved tokenizer that increases the learning efficiency. It also has more parameters.

Parameter count

178B

Bibliography

Last changed 2022.07.26 | authored by Hugo Cisneros

Comments

Loading comments...

Back to Notes