Jurassic-1

tags
Transformers, GPT, NLP
blog post
AI21Labs blog

Architecture

This model is similar to GPT-3 with an improved tokenizer that increases the learning efficiency. It also has more parameters.

Parameter count

178B

Bibliography

    Last changed | authored by

    Comments


    ← Back to Notes