Not-so-large language models: a concise guide

Meta, Mistral, Cohere, and Databricks all have impressive open-weight models.

May 09, 2024

∙ Paid

When OpenAI released GPT-4 in March 2023, it set a new standard for large language model performance. Since then, two companies—Google and Anthropic—have created models in the same league as GPT-4.

Many other companies have focused on building smaller models. Small models are useful because a model’s parameter count drives inference costs. GPT-4 is more …

Keep reading with a 7-day free trial

Subscribe to Understanding AI to keep reading this post and get 7 days of free access to the full post archives.