Understanding AI

Understanding AI

Share this post

Understanding AI
Understanding AI
Not-so-large language models: a concise guide
Copy link
Facebook
Email
Notes
More

Not-so-large language models: a concise guide

Meta, Mistral, Cohere, and Databricks all have impressive open-weight models.

Timothy B. Lee's avatar
Timothy B. Lee
May 09, 2024
∙ Paid
11

Share this post

Understanding AI
Understanding AI
Not-so-large language models: a concise guide
Copy link
Facebook
Email
Notes
More
1
Share

When OpenAI released GPT-4 in March 2023, it set a new standard for large language model performance. Since then, two companies—Google and Anthropic—have created models in the same league as GPT-4.

Many other companies have focused on building smaller models. Small models are useful because a model’s parameter count drives inference costs. GPT-4 is more …

Keep reading with a 7-day free trial

Subscribe to Understanding AI to keep reading this post and get 7 days of free access to the full post archives.

Already a paid subscriber? Sign in
© 2025 Timothy B Lee
Privacy ∙ Terms ∙ Collection notice
Start writingGet the app
Substack is the home for great culture

Share

Copy link
Facebook
Email
Notes
More