AI UpdatesToday
Track AI model updates and LLM releases in real time. Version launches, API changes, and notable improvements across GPT, Claude, Gemini, Llama, and 500+ language models.
Track AI model updates and LLM releases in real time. Version launches, API changes, and notable improvements across GPT, Claude, Gemini, Llama, and 500+ language models.
Track all LLM releases and version updates
Sigma-normalized vs. each model's baseline
For each model we reconstruct daily TrueSkill conservative ratings per arena from match-level vote outcomes, then compute a baseline from the first 21 days of activity (after a 3-day warm-up). The Quality Index is the sigma-normalized deviation from that baseline, weighted across arenas. Change shown is the difference between today and 30 days ago. A swing of ±0.5σ is noticeable; ±1σ is significant.
Recent open-weight model releases with permissive licenses
Open source LLM news has become increasingly important as open-weight models transform the AI landscape. Stay updated with open source LLM updates today covering models like Llama 3, Mistral, Qwen, and DeepSeek—now rivaling proprietary alternatives on many benchmarks while providing flexibility to fine-tune, self-host, and customize for specific domains.
Our open-source LLM coverage includes licensing terms (Apache 2.0, MIT, or custom licenses), parameter count affecting LLM inference costs, quantization support for efficient deployment, and the community ecosystem of fine-tuned variants and LLM tools.
AI model versioning follows patterns that help developers understand capabilities and stability. Major versions (GPT-3 → GPT-4, Claude 2 → Claude 3) indicate significant capability improvements and may require prompt adjustments. Minor updates (GPT-4 → GPT-4 Turbo) offer performance optimizations, cost reductions, or context window expansions while maintaining compatibility.
Organizations use various naming conventions: OpenAI uses dated snapshots (gpt-4-0613), Anthropic uses descriptive tiers (Claude 3.5 Sonnet), and Google uses generation markers (Gemini 1.5 Pro). Understanding these patterns helps you make informed decisions about when to upgrade and how to manage deprecations.
Track model releases from leading AI labs
Free head-to-head playgrounds across image, video, website, game and chat modalities.
The AI industry is releasing new models at an unprecedented rate. We track 316+ model releases across major organizations. Capabilities that seemed cutting-edge months ago are now baseline expectations.
Key trends include reasoning models (OpenAI o1, DeepSeek-R1) trading speed for accuracy, multimodal capabilities becoming standard across frontier models, and efficiency improvements delivering GPT-4-level performance at dramatically lower costs.
Pricing, latency, and feature updates from inference providers
Key factors for selecting an inference provider
Providers charge per-token (input/output priced separately), per-request, or offer committed use discounts. For high-volume apps, $0.50/M token differences translate to thousands in monthly savings.
First-token latency matters for interactive apps; total generation time for batch processing. Throughput (tokens/sec) is critical for real-time applications and agent workflows.
First-party providers (OpenAI, Anthropic) offer latest models first. Third-party providers (Together, Fireworks, Groq) often provide same quality at lower cost plus open-source alternatives.
Uptime, rate limits, and SLAs vary significantly. For production workloads, consider multi-provider strategies with automatic failover. Check our provider rankings.
Common questions about LLM updates, version releases, and API changes
Dive deeper into LLM data, benchmarks, and comparisons