2024

2024 has been a year of rapid advancements in AI, with major releases from leading AI companies. The focus has been on improving model capabilities, efficiency, and accessibility, while also pushing the boundaries of what AI can do.

August

Google introduces Gemini Live; mobile conversational experience for free-flowing dialogues with Gemini

Introduces 10 new voice options for more natural interactions
Features mid-conversation interruptions and hands-free operation
Expanded app integrations including Keep, Tasks, YouTube Music, and upcoming Calendar extension

Mistral AI introduces La Plateforme, a toolkit for creating custom AI agents, catering to both developers and non-technical users

Agents API: For deep integration of Mistral’s AI models into applications
La Plateforme Agent Builder: No-code solution with an intuitive interface for easy AI agent creation

July

OpenAI begins rolling out Advanced Voice Mode for ChatGPT

Offers hyper-realistic audio interactions powered by the GPT-4o model
Features include real-time conversations, ability to interrupt mid-sentence, and detection of emotional intonations

Google DeepMind's AlphaProof and AlphaGeometry 2 achieve new milestone

Achieve a silver medal-level score at the 2024 International Mathematical Olympiad
Solve complex algebra, number theory, and geometry problems
Demonstrate AI's growing capability in advanced mathematical reasoning

OpenAI unveils SearchGPT
- AI-powered search engine prototype
- Offers conversational answers with source attribution
- Challenges traditional search paradigms
Mistral AI unveils Mistral Large 2
- 123B parameter model with 128K token context
- Excels in multilingual tasks, reasoning, and coding
- Offers competitive pricing
Meta releases Llama 3.1 405B
- Open-source language model with 405 billion parameters
- 128K token context window
- Performance rivaling leading proprietary models
Mistral AI and NVIDIA release Mistral NeMo
- Compact 12B parameter model with 128K token context
- Designed for efficient local deployment
- Excels in reasoning and coding tasks
OpenAI replaces GPT-3.5 with GPT-4o mini
- Features 128,000 token context
- 82% performance on MMLU
- Multimodal support and reduced operational costs
- Available for free to ChatGPT users

June

Anthropic debuts Claude 3.5 Sonnet
- Advanced AI model with superior reasoning, knowledge, and coding skills
- Enhanced speed and new Artifacts feature for improved collaboration

May

OpenAI releases GPT-4o
- Capable of processing and generating text, images, and audio
Google releases Gemini 1.5
- Multimodal AI model with a 1 million token context window

April

Meta releases LLaMA 3
- 70B parameter language model with improved multilingual capabilities
Udio releases their AI text-to-music platform

Generative artificial intelligence model designed to produce music from simple text prompts.
Capable of generating both vocals and instrumentation.
Opens new avenues for creativity in music production, allowing users to compose complex pieces with ease.

Mistral AI releases Mixtral 8x22B
- 141B parameter mixture-of-experts model
- Actively uses 39B parameters per inference for enhanced efficiency and performance

March

Anthropic releases Claude 3
- Outperforms GPT-4 in reasoning and coding benchmarks

February

Google rebrands Bard as "Gemini"
- Aligns the consumer-facing product name with the underlying model

Home 2023

AI Development Milestones

Search This Blog

2024