Introducing GPT-4o

Hi everyone, it's great to have you here today. I'm going to talk about three things. We'll start with why it's so important to us to have a product that we can make freely available and broadly available to everyone. We're always trying to find out ways to reduce friction so everyone can use ChatGPT wherever they are. Today, we'll be releasing the desktop version of ChatGPT and the refreshed UI that makes it simpler and more natural to use. But the big news today is that we are launching our new flagship model, and we are calling it GPT-4o.

Making AI Freely Accessible

A very important part of our mission is to be able to make our advanced AI tools available to everyone for free. We think it's very important for people to have an intuitive feel for what the technology can do. We're always finding ways to reduce that friction, and recently, we've made ChatGPT available without the signup flow. Today, we're also bringing the desktop app to ChatGPT because we want you to be able to use it wherever you are. As you can see, it's easy, it's simple, and it integrates very easily into your workflow.

New UI and User Experience

Along with it, we have also refreshed the UI. We know that these models get more and more complex, but we want the experience of interaction to become more natural and easy. We want you to focus on the collaboration with ChatGPT and not on the UI. Now, the big news today: we are releasing our newest flagship model, GPT-4o.

Introducing GPT-40

GPT-40 provides GPT-4 level intelligence but is much faster and improves its capabilities across text, vision, and audio. For the past couple of years, we've been very focused on improving the intelligence of these models, and they've gotten pretty good. But this is the first time we're making a huge step forward when it comes to ease of use.

This is incredibly important because we're looking at the future of interaction between ourselves and the machine, and we think GPT-40 is shifting that paradigm into the future of collaboration. Making this happen is actually quite complex because when we interact with one another, there is a lot of stuff we take for granted: the ease of our dialogue, the background noises, the multiple voices in a conversation, or understanding the tone of voice.

Advanced Features and Real-Time Capabilities

Until now, with Voice Mode, we had three models that come together to deliver this experience: transcription, intelligence, and text-to-speech. This brings a lot of latency and breaks that immersion. But now, with GPT-40, this all happens natively. GPT-40 reasons across voice, text, and vision and brings GPT-4 class intelligence to our free users.

We have 100 million people, more than 100 million in fact, who use ChatGPT to create, work, and learn, but we had only made these advanced tools available to paid users until now.

Broader Availability and GPT Store

Starting today, you can use GPTs and the GPT Store. So far, we've had more than a million users create amazing experiences with GPTs. These are custom ChatGPTs for specific use cases and are available in the store. Now, our builders have a much bigger audience, where university professors can create content for their students, podcasters can create content for their listeners, and you can use photos, documents containing both text and images, and start conversations with ChatGPT about all of this content. You can also use memory and browse real-time information in your conversation.

Improved Multi-Language Support and API Access

We have improved the quality and speed in 50 different languages for ChatGPT. This is important because we want to bring this experience to as many people as possible.

For the paid users, they will continue to have up to five times the capacity limits of our free users. GPT-40 is also available in the API, allowing developers to start building today with GPT-40, making amazing AI applications and deploying them at scale.

GPT-40 presents new challenges for us when it comes to safety because we're dealing with real-time audio and vision. Our team has been working diligently to build in mitigations against misuse. We continue to work with various stakeholders to figure out how to best bring these technologies into the world. Over the next few weeks, we'll continue our iterative deployment to bring out all the capabilities to you.

Live Demos of GPT-40

We had several live demos showcasing real-time conversational speech, enhanced voice modes, and even vision capabilities where the model can interact with video and images. We also demonstrated real-time translation and emotion detection from selfies.

Concluding Remarks

Today has been focused on the free users, new modalities, and new products, but we also care about the next frontier. Soon, we'll update you on our progress towards the next big thing.

Thanks to the incredible OpenAI team and Jensen and the Nvidia team for bringing us the most advanced GPUs to make this demo possible. Thank you all for being part of this today.

Keywords

GPT-40
ChatGPT
Free AI tools
Desktop app
Enhanced UI
Real-time capabilities
Voice mode
Vision features
Multi-language support
API access
Safety measures
Live demos

FAQ

Q: What is GPT-40? A: GPT-40 is the latest flagship model that provides GPT-4 level intelligence, offering improved speed and capabilities across text, vision, and audio.

Q: Is GPT-40 available for free users? A: Yes, GPT-40's advanced features are available to all users, including free users, making advanced AI more accessible.

Q: What new features are included in GPT-40? A: GPT-40 includes real-time conversational speech, improved voice modes, vision capabilities for interacting with multimedia content, and enhanced language support.

Q: Can developers access GPT-40 via API? A: Yes, GPT-40 is available through the API, allowing developers to build and deploy AI applications at scale.

Q: How does GPT-40 handle safety concerns? A: The team has implemented various safety measures and continues to work with stakeholders to mitigate misuse, especially given the real-time audio and vision features.

Q: Are there live demos available for GPT-40 features? A: Yes, during the launch event, several live demos showcased GPT-40's capabilities in real-time conversational speech, vision, and translation.

Q: What improvements have been made to the UI? A: The UI has been refreshed to be more intuitive and natural, allowing users to focus on collaboration rather than navigating the interface.