OpenAI Reveals GPT-4o And A New Desktop App At The Spring Update Event

By Shikhar Mehrotra • Updated On May 14, 2024

Like
Comment
Share

So far, ChatGPT users have been able to use the Voice Mode to talk to the chatbot and get answers to their queries. However, the general latency with GPT-3.5 is around 2.8 seconds, whereas the latency on GPT-4.0 is around 5.4 seconds. While the Voice Mode works, it doesn’t feel as natural and intuitive as having a regular conversation, something that OpenAI has improved with its latest GPT-4o model.

ALSO SEE: Best Laptops Under 50000 in India (May 2024)

What Is OpenAI’s GPT-4o?

Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time: https://t.co/MYHZB79UqN

Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks. pic.twitter.com/uuthKZyzYx
— OpenAI (@OpenAI) May 13, 2024

OpenAI’s GPT-4o is a multimodal model that can interact with text, visuals, or audio. According to the official release, the new tool can respond to audio inputs in around 0.2 seconds, with an average response time of 0.3 seconds, similar to the human response time. The model matches GPT-4 Turbo performance on text in English and code, with significant improvements in non-English languages.

The current Voice Mode consists of three separate models. While the first transcribes audio to text and the second model gets the query solved by GPT-3.5 or GPT-4.0. The third model then transcribes the result from text to audio. However, in the process, GPT-4.0 can’t observe users’ tone, multiple speakers, or background noises and can’t express emotion, either.

While one might argue whether this is a genuine problem, OpenAI seems to have solved it with GPT-4o. The new tool consists of a single new model that takes the input via text or audio, answers the query, and relays it to the user using the desired output method. That’s how GPT-4o functions differently than the current model.

ALSO SEE: Cars With Front Parking Sensors

GPT-4o New Features

@BeMyEyes with GPT-4o pic.twitter.com/nWb6sEWZlo
— OpenAI (@OpenAI) May 13, 2024

For example, you could upload an image and discuss it with the AI model. On the other hand, you could ask it to recognize something on the screen and provide more information about it. Here’s a list of all the features that GPT-4o will provide.

GPT-4 level intelligence
Responses from the model and the web
Analyze data and create charts
Chat about photos
Upload files for assistance in summarizing, writing, or analyzing
Discover and use GPTs
Building a more helpful experience with Memory

During the Spring Update launch event, the company showcased the GPT-4o in several demo videos. In these videos, the model, running on a smartphone, recognized several real-world objects, people, and their surroundings while answering users’ queries. However, not all GPT-4o abilities will immediately make it to users’ phones. For now, OpenAI is rolling out upgraded text and image abilities.

In the coming days, OpenAI will release the audio and vision capabilities. What’s important is that unlike GPT-4.0, GPT-4o will be available to all ChatGPT users without a subscription fee. Even so, ChatGPT Plus users will have a five times higher conversation limit.

ALSO SEE: Power Steering Cars Price List

OpenAI Gets A New Desktop App For Simplified Usage

Apart from GPT-4o, OpenAI also released a new desktop app for ChatGPT. Per Mira Murati, the interface contains refreshed UI elements that aim to make interactions more natural. The interface now supports a new keyboard shortcut (Option + Space), allowing users to ask ChatGPT a question immediately. “You can now have voice conversations with ChatGPT directly from your computer, starting with Voice Mode that has been available in ChatGPT at launch,” reads the official blog post.

You can follow Smartprix on Twitter, Facebook, Instagram, and Google News. Visit smartprix.com for the latest tech and auto news, reviews, and guides.

Shikhar Mehrotra

Shikhar Mehrotra is a seasoned technology writer and reviewer with over five years of experience covering consumer tech across India and global markets. At Smartprix, he has authored more than 1,700 articles, including news stories, features, comparisons, and product reviews spanning automobiles, smartphones, chipsets, wearables, laptops, home appliances, and operating systems. Shikhar has reviewed flagship devices such as the iPhone 16, Galaxy S25+, and Sennheiser HD 505 Open-Ear headphones. He also contributes regularly to Smartprix’s growing automotive section.

With a deep understanding of both iOS and Android ecosystems, Shikhar specializes in daily tech news, how-to explainers, product comparisons, and in-depth reviews. His DSLR photography in product reviews is recognized as among the best on the team.

Before joining Smartprix, Shikhar wrote for leading publications including Forbes Advisor India, Republic World, and ScreenRant. He holds a Bachelor of Arts in Journalism and Mass Communication from Amity University, Lucknow.

CMF Phone 3 Pro Specs Leaked: Upgrades Across the Board

CMF’s next phone has now leaked ahead of launch, and this one brings a few clear changes over the previous model. The information comes from Android Headlines, which suggests the CMF Phone 3 Pro is expected to arrive by the end of April or early May 2026. There is no exact launch date yet, but …

What Makes GPT-4 Turbo OpenAI’s Most Powerful Generative AI Model?

Last week, reports suggested that Sam Altman-led OpenAI is going to announce several key upgrades to its large language model GPT, along with reduced prices for developers. On November 6, 2023, the company announced similar improvements, including a new GPT-4 Turbo model, which has a vast database, new APIs for enabling features like image analysis …

ChatGPT App Launched on iOS: How To Use ChatGPT on iPhone?

It’s official. OpenAI has finally launched the app for ChatGPT on the App Store for iOS devices. The app brings onboard the power of ChatGPT on your iPhone without needing to access it via a browser. ChatGPT app on iOS is made to give you instant answers without sifting through endless search results and ads …

OpenAI Launches Its Most Advanced Image Generation Tool Integrated Into GPT-4o

On March 25, 2025, OpenAI announced its most advanced image generation tool built into GPT-4o. While language models have been able to generate images for a while, OpenAI claims that its latest text-to-image model offers improved text rendering. Further, the company says the tool is better at following instructions than ever. The New Image Generation …

OpenAI Introduces Direct App Interactions in ChatGPT: Talk to Spotify, Canva, and More

OpenAI has announced a major update to ChatGPT, allowing users to interact directly with integrated apps through natural language commands. Revealed at DevDay 2025, this upgrade transforms ChatGPT into a central hub for real-world tasks, from asking Spotify to create playlists to generating designs in Canva or searching homes via Zillow, all within a single …