Tech company unveils new flagship model that signifies the future of human-machine interaction
OpenAI announced the launch of its new flagship AI model, GPT-4o, along with updates including a new desktop service and enhancements to its voice assistant capabilities, on Monday.
Mira Murati, the Chief Technology Officer, was greeted with applause as she took the stage at the OpenAI offices. She hailed the new model as an advancement in AI, noting that it would make the faster and more accurate GPT-4 AI model available to free users, a privilege previously reserved for paid customers.
“We are envisioning the future of human-machine interaction,” Murati stated. “We believe that GPT-4o is truly transforming that paradigm.”
OpenAI’s announcements on Monday included enhancements to ChatGPT’s international language capabilities, with improved quality and speed. Additionally, ChatGPT now has the ability to analyze uploaded images, audio, and text documents. The company stated that these features would be introduced gradually to ensure safe usage.
The event showcased the model’s new voice capabilities through a live demonstration. Two OpenAI research leads engaged in a conversation with an AI voice model, which generated a bedtime story about love and robots. Researchers instructed the model to narrate using various emotional and vocal tones.
Another demonstration involved using a phone camera to show the AI model a math problem, and then ChatGPT’s voice mode guided them through solving it.
During the demonstration, a researcher asked the AI model to interpret their facial expression and emotions. The ChatGPT voice assistant described the researcher as appearing “happy and cheerful, with a big smile and perhaps even a touch of excitement.”
“Seems like you’re in a great mood,” ChatGPT chirped in a lively, feminine voice. “Care to share the source of those good vibes?”
Rumors last week suggested that OpenAI would be launching a search product to compete with Google. However, Reuters reported that the company had postponed revealing that project. CEO Sam Altman refuted the notion that Monday’s announcement was related to a search engine, but he did tweet on Friday, saying, “We’ve been hard at work on some new stuff we think people will love! Feels like magic to me.”
With the exception of minor glitches or unintentional responses, the demonstration proceeded smoothly, and the features operated as intended. While Murati mentioned that OpenAI was implementing measures to prevent misuse of its new voice capabilities, the event offered limited information on how the company plans to address safeguards related to facial recognition and audio generation.
Reportedly, OpenAI is nearing a deal to integrate its generative AI capabilities into Apple’s iPhone operating system, Bloomberg reports. Apple has been focusing on AI, aiming to catch up with competitors like Google, which has its flagship Gemini model. OpenAI’s voice mode seemed to offer potentially more advanced features than Apple’s Siri voice assistant.
While OpenAI expands its products and seeks partnerships, it faces several lawsuits from publishers and media outlets over alleged copyright violations. Organizations like the New York Times accuse OpenAI of unlawfully training its AI models on their content without consent or compensation, potentially leading to billions in damages.