I find the below extract from an article dated 6 Nov 2023 very interesting.
View attachment 56338
We all know OpenAI and Mercedes are collaborating.
And remember that the Mercedes EQXX featured AKIDA-powered neuromorphic AI voice control technology which Mercedes engineers said was 5 to 10 times more efficient than conventional voice control.
Sooo...I wonder....
View attachment 56337
Is OpenAI about to take on Alexa and Siri? ChatGPT maker files trademark for Voice Engine
News
By
Ryan Morrison
published 27 March 2024
Trademark covers voice assistants
(Image credit: Shutterstock)
OpenAI may have Apple, Amazon, and Google in its sights for its next big artificial intelligence push, taking on the voice assistant market with a new Voice Engine tool.
While
ChatGPT does have a
voice-friendly interface on mobile — and recently introduced a way to have it speak its responses on desktop — a
new trademark application from OpenAI for the words Voice Engine relates specifically to building digital voice assistants.
It is now possible to swap out the
default voice assistant on Android. Apple seems to be in talks with a range of AI companies over the future of artificial intelligence on the iPhone, so this could be a preemptive move from OpenAI building on a
potential new market.
Apple is also rumored to be opening a dedicated AI App Store with the next major upgrade to iOS, which would create a new market for AI-powered assistants.
Sam Altman, OpenAI CEO said there are "many different things" being released this year. While it is expected this will include
Sora, the AI video tool it could include a new AI voice system.
What do we know about Voice Engine
(Image credit: Getty)
We don’t know much about Voice Engine or whether it will even be a product. OpenAI hasn’t commented publicly on it, so all we have is rumor and the trademark filing.
While Voice Engine could be a new model built specifically for speech applications, it is also likely this is part of an enterprise play for OpenAI. It could be building a high-quality speech system that would let companies build out more efficient call center bots.
Sign up to get the BEST of Tom’s Guide direct to your inbox.
Upgrade your life with a daily dose of the biggest tech news, lifestyle hacks and our curated analysis. Be the first to know about cutting-edge gadgets and the hottest deals.
Contact me with news and offers from other Future brandsReceive email from us on behalf of our trusted partners or sponsorsBy submitting your information you agree to the
Terms & Conditions and
Privacy Policy and are aged 16 or over.
It sounds a lot like all the pieces you'd need for a fully functional, fully interactive AI voice assistant that can not only handle complex tasks but chat naturally and even take phone calls on your behalf.
The new trademark application was filed with the U.S. Patent and Trademark Office last week. While an application doesn't necessarily mean it will result in a product, this does line up with the wider market shifting more to voice and OpenAI's direction to targeted models.
The filing covers the creation of software used for building digital voice assistants, audio generation from text prompts, voice command processing, and voice service delivery.
The full application covers the development of voice service delivery, using AI for text or-voice and text-to-audio, natural language, and speech processing, generating audio and voice from a prompt (text, speech, visual, image), processing voice commands, speech recognition, and building digital voice assistants.
That sounds a lot like all the pieces you'd need for a fully functional, fully interactive AI voice assistant that can handle complex tasks, chat naturally, and even take phone calls on your behalf.
Where does GPT-5 fit in this?
(Image credit: OpenAI)
OpenAI released GPT-4 a year ago. At the time this was a groundbreaking generative AI model that powers
ChatGPT and Microsoft Copilot.
The company also started training GPT-5 late last year, resulting in speculation over its release date. Altman told
podcaster Lex Fridman, "We will release an amazing new model this year," but wouldn't confirm whether this was GPT-5 or some precursor.
He also said there would be "many different things" released over the coming months. According to OpenAI CTO Mira Murati, this will include the AI video platform Sora.
There is some speculation on social media that Sora and this new Voice Engine are different modal interfaces for GPT-5.
It is very likely that GPT-5 will be a true multimodal model, able to understand video, images, speech, text, and code — as well as generate all those content types.
Voice Engine could be a new Assistant
Given the trademark's description, it is also possible that Voice Engine could be a new voice assistant, merging
Siri,
Alexa, or Google Assistant's wider capabilities with ChatGPT's reasoning and natural language capabilities.
Google has already started upgrading Gemini to work in that way, Apple is rumored to be building a new version of Siri with large language model functionality, and Amazon is already testing Alexa Plus with similar underlying skills.
OpenAI may offer Voice Engine to power such systems in the future or as an alternative interface to ChatGPT that can run on smart speakers, phones, or even headphones.
Or it could just be OpenAI playing it cautious with trademarks. It had a bid to protect GPT rejected, so it now has filed trademark applications for GPT-5, 6, and even GPT-7. The latter includes music generation, converting text and data to code, and writing code from scratch.