Definitely not us.
Cerence:
US2022415318A1 VOICE ASSISTANT ACTIVATION SYSTEM WITH CONTEXT DETERMINATION BASED ON MULTIMODAL DATA
View attachment 38000
A vehicle system for classifying spoken utterance within a vehicle cabin as one of system-directed and non-system directed may include at least one microphone to detect at least one acoustic utterance from at least one occupant of the vehicle, at least one camera to detect occupant data indicative of occupant behavior within the vehicle corresponding to the acoustic utterance, and a processor programmed to receive the acoustic utterance, receive the occupant data, determine whether the occupant data is indicative of a vehicle feature, classify the acoustic utterance as a system-directed utterance in response to the occupant data being indicative of a vehicle feature, and process the acoustic utterance.
[0016] The vehicle 104 may be configured to include various types of components, processors, and memory, and may communicate with a communication network 110 . The communication network 110 may be referred to as a “cloud” and may involve data transfer via wide area and/or local area networks, such as the Internet, Global Positioning System (GPS), cellular networks, Wi-Fi, Bluetooth, etc. The communication network 110 may provide for communication between the vehicle 104 and an external or remote server 112 and/or database 114 , as well as other external applications, systems, vehicles, etc. This communication network 110 may provide navigation, music or other audio, program content, marketing content, internet access, speech recognition, cognitive computing,
artificial intelligence, to the vehicle 104 .
US2021043195A1 AUTOMATED SPEECH RECOGNITION SYSTEM
View attachment 38001
There is provided an automated speech recognition system that applies weights to grapheme-to-phoneme models, and interpolates pronunciations from combinations of the models, to recognize utterances of foreign named entities for naive, informed, and in-between pronunciations.
[0013] Memory
125 is a tangible, non-transitory, computer-readable storage device
encoded with a computer program. In this regard, memory
125 stores data and instructions, i.e., program code, that are readable and
executable by processor 120 for controlling operation of processor
120 ...
[0014] Program module
130 contains instructions for controlling processor
120 to execute methods described herein. For example, under control of program module
130 ,
processor 120 will receive and analyze audio signals from microphone 110 , and in particular speech from users
101 ,
102 and
103 , and produce an output
135 . For example, in a case where system
100 is
employed in an automobile (not shown), output
135 could be a signal that controls an air conditioner or navigation device in the automobile.
[0043] … This method is also not constrained to only two models, or any particular kind of model (e.g., classical n-gram,
Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM), . . . ).