BRN Discussion Ongoing

Hi Damo,

Glad you asked. I was just about to post that 10 years ago, Cerence was an entirely cloud-based system.

I'm pretty certain that they had no in-house digital or analog SNN SoC expertise, at least until they met BRN which they must have done over at Markus Schäfer's place. Their patents talk about NN as a known quantity but do not evince any evidence of a deep knowledge of the circuit configuration.

The Cerence patent has a priority of 20210426.

Just harking back to Fig1 above, there is no provision for on-line learning in the acoustic classifier:

View attachment 38364

[0038] The configuration data 160 is generally or largely determined before operation of the system. For example, an offline training/configuration component 180 uses an audio or transcribed text corpus of commands for one or more assistants to determine the structure of valid commands for those assistants, and a language model may be derived from the corpus for use in processing audio input containing speech. For example, the language model may be a statistical language model, meaning that different word sequences are associated with different probability or scores indicating their likelihood. In some cases, such a statistical language model is an “n-gram” model in which statistics on n-long sequences of words are used to build the model. In some cases, some or all of the language model may have a finite-state form, for example, specifying a finite set of well-structured commands. In some cases, such a finite-state form is statistical in that different commands may have different probabilities or scores. In some examples, a combination of approaches, such as n-gram and finite-state forms, are combined in a finite-state transducer. It should be understood that there are a number of alternatives to the form of the language model. In some examples, the occurrences of wake words and/or names given to assistants, or generic subjects that may not be explicitly defined (e.g., “computer,” “automobile,” “you”), are identified in the corpus and are essentially replaced with placeholders to permit configuration of different words or names for assistants, including coined names that may not actually occur in the corpus, for use in runtime systems without having to derive new language models. In some examples, the language model used to automatically transcribe an audio input may be biased to avoid missing true occurrences of wake words that are part of commands in the process of automated transcription of audio input. In some examples, the language model provides a way to tag output words, for example, to indicate that the word occurred in the position of a subject or assistant name in the language model, and such tags are used in further processing of the output text.

[0039] In some examples, the configuration data 160 may be determined in part during operation or shortly before operation of the system. For example, an optional online training/configuration component 170 may receive, from the user, a name they have given to an assistant. For example, the user may say or type the word “Sophie” to name the assistant. The configuration data 160 is then amended to modify the language model to permit the word “Sophie” in the name position in the language model and/or modify the pronunciation of a placeholder for the name with a pronunciation of the word “Sophie,” for example, determined by an automated text-to-phoneme converter (e.g., as is used in a speech synthesis system). In some examples, the online training/configuration component 170 may receive other information that is used in configuring components of the system, for example, the names of family members of the user and/or names of passengers in a vehicle. These names may be used to configure the language model to replace a placeholder for non-system names that may be present in utterances spoken by the user but that are not directed to the system
.

[0042] As introduced above, the reasoner 150 processes both text output 115 and the audio signal 105 . The audio signal 105 is processed by an acoustic classifier 152 . In some implementations, this classifier is a machine learning classifier that is configured with data (i.e., from configuration data 160 ) that was trained on examples of system-directed and of non-system directed utterance by an offline training system 180 . In some examples, the machine-learning component of the acoustic classifier 152 receives a fixed-length representation of the utterance (or at least the part of the utterance received to that point) and outputs a score (e.g., probability, log likelihood, etc.) that represents a confidence that the utterance is a command. For example, the machine-learning component can be a deep neural network. Note that such processing does not in general depend on any particular words in the input, and may instead be based on features such as duration, amplitude, or pitch variation (e.g., rising or falling pitch). In some implementations, the machine-learning component processes a sequence, for example, processing a sequence of signal processing features (e.g., corresponding to fixed-length frames) that represent time-local characteristics of the signal, such as amplitude, spectral, and/or pitch, and the machine-learning component processes the sequence to provide the output score. For example, the machine learning component can implement a convolutional or recurrent neural network.
So, reading the paragraph after & relating to 180 it seems to me they have essentially replaced the phrase " wake word" and embedded or left open placeholders to be later embedded, words or phrases with weightings so the assistant recognises them....just like a wake word without specifically saying "hey...whoever" :rolleyes:
 
  • Fire
  • Like
  • Thinking
Reactions: 5 users
D

Deleted member 118

Guest
  • Like
Reactions: 2 users

Diogenese

Top 20
So, reading the paragraph after & relating to 180 it seems to me they have essentially replaced the phrase " wake word" and embedded or left open placeholders to be later embedded, words or phrases with weightings so the assistant recognises them....just like a wake word without specifically saying "hey...whoever" :rolleyes:
Just a note of caution about patents - they are usually not published until 18 months after filing, so they may have a revised version in the pipeline.
So, reading the paragraph after & relating to 180 it seems to me they have essentially replaced the phrase " wake word" and embedded or left open placeholders to be later embedded, words or phrases with weightings so the assistant recognises them....just like a wake word without specifically saying "hey...whoever" :rolleyes:
... but they also have the option of using the video input to determine the speaker's intention:

0045] In some examples, the reasoner implements a classifier that takes advantage of yet other input to make the classification. For example, video from the camera 103 monitoring the user's facial expressions, lip movement, etc., is provided to a visual classifier 157 of the reasoner 150 to aid in the classification. In a machine learning approach, such a video signal may be processed using a machine learning approach that is trained on video (i.e., image sequences) for users making system-directed and non-system-directed utterances. In some examples, the video is processed in conjunction with the audio, essentially combining functions of the acoustic classifier 152 and visual classifier 157 (e.g., to permit taking advantage of relationships between visual and audio cues that help determine when the user intends for an utterance to be system directed).
 
  • Like
Reactions: 6 users

TECH

Regular
I stand by my comment, until we have a signed deal then it’s nothing. Chat, partnerships, discussions, articles, and tweets mean diddly squat unless they sign, start using Akida, and revenue starts flowing. Until then they have zero commitment towards Brainchip. That’s just the way it is.

What you have stated is 100% how many would feel due mainly because of the lack of IP signings, and revenue streams having been
established as yet, to counter your view, I have been told that Mercedes Benz is a "client" and will do what they will do with our technology,
as is their right...read that as you wish.

Sean is certainly moving around the globe now, or appears to be, I personally think the name Brainchip and more importantly, AKIDA 2.0
is starting to be more freely talked about in the tech world and the word success can take many forms, and to this point we have had some
tremendous success but ultimately we all know that a business can't survive without revenue, have faith it's coming.

What makes me laugh somewhat is persons whom bag our company, but still feel comfortable holding, is that a modern definition of a
"fence sitter" or a politician ?

Tech in Melbourne 💎 AKD 3.0
 
  • Like
  • Love
  • Fire
Reactions: 46 users

Iseki

Regular
Ahhhhh, I see, you're looking for an argument.
That room is three doors on the left down the hall. 🤣
Not at all.
When you say that there are wonderful things happening only we can't know about them, you are causing a distrust of the board that will bring about an outcome that neither you or I want. Please desist. And let's not fight.
Instead let's find a new way that BRN can spread the word AND give shareholders something to hang on to.

eg - Let's give the akida IP away free for the first three medical devices that can utilize it. Something like that - win, win, win.
 
Last edited:
  • Like
  • Love
Reactions: 2 users

Diogenese

Top 20
Not sure of date on this article, but suggests DNN embedded and cloud?


CERENCE INTRODUCES NEW FEATURES IN CERENCE DRIVE, THE WORLD’S LEADING TECHNOLOGY AND SOLUTIONS PORTFOLIO FOR AUTOMAKERS AND CONNECTED CARS​

New capabilities such as enhanced voice recognition and synthetic speech serve as the foundation for a safer, more enjoyable journey for everyone
BURLINGTON, Mass. – Cerence Inc., AI for a world in motion, today introduced new innovations in Cerence Drive, its technology and solutions portfolio for automakers and IoT providers to build high-quality, intelligent voice assistant experiences and speech-enabled applications. Cerence Drive today powers AI-based, voice-enabled assistants in approximately 300 million cars from nearly every major automaker in the world, including Audi, BMW, Daimler, Ford, Geely, GM, SAIC, Toyota, and many more.
The Cerence Drive portfolio offers a distinct, hybrid approach with both on-board and cloud-based technologies that include voice recognition, natural language understanding (NLU), text-to-speech (TTS), speech signal enhancement (SSE), and more. These technologies can be deployed and tightly integrated with the wide variety of systems, sensors and interfaces found in today’s connected cars. The latest version of Cerence Drive includes a variety of new features to elevate the in-car experience:
> Enhanced, active voice recognition and assistant activation that goes beyond the standard push-to-talk buttons and wake-up words. The voice assistant is always listening for a relevant utterance, question or command, much like a personal assistant would, creating a more natural experience. In addition, Cerence’s voice recognition can run throughout the car, both embedded and in the cloud, distributing the technical load and delivering a faster user experience for drivers.
> New, deep neural net (DNN)-based NLU engine built on one central technology stack with 23 languages available both embedded and in the cloud. This streamlined approach creates new standards for scalability and flexibility between embedded and cloud applications and domains for simpler integration, faster innovation, and a more seamless in-car experience, regardless of connectivity.
> TTS and synthetic voice advancements that deliver new customizations, including a non-gender-specific voice for the voice assistant, and emotional output, which enables automakers to adjust an assistant’s speaking style based on the information delivered or tailored to a specific situation. In addition, the introduction of deep learning delivers a more natural and human-like voice with an affordable computational footprint.
> Improved, more intelligent speech signal enhancement that includes multi-zone processing with quick and simple speaker identification; passenger interference cancelation that blocks out background noise as well as voices from others in the car; and a deep neural net-based approach for greater noise suppression and better communication.
“Improving the experience for drivers and creating curated technology that feels unique and harmonious with our partners’ brands have been true motivators since we started our new journey as Cerence, and that extends to our latest innovations in Cerence Drive,” said Sanjay Dhawan, CEO, Cerence. “Cerence Drive, our flagship offering, is the driving force behind our promise of a truly moving in-car experience for our customers and their drivers, and our new innovations announced today are core to making that mission a reality. ”
Cerence Drive’s newest features are available now for automakers worldwide. To learn more about Cerence Drive, visit www.cerence.com/solutions.

Also a 2022 pdf spiel on their overall solutions package.

HERE

Guess we have to remember we also have a patent granted on 2018 on neuromorphic application via PVDM.


US-10157629-B2 - Low Power Neuromorphic Voice Activation System and Method​


Abstract
The present invention provides a system and method for controlling a device by recognizing voice commands through a spiking neural network. The system comprises a spiking neural adaptive processor receiving an input stream that is being forwarded from a microphone, a decimation filter and then an artificial cochlea. The spiking neural adaptive processor further comprises a first spiking neural network and a second spiking neural network. The first spiking neural network checks for voice activities in output spikes received from artificial cochlea. If any voice activity is detected, it activates the second spiking neural network and passes the output spike of the artificial cochlea to the second spiking neural network that is further configured to recognize spike patterns indicative of specific voice commands. If the first spiking neural network does not detect any voice activity, it halts the second spiking neural network.

This Cerence patent uses hybrid SoC/Cloud:

1686737304323.png


US11462216B2 Hybrid arbitration system
A method for selecting a speech recognition result on a computing device includes receiving a first speech recognition result determined by the computing device, receiving first features, at least some of the features being determined using the first speech recognition result, determining whether to select the first speech recognition result or to wait for a second speech recognition result determined by a cloud computing service based at least in part on the first speech recognition result and the first features.
Hi Dr E,

I too share your confusion about Cerence.

However, there are some points which mitigate my concerns somewhat.

A few weeks ago, Mercedes started talking about MBUX not needing the "Hey Mercedes" wake up when there was only one person in the car, the corollary being that they still need it when there are two or more people.

Another thing is I've looked at Cerence patents, and while they discuss the use of NNs, they do not describe or claim any NN circuitry.

As you say, Mercedes found Akida to be 5 to 10 times beter than other systems for "Hey Mercedes". They also used "Hey Mercedes" as an example of what Akida could do, and appeared to make reference to plural uses of Akida.

On top of that, Mercedes also stated their desire to standardize on the chips they use. Akida is sensor agnostic.

Then there's Valeo Scala 3 lidar due out shortly, which I think may contain Akida, leaving aside Luminar with their foveated lidar and who have stated that they expect to expand their cooperation with Mercedes from mid-decade. MB used Scala 2 to obtain Level 3 ADAS certification, (sub-60 kph), while Scala 3 is rated to 160 kph.

Luminar, like Cerence, talk about using AI, but do not describe its construction.

Standardizing on Akida would improve the efficiency of the MB design office as their engineers would all be signing off the same hymn sheet in close harmony.


https://www.bing.com/videos/search?&q=Comedian+Harmonists+Songs&view=detail&mid=44C7F5D2E5E5041230E244C7F5D2E5E5041230E2&FORM=VDRVRV&ru=/videos/search?&q=Comedian+Harmonists+Songs&FORM=VDRSCL&ajaxhist=0


https://www.bing.com/videos/search?...edian+Harmonists+Songs&FORM=VDRSCL&ajaxhist=0
 
  • Like
  • Haha
Reactions: 7 users

robsmark

Regular
What you have stated is 100% how many would feel due mainly because of the lack of IP signings, and revenue streams having been
established as yet, to counter your view, I have been told that Mercedes Benz is a "client" and will do what they will do with our technology,
as is their right...read that as you wish.

Sean is certainly moving around the globe now, or appears to be, I personally think the name Brainchip and more importantly, AKIDA 2.0
is starting to be more freely talked about in the tech world and the word success can take many forms, and to this point we have had some
tremendous success but ultimately we all know that a business can't survive without revenue, have faith it's coming.

What makes me laugh somewhat is persons whom bag our company, but still feel comfortable holding, is that a modern definition of a
"fence sitter" or a politician ?

Tech in Melbourne 💎 AKD 3.0
Hey Tech,

For what it’s worth, I think that Mercedes will likely be a future customer, but as I have repeatedly mentioned in my last few posts, find it very unlikely that they currently are as it hasnt been announced and the company has a continual disclosure as per ASX requirements to release this information as soon as it becomes available.

I have no issues with Sean travelling overseas to promote the product, in fact as a CEO I would expect it.

Cheers.
 
  • Like
  • Thinking
Reactions: 12 users

manny100

Regular
All the NDA talk also mean diddly squat too. It may or not be true..
That is the risk. If it all comes off as expected these prices will seem cheap. If not well...
Its just patience from here. There will be some who cannot stand the wait and sell. There will be others who maintain a hold or a hold and accumulate like myself.
BRN has at least a 2 plus year of first mover advantage so we need it to gather momentum before others catch up.
I note AKIDA 3 will be on the drawing board later this year.
 
  • Like
  • Fire
Reactions: 15 users
HI Dio,

This is more a question than statement. When Mercedes.mentioned their relationship with Brainchip a few of us did a deep dive into Cerence. Promising but nothing definitive. Bravo was one ring leader if I remember correctly. I have heard you state that, it takes about 18 months for patents to become public. It is now about 18 months since it became known about Mercedes and they would have been working on this for some time. It seems to me if Cerence were involved we should be seeing new patents coming from Cerence. Or are current patents enough as written to be working with us?

SC
 
  • Like
Reactions: 8 users
So, reading the paragraph after & relating to 180 it seems to me they have essentially replaced the phrase " wake word" and embedded or left open placeholders to be later embedded, words or phrases with weightings so the assistant recognises them....just like a wake word without specifically saying "hey...whoever" :rolleyes:

Thanks guys for your input re the Cerence/Akida matter.

I simply thought it was Akida the hardware; and Cerence the software.

Akida on all the sensors either sending signals to act on the info directly or feeding relevant info to Nvidia’s main control unit.

Akida on Valeo’s Lidar interpreting the info.

Thanks for the deep dive.

I have found this Brainchip journey to be quite intriguing and exciting!

Many chapters still to be written in this adventure; can’t wait for the next instalment.

:)
 
Last edited:
  • Like
  • Love
  • Fire
Reactions: 36 users

manny100

Regular
What happened to the chips (AKIDA 1000) we received from Socionext in August 2021?
 
  • Like
  • Thinking
  • Haha
Reactions: 7 users

Getupthere

Regular
  • Like
  • Thinking
Reactions: 3 users
What happened to the chips (AKIDA 1000) we received from Socionext in August 2021?

Hi @manny100

It’s my understanding they were reference chips to prove they worked in silicon.

1686744947828.png

You could buy them to test via Development Kits etc as per below.

1686745001952.png



I imagine the same will happen with Gen 2 variations although there shouldn’t be as much doubt over them working as essentially they are the same tech, just different sizes (large, medium and small) depending on use case.

I’m confident that Brainchip has listened to the customer and is providing them what they have asked for. As Sean said, they will be more easily consumed/used and be a better fit depending on the application.

Edit: in saying that the TENNS sounds like a huge technological advancement which raises the bar significantly so I shouldn’t ignore that new feature!

:)
 
  • Like
  • Love
Reactions: 33 users

manny100

Regular
Hi @manny100

It’s my understanding they were reference chips to prove they worked in silicon.

View attachment 38368
You could buy them to test via Development Kits etc as per below.

View attachment 38369


I imagine the same will happen with Gen 2 variations although there shouldn’t be as much doubt over them working as essentially they are the same tech, just different sizes (large, medium and small) depending on use case.

I’m confident that Brainchip has listened to the customer and is providing them what they have asked for. As Sean said, they will be more easily consumed/used and be a better fit depending on the application.

Edit: in saying that the TENNS sounds like a huge technological advancement which raises the bar significantly so I shouldn’t ignore that new feature!

:)
Thanks, cheers
 
  • Like
Reactions: 4 users
This Cerence patent uses hybrid SoC/Cloud:

View attachment 38365

US11462216B2 Hybrid arbitration system
A method for selecting a speech recognition result on a computing device includes receiving a first speech recognition result determined by the computing device, receiving first features, at least some of the features being determined using the first speech recognition result, determining whether to select the first speech recognition result or to wait for a second speech recognition result determined by a cloud computing service based at least in part on the first speech recognition result and the first features.



https://www.bing.com/videos/search?&q=Comedian+Harmonists+Songs&view=detail&mid=44C7F5D2E5E5041230E244C7F5D2E5E5041230E2&FORM=VDRVRV&ru=/videos/search?&q=Comedian+Harmonists+Songs&FORM=VDRSCL&ajaxhist=0


https://www.bing.com/videos/search?&q=Comedian+Harmonists+Songs&view=detail&mid=252FACB077B6AE44C21A252FACB077B6AE44C21A&FORM=VDRVRV&ru=/videos/search?&q=Comedian+Harmonists+Songs&FORM=VDRSCL&ajaxhist=0
@Diogenese

Recent job vacancy at Cerence.

Indication of some probable AI they using?

I note that TDNN was first developed in the late 1980s apparently :oops:



Product R&D Engineer - Acoustic Modelling

Cerence
Be the first to apply
Posted 11d ago


Job highlights
Strong background knowledge and hands-on experience in speech processing technologies,with particular familiarity in speech signal processing and speech enhancement techniques . Good understanding or experience in acoustic echo cancellation and noise reduction technologies


Who you are:

  • Strong background knowledge and hands-on experience in speech processing technologies, with particular familiarity in speech signal processing and speech enhancement techniques
  • Good understanding or experience in acoustic echo cancellation and noise reduction technologies
  • Familiarity with one or more deep learning technologies, such as TDNN, Deep Feed Forward Neural Network, and LSTM, would be an advantage
  • Proven industry or academic experience in developing large vocabulary speech recognition systems
  • Extensive knowledge and hands-on experience in statistical language modeling, including estimation methods, smoothing, pruning, efficient representation, and interpolation techniques
  • Experience in conducting accuracy experiments and systematically improving performance
  • Self-driven and diligent in solving real-world problems, with the ability to manage tasks and deliverables for multiple projects simultaneously
  • An understanding of information and application cyber s ecurity standards (secure coding, securing SDLC s, etc)
 
  • Like
  • Thinking
Reactions: 12 users

Diogenese

Top 20
@Diogenese

Recent job vacancy at Cerence.

Indication of some probable AI they using?

I note that TDNN was first developed in the late 1980s apparently :oops:



Product R&D Engineer - Acoustic Modelling

Cerence
Be the first to apply
Posted 11d ago


Job highlights
Strong background knowledge and hands-on experience in speech processing technologies,with particular familiarity in speech signal processing and speech enhancement techniques . Good understanding or experience in acoustic echo cancellation and noise reduction technologies


Who you are:

  • Strong background knowledge and hands-on experience in speech processing technologies, with particular familiarity in speech signal processing and speech enhancement techniques
  • Good understanding or experience in acoustic echo cancellation and noise reduction technologies
  • Familiarity with one or more deep learning technologies, such as TDNN, Deep Feed Forward Neural Network, and LSTM, would be an advantage
  • Proven industry or academic experience in developing large vocabulary speech recognition systems
  • Extensive knowledge and hands-on experience in statistical language modeling, including estimation methods, smoothing, pruning, efficient representation, and interpolation techniques
  • Experience in conducting accuracy experiments and systematically improving performance
  • Self-driven and diligent in solving real-world problems, with the ability to manage tasks and deliverables for multiple projects simultaneously
  • An understanding of information and application cyber s ecurity standards (secure coding, securing SDLC s, etc)
TDNN sounds a bit like LSTM, only different.

https://www.isca-speech.org/archive/pdfs/odyssey_2020/huang20_odyssey.pdf
 
Last edited:
  • Like
Reactions: 5 users

cassip

Regular
Mercedes One-Eleven

 
Last edited:
  • Like
Reactions: 4 users

GStocks123

Regular


Worth the watch- Sailesh from Renesas presenting on AI at endpoint/edge (specifically digital). I like those figures!
 
  • Like
  • Fire
Reactions: 10 users

GStocks123

Regular
  • Like
  • Love
  • Fire
Reactions: 22 users
Top Bottom