Hi Bravo,Not on my PC ATM and can’t drive my i-Pad very well.
Could we be involved in the new Arm Cortex-A320???
Check out the Arm blog - the link at the bottom of the post. It reads like does everything Pico can do???
The Ethos-U85 driver has now been updated so that Ethos-U85 can be driven directly by a Cortex-A320, without the need for a Cortex-M based ML island. This update improves latency and allows Arm partners to remove the cost and complexity of using a Cortex-M to drive the NPU.
Arm to support more intelligent applications at the edge with Armv9 Edge AI Platform
![]()
BY MIKE WHEATLEY
FEBRUARY 26 2025
Chipmaker Arm Holdings Plc is looking to strengthen its grip on artificial intelligence at the network edge with the debut of a powerful new lightweight processor designed to sit at the heart of intelligent internet of things applications.
????
The company unveiled the Arm Cortex-A320 central processing unit today, saying it’s the centerpiece of its all-new Armv9 Edge AI Platform, which provides all of the hardware needed to run lightweight AI workloads independently of the cloud.
???
In its pitch, Arm says the increasingly connected world we live in means that we cannot just rely on the cloud to continue processing AI workloads anymore. Use cases such as smart cities and industrial automation demand that AI applications live at the edge, and there’s an urgent need for them to process data locally to eliminate latency, but to do that we need to have the right infrastructure to run them, the company says.
That’s what it’s providing with the Armv9 Edge AI Platform, which combines the Cortex-A320 CPU with a new AI accelerator chip, the Arm Ethos-U85 neural processing unit, to run powerful AI models with up to 1 billion parameters locally on any device.
Arm said the edge platform is equipped to handle workloads such as autonomous vehicles that can navigate busy factory floors, smart cameras that must be able to process what they’re seeing, drones that carry out autonomous operations, and human-machine interfaces that drive natural, conversational interactions.
The Cortex-A320 is based on the company’s most advanced CPU architecture, Armv9, and delivers key features such as SVE2 for enhanced machine learning performance of up to 10 times its predecessor edge CPU, the Cortex-A35. It also benefits from improved security, with new capabilities such as Pointer Authentication, Branch Target Identification and Memory Tagging Extension, which enable edge devices to handle sensitive data in the most exposed locations, the company said. At the same time, Armv9 provides greater efficiency, meaning lower running costs for edge AI workloads.
Arm KleidiAI comes to the network edge
It’s one thing to provide the infrastructure for edge AI applications, and another thing to build them, but Arm has this covered too. Alongside the Armv9 Edge AI Platform, it’s also extending its Arm KleidiAI software development platform to the edge. It provides a powerful set of compute libraries to support the development of AI frameworks that can optimize AI and machine learning workloads to run on the new Armv9 Edge AI Platform, the company said.
KleidiAI is a popular platform that has already been widely integrated into IoT AI software frameworks such as Llama.cpp and ExecuTorch to accelerate the performance of lightweight large language models such as Meta Platform’s Llama 3 and Microsoft Corp.’s Phi-3. According to Arm, KleidiAI can help to boost the performance of the new Cortex-A320 CPUs by up to 70% in some scenarios.
By using KleidiAI, developers can also accelerate the time-to-market for new edge AI applications, meaning they can quickly build new solutions that grow and adapt as their requirements evolve.
The launch of the Armv9 Edge AI Platform has been warmly welcomed by customers including Amazon Web Services Inc. and the edge server manufacturer Eurotech S.p.A. For instance, AWS has already integrated the hardware into the nucleus lite runtime environment within its AWS IoT Greengrass platform for edge devices.
“This seamless integration between the two technologies provides an optimized solution for developers to build modern edge AI applications, like anomaly detection in precision agriculture, smart manufacturing and autonomous vehicles,” said AWS Vice President of IoT Yasser Alsaied.
Meanwhile, Eurotech has been quick to install Arm’s new hardware at the foundation of its latest edge computing hardware.
“Arm’s new edge AI platform provides us with the foundation to build the next generation of rich IoT devices, with Armv9 giving us access to new levels of secure performance, energy-efficiency and software flexibility,” said Eurotech Chief Technology Officer Marco Carrer.
Introducing Cortex-A320: Ultra-efficient Armv9 CPU Optimized for IoT
Unlock ultra-efficient performance, advanced AI processing, and robust security with the Cortex-A320—designed to power the future of IoT and edge AI innovation.
![]()
Introducing Cortex-A320: Ultra-efficient Armv9 CPU Optimized for IoT
Arm Cortex-A320, the first ultra-efficient Armv9 CPU, delivers advanced AI, security, and efficiency to power-constrained devices in the evolving IoT and edge AI landscape.newsroom.arm.com
In your orange highlighted bit it talks about running lightweight AI workloads independently of the cloud.
The word lightweight in my mind rules out Akida ... and I don't class TENNs as lightweight because it adds the time factor for image tracking, and, with long skip, speech processing.
Here's one I prepared earlier:
https://www.cnx-software.com/2025/0...rmv9-core-optimized-for-edge-ai-and-iot-socs/
February 27, 2025 by Jean-Luc Aufranc (CNXSoft) - 2 Commentson Arm Cortex-A320 low-power CPU is the smallest Armv9 core, optimized for Edge AI and IoT SoCs
Arm Cortex-A320 low-power CPU is the smallest Armv9 core, optimized for Edge AI and IoT SoCs
Arm Cortex-A320 is a low-power Armv9 CPU core optimized for Edge AI and IoT applications, with up to 50% efficiency improvements over the Cortex-A520 CPU core. It is the smallest Armv9 core unveiled so far.The Armv9 architecture was first introduced in 2021 with a focus on AI and specialized cores, followed by the first Armv9 cores – Cortex-A510, Cortex-A710, Cortex-X2 – unveiled later that year and targeting flagship mobile devices. Since then we’ve seen Armv9 cores on a wider range of smartphones, high-end Armv9 motherboards, and TV boxes, The upcoming Rockchip RK3688 AIoT SoC also features Armv9 but targets high-end applications. The new Arm Cortex-A320 will expand Armv9 usage to a much wider range of IoT devices including power-constrained Edge AI devices.
Arm Cortex-A320 highlights:
- Architecture – Armv9.2-A (Harvard)
- Extensions
- Up to Armv8.7 extensions
- QARMA3 extensions
- SVE2 extensions
- Memory Tagging Extensions (MTE) (including Asymmetric MTE)
- Cryptography extensions
- RAS extensions
- Microarchitecture
- In-order pipeline
- Partial superscalar support
- NEON/Floating Point Unit
- Optional Cryptography Unit
- Up to 4x CPUs in cluster
- 40-bit Physical Addressing (PA)
- Memory system and external interfaces
- 32KB or 64KB L1 I-Cache / D-Cache
- Optional L2 Cache – 128KB, 192KB, 256KB, 384KB, or 512KB
- No L3 Cache
- ECC Support
- Bus interfaces – AMBA AXI5
- No ACP, No Peripheral Port
- Security – TrustZone, Secure EL2, MTE, PAC/BTI
- Debugging
- Debug – Armv9.2-A features
- CoreSightv3
- Embedded Trace Extension (ETEv1.1)
- Trace Buffer Extension
- Misc
- Interrupts – GIC interface, GICv4.1
- Generic timer – Armv9.2-A
- PMUv3.7
The Cortex-A320 can be combined with the Ethos-U85 NPU for Edge AI, providing an upgrade path to Cortex-M85+Ethos-U85-based Endpoint AI devices, with support for LLMs with up to one billion parameters, and Linux or Android operating systems, besides RTOSes like FreeRTOS or Zephyr OS. We’re also told a quad-core Cortex-A320 can execute up to 256 GOPS, measured in 8-bit MACs/cycle when running at 2GHz.
Besides the 50% efficiency improvements over the Cortex-A520, Arm says the performance of the Cortex-A320 has improved by more than 30% in SPECINT2K6, compared to its Armv8 predecessor, the Cortex-A35 thanks to efficient branch predictors and pre-fetchers, and memory system improvements.
The Cortex-A320 also makes use of NEON and SVE2 improvements in the Armv9 architecture to deliver up to 10x better machine learning (ML) performance compared to Cortex-A35, or up to 6x higher ML performance than the Cortex-A53. With these ML improvements and high area and energy efficiencies, Arm claims that the Arm Cortex-A320 is the most efficient core in ML applications across all Arm Cortex-A CPUs.
Consider yourself ogred.
Last edited: