Add AI Acceleration with This Tiny Coprocessor

Very low-power AI acceleration is possible with the right hardware, such as a sparse processing unit.

William G. Wong

Related To:

Electronic Design

July 17, 2025

2 min read

What you’ll learn:

· How to add artificial intelligence acceleration to a host microcontroller.

· How to add always-listening keyword detection using less than 100 mW.

Artificial-intelligence (AI) developers are improving performance of AI and machine-learning (ML) models through a range of techniques like DeepSeek. Addressing model sparsity is one of these methods. While much of this focus is on high-end, cloud-based solutions, it’s equally applicable to low-power embedded solutions.

I recently talked with Sam Fok, CEO at Femtosense, about how sparsity and other techniques (Fig. 1) enable them to provide very low-power hardware for AI edge computing. Sparse matrices are common in machine-learning models since these weights are zero or close to it. Eliminating the need to perform arithmetic operations can reduce overhead by a factor of 100 or more.

Sparsity is key to reducing the amount of storage and computational power required to handle AI/ML models — 1. Sparsity is the key to reducing the amount of storage and computational power required to handle AI/ML models in low-power, battery-operated environments.

Femtosense’s hardware is based on a sparse processing unit (SPU). This neural processing unit (NPU) is optimized to handle sparse data often consuming under 1 mW. The SPU-001 (Fig. 2) utilizes an SPI interface to connect to a host processor. The evaluation board contains an SPU-001 and plugs into a PMOD connector found on many processor evaluation boards. Femtosense has a single-chip solution: the AI-ADAM100 incorporates a Cortex-M0+ core and an SPU.

Femtosense evaluation board with SPU-001 — 2. Femtosense’s evaluation board, which plugs into a PMOD socket, includes an SPU-001 (right).

The company’s software tools can accept models from popular AI/ML frameworks like PyTorch and TensorFlow. Tools include software simulation that provides information about power requirements, latency, and memory footprint. The SPU-001 includes 1 MB of SRAM.

The SPU can handle a range of applications, but there’s a focus on audio applications such as keyword detection. The low-power requirements make it possible to implement an always-listening mode even when using battery power. Currently, the SPU can be found in some earbud applications.

^{>>Check out these TechXchanges for similar articles and videos}

TinyML: Machine Learning for Small Platforms

Machine learning does not have to run on big servers in the cloud. This TechXchange presents articles and videos about TinyML.

AI on the Edge

Artificial intelligence requires compute horsepower, but more efficient algorithms and specialized hardware have made it practical for edge nodes.

About the Author

William G. Wong

Senior Content Director - Electronic Design and Microwaves & RF

I am Editor of Electronic Design focusing on embedded, software, and systems. As Senior Content Director, I also manage Microwaves & RF and I work with a great team of editors to provide engineers, programmers, developers and technical managers with interesting and useful articles and videos on a regular basis. Check out our free newsletters to see the latest content.

You can send press releases for new products for possible coverage on the website. I am also interested in receiving contributed articles for publishing on our website. Use our template and send to me along with a signed release form.

Check out my blog, AltEmbedded on Electronic Design, as well as his latest articles on this site that are listed below.

You can visit my social media via these links:

I earned a Bachelor of Electrical Engineering at the Georgia Institute of Technology and a Masters in Computer Science from Rutgers University. I still do a bit of programming using everything from C and C++ to Rust and Ada/SPARK. I do a bit of PHP programming for Drupal websites. I have posted a few Drupal modules.

I still get a hand on software and electronic hardware. Some of this can be found on our Kit Close-Up video series. You can also see me on many of our TechXchange Talk videos. I am interested in a range of projects from robotics to artificial intelligence.

Intel Plots Strategy to Retake Process Technology Crown by 2025

Add AI Acceleration with This Tiny Coprocessor

^{>>Check out these TechXchanges for similar articles and videos}

TinyML: Machine Learning for Small Platforms

AI on the Edge

About the Author

William G. Wong

Senior Content Director - Electronic Design and Microwaves & RF

Related

Intel Plots Strategy to Retake Process Technology Crown by 2025

"Quiet" Three-Phase GaN IPM Has Very Low Dead Time

Smarter Sunroof Control with MPS Power ICs

Optimized Power for Wiper Control Systems

Voice Your Opinion!

To join the conversation, and become an exclusive member of Electronic Design, create an account today!

Trending

12-kW High-Density PSU Ref Design Aimed at AI Data Centers and Servers

What Does Ultra High Reliability Mean from a Physical-Layer Perspective?

What’s Inside a Wireless IoT SoC?

Sponsored Picks

Designing Accurate Gas Monitoring Systems with Chemiresistive Devices

Intelligent Buildings

LTC4296-1/LTC9111 SPoE/PD Controllers