Tiny AI Cube Recognizes Audio Commands and Camera Images

Oct. 11, 2022
Maxim's AI Cube packs in dual MAX78000 AI accelerators into this miniature reference design kit.

Check out more Kit Close-Up videos.

In this video, I take a look at Maxim Integrated's AI cube camera, the MAXREFDES178 (Fig. 1). It packs an amazing amount of computing power in a tiny package, which is great for developers looking to take advantage of the company's MAX78000 system-on-chip (SoC).

The MAX78000 has a Cortex-M4F core, but the artificial-intelligence (AI) work is normally done by the convolutional-neural-network (CNN) accelerator with a 332k 8-bit weight capability. It supports 1-, 2- 4-, and 8-bit weights with a programmable input image size up to 1024 × 1024 pixels. And it handles one- and two-dimensional convolutions with up to 64 layers. The cube has a built-in microphone in addition to input jacks, allowing it to process voice commands. 

Inside, the system consists of two boards plus a battery (Fig. 2). The connectivity board includes a MAX32666 microcontroller with Bluetooth 5 support. A matching set of apps for Android and Apple smartphones can be used to configure the systems as well as download trained models to the built-in applications. These applications work right out of the box, but you can add more images to be recognized using the app without any further software modifications. 

The two boards are connected by a ribbon cable (Fig. 3) that has QSPI and I2C interfaces. The dual-core MAX32666 manages wireless communication as well as the USB interface. A debug module plugs into the USB Type-C socket so that program changes can be made. AMicroSD card slot also can be used for software and data. In addition, the micro handles the touchscreen and the BMI160 accelerometer and gyro. 

The video MAX78000 takes input directly from the Omnivision Technologies OVM7692 camera armed with 640 × 480 resolution. It also can control the two LEDs directly. The other MAX78000 handles audio input from a digital MEMS microphone (SPH0645LM4H) on the front panel. The MAX34417 is an SMBus four-channel, high-dynamic-range accumulator with a multiplexed ADC. It can be used to track the power utilization of the MAX78000 chips. 

The MAX78000 chips can work with multiple machine-learning (ML) models. They're powerful enough to support real-time image recognition, such as face identification along with voice analysis. The demos include voice-command recognition and image recognition of faces and animals. 

All software and schematics are available as well. The software development kit (SDK) works with Mac, Windows, and Linux. 

Overall, this is a reference kit with lots of potential. The demos highlight the capabilities of the system and there's plenty to test simply by adjusting the recognition models. The training is done in the cloud, although the models are strictly for the demonstration programs. Training models for a specific application will take a bit more work. 

Check out more Kit Close-Up videos.

Links

Sponsored Recommendations

The Importance of PCB Design in Consumer Products

April 25, 2024
Explore the importance of PCB design and how Fusion 360 can help your team react to evolving consumer demands.

PCB Design Mastery for Assembly & Fabrication

April 25, 2024
This guide explores PCB circuit board design, focusing on both Design For Assembly (DFA) and Design For Fabrication (DFab) perspectives.

What is Design Rule Checking in PCBs?

April 25, 2024
Explore the importance of Design Rule Checking (DRC) in manufacturing and how Autodesk Fusion 360 enhances the process.

Unlocking the Power of IoT Integration for Elevated PCB Designs

April 25, 2024
What does it take to add IoT into your product? What advantages does IoT have in PCB related projects? Read to find answers to your IoT design questions.

Comments

To join the conversation, and become an exclusive member of Electronic Design, create an account today!