Multicore Mobile GPU Handles Computation Chores

Nov. 22, 2011
Arm's latest Mali-T658 architecture pushes GPGPU computational features into the embedded and mobile spaces.

Arm Mali-T658 architecture

Arm GPU delivery timeline

Mali-T658 core architecture

Mali-T658 job scheduling

Arm provides two families of GPUs, Utgard and Midgard. The Utgard provides graphics display support while the Midgard is the more advanced GPU with user computational capability as well. Arm's latest Mali-T658 architecture (Fig. 1) is the new top end for the Midgard family. It targets the superphone and midrange smartphone platforms (Fig. 2) as well as embedded multimedia mobile devices.

The Mali-T658 core doubles the number of cores, versus the Mali-T604, to eight and doubles the number of arithmetic units to four (Fig. 3). It also employs hardware-based job scheduling (Fig. 4) that has the capability to turn cores on and off thereby reducing power requirements. This is normally handled in software.

The new 8-core family meshes well with Arm's big.LITTLE announcement (see Little Core Shares Big Core Architecture). The Mali-T658 can be combined with the low power Cortex-A7 and the powerful Cortex-A15 using Arm's CoreLink Interconnect. There is cache coherency between the GPU and CPU. Likewise, the MMU page table setup the same for both platforms. The Mali-T658 is also compatible with 64-bit ARMv8 architecture.

The arithmetic units can handle double precision floating point values. This is key for API support of OpenCL, Google RenderScript, and Microsoft DirectCompute. More applications are staring to take advantage of this type of computational capability.

The Mali-T658 along with some combination of Cortex cores will compete with platforms like NVidia's Tegra 3. The Tegra 3 has a ULP (ultra low power) GeForce GPU with up to 12 cores. Four Cortex-A9's are on the CPU side of the Tegra 3.

AMD's Fusion APU (Accelerated Processing Unit) does not target smartphones but it does combine CPU and GPU. The GPU supports OpenCL in addition to providing display graphics support.

Now the challenge for developers is how to balance graphics acceleration with computation acceleration.

About the Author

William G. Wong | Senior Content Director - Electronic Design and Microwaves & RF

I am Editor of Electronic Design focusing on embedded, software, and systems. As Senior Content Director, I also manage Microwaves & RF and I work with a great team of editors to provide engineers, programmers, developers and technical managers with interesting and useful articles and videos on a regular basis. Check out our free newsletters to see the latest content.

You can send press releases for new products for possible coverage on the website. I am also interested in receiving contributed articles for publishing on our website. Use our template and send to me along with a signed release form. 

Check out my blog, AltEmbedded on Electronic Design, as well as his latest articles on this site that are listed below. 

You can visit my social media via these links:

I earned a Bachelor of Electrical Engineering at the Georgia Institute of Technology and a Masters in Computer Science from Rutgers University. I still do a bit of programming using everything from C and C++ to Rust and Ada/SPARK. I do a bit of PHP programming for Drupal websites. I have posted a few Drupal modules.  

I still get a hand on software and electronic hardware. Some of this can be found on our Kit Close-Up video series. You can also see me on many of our TechXchange Talk videos. I am interested in a range of projects from robotics to artificial intelligence. 

Sponsored Recommendations

Comments

To join the conversation, and become an exclusive member of Electronic Design, create an account today!