Elon Musk's Research Company Develops a “Training Gym” for Artificial Intelligence

OpenAI Gym, a toolkit for developing and comparing machine learning algorithms, was recently opened to the public.

May 3, 2016

4 min read

Elon Musk, the founder of Tesla Motors and Space X, has openly shared his belief that artificial intelligence could be the “biggest existential threat” to humanity if not developed carefully. He has advocated for artificial intelligence that benefits everyone and not only a small group of companies using it to crunch piles of data.

His concerns were perhaps not entirely clear until he helped found OpenAI, an artificial intelligence research company that has promised to release its research to the public. Last December, Musk and other investors pledged $1 billion to the non-profit, which has hired several high-profile researchers over the last few months. Now, the company has revealed its first toolkit, OpenAI Gym, for developing and comparing so-called reinforcement learning algorithms.

Reinforcement-learning programs are unique in that they teach themselves abstract concepts by cycling through huge amounts of data. Usually, the program performs a specific task over and over again, learning through trial and error, without human supervision. Reinforcement learning is one of the strongest candidates for new programs that enable anything from autonomous cars to personal assistants in our smartphones.

The new tool consists of several basic environments where users can experiment with different reinforcement learning programs. In one environment, users can control virtual robots, which can train themselves to do things like walk. Users can also build programs that teach themselves to solve basic math problems or play anything from simple Atari video games to more complex board games like Go.

Reinforcement learning is turning into a major trend in artificial intelligence. The research team from DeepMind, a machine learning company under Google parent Alphabet, applied the training regimen to its AlphaGo program, which taught itself to play the ancient board game Go. The program developed new strategies and an almost instinctual feel for the game, defeating one of the world’s top players earlier this year.

The technique could eventually factor into programs built by Facebook and IBM, which are using machine learning to help chatbots have conversations with users on social media and custom programs make sense of electronic medical records. Other companies, like Geometric Intelligence, have rejected the reinforcement learning model, saying that it is too simple to teach computers the abstraction necessary to interact with humans.

OpenAI Gym was first developed for the internal research team, but the company decided to release the toolkit to the public. The company hopes that research projects inside OpenAI Gym will help forge new standards for testing machine learning programs, so that researchers can easily duplicate results published in journals.

The ability to recreate test results could help the companies investing in OpenAI to close the gap with more experienced companies like Facebook and Google. The non-profit is also supported by Paypal founder Peter Thiel and startup incubator Y-Combinator founder Sam Altman. Amazon Web Services, the online retailer’s cloud computing division, has also invested in OpenAI and partnered on the toolkit launch.

The virtual training gym is compatible with algorithms written in any framework, including Google’s Tensorflow software library, an open-source resource for building machine learning programs. It can also be used with programs built in Theano, another open-source machine learning library.

The toolkit also provides a way for users to upload and share their projects with the wider community. Rather than creating a leaderboard for users to compare results of different programs, OpenAI researchers will curate a list of algorithms that work in interesting ways.

The company eventually plans to hand control of that list to the community, with the intent of building a more collaborative space for research. “What matters for research isn’t your score,” the company writes in a blog post, “but instead the generality of your technique.”

Looking for parts? Go to SourceESB.

About the Author

James Morra

Senior Editor

James Morra is the senior editor for Electronic Design, covering the semiconductor industry and new technology trends, with a focus on power electronics and power management. He also reports on the business behind electrical engineering, including the electronics supply chain. He joined Electronic Design in 2015 and is based in Chicago, Illinois.

What’s the Difference Between DIMM and CAMM?

NVIDIA to Supply Millions of GPUs and CPUs to Meta in New AI Deal

Sponsored

Smarter Sunroof Control with MPS Power ICs

Sponsored

Elon Musk's Research Company Develops a “Training Gym” for Artificial Intelligence

About the Author

James Morra

Senior Editor

Related

What’s the Difference Between DIMM and CAMM?

NVIDIA to Supply Millions of GPUs and CPUs to Meta in New AI Deal

Smarter Sunroof Control with MPS Power ICs

Optimized Power for Wiper Control Systems

Voice Your Opinion!

To join the conversation, and become an exclusive member of Electronic Design, create an account today!

Trending

Real-Time Signal-Integrity Analysis on the 7 Series Oscilloscope

Can Digital Twins Recreate Everything Down to the Chip?

VTT Report on Donut Lab’s Solid-State Battery is a Nothingburger

Sponsored Picks

LT8645/LT8646 Synchronous Step-Down Regulators

Designing Accurate Gas Monitoring Systems with Chemiresistive Devices

Faster Timing Design and Accurate Performance Testing with Live Bench Measurement Tool