NVIDIA Jarvis: Speech Recognition, Real-Time Machine Translation, and Controllable Text-to-Speech

April 12, 2021
NVIDIA Jarvis is a framework for building multimodal conversational AI apps with state-of-the-art models optimized to run in real time. Watch to see Jarvis' automatic speech recognition (ASR) accuracy when fine-tuned on medical jargon, its real-time neural machine translation from English to Spanish and Japanese, and its powerful controllability of neural text-to-speech.
Sign up for Electronic Design Newsletters
Get the latest news and updates.

Voice Your Opinion!

To join the conversation, and become an exclusive member of Electronic Design, create an account today!