NVIDIA Jarvis: Speech Recognition, Real-Time Machine Translation, and Controllable Text-to-Speech

April 12, 2021
NVIDIA Jarvis is a framework for building multimodal conversational AI apps with state-of-the-art models optimized to run in real time. Watch to see Jarvis' automatic speech recognition (ASR) accuracy when fine-tuned on medical jargon, its real-time neural machine translation from English to Spanish and Japanese, and its powerful controllability of neural text-to-speech.

Sponsored Recommendations

Comments

To join the conversation, and become an exclusive member of Electronic Design, create an account today!