NVIDIA Jarvis is a framework for building multimodal conversational AI apps with state-of-the-art models optimized to run in real time. Watch to see Jarvis' automatic speech recognition (ASR) accuracy when fine-tuned on medical jargon, its real-time neural machine translation from English to Spanish and Japanese, and its powerful controllability of neural text-to-speech.