New Standard for Speech Recognition and Translation from the NVIDIA NeMo Canary Model

NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team…

NVIDIA NeMo is an end-to-end platform for the development of multimodal generative AI models at scale anywhere—on any cloud and on-premises. The NeMo team just released Canary, a multilingual model that transcribes speech in English, Spanish, German, and French with punctuation and capitalization. Canary also provides bi-directional translation, between English and the three other supported…

Source

Leave a Reply

Your email address will not be published.

Previous post Pushing the Boundaries of Speech Recognition with NVIDIA NeMo Parakeet ASR Models
Next post A site literally called ‘Spy.pet’ claims to have scraped billions of public Discord messages and wants to sell them