Vosk is an offline open source speech recognition toolkit. It enables speech recognition models for 16 languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi.
Vosk models are small...
More information
Roham AI platform is a collection of Speech to Text, Text to Speech, Optical Charachter Recognition, Face Recognition and Natural Language Understanding.
Roham AI platform is a collection of Speech to Text, Text to Speech, Optical Charachter Recognition, Face Recognition and Natural Language Understanding.