An Android app that automatically generates subtitles for videos locally, without needing an internet connection.
-
Updated
May 18, 2025 - Java
An Android app that automatically generates subtitles for videos locally, without needing an internet connection.
Curated list of open-source speech-to-text and voice typing tools for Linux, macOS, Windows, Android, and iOS. Offline, local, and cloud.
A Flask API to convert speech to text using Offline Transcription methods - CMU Sphinx and DeepSpeech.
Offline Speech Recognition For Android Library
中文 vosk-android-demo
ROBOKIDS is a smart educational robot for kids, that connected with educational app that uses technology to make learning fun for kids. Its features like AI and deep learning, has levels for basic concepts, and has parental controls for safety and progress monitoring.
Voice Assistant using Whisper in python3
Automatic video translator and dubber using Whisper, XTTS v2 for voice cloning, and Ollama for local LLM translation. Supports 100+ languages.
"An offline video & audio transcription tool powered by OpenAI Whisper. Convert your tutorials, lectures, and podcasts into accurate text transcripts and use AI to generate summaries, notes, and mind maps — saving hours of time and boosting productivity."
Offline speech recognition for roboy
Use Vosk speech recognition toolkit to transcribe real-time audio from your microphone.
A Python-based offline voice assistant leveraging Vosk and Pyttsx3 to provide accessible emergency support, voice commands, and reminders for elderly users.
A Capacitor plugin that provides offline speech-to-text functionality for Android and iOS platforms. The plugin offers true offline recognition for Android with multiple languages, while iOS provides offline support for English with online fallback for other languages.
efronic-voice-assistant is a voice-controlled assistant platform which runs on a raspberry pi
Control your PC using the fastest speech recognition in the world.
📝 Notes AI Bot — интеллектуальный помощник для заметок в Telegram. Принимает текст и голосовые сообщения, преобразует речь в текст с помощью Vosk, классифицирует заметки по типам и формирует их по шаблонам, выделяя суть из длинных сообщений.
Real-time transcription of Windows system audio to text via a floating, always-on-top GUI. Utilizes offline Vosk models for privacy.
Provide a curated list of open-source speech-to-text tools for voice typing and dictation on desktop, mobile, and command line interfaces.
Add a description, image, and links to the offline-speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the offline-speech-recognition topic, visit your repo's landing page and select "manage topics."