A super tiny speech-to-text tool.
It's written in a few lines of Bash, and built with common tools you likely already have installed.
supertinywhisper records your audio using ffmpeg (with settings optimized for speech), transcribes it using the OpenAI API, and outputs the resulting transcription to standard output. You can then pipe that into other tools and compose it however you like!
- libnotify - desktop notifications
-
Get your API key from OpenAI Platform.
-
Create the config directory and add your API key:
mkdir -p ~/.config/supertinywhisper
echo "your-api-key-here" > ~/.config/supertinywhisper/openai_api_keyFor best experience, supertinywhisper should be bound to a keyboard shortcut and composed with other tools.
# X11
supertinywhisper | xdotool type --clearmodifiers --file -
# Wayland
supertinywhisper | wtype -# X11
supertinywhisper | xclip -selection clipboard
# Wayland
supertinywhisper | wl-copySpecify the language using the ISO 639 code. This improves model accuracy and latency. Defaults to English.
# Spanish
supertinywhisper --language es | wtype -
# French
supertinywhisper --language fr | wtype -supertinywhisper --cancel