Note
Mini macOS SwiftUI app that lets you select a video, transcribe its audio locally to detect and list filler words like “uh”, “um”, etc., using Whisper (OpenAI) locally.
- Install Python and local Whisper (small model):
brew install ffmpeg
pip install --upgrade pip
pip install git+https://github.com/openai/whisper.gitMake sure your Terminal’s Python environment is accessible by your app or adjust the script to use absolute paths if needed.
- Open the Xcode project
SilenceTrimmer.xcodeprojor.xcworkspace - Add the
whisper_local.shscript to the app bundle (Resources) - Note: Only add it to the main SilenceTrimmer target, not the test targets - Build and run the app on macOS 12+
- Select a video (mp4, mov)
- Click “Start Processing”
- The app converts the video into WAV audio
- Transcribes locally with Whisper
- Displays occurrences of detected words like “uh”, “um”
Made with ❤️ by Pierre-Henry Soria. A super passionate & enthusiastic Problem-Solver Engineer. Also, a true cheese 🧀, ristretto ☕️ and dark chocolate lover! 😋
This Silence Trimmer is licensed under the MIT License.