srt-maker

CLI tool to generate SRT subtitles from video audio using speech recognition.

Features

Automatic speech recognition using OpenAI Whisper (local, offline)
Language detection support
Progress indicators for transcription
Configurable output parameters
Timestamp precision control
Minimum subtitle display duration for better readability

Requirements

Python 3.9+
ffmpeg (installed on system)

Installation

Install System Dependencies

Ubuntu/Debian:

sudo apt-get install ffmpeg

macOS:

brew install ffmpeg

Windows: Download from https://ffmpeg.org/download.html

Install Python Package

pip install -e .

For development:

pip install -e ".[dev]"

Usage

Basic Usage

srt-maker video.mp4

This generates video.srt in the same directory.

Options

srt-maker video.mp4 [OPTIONS]

Options:
  video_file                  Path to the input video file (required)
  -o, --output OUTPUT         Output SRT file path (default: <video_name>.srt)
  -m, --model MODEL           Whisper model size: tiny, base, small, medium, large, large-v1, large-v2, large-v3 (default: base)
  -l, --language LANG         Language code (e.g., en, es, fr). Auto-detect if not specified
  -p, --precision N           Timestamp precision in milliseconds (default: 0)
  -d, --device DEVICE         Device to run the model on: cpu, cuda, auto (default: auto)
  --min-display-duration N    Minimum display duration for subtitles in seconds (default: 0.0 - use actual speech duration)
  --no-speech-threshold N     Filter segments with no_speech_prob above this value (default: 0.6)
  --logprob-threshold N       Filter segments with avg_logprob below this value (default: -1.0)
  --min-duration N            Minimum segment duration in seconds (default: 0.1)
  --max-repetitions N         Max consecutive repetitions of same text (default: 2)
  --offset N                 Time offset in seconds to add to all timestamps (default: 0.0)
  -v, --verbose              Enable verbose logging
  --help                     Show help message

Examples

Generate subtitles with custom output path:

srt-maker video.mp4 -o subtitles.srt

Use tiny model for faster transcription (less accurate):

srt-maker video.mp4 -m tiny

Use large model for better accuracy (slower):

srt-maker video.mp4 -m large

Specify language for better accuracy:

srt-maker video.mp4 -l en

Force CPU usage:

srt-maker video.mp4 -d cpu

Extend short subtitles for better readability (2 second minimum display duration):

srt-maker video.mp4 --min-display-duration 2.0

Development

Run Tests

Run all tests:

pytest

Run with coverage:

pytest --cov=srt_maker

Run specific test file:

pytest tests/test_audio_extractor.py

Watch Mode

Run tests in watch mode for continuous feedback during development:

./test_runner.sh --watch

Linting

Run linting checks:

pyflakes srt_maker/**/*.py

Running the Test Runner

The test_runner.sh script provides automated testing with continuous feedback:

# Run all tests with coverage
./test_runner.sh

# Skip slow tests
./test_runner.sh --skip-slow

# Watch mode (re-runs on file changes)
./test_runner.sh --watch

Project Structure

srt-maker/
├── srt_maker/
│   ├── __init__.py
│   ├── audio_extractor.py    # Audio extraction from video
│   ├── transcriber.py       # Whisper speech recognition
│   ├── srt_generator.py     # SRT file formatting
│   └── cli.py               # CLI interface
├── tests/
│   ├── conftest.py          # Test fixtures
│   ├── test_audio_extractor.py
│   ├── test_transcriber.py
│   ├── test_srt_generator.py
│   └── test_cli.py
├── test_runner.sh           # Automated test runner
└── pyproject.toml

How It Works

Audio Extraction: Extract audio track from video using ffmpeg
Speech Recognition: Use OpenAI Whisper to transcribe audio segments
Language Detection: Automatically detect language (or use specified)
SRT Generation: Format segments into SRT subtitle format

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
srt_maker		srt_maker
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
DEVELOPMENT_PLAN.md		DEVELOPMENT_PLAN.md
README.md		README.md
pyproject.toml		pyproject.toml
test_runner.sh		test_runner.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

srt-maker

Features

Requirements

Installation

Install System Dependencies

Install Python Package

Usage

Basic Usage

Options

Examples

Development

Run Tests

Watch Mode

Linting

Running the Test Runner

Project Structure

How It Works

License

About

Uh oh!

Releases

Uh oh!

Languages

dodbrian/srt-maker

Folders and files

Latest commit

History

Repository files navigation

srt-maker

Features

Requirements

Installation

Install System Dependencies

Install Python Package

Usage

Basic Usage

Options

Examples

Development

Run Tests

Watch Mode

Linting

Running the Test Runner

Project Structure

How It Works

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Uh oh!

Languages