RTSP Face Recognition System

A real-time face recognition system for RTSP video streams

Features • Installation • Usage • Documentation • Troubleshooting

📋 Overview

A real-time face recognition system that processes RTSP video streams using YuNet for face detection and ArcFace for face recognition. The system can identify enrolled individuals and log detections with automatic image capture.

🛠️ Technologies

Python 3.8+ - Core programming language
OpenCV - Computer vision and face detection (YuNet)
ONNX Runtime - Efficient model inference
NumPy - Numerical operations and embeddings
ArcFace - Deep learning face recognition model

Features

Real-time Processing: Monitors RTSP video streams continuously
Face Detection: Uses YuNet (OpenCV) for robust face detection
Face Recognition: Employs ArcFace embeddings for accurate face matching
Auto-enrollment: Simple image-based enrollment system
Smart Logging: Captures detected faces with configurable cooldown periods
Unknown Face Detection: Optional logging of unrecognized faces
Performance Optimized: Configurable frame sampling and downscaling

Prerequisites

Python 3.8+
RTSP camera stream access
ONNX models (see Model Setup)

Installation

Clone the repository:

git clone <repository-url>
cd <repository-name>

Install required dependencies:

pip install opencv-python numpy onnxruntime python-dotenv

Create required directories:

mkdir -p enroll models output/matches output/unknown

Model Setup

Download the required ONNX models and place them in the models/ directory:

YuNet Face Detector: face_detection_yunet_2023mar.onnx
- Download from OpenCV Zoo
ArcFace Recognition Model: w600k_r50.onnx
- Download from InsightFace

Configuration

Create a .env file in the project root:

RTSP_URL=rtsp://username:password@camera-ip:port/stream

Configurable Parameters

Edit the following constants in cam.py:

Parameter	Default	Description
`DETECT_EVERY_N_FRAMES`	10	Process every Nth frame for performance
`DOWNSCALE_DETECT`	0.5	Scale factor for detection (0.5 = 50% size)
`MIN_FACE_SIZE`	40	Minimum face size in pixels to process
`RECOG_THRESHOLD`	0.45	Recognition threshold (lower = stricter)
`SAVE_COOLDOWN_SEC`	3.0	Seconds between saves for same person
`SAVE_UNKNOWN`	True	Whether to save unknown faces
`PAD_FACTOR`	0.1	Padding around detected faces (10%)

Usage

1. Enroll Faces

Add images to the enroll/ directory with the naming convention:

enroll/
├── john_1.jpg
├── john_2.jpeg
├── jane_001.png
└── bob_photo.jpg

Naming Rules:

The person's name is everything before the first underscore
Example: john_1.jpg → Person name: "john"
Supported formats: .jpg, .jpeg, .png
Include multiple photos per person for better accuracy

2. Run the System

python cam.py

The system will:

Load and process enrollment images
Connect to the RTSP stream
Detect and recognize faces in real-time
Save matched and unknown faces to output/

3. Review Results

Output images are saved in:

output/matches/ - Recognized faces with names
output/unknown/ - Unrecognized faces

File naming format:

{name}_{timestamp}_d{distance}.jpg

How It Works

Detection Pipeline

Stream Capture: Connects to RTSP stream with minimal buffering
Frame Sampling: Processes every Nth frame to optimize performance
Downscaling: Reduces frame size for faster detection
Face Detection: YuNet identifies faces and bounding boxes
Face Extraction: Crops and pads detected faces
Embedding: ArcFace generates 512-dimensional embeddings
Matching: Compares embeddings with enrolled gallery
Logging: Saves annotated frames and face crops

Recognition Process

Uses cosine distance between normalized embeddings
Distance threshold determines match/unknown classification
Lower distance = higher similarity (0 = identical, 2 = opposite)
Cooldown prevents duplicate saves of the same person

Troubleshooting

No Faces Detected in Enrollment

Symptoms: "Sem rosto" messages during enrollment

Solutions:

Ensure faces are clearly visible and well-lit
Use higher resolution images (recommended: 640px minimum)
Check that faces occupy at least 20% of the image
Verify image files are not corrupted

RTSP Connection Failed

Symptoms: "Não abriu RTSP" error

Solutions:

Verify RTSP URL format and credentials
Test stream with VLC or ffplay first
Check network connectivity to camera
Ensure camera supports RTSP protocol

Poor Recognition Accuracy

Symptoms: Wrong matches or too many unknowns

Solutions:

Adjust RECOG_THRESHOLD (lower = stricter, higher = looser)
Add more enrollment photos per person (5-10 recommended)
Use varied angles and lighting in enrollment photos
Increase MIN_FACE_SIZE to filter distant faces

Performance Issues

Symptoms: Lag or high CPU usage

Solutions:

Increase DETECT_EVERY_N_FRAMES to process fewer frames
Reduce DOWNSCALE_DETECT further (try 0.3 or 0.25)
Use GPU acceleration with CUDA providers in ONNXRuntime
Lower camera stream resolution at source

Advanced Configuration

GPU Acceleration

To use GPU acceleration, modify the ONNX Runtime provider:

arc_sess = ort.InferenceSession(
    ARCFACE_PATH, 
    providers=["CUDAExecutionProvider", "CPUExecutionProvider"]
)

Requires: onnxruntime-gpu and NVIDIA GPU with CUDA support

Custom Detection Parameters

Adjust YuNet detector settings:

detector = cv2.FaceDetectorYN.create(
    YUNET_PATH,
    "",
    (320, 320),
    score_threshold=0.6,  # Higher = fewer false positives
    nms_threshold=0.3,    # Non-maximum suppression
    top_k=5000            # Max faces per frame
)

Output Format

Saved Images

Each detection saves two images:

Annotated Frame: Full frame with bounding box and label
Face Crop: Extracted face region (commented out by default)

Console Output

[ENROLL] OK: john <- john_1.jpg
[GALLERY] embeddings: 3
RTSP aberto com sucesso!
[SAVE] output/matches/john_20260115_143022_123456_d0.234.jpg | ... (name=john, 0, d=0.234, bbox=120,80,150,180)

Project Structure

.
├── cam.py                  # Main application
├── .env                    # Configuration (not in git)
├── enroll/                 # Enrollment images
├── models/                 # ONNX model files
│   ├── face_detection_yunet_2023mar.onnx
│   └── w600k_r50.onnx
└── output/                 # Detection results
    ├── matches/           # Recognized faces
    └── unknown/           # Unrecognized faces

Security Considerations

Store .env file securely (never commit to git)
Use strong RTSP credentials
Implement access controls for output directory
Consider data retention policies for saved images
Ensure compliance with local privacy regulations

Acknowledgments

OpenCV YuNet face detection model
InsightFace ArcFace recognition model
ONNXRuntime for efficient inference

Support

For issues and questions:

Open an issue on GitHub
Check troubleshooting section above
Review debug logs in console output

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
cam.py		cam.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RTSP Face Recognition System

📋 Overview

🛠️ Technologies

Features

Prerequisites

Installation

Model Setup

Configuration

Configurable Parameters

Usage

1. Enroll Faces

2. Run the System

3. Review Results

How It Works

Detection Pipeline

Recognition Process

Troubleshooting

No Faces Detected in Enrollment

RTSP Connection Failed

Poor Recognition Accuracy

Performance Issues

Advanced Configuration

GPU Acceleration

Custom Detection Parameters

Output Format

Saved Images

Console Output

Project Structure

Security Considerations

Acknowledgments

Support

About

Uh oh!

Releases

Packages

Languages

maubaum/face-recognition

Folders and files

Latest commit

History

Repository files navigation

RTSP Face Recognition System

📋 Overview

🛠️ Technologies

Features

Prerequisites

Installation

Model Setup

Configuration

Configurable Parameters

Usage

1. Enroll Faces

2. Run the System

3. Review Results

How It Works

Detection Pipeline

Recognition Process

Troubleshooting

No Faces Detected in Enrollment

RTSP Connection Failed

Poor Recognition Accuracy

Performance Issues

Advanced Configuration

GPU Acceleration

Custom Detection Parameters

Output Format

Saved Images

Console Output

Project Structure

Security Considerations

Acknowledgments

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages