Fix: Add dynamic resolution scaling to crop_video.py to support various video resolutions by cvjekim · Pull Request #3 · etri/kmsav

cvjekim · 2026-03-13T07:11:47Z

Description

This PR addresses an issue in utils/crop_video.py where the facial cropping region becomes incorrect if the downloaded video's resolution does not exactly match the base resolution (e.g., 1280x720) specified in the ASD info file.

Why is this necessary?

The original implementation assumes that the input video has the exact same dimensions as the # ImageSize defined in the annotation texts. However, videos downloaded from YouTube (via yt-dlp or similar tools) often come in 1080p or other arbitrary resolutions. When a 1080p video is processed using 720p coordinates without scaling, it results in cropping the wrong background region instead of the speaker's mouth.

Changes Made

Added a dynamic scaling logic that compares the actual resolution of the input video frame with the # ImageSize from the ASD info file.
Calculated the scale factors (sw, sh) and multiplied them by the original X, Y, W, H coordinates to correctly adjust the bounding box regardless of the video's original resolution.

How to Test

Download a 1080p or 4K video from the dataset list.
Run the modified crop_video.py.
Verify that the output .mp4 files in the utts/ directory successfully contain the cropped lip/face regions.

Fixed dynamic resolution scaling for video cropping

89590af

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Add dynamic resolution scaling to crop_video.py to support various video resolutions#3

Fix: Add dynamic resolution scaling to crop_video.py to support various video resolutions#3
cvjekim wants to merge 1 commit intoetri:mainfrom
cvjekim:main

cvjekim commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

cvjekim commented Mar 13, 2026

Description

Why is this necessary?

Changes Made

How to Test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant