Fix #41: Add explicit binary detection for PDF files by Dreamstick9 · Pull Request #48 · aboutcode-org/typecode

Dreamstick9 · 2026-01-16T17:38:54Z

Some PDF files were being incorrectly identified as text because they start with a text header (%PDF-)
This change adds a check in src/typecode/contenttype.py
if a file is initially detected as text, it now looks for a %PDF- signature and correctly sets it to binary if found.
i also added a regression test in tests/test_testcontenttype.py to cover this case
Fixes #41

Signed-off-by: Kushagar Garg <dreamstick909@gmail.com>

Dreamstick9 · 2026-01-21T15:13:14Z

@AyanSinhaMahapatra If you have some free time Could you please review this pr

Fix aboutcode-org#41: Fix PDF binary detection and apply formatting

1cea9bb

Signed-off-by: Kushagar Garg <dreamstick909@gmail.com>

Dreamstick9 force-pushed the main branch from 878d260 to 1cea9bb Compare January 16, 2026 17:46

Add Kushagar Garg to the list of authors

67733ce

Dreamstick9 closed this Apr 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix #41: Add explicit binary detection for PDF files#48

Fix #41: Add explicit binary detection for PDF files#48
Dreamstick9 wants to merge 2 commits into
aboutcode-org:mainfrom
Dreamstick9:main

Dreamstick9 commented Jan 16, 2026

Uh oh!

Dreamstick9 commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

Dreamstick9 commented Jan 16, 2026

Uh oh!

Dreamstick9 commented Jan 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant