FaceDuplicationFilter

Content

FaceDuplicationFilter

Project Overview

This project is based on the eDifFIQA model, combining its face quality assessment capabilities to develop a face quality-enhanced duplicate data filtering system for identifying and removing duplicate face images.

Face recognition technology has long been a hot topic in cutting-edge academic research. With the advancement of AI technology, various face recognition techniques heavily rely on large-scale training datasets. However, these datasets often contain significant redundant data which, if not properly filtered, may become high-frequency noise that affects model training. This could lead to suboptimal training results, slower convergence, or even abnormal gradient changes during training. Therefore, establishing a reliable face data cleaning system is crucial.

Our system addresses this need by utilizing a carefully trained face quality assessment model to effectively extract the most distinctive face data while filtering out low-distinctiveness redundant data. This process not only improves training efficiency but also significantly accelerates model convergence, thereby enhancing overall system performance and reliability.

Technical Innovations

K-Fold Cross Validation for Optimal Face Selection:
- We employ k-fold cross validation to obtain optimal face combinations
- The evaluation criteria is based on distances between faces
- Particularly effective for our product's scenario of small-batch, high-duplication applications
Enhanced Face Representation with eDifFIQA:
- Compared to traditional average distance methods, we introduce the eDifFIQA model for quality assessment
- Use quality-weighted averages as face representation
- Considers factors like face angle, noise, brightness, and camera distortion
Advanced Quality Assessment Methodology:
- Diffusion process using a custom UNet model for generating noisy and reconstructed images
- Process repeated on horizontally flipped images to capture pose impact
- Quality score calculation through embedding comparison
- Enhanced with knowledge distillation and label optimization:
  - Quality label optimization using relative position information from FR model embedding space
  - Representation consistency loss (Lrc) and quality loss (Lq) for improved prediction

References

@article{babnikTBIOM2024,
  title={{eDifFIQA: Towards Efficient Face Image Quality Assessment based on Denoising Diffusion Probabilistic Models}},
  author={Babnik, {\v{Z}}iga and Peer, Peter and {\v{S}}truc, Vitomir},
  journal={IEEE Transactions on Biometrics, Behavior, and Identity Science (TBIOM)},
  year={2024},
  publisher={IEEE}
}

System Requirements

Windows environment (Python embedded package)
Linux environment (theoretically supported, requires Python configuration)

Environment Setup and Usage

Method 1: Complete Package

Download the complete package via HuggingFace linkDownload
Extract the package
Run start.bat on Windows

Method 2: Conda Installation

Create a Python 3.10 environment
Install packages from requirements.txt in the project root
Download model weights according to instructions from eDifFIQA:
- Recommended weights: r100.pth and ediffiqaL.pth
- Place them in the weights folder
Run allmain.py

License

Follow the original project eDifFIQA, open source licenses.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.vscode		.vscode
configs		configs
face/vggface		face/vggface
log_csv		log_csv
model		model
python-3.10.10-embed-amd64		python-3.10.10-embed-amd64
selected		selected
weights		weights
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
Ui_test.py		Ui_test.py
allmain.py		allmain.py
dataset.py		dataset.py
inference_sigle_function.py		inference_sigle_function.py
loss.py		loss.py
main_meng_1.py		main_meng_1.py
main_meng_2.py		main_meng_2.py
main_wy_1.py		main_wy_1.py
prepare_data.py		prepare_data.py
requirements.txt		requirements.txt
start.bat		start.bat
test.ui		test.ui
train.py		train.py
utils.py		utils.py
video_gen.py		video_gen.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FaceDuplicationFilter

Content

Project Overview

Technical Innovations

References

System Requirements

Environment Setup and Usage

Method 1: Complete Package

Method 2: Conda Installation

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FaceDuplicationFilter

Content

Project Overview

Technical Innovations

References

System Requirements

Environment Setup and Usage

Method 1: Complete Package

Method 2: Conda Installation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages