Biological Database Search Engine is a Python-based bioinformatics project designed especially for BSc / MSc Biotechnology and Bioinformatics students.
This tool allows a user to input a human gene name and automatically fetches relevant gene and protein information from trusted biological databases. The fetched data is then presented in a simple, student-friendly format.
The main goal of this project is to help students understand:
- How biological databases work
- How gene and protein data are connected
- How to use APIs in bioinformatics using Python
Most biological databases like NCBI and UniProt provide highly technical data which is difficult for beginners to understand.
This project:
- Simplifies complex biological information
- Presents data in an easy-to-read format
- Helps students learn bioinformatics practically
- User enters a human gene name (example:
TP53) - The program searches the gene in NCBI Gene database
- Gene-related information is fetched:
- Gene symbol
- Chromosome location
- Organism name
- The same gene name is searched in UniProt
- Protein-related information is fetched:
- Protein name
- Protein length
- Function
- The program generates a student-friendly explanation explaining the gene and protein in simple language
- Gene Symbol: TP53
- Chromosome: 17
- Organism: Homo sapiens
- Protein Name: Cellular tumor antigen p53
- Protein Function: Prevents uncontrolled cell growth
- Simple explanation for students
- Python
- Biopython
- Requests
- NCBI Entrez API
- UniProt REST API
- Git & GitHub
pip install -r requirements.txt
python src/main_pipeline.py
Enter human gene name: TP53