Skip to content

shubham-bioai/BioGeneXplorer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

🧬 Biological Database Search Engine (Student-Focused)

📌 Project Description

Biological Database Search Engine is a Python-based bioinformatics project designed especially for BSc / MSc Biotechnology and Bioinformatics students.

This tool allows a user to input a human gene name and automatically fetches relevant gene and protein information from trusted biological databases. The fetched data is then presented in a simple, student-friendly format.

The main goal of this project is to help students understand:

  • How biological databases work
  • How gene and protein data are connected
  • How to use APIs in bioinformatics using Python

🎯 What Problem Does This Project Solve?

Most biological databases like NCBI and UniProt provide highly technical data which is difficult for beginners to understand.

This project:

  • Simplifies complex biological information
  • Presents data in an easy-to-read format
  • Helps students learn bioinformatics practically

⚙️ How This Project Works (Step-by-Step)

  1. User enters a human gene name (example: TP53)
  2. The program searches the gene in NCBI Gene database
  3. Gene-related information is fetched:
    • Gene symbol
    • Chromosome location
    • Organism name
  4. The same gene name is searched in UniProt
  5. Protein-related information is fetched:
    • Protein name
    • Protein length
    • Function
  6. The program generates a student-friendly explanation explaining the gene and protein in simple language

🧪 Example Use Case

Input:

Output Includes:

  • Gene Symbol: TP53
  • Chromosome: 17
  • Organism: Homo sapiens
  • Protein Name: Cellular tumor antigen p53
  • Protein Function: Prevents uncontrolled cell growth
  • Simple explanation for students

🛠 Tools & Technologies Used

  • Python
  • Biopython
  • Requests
  • NCBI Entrez API
  • UniProt REST API
  • Git & GitHub

🗂 Project Structure


▶️ How to Run This Project

1️⃣ Install required libraries

pip install -r requirements.txt


python src/main_pipeline.py

Enter human gene name: TP53

Releases

No releases published

Packages

 
 
 

Contributors

Languages