GitHub - AGR114molecularBreeding/scripts: Common scripts in the team

tidy_names : Read a FASTA file containing isoforms, where the headers (previously taken from a GFF file) include the IDs for the protein, mRNA, and gene. The script generates a new FASTA file with cleaned headers, where only the gene ID is shown. In cases where there are isoforms, only the longest sequence is included.
family_expansion : Read a FASTA file containing CDS, tidy the names for pairwise alignments (e.g., BLAST), and identify family members at different identity and coverage thresholds.
read_coverage : Compute the read coverage at each position in the genome based on aligned reads (from a .BAM file). Plot the read coverage and gene annotations within a specified genomic region of interest. Obtains the read counts (i.e., the number of reads assigned to each gene) for all genes in the provided GFF file.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
family_expansion		family_expansion
read_coverage		read_coverage
tidy_names		tidy_names
README.md		README.md

Provide feedback