VIS/docs/tutorial/tutorial_before.md at main · liscruk/VIS

Before running the pipeline

Prepare config.yml

To inform the pipeline about the location of our samples and other dependencies, we need to open and edit the configuration file. Open or create a new config.yml file and fill in the missing dependencies as outlined below:

# tutorial config
experiment: "simulation_tutorial"
samples:
  S1: "tutorial/simulated/S1.bam"
  S2: "tutorial/simulated/S2.bam"
processing_dir: "tutorial/out"
threads: 2
insertion_fasta: "tutorial/references/vectorseq.fa"
splitmode: "Buffer"
fragment_size: 100
bridging_size: 300
MinLength: 1
MAPQ: 10
MinInsertionLength: 500
ref_genome_ctrl: "tutorial/references/chr1_1_50000_ref.fa"
annotate_ucsc_genes: "tutorial/references/UCSC_genes_chr1_0_500000_processed.bed"
detection: "rules/detection.smk"
quality_control: "rules/qc.smk"
functional_genomics: "rules/functional_genomics.smk"

Check setup

Before we begin, make sure you have all the necessary files ready. You can quickly verify this by running:

    snakemake -n

If you see a list of jobs waiting for execution, you're all set for the next steps.

!!! info

For this tutorial, we will run the workflow using the recommended setup. If you have chosen the alternative version (i.e., one virtual environment (`VIS_full`)), simply remove `--use-conda` from following `Snakemake` commands.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Before running the pipeline

Prepare config.yml

Check setup

FilesExpand file tree

tutorial_before.md

Latest commit

History

tutorial_before.md

File metadata and controls

Before running the pipeline

Prepare config.yml

Check setup