Before starting the Core, complete all of the following:

Checklist
Access Volac's genomics server
Secure connection set up
Raw data saved in a sub-folder inside input_core
Sample sheet made and saved inside input_core

Starting the Core

First enter this command:

cd ~/genome_tools/Core

Then enter one of the following commands, depending on the file extension of your raw read data.

If the file extension of your raw read data is .fastq.gz, enter this command, replacing YYYY-MM-DD with a date:

nextflow run core.nf --indexpath 'index_YYYY-MM-DD.csv' --inputfq 'YYYY-MM-DD/*_R{1,2}_001.fastq.gz'

If the file extension of your raw read data is .fq.gz, enter this command, replacing YYYY-MM-DD with a date:

nextflow run core.nf --indexpath 'index_YYYY-MM-DD.csv' --inputfq 'YYYY-MM-DD/*_R{1,2}_001.fq.gz'

The sample sheet

Use --indexpath 'index_YYYY-MM-DD.csv' to tell the Core which sample sheet to use.

For example,

--indexpath 'index_2022-04-29.csv'

The raw data

Use --inputfq 'YYYY-MM-DD/*_R{1,2}_001.fastq.gz' to tell the Core which raw data to use.

For example,

--inputfq '2022-04-29/*_R{1,2}_001.fastq.gz'

Example

The full entry will be:

nextflow run core.nf --indexpath 'index_2022-04-29.csv' --inputfq '2022-04-29/*_R{1,2}_001.fastq.gz'


Exception (for MiSeq raw data)

If your samples were sequenced using MiSeq technology (rather than NextSeq) an additional 5 characters are required to start the Core, _L001.

Use --inputfq 'YYYY-MM-DD/*_L001_R{1,2}_001.fastq.gz' to tell the Core which raw data to use.

Everything else remains the same.

MiSeq example

The full entry will be:

nextflow run core.nf --indexpath 'index_2022-04-29.csv' --inputfq '2022-04-29/*_L001_R{1,2}_001.fastq.gz'