Pipeline: sanger-tol/ascc (0.1.0)

Launch ID: 1740254441_4275e8a9e48e

Go through the pipeline inputs below, setting them to the values that you would like. When you're done, click Launch and your parameters will be saved.

The page shown will show a command that you can use to directly launch the workflow. For those running on a system with no internet connection, you can copy the parameters JSON to a file and use the supplied command to launch.

Nextflow command-line flags
Nextflow command-line flags

General Nextflow flags to control how the pipeline runs.

These are not specific to the pipeline and will not be saved in any parameter file. They are just used when building the `nextflow run` launch command.
Must match pattern ^[a-zA-Z0-9-_]+$

Unique name for this nextflow run

Configuration profile

Work directory for intermediate files

Resume previous run, if found

Execute the script using the cached results, useful to continue executions that was stopped by an error

Input/output options

Define where the pipeline should find input data and save output data.

Must match pattern ^\S+\.csv$

Path to comma-separated file containing information about the samples in the experiment.

You will need to create a design file with information about the samples in your experiment before running the pipeline. Use this parameter to specify its location. It has to be a comma-separated file with 3 columns, and a header row.

There maybe occasions where the end user only has the Primary Genomic Assembly, in this case this value should be --genomic_only true. The value --exclude organellar_blast should also be included in the CLI

Scientific name of sample according to NCBI

Must be an integer

TaxID according to NCBI

folder containing long read data files in FASTA.GZ format

Type of read (hifi)

Must match pattern ^\S+\.fa$

Fasta format file containing barcode data

barcodes to use in run

Length of kmer to use in run

dimensionality_reduction_methods to use in runs

folder containing nt database

Prefix of the NT database

folder containing Kraken2 database

folder containing accession2taxid file

folder containing dmp files

Must match pattern ^\S+\.dmp$

Path to rankedlineage.dmp

BUSCO folder containing busco Lineages folder

CSV list of lineages to use in run

FCS_GX folder containing database

Vecscreen folder containing database

Must match pattern ^\S+\.dmnd$

Path to Uniprot Diamond DB

Must match pattern ^\S+\.dmnd$

Path to NR Diamond DB

Length of slide for Seqkit

Length of the window for Seqkit

Nearest Neighbour value - This is for the dimensional reduction of kmer counts (UMAP and t-SNE).

Must match pattern ^\S+\.yaml$

Path to BTK yaml

This parameter is required

The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.

Must match pattern ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Email address for completion summary.

Set this parameter to your e-mail address to get a summary e-mail with details of the run sent to you when the workflow exits. If set in your user config file (~/.nextflow/config) then you don't need to specify this on the command line for every run.

MultiQC report title. Printed as page header, used for filename if not otherwise specified.

Select steps to run.

Deselect steps to run.

Select steps to run - specific to the organallar workflow.

Deselect steps to run - specific to the organallar workflow.

Should btk be run as mandatory or conditional on levels of contamination.

Generic options

Less common options for the pipeline, typically set in a config file.

These options are common to all nf-core pipelines and allow you to customise some of the core preferences for how the pipeline runs. Typically these options would be set in a Nextflow config file loaded for all pipeline runs, such as `~/.nextflow/config`.

Custom MultiQC yaml file containing HTML including a methods description.