Pipeline: sanger-tol/blobtoolkit (0.6.0)

Launch ID: 1732349399_164798c85968

Go through the pipeline inputs below, setting them to the values that you would like. When you're done, click Launch and your parameters will be saved.

The page shown will show a command that you can use to directly launch the workflow. For those running on a system with no internet connection, you can copy the parameters JSON to a file and use the supplied command to launch.

Nextflow command-line flags
Nextflow command-line flags

General Nextflow flags to control how the pipeline runs.

These are not specific to the pipeline and will not be saved in any parameter file. They are just used when building the `nextflow run` launch command.
Must match pattern ^[a-zA-Z0-9-_]+$

Unique name for this nextflow run

Configuration profile

Work directory for intermediate files

Resume previous run, if found

Execute the script using the cached results, useful to continue executions that was stopped by an error

Input/output options

Define where the pipeline should find input data and save output data.

Must match pattern ^\S+\.csv$

Path to comma-separated file containing information about the samples in the experiment.

You will need to create a design file with information about the samples in your experiment before running the pipeline. Use this parameter to specify its location. It has to be a comma-separated file with 3 columns, and a header row.

Turn on optional alignment before running the rest of the pipeline.

Turn on optional genome masking if needed.

Skip filtering out hits from the same species in blast* searches.

Turn on the conversion from a nf-core/fetchngs samplesheet.

Must match pattern ([^,]+_odb10,)*[^,]+_odb10

Name of the ODB10 lineages to run BUSCO on. By default, the pipeline will automatically selecy the correct lineage based on the taxonomy.

Select the format of the output images.

This parameter is required

The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.

Must match pattern ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Email address for completion summary.

Set this parameter to your e-mail address to get a summary e-mail with details of the run sent to you when the workflow exits. If set in your user config file (~/.nextflow/config) then you don't need to specify this on the command line for every run.

MultiQC report title. Printed as page header, used for filename if not otherwise specified.

Reference genome options

Reference genome related files and options required for the workflow.

This parameter is required

Name or taxonomy ID for the genome species

Genome accession where available or an identifier for draft assemblies

Must match pattern ^\S+\.fn?a(sta)?(\.gz)?$

Path to FASTA genome file.

Databases

Define the location and parameters to work with databases.

Local directory where clade-specific BUSCO lineage datasets are stored

This parameter is required

Local file that holds a mapping between BUSCO lineages and taxon IDs.

Must match pattern ^\S+\.dmnd$

Path to the Diamond species-specific buscogenes database

Must match pattern ^\S+\.dmnd$

Path to the Diamond species-specific buscoregions database

This parameter is required

Path to the nucleotide BLAST database

This parameter is required

Path to the new NCBI tax dump database

Generic options

Less common options for the pipeline, typically set in a config file.

These options are common to all nf-core pipelines and allow you to customise some of the core preferences for how the pipeline runs. Typically these options would be set in a Nextflow config file loaded for all pipeline runs, such as `~/.nextflow/config`.

Custom MultiQC yaml file containing HTML including a methods description.