sanger-tol/
genomenote

Nextflow DSL2 pipeline to generate a Genome Note, including assembly statistics, quality metrics, and Hi-C contact maps. This workflow is part of the Tree of Life production suite.

Parameters

Input/output options

Define where the pipeline should find input data and save output data.

`--input`

type: 'string'

Path to comma-separated file containing information about the samples in the experiment.

required

You will need to create a design file with information about the samples in your experiment before running the pipeline. Use this parameter to specify its location. It has to be a comma-separated file with 3 columns, and a header row. See usage docs.

pattern: ^\S+\.csv$

`--ancestral_table`

type: 'string'

TSV containing mappings for ancestral elements to busco genes.

`--ancestral_busco_lineage`

type: 'string'

The busco lineage used for ancestral painting (must match --ancestral_table).

`--binsize`

type: 'integer'

Bin size in base pairs for cooler cload

default: 1000

required

`--kmer_size`

type: 'integer'

Size for Fastk to create the k-mer library

default: 31

required

`--assembly`

type: 'string'

The Genbank assembly accession for the assembly, for example: GCA_922984935.2.

required

`--biosample_wgs`

type: 'string'

The biosample accesion(s) linked to the WGS samples in the experiment, for example: SAMEA7520803.

`--biosample_rna`

type: 'string'

The biosample accesion(s) linked to the RNA samples in the experiment, for example: SAMEA7521081.

`--biosample_hic`

type: 'string'

The biosample accesion(s) linked to the Hi-C samples in the experiment, for example: SAMEA7520846.

`--busco_lineage`

type: 'string'

Override the BUSCO database used for the assembly assessment.

`--select_contact_map`

type: 'string'

Select which contact maps should be generated, higlass, pretext or both

`--btk_location`

type: 'string'

Location of the local blobtoolkit "blobdir"

`--btk_online_location`

type: 'string'

Location of an online blobtoolkit "blobdir"

`--outdir`

type: 'string'

The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.

default: 'results'

required

`--email`

type: 'string'

Email address for completion summary.

Set this parameter to your e-mail address to get a summary e-mail with details of the run sent to you when the workflow exits. If set in your user config file (~/.nextflow/config) then you don't need to specify this on the command line for every run.

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

`--multiqc_title`

type: 'string'

MultiQC report title. Printed as page header, used for filename if not otherwise specified.

`--cool_order`

type: 'string'

Path to a file that contains the sequence names in the order you want to represent them on the contact-map.

If not specified, the pipeline will use the sequences in the same order they are listed by the NCI, which follows the karyotype.

`--write_to_portal`

type: 'boolean'

Flag to control if results are written to genome notes portal database .

hidden

`--genome_notes_api`

type: 'string'

URL for Genome Notes Portal API .

`--note_template`

type: 'string'

The path to a genome note template file.

Set this parameter if you have a genome note template file that you wish to populate. Templates may be docx or xml files

`--annotation_set`

type: 'string'

Path to annotation file :gff.

You will need to specify the path to the annotation file as eith gff or gff.gz.

pattern: ^(?:[\w/]+/)?\S+\.gff3?(?:\.gz)?$

HiGlass options

Define if and how to upload the contact map to a HiGlass server.

hidden

`--upload_higlass_data`

type: 'boolean'

flag to control if HiGlass server should be updated to add new files

hidden

`--higlass_url`

type: 'string'

URL for the HiGlass server

hidden

`--higlass_upload_directory`

type: 'string'

The ingress directory for the kubernetes cluster running the HiGlass server.

hidden

`--higlass_data_project_dir`

type: 'string'

Subdirectory struture to use for organising HiGlass data, suggested format is / e.g. '/asg/algae'

hidden

`--higlass_kubeconfig`

type: 'string'

The path to the kubeconfig for the kubernetes cluster running the HiGlass server.

hidden

`--higlass_deployment_name`

type: 'string'

Name of the kubernetes deployment for the HiGlass server.

hidden

`--higlass_namespace`

type: 'string'

The name for the namespace used in the Kubernetes cluster running the HiGlass server.

hidden

Reference genome options

Reference genome related files and options required for the workflow.

`--fasta`

type: 'string'

Path to FASTA genome file.

required

Databases

Define where the pipeline should find databases.

`--lineage_db`

type: 'string'

Local directory where clade-specific BUSCO lineage datasets are stored.

`--lineage_tax_ids`

type: 'string'

Local file that holds a mapping between BUSCO lineages and taxon IDs.

required

Initialised from https://busco-data.ezlab.org/v5/data/placement_files/mapping_taxids-busco_dataset_name.eukaryota_odb10.2019-12-16.txt.tar.gz

Execution

Control the execution of the pipeline.

hidden

`--use_work_dir_as_temp`

type: 'boolean'

Set to true to make tools (e.g. sort, FastK, MerquryFK) use the work directory for their temporary files, rather than the system default.

hidden

Institutional config options

Parameters used to describe centralised config profiles. These should not be edited.

hidden

The centralised nf-core configuration profiles use a handful of pipeline parameters to describe themselves. This information is then printed to the Nextflow log when you run a pipeline. You should not need to change these values when you run a pipeline.

`--custom_config_version`

type: 'string'

Git commit id for Institutional configs.

default: 'master'

hidden

`--custom_config_base`

type: 'string'

Base directory for Institutional configs.

default: 'https://raw.githubusercontent.com/nf-core/configs/master'

hidden

If you're running offline, Nextflow will not be able to fetch the institutional config files from the internet. If you don't need them, then this is not a problem. If you do need them, you should download the files from the repo and tell Nextflow where to find them with this parameter.

`--config_profile_name`

type: 'string'

Institutional config name.

hidden

`--config_profile_description`

type: 'string'

Institutional config description.

hidden

`--config_profile_contact`

type: 'string'

Institutional config contact information.

hidden

`--config_profile_url`

type: 'string'

Institutional config URL link.

hidden

Generic options

Less common options for the pipeline, typically set in a config file.

These options are common to all nf-core pipelines and allow you to customise some of the core preferences for how the pipeline runs.

Typically these options would be set in a Nextflow config file loaded for all pipeline runs, such as ~/.nextflow/config.

`--version`

type: 'boolean'

Display version and exit.

hidden

`--publish_dir_mode`

type: 'string'

Method used to save pipeline results to output directory.

hidden

The Nextflow publishDir option specifies which intermediate files should be saved to the output directory. This option tells the pipeline what method should be used to move these files. See Nextflow docs for details.

`--email_on_fail`

type: 'string'

Email address for completion summary, only when pipeline fails.

hidden

An email address to send a summary email to when the pipeline is completed - ONLY sent if the pipeline does not exit successfully.

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

`--plaintext_email`

type: 'boolean'

Send plain-text email instead of HTML.

hidden

`--max_multiqc_email_size`

type: 'string'

File size limit when attaching MultiQC reports to summary emails.

default: '25.MB'

hidden

`--monochrome_logs`

type: 'boolean'

Do not use coloured log outputs.

hidden

`--hook_url`

type: 'string'

Incoming hook URL for messaging service

hidden

Incoming hook URL for messaging service. Currently, MS Teams and Slack are supported.

`--multiqc_config`

type: 'string'

Custom config file to supply to MultiQC.

hidden

`--multiqc_logo`

type: 'string'

Custom logo file to supply to MultiQC. File name must also be set in the MultiQC config file

hidden

`--multiqc_methods_description`

type: 'string'

Custom MultiQC yaml file containing HTML including a methods description.

hidden

`--validate_params`

type: 'boolean'

Boolean whether to validate parameters against the schema at runtime

default: 1

hidden

`--pipelines_testdata_base_path`

type: 'string'

Base URL or local path to location of pipeline test dataset files

default: 'https://raw.githubusercontent.com/nf-core/test-datasets/'

hidden

`--trace_report_suffix`

type: 'string'

Suffix to add to the trace report filename. Default is the date and time in the format yyyy-MM-dd_HH-mm-ss.

hidden

`--help`

Display the help message.

`--help_full`

type: 'boolean'

Display the full detailed help message.

`--show_hidden`

type: 'boolean'

Display hidden parameters in the help message (only works when --help or --help_full are provided).

The following uncommon parameters have been hidden: --write_to_portal, --upload_higlass_data, --higlass_url, --higlass_upload_directory, --higlass_data_project_dir, --higlass_kubeconfig, --higlass_deployment_name, --higlass_namespace, --use_work_dir_as_temp, --custom_config_version, --custom_config_base, --config_profile_name, --config_profile_description, --config_profile_contact, --config_profile_url, --version, --publish_dir_mode, --email_on_fail, --plaintext_email, --max_multiqc_email_size, --monochrome_logs, --hook_url, --multiqc_config, --multiqc_logo, --multiqc_methods_description, --validate_params, --pipelines_testdata_base_path, --trace_report_suffix

Click here to show all hidden params.

sanger-tol/genomenote

Parameters

Input/output options

--input

--ancestral_table

--ancestral_busco_lineage

--binsize

--kmer_size

--assembly

--biosample_wgs

--biosample_rna

--biosample_hic

--busco_lineage

--select_contact_map

--btk_location

--btk_online_location

--outdir

--email

--multiqc_title

--cool_order

--write_to_portal

--genome_notes_api

--note_template

--annotation_set

HiGlass options

--upload_higlass_data

--higlass_url

--higlass_upload_directory

--higlass_data_project_dir

--higlass_kubeconfig

--higlass_deployment_name

--higlass_namespace

Reference genome options

--fasta

Databases

--lineage_db

--lineage_tax_ids

Execution

--use_work_dir_as_temp

Institutional config options

--custom_config_version

--custom_config_base

--config_profile_name

--config_profile_description

--config_profile_contact

--config_profile_url

Generic options

--version

--publish_dir_mode

--email_on_fail

--plaintext_email

--max_multiqc_email_size

--monochrome_logs

--hook_url

--multiqc_config

--multiqc_logo

--multiqc_methods_description

--validate_params

--pipelines_testdata_base_path

--trace_report_suffix

--help

--help_full

--show_hidden

sanger-tol/
genomenote

`--input`

`--ancestral_table`

`--ancestral_busco_lineage`

`--binsize`

`--kmer_size`

`--assembly`

`--biosample_wgs`

`--biosample_rna`

`--biosample_hic`

`--busco_lineage`

`--select_contact_map`

`--btk_location`

`--btk_online_location`

`--outdir`

`--email`

`--multiqc_title`

`--cool_order`

`--write_to_portal`

`--genome_notes_api`

`--note_template`

`--annotation_set`

`--upload_higlass_data`

`--higlass_url`

`--higlass_upload_directory`

`--higlass_data_project_dir`

`--higlass_kubeconfig`

`--higlass_deployment_name`

`--higlass_namespace`

`--fasta`

`--lineage_db`

`--lineage_tax_ids`

`--use_work_dir_as_temp`

`--custom_config_version`

`--custom_config_base`

`--config_profile_name`

`--config_profile_description`

`--config_profile_contact`

`--config_profile_url`

`--version`

`--publish_dir_mode`

`--email_on_fail`

`--plaintext_email`

`--max_multiqc_email_size`

`--monochrome_logs`

`--hook_url`

`--multiqc_config`

`--multiqc_logo`

`--multiqc_methods_description`

`--validate_params`

`--pipelines_testdata_base_path`

`--trace_report_suffix`

`--help`

`--help_full`

`--show_hidden`