Available Modules

Modules are the building stones of all DSL2 nf-core blocks. You can find more info from nf-core website, if you would like to write your own module.

  • fasta 5
  • vsearch 5
  • cut 4
  • binning 3
  • contigs 3
  • mags 3
  • fragment 3
  • cut up 3
  • genomics 2
  • vcf 2
  • metagenomics 2
  • merge 2
  • population genetics 2
  • amplicon sequences 2
  • notebook 2
  • reports 2
  • bam 1
  • alignment 1
  • assembly 1
  • gatk4 1
  • sort 1
  • variant calling 1
  • coverage 1
  • nanopore 1
  • split 1
  • taxonomy 1
  • sentieon 1
  • somatic 1
  • clustering 1
  • trimming 1
  • sv 1
  • wgs 1
  • bins 1
  • peaks 1
  • segmentation 1
  • splicing 1
  • clipping 1
  • wxs 1
  • microbiome 1
  • adapters 1
  • bedgraph 1
  • seqtk 1
  • RNA-seq 1
  • adapter trimming 1
  • quality trimming 1
  • varcal 1
  • trim 1
  • Read depth 1
  • Duplication purging 1
  • purge duplications 1
  • structural-variant calling 1
  • leafcutter 1
  • vsearch/fastqfilter 1
  • fastqfilter 1
  • Read coverage histogram 1
  • vsearch/sort 1
  • sintax 1
  • usearch 1
  • regtools 1
  • python 1
  • r 1
  • generic 1
  • coreutils 1
  • gnu 1
  • quarto 1
  • vqsr 1
  • variant quality score recalibration 1
  • trimBam 1
  • bamUtil 1
  • cutesv 1
  • cutoff 1
  • longread 1
  • de-novo 1
  • applyvarcal 1
  • VQSR 1
  • seacr 1
  • chromatin 1
  • cut&run 1
  • cut&tag 1
  • peak-caller 1
  • multicut 1
  • pixel classification 1
  • papermill 1
  • jupytext 1
  • Jupyter 1
  • Python 1
  • fastq 0
  • genome 0
  • index 0
  • reference 0
  • bed 0
  • cram 0
  • sam 0
  • annotation 0
  • structural variants 0
  • database 0
  • align 0
  • filter 0
  • gff 0
  • bacteria 0
  • map 0
  • statistics 0
  • variants 0
  • qc 0
  • classification 0
  • quality control 0
  • download 0
  • gtf 0
  • classify 0
  • cnv 0
  • MSA 0
  • k-mer 0
  • variant 0
  • taxonomic profiling 0
  • contamination 0
  • gfa 0
  • pacbio 0
  • convert 0
  • conversion 0
  • count 0
  • proteomics 0
  • quality 0
  • VCF 0
  • single-cell 0
  • ancient DNA 0
  • copy number 0
  • long reads 0
  • imputation 0
  • bedtools 0
  • phylogeny 0
  • rnaseq 0
  • graph 0
  • bisulfite 0
  • isoseq 0
  • reporting 0
  • variation graph 0
  • gvcf 0
  • bcftools 0
  • protein 0
  • build 0
  • consensus 0
  • kmer 0
  • bisulphite 0
  • imaging 0
  • methylseq 0
  • indexing 0
  • illumina 0
  • methylation 0
  • databases 0
  • table 0
  • picard 0
  • QC 0
  • visualisation 0
  • bqsr 0
  • cna 0
  • compression 0
  • long-read 0
  • openms 0
  • demultiplex 0
  • stats 0
  • antimicrobial resistance 0
  • serotype 0
  • depth 0
  • metrics 0
  • phage 0
  • mapping 0
  • plink2 0
  • sequences 0
  • taxonomic classification 0
  • tsv 0
  • 5mC 0
  • DNA methylation 0
  • haplotype 0
  • searching 0
  • WGBS 0
  • structure 0
  • protein sequence 0
  • plot 0
  • cluster 0
  • samtools 0
  • base quality score recalibration 0
  • aDNA 0
  • filtering 0
  • pangenome graph 0
  • repeat 0
  • histogram 0
  • neural network 0
  • pairs 0
  • scWGBS 0
  • example 0
  • markduplicates 0
  • expression 0
  • matrix 0
  • amr 0
  • machine learning 0
  • mappability 0
  • cooler 0
  • gzip 0
  • transcriptome 0
  • mmseqs2 0
  • low-coverage 0
  • annotate 0
  • iCLIP 0
  • virus 0
  • validation 0
  • db 0
  • bcf 0
  • phasing 0
  • completeness 0
  • bwa 0
  • aligner 0
  • metagenome 0
  • checkm 0
  • bisulfite sequencing 0
  • biscuit 0
  • sequence 0
  • LAST 0
  • damage 0
  • palaeogenomics 0
  • germline 0
  • seqkit 0
  • gene 0
  • transcript 0
  • archaeogenomics 0
  • genotype 0
  • msa 0
  • evaluation 0
  • bismark 0
  • kraken2 0
  • ucsc 0
  • prediction 0
  • blast 0
  • hmmsearch 0
  • hmmer 0
  • decompression 0
  • genotyping 0
  • spatial 0
  • glimpse 0
  • mag 0
  • umi 0
  • mkref 0
  • newick 0
  • ncbi 0
  • dedup 0
  • complexity 0
  • gff3 0
  • feature 0
  • sketch 0
  • report 0
  • json 0
  • prokaryote 0
  • scRNA-seq 0
  • bedGraph 0
  • kmers 0
  • short-read 0
  • rna 0
  • pangenome 0
  • plasmid 0
  • multiple sequence alignment 0
  • cnvkit 0
  • single 0
  • tumor-only 0
  • antimicrobial peptides 0
  • csv 0
  • NCBI 0
  • deduplication 0
  • antimicrobial resistance genes 0
  • mitochondria 0
  • snp 0
  • profile 0
  • low frequency variant calling 0
  • differential 0
  • demultiplexing 0
  • extract 0
  • reads 0
  • mirna 0
  • duplicates 0
  • mem 0
  • ptr 0
  • arg 0
  • HMM 0
  • reference-free 0
  • benchmark 0
  • sourmash 0
  • indels 0
  • detection 0
  • merging 0
  • coptr 0
  • diversity 0
  • concatenate 0
  • deamination 0
  • cat 0
  • compare 0
  • FASTQ 0
  • de novo 0
  • single cell 0
  • text 0
  • mpileup 0
  • antibiotic resistance 0
  • idXML 0
  • gridss 0
  • isolates 0
  • tabular 0
  • 3-letter genome 0
  • interval 0
  • mutect2 0
  • de novo assembly 0
  • structural 0
  • distance 0
  • profiling 0
  • MAF 0
  • amps 0
  • visualization 0
  • riboseq 0
  • svtk 0
  • kallisto 0
  • query 0
  • fastx 0
  • ont 0
  • call 0
  • counts 0
  • summary 0
  • view 0
  • fgbio 0
  • add 0
  • propr 0
  • haplotypecaller 0
  • malt 0
  • gsea 0
  • STR 0
  • compress 0
  • parsing 0
  • microarray 0
  • hic 0
  • redundancy 0
  • family 0
  • ganon 0
  • phylogenetic placement 0
  • bedpe 0
  • ranking 0
  • logratio 0
  • clean 0
  • genome assembler 0
  • transcriptomics 0
  • CLIP 0
  • read depth 0
  • genmod 0
  • circrna 0
  • pypgx 0
  • interval_list 0
  • peak-calling 0
  • ampir 0
  • enrichment 0
  • bgzip 0
  • union 0
  • isomir 0
  • microsatellite 0
  • normalization 0
  • umitools 0
  • DNA sequencing 0
  • fusion 0
  • abundance 0
  • dna 0
  • DNA sequence 0
  • ccs 0
  • quantification 0
  • sample 0
  • sequencing 0
  • mtDNA 0
  • snps 0
  • ATAC-seq 0
  • targeted sequencing 0
  • resistance 0
  • hybrid capture sequencing 0
  • bin 0
  • chunk 0
  • copy number alteration calling 0
  • xeniumranger 0
  • retrotransposon 0
  • containment 0
  • chromosome 0
  • bigwig 0
  • diamond 0
  • preprocessing 0
  • fai 0
  • telomere 0
  • SV 0
  • sylph 0
  • ngscheckmate 0
  • archaeogenetics 0
  • ancestry 0
  • bcl2fastq 0
  • happy 0
  • deep learning 0
  • image 0
  • nucleotide 0
  • fungi 0
  • miscoding lesions 0
  • public datasets 0
  • HiFi 0
  • skani 0
  • BGC 0
  • matching 0
  • biosynthetic gene cluster 0
  • palaeogenetics 0
  • paf 0
  • hmmcopy 0
  • gatk4spark 0
  • somatic variants 0
  • dist 0
  • SNP 0
  • comparison 0
  • lossless 0
  • bacterial 0
  • mzml 0
  • identity 0
  • typing 0
  • pairsam 0
  • relatedness 0
  • subsample 0
  • entrez 0
  • fastk 0
  • structural_variants 0
  • pan-genome 0
  • pangolin 0
  • spaceranger 0
  • lineage 0
  • anndata 0
  • covid 0
  • UMI 0
  • observations 0
  • survivor 0
  • panel 0
  • wastewater 0
  • mapper 0
  • benchmarking 0
  • bim 0
  • duplication 0
  • PacBio 0
  • fam 0
  • rsem 0
  • mask 0
  • hidden Markov model 0
  • cfDNA 0
  • polishing 0
  • population genomics 0
  • vrhyme 0
  • scaffold 0
  • amplicon sequencing 0
  • mlst 0
  • prokka 0
  • krona chart 0
  • pseudoalignment 0
  • transposons 0
  • khmer 0
  • windowmasker 0
  • npz 0
  • krona 0
  • html 0
  • small indels 0
  • popscle 0
  • genotype-based deconvoltion 0
  • indel 0
  • kinship 0
  • shapeit 0
  • spark 0
  • miRNA 0
  • tabix 0
  • dictionary 0
  • ambient RNA removal 0
  • informative sites 0
  • rna_structure 0
  • RNA 0
  • fusions 0
  • replace 0
  • score 0
  • genome assembly 0
  • scaffolding 0
  • transcripts 0
  • uLTRA 0
  • insert 0
  • variant_calling 0
  • ligate 0
  • minimap2 0
  • long_read 0
  • guide tree 0
  • untar 0
  • uncompress 0
  • chimeras 0
  • unzip 0
  • zip 0
  • archiving 0
  • organelle 0
  • cellranger 0
  • kraken 0
  • angsd 0
  • genome mining 0
  • bamtools 0
  • pileup 0
  • cool 0
  • png 0
  • proteome 0
  • repeat expansion 0
  • bracken 0
  • aln 0
  • bwameth 0
  • das tool 0
  • das_tool 0
  • wig 0
  • prefetch 0
  • prokaryotes 0
  • chip-seq 0
  • comparisons 0
  • ataqv 0
  • image_analysis 0
  • mcmicro 0
  • highly_multiplexed_imaging 0
  • CRISPR 0
  • dump 0
  • arriba 0
  • eukaryotes 0
  • combine 0
  • bakta 0
  • intervals 0
  • host 0
  • converter 0
  • deeparg 0
  • C to T 0
  • roh 0
  • remove 0
  • virulence 0
  • fingerprint 0
  • macrel 0
  • amplify 0
  • neubi 0
  • fcs-gx 0
  • scores 0
  • gene expression 0
  • regions 0
  • mkfastq 0
  • checkv 0
  • hi-c 0
  • complement 0
  • atac-seq 0
  • genomes 0
  • PCA 0
  • DRAMP 0
  • microbes 0
  • minhash 0
  • windows 0
  • immunoinformatics 0
  • intersect 0
  • norm 0
  • long terminal repeat 0
  • normalize 0
  • intersection 0
  • mash 0
  • long terminal retrotransposon 0
  • kma 0
  • retrotransposons 0
  • checksum 0
  • scatter 0
  • megan 0
  • assembly evaluation 0
  • GC content 0
  • k-mer frequency 0
  • k-mer index 0
  • archive 0
  • lofreq 0
  • bloom filter 0
  • pharokka 0
  • reheader 0
  • xz 0
  • function 0
  • profiles 0
  • COBS 0
  • resolve_bioscience 0
  • spatial_transcriptomics 0
  • tree 0
  • salmon 0
  • BAM 0
  • rna-seq 0
  • regression 0
  • reformat 0
  • haplotypes 0
  • functional analysis 0
  • mapcounter 0
  • hlala_typing 0
  • hla_typing 0
  • hlala 0
  • hla 0
  • haplogroups 0
  • interactions 0
  • taxids 0
  • ichorcna 0
  • immunoprofiling 0
  • taxon name 0
  • zlib 0
  • pigz 0
  • vdj 0
  • find 0
  • differential expression 0
  • trancriptome 0
  • tama 0
  • translation 0
  • amino acid 0
  • genetics 0
  • barcode 0
  • orf 0
  • primer 0
  • pair 0
  • region 0
  • interactive 0
  • krakenuniq 0
  • sizes 0
  • bases 0
  • homologs 0
  • krakentools 0
  • screen 0
  • bustools 0
  • metamaps 0
  • awk 0
  • tbi 0
  • polyA_tail 0
  • blastn 0
  • refine 0
  • maximum likelihood 0
  • iphop 0
  • instrain 0
  • leviosam2 0
  • lift 0
  • homoploymer 0
  • deseq2 0
  • MSI 0
  • dict 0
  • MaltExtract 0
  • HOPS 0
  • authentication 0
  • soft-clipped clusters 0
  • edit distance 0
  • ragtag 0
  • qualty 0
  • samples 0
  • fixmate 0
  • collate 0
  • taxon tables 0
  • secondary metabolites 0
  • bam2fq 0
  • NRPS 0
  • RiPP 0
  • antibiotics 0
  • antismash 0
  • rtgtools 0
  • vcflib 0
  • junctions 0
  • vg 0
  • salmonella 0
  • rename 0
  • allele 0
  • FracMinHash sketch 0
  • join 0
  • signature 0
  • cancer genomics 0
  • snpsift 0
  • snpeff 0
  • effect prediction 0
  • small genome 0
  • de novo assembler 0
  • gwas 0
  • shigella 0
  • otu tables 0
  • svdb 0
  • switch 0
  • ancient dna 0
  • Streptococcus pneumoniae 0
  • standardization 0
  • sequenzautils 0
  • taxonomic profile 0
  • standardise 0
  • transformation 0
  • standardisation 0
  • runs_of_homozygosity 0
  • polish 0
  • instability 0
  • microscopy 0
  • nucleotides 0
  • GPU-accelerated 0
  • multiallelic 0
  • small variants 0
  • rgfa 0
  • tnhaplotyper2 0
  • gstama 0
  • reformatting 0
  • graph layout 0
  • nextclade 0
  • graft 0
  • orthology 0
  • parallelized 0
  • removal 0
  • transcriptomic 0
  • mudskipper 0
  • concat 0
  • msisensor-pro 0
  • micro-satellite-scan 0
  • tumor 0
  • msi 0
  • cnvnator 0
  • proportionality 0
  • RNA-Seq 0
  • preseq 0
  • contig 0
  • simulate 0
  • artic 0
  • duplicate 0
  • aggregate 0
  • demultiplexed reads 0
  • library 0
  • adapter 0
  • ped 0
  • import 0
  • variant pruning 0
  • bfiles 0
  • subset 0
  • SimpleAF 0
  • copyratios 0
  • image_processing 0
  • registration 0
  • mitochondrion 0
  • read-group 0
  • xenograft 0
  • rrna 0
  • serogroup 0
  • nacho 0
  • metagenomic 0
  • cgMLST 0
  • unaligned 0
  • mass spectrometry 0
  • UMIs 0
  • version 0
  • orthologs 0
  • duplex 0
  • trgt 0
  • nanostring 0
  • fetch 0
  • GEO 0
  • sra-tools 0
  • fasterq-dump 0
  • identifier 0
  • sequence analysis 0
  • baf 0
  • pharmacogenetics 0
  • estimation 0
  • expansionhunterdenovo 0
  • repeat_expansions 0
  • cleaning 0
  • metadata 0
  • screening 0
  • tab 0
  • recombination 0
  • gem 0
  • metagenomes 0
  • eCLIP 0
  • WGS 0
  • long-read sequencing 0
  • doublets 0
  • corrupted 0
  • mRNA 0
  • realignment 0
  • microbial 0
  • deconvolution 0
  • allele-specific 0
  • smrnaseq 0
  • bayesian 0
  • interval list 0
  • RNA sequencing 0
  • filtermutectcalls 0
  • mirdeep2 0
  • MCMICRO 0
  • calling 0
  • ome-tif 0
  • Pharmacogenetics 0
  • cvnkit 0
  • split_kmers 0
  • evidence 0
  • repeats 0
  • panelofnormals 0
  • cnv calling 0
  • CNV 0
  • dereplicate 0
  • joint genotyping 0
  • gatk 0
  • merge mate pairs 0
  • reads merging 0
  • short reads 0
  • correction 0
  • frame-shift correction 0
  • splice 0
  • settings 0
  • random forest 0
  • amptransformer 0
  • gene set 0
  • gene set analysis 0
  • eigenstrat 0
  • variation 0
  • samplesheet 0
  • human removal 0
  • validate 0
  • format 0
  • genome bins 0
  • blastp 0
  • phase 0
  • decontamination 0
  • ChIP-seq 0
  • gene labels 0
  • genomad 0
  • single cells 0
  • hostile 0
  • emboss 0
  • parse 0
  • heatmap 0
  • ampgram 0
  • eido 0
  • spatial_omics 0
  • concordance 0
  • spatialdata 0
  • melon 0
  • c to t 0
  • proteus 0
  • plant 0
  • mapad 0
  • hash sketch 0
  • signatures 0
  • setgt 0
  • readproteingroups 0
  • metabolomics 0
  • cell segmentation 0
  • SINE 0
  • adna 0
  • copy-number 0
  • jvarkit 0
  • remove samples 0
  • gender determination 0
  • scanner 0
  • copy number alterations 0
  • helitron 0
  • tar 0
  • unmarkduplicates 0
  • covariance models 0
  • translate 0
  • copy number analysis 0
  • trna 0
  • wham 0
  • fracminhash sketch 0
  • genome annotation 0
  • mobile genetic elements 0
  • tarball 0
  • copy number variation 0
  • yahs 0
  • geo 0
  • recovery 0
  • relabel 0
  • bedcov 0
  • genome polishing 0
  • assembly polishing 0
  • chloroplast 0
  • confidence 0
  • blat 0
  • alr 0
  • clr 0
  • boxcox 0
  • tnscope 0
  • vsearch/dereplicate 0
  • telseq 0
  • rRNA 0
  • Escherichia coli 0
  • stardist 0
  • propd 0
  • immunology 0
  • BCR 0
  • bgen 0
  • groupby 0
  • eigenvectors 0
  • secondary structure 0
  • network 0
  • resegment 0
  • wget 0
  • wavefront 0
  • hicPCA 0
  • sliding 0
  • mgi 0
  • snakemake 0
  • workflow 0
  • morphology 0
  • ATACseq 0
  • workflow_mode 0
  • ATACshift 0
  • createreadcountpanelofnormals 0
  • shift 0
  • denoisereadcounts 0
  • readwriter 0
  • ribosomal RNA 0
  • dnamodelapply 0
  • dnascope 0
  • comp 0
  • whamg 0
  • mashmap 0
  • source tracking 0
  • decompress 0
  • vcf2bed 0
  • significance statistic 0
  • scanpy 0
  • rdtest 0
  • hwe 0
  • emoji 0
  • umicollapse 0
  • data-download 0
  • scRNA-Seq 0
  • gtftogenepred 0
  • controlstatistics 0
  • rdtest2vcf 0
  • countsvtypes 0
  • p-value 0
  • scvi 0
  • elprep 0
  • files 0
  • baftest 0
  • elfasta 0
  • ucsc/liftover 0
  • refflat 0
  • upd 0
  • eucaryotes 0
  • doublet_detection 0
  • subsetting 0
  • fast5 0
  • references 0
  • modelsegments 0
  • polya tail 0
  • Mycobacterium tuberculosis 0
  • metagenome assembler 0
  • chromosomal rearrangements 0
  • coding 0
  • genepred 0
  • missingness 0
  • cds 0
  • transcroder 0
  • quality_control 0
  • sequencing adapters 0
  • patch 0
  • logFC 0
  • bedgraphtobigwig 0
  • bigbed 0
  • bedtobigbed 0
  • nucleotide content 0
  • uniparental 0
  • all versus all 0
  • spa 0
  • graph projection to vcf 0
  • nucBed 0
  • long-reads 0
  • bclconvert 0
  • plotting 0
  • variantcalling 0
  • sccmec 0
  • streptococcus 0
  • extractunbinned 0
  • linkbins 0
  • integron 0
  • targz 0
  • iterative model refinement 0
  • nuclear segmentation 0
  • spatype 0
  • barcodes 0
  • long read alignment 0
  • pangenome-scale 0
  • construct 0
  • DNA contamination estimation 0
  • disomy 0
  • metabolite annotation 0
  • snv 0
  • downsample 0
  • svtk/baftest 0
  • downsample bam 0
  • subsample bam 0
  • vcf2db 0
  • AT content 0
  • gemini 0
  • maf 0
  • lua 0
  • detecting svs 0
  • toml 0
  • solo 0
  • import segmentation 0
  • short-read sequencing 0
  • vcfbreakmulti 0
  • uniq 0
  • deduplicate 0
  • VCFtools 0
  • verifybamid 0
  • metaspace 0
  • check 0
  • decoy 0
  • genotype dosages 0
  • impute 0
  • 10x 0
  • hwe statistics 0
  • ribosomal 0
  • grabix 0
  • SNV 0
  • hwe equilibrium 0
  • reference-independent 0
  • Indel 0
  • bwameme 0
  • host removal 0
  • haploype 0
  • genotype likelihood 0
  • patterns 0
  • collapse 0
  • liftover 0
  • probabilistic realignment 0
  • seqfu 0
  • n50 0
  • bwamem2 0
  • guidetree 0
  • Pacbio 0
  • doublet 0
  • cell_type_identification 0
  • cell_phenotyping 0
  • machine_learning 0
  • hardy-weinberg 0
  • regex 0
  • AC/NS/AF 0
  • distance-based 0
  • circular 0
  • realign 0
  • quality check 0
  • coexpression 0
  • correlation 0
  • corpcor 0
  • low-complexity 0
  • assay 0
  • phylogenetics 0
  • minimum_evolution 0
  • nucleotide sequence 0
  • shuffleBed 0
  • GFF/GTF 0
  • size 0
  • trio binning 0
  • cram-size 0
  • selector 0
  • paraphase 0
  • transcription factors 0
  • regulatory network 0
  • tandem repeats 0
  • multi-tool 0
  • long read 0
  • predict 0
  • reference compression 0
  • vcflib/vcffixup 0
  • spot 0
  • nanopore sequencing 0
  • cell_barcodes 0
  • hhsuite 0
  • 16S 0
  • mygene 0
  • go 0
  • retrieval 0
  • CRISPRi 0
  • pile up 0
  • catpack 0
  • prepare 0
  • transposable element 0
  • hmmpress 0
  • rna velocity 0
  • cobra 0
  • extension 0
  • grea 0
  • hashing-based deconvoltion 0
  • hamming-distance 0
  • functional enrichment 0
  • paired reads merging 0
  • overlap-based merging 0
  • taxonomic composition 0
  • tag 0
  • Computational Immunology 0
  • trimfq 0
  • omics 0
  • clahe 0
  • refresh 0
  • association 0
  • GWAS 0
  • case/control 0
  • associations 0
  • reference panel 0
  • spatial_neighborhoods 0
  • scimap 0
  • cellsnp 0
  • Bayesian 0
  • structural-variants 0
  • donor deconvolution 0
  • hmmscan 0
  • genotype-based demultiplexing 0
  • lexogen 0
  • biological activity 0
  • droplet based single cells 0
  • junction 0
  • Immune Deconvolution 0
  • Bioinformatics Tools 0
  • prior knowledge 0
  • phylogenies 0
  • busco 0
  • InterProScan 0
  • MMseqs2 0
  • masking 0
  • variant-calling 0
  • staging 0
  • derived alleles 0
  • tnfilter 0
  • heterozygous genotypes 0
  • inbreeding 0
  • array_cgh 0
  • cytosure 0
  • Staging 0
  • vector 0
  • gprofiler2 0
  • gost 0
  • rad 0
  • block substitutions 0
  • covariance model 0
  • haplotag 0
  • standard 0
  • svg 0
  • structural variant 0
  • xml 0
  • run 0
  • script 0
  • bam2fastx 0
  • bam2fastq 0
  • dereplication 0
  • java 0
  • pdb 0
  • ancestral alleles 0
  • immcantation 0
  • mass_error 0
  • reverse complement 0
  • vcf file 0
  • poolseq 0
  • bgen file 0
  • plink2_pca 0
  • search engine 0
  • simulation 0
  • hmmfetch 0
  • decompose 0
  • identity-by-descent 0
  • decomposeblocksub 0
  • transmembrane 0
  • genome graph 0
  • site frequency spectrum 0
  • pca 0
  • tnseq 0
  • multiqc 0
  • mzML 0
  • pruning 0
  • htseq 0
  • linkage equilibrium 0
  • sompy 0
  • f coefficient 0
  • peak picking 0
  • homozygous genotypes 0
  • microRNA 0
  • rank 0
  • airrseq 0
  • orthogroup 0
  • uq 0
  • isoform 0
  • joint-genotyping 0
  • variancepartition 0
  • genotypegvcf 0
  • dream 0
  • redundant 0
  • fix 0
  • extraction 0
  • featuretable 0
  • md 0
  • nm 0
  • parallel 0
  • plastid 0
  • malformed 0
  • paired reads re-pairing 0
  • short 0
  • sage 0
  • resfinder 0
  • resistance genes 0
  • raw 0
  • mgf 0
  • parquet 0
  • intron 0
  • parser 0
  • dbsnp 0
  • standardize 0
  • install 0
  • nanoq 0
  • hashing-based deconvolution 0
  • deep variant 0
  • co-orthology 0
  • updatedata 0
  • homology 0
  • microbial genomics 0
  • chip 0
  • tag2tag 0
  • sequence similarity 0
  • spectral clustering 0
  • tags 0
  • comparative genomics 0
  • partitioning 0
  • functional 0
  • Illumina 0
  • Read filters 0
  • uniques 0
  • mutect 0
  • idx 0
  • drep 0
  • drug categorization 0
  • Read report 0
  • agat 0
  • Read trimming 0
  • transform 0
  • gaps 0
  • introns 0
  • longest 0
  • impute-info 0
  • assembler 0
  • constant 0
  • getpileupsummaries 0
  • short variant discovery 0
  • combinegvcfs 0
  • collectsvevidence 0
  • collectreadcounts 0
  • cnnscorevariants 0
  • calibratedragstrmodel 0
  • cross-samplecontamination 0
  • dragstr 0
  • calculatecontamination 0
  • bedtointervallist 0
  • asereadcounter 0
  • annotateintervals 0
  • composestrtablefile 0
  • condensedepthevidence 0
  • heattree 0
  • gatherbqsrreports 0
  • germlinecnvcaller 0
  • germline contig ploidy 0
  • panelofnormalscreation 0
  • jointgenotyping 0
  • genomicsdbimport 0
  • genomicsdb 0
  • tranche filtering 0
  • createsequencedictionary 0
  • filtervarianttranches 0
  • filterintervals 0
  • estimatelibrarycomplexity 0
  • duplication metrics 0
  • determinegermlinecontigploidy 0
  • createsomaticpanelofnormals 0
  • targets 0
  • gangstr 0
  • getpileupsumaries 0
  • antibiotic resistance genes 0
  • consensus sequence 0
  • public 0
  • ENA 0
  • SRA 0
  • ANI 0
  • ARGs 0
  • faqcs 0
  • groupreads 0
  • str 0
  • cache 0
  • percent on target 0
  • endogenous DNA 0
  • Streptococcus pyogenes 0
  • swissprot 0
  • duplexumi 0
  • unmapped 0
  • gene-calling 0
  • variant caller 0
  • gamma 0
  • UShER 0
  • bootstrapping 0
  • bacterial variant calling 0
  • germline variant calling 0
  • somatic variant calling 0
  • rust 0
  • ubam 0
  • fq 0
  • lint 0
  • random 0
  • generate 0
  • single molecule 0
  • zipperbams 0
  • germlinevariantsites 0
  • readcountssummary 0
  • embl 0
  • Imputation 0
  • gene model 0
  • tama_collapse.py 0
  • genomes on a tree 0
  • merge compare 0
  • GNU 0
  • joint-variant-calling 0
  • Haplotypes 0
  • gstama/merge 0
  • Sample 0
  • low coverage 0
  • gget 0
  • genome statistics 0
  • genome manipulation 0
  • genome summary 0
  • TAMA 0
  • gstama/polyacleanup 0
  • Mykrobe 0
  • abricate 0
  • beagle 0
  • hbd 0
  • ibd 0
  • rgi 0
  • fARGene 0
  • amrfinderplus 0
  • extractvariants 0
  • GTDB taxonomy 0
  • extract_variants 0
  • gvcftools 0
  • gunzip 0
  • gunc 0
  • archaea 0
  • genome taxonomy database 0
  • gfastats 0
  • Salmonella Typhi 0
  • indexfeaturefile 0
  • preprocessintervals 0
  • shiftchain 0
  • selectvariants 0
  • revert 0
  • reblockgvcf 0
  • printsvevidence 0
  • printreads 0
  • postprocessgermlinecnvcalls 0
  • shiftintervals 0
  • snvs 0
  • mutectstats 0
  • mergebamalignment 0
  • leftalignandtrimvariants 0
  • readorientationartifacts 0
  • learnreadorientationmodel 0
  • shiftfasta 0
  • site depth 0
  • repeat content 0
  • file parsing 0
  • genome heterozygosity 0
  • genome size 0
  • models 0
  • compound 0
  • genome profile 0
  • bgc 0
  • txt 0
  • splitcram 0
  • gawk 0
  • variantrecalibrator 0
  • recalibration model 0
  • variantfiltration 0
  • svcluster 0
  • svannotate 0
  • splitintervals 0
  • genbank 0
  • split by chromosome 0
  • Haemophilus influenzae 0
  • illumiation_correction 0
  • BCF 0
  • csi 0
  • deduping 0
  • smaller fastqs 0
  • clumping fastqs 0
  • background_correction 0
  • element 0
  • biallelic 0
  • bamtools/split 0
  • yaml 0
  • bamtools/convert 0
  • mouse 0
  • update header 0
  • homozygosity 0
  • virulent 0
  • chunking 0
  • subtract 0
  • slopBed 0
  • shiftBed 0
  • multinterval 0
  • overlapped bed 0
  • maskfasta 0
  • jaccard 0
  • autozygosity 0
  • overlap 0
  • getfasta 0
  • genomecov 0
  • closest 0
  • bamtobed 0
  • sorting 0
  • bacphlip 0
  • temperate 0
  • bioawk 0
  • amp 0
  • allele counts 0
  • nuclear contamination estimate 0
  • post Post-processing 0
  • model 0
  • AMPs 0
  • antimicrobial peptide prediction 0
  • Staphylococcus aureus 0
  • installation 0
  • affy 0
  • reference panels 0
  • admixture 0
  • adapterremoval 0
  • antimicrobial reistance 0
  • contiguate 0
  • doCounts 0
  • HLA 0
  • lifestyle 0
  • read group 0
  • autofluorescence 0
  • cycif 0
  • background 0
  • single-stranded 0
  • ancientDNA 0
  • authentict 0
  • bias 0
  • utility 0
  • ATLAS 0
  • sequencing_bias 0
  • post mortem damage 0
  • atlas 0
  • mkarv 0
  • http(s) 0
  • unionBedGraphs 0
  • file manipulation 0
  • deletion 0
  • Segmentation 0
  • gct 0
  • cls 0
  • na 0
  • custom 0
  • Cores 0
  • TMA dearray 0
  • paired-end 0
  • UNet 0
  • mcool 0
  • genomic bins 0
  • makebins 0
  • enzyme 0
  • digest 0
  • pcr duplicates 0
  • track 0
  • cooler/balance 0
  • escherichia coli 0
  • circos 0
  • eklipse 0
  • eigenstratdatabasetools 0
  • pep 0
  • schema 0
  • PEP 0
  • depth information 0
  • corrrelation 0
  • structural variation 0
  • duphold 0
  • segment 0
  • blastx 0
  • cumulative coverage 0
  • scatterplot 0
  • cload 0
  • subcontigs 0
  • sorted 0
  • compartments 0
  • multiomics 0
  • mkvdjref 0
  • cellpose 0
  • hifi 0
  • Assembly 0
  • domains 0
  • topology 0
  • antibody capture 0
  • calder2 0
  • cadd 0
  • postprocessing 0
  • tblastn 0
  • subtyping 0
  • Salmonella enterica 0
  • antigen capture 0
  • crispr 0
  • nucleotide composition 0
  • cmseq 0
  • concoct 0
  • partition histograms 0
  • target 0
  • export 0
  • antitarget 0
  • access 0
  • protein coding genes 0
  • qa 0
  • polymorphic sites 0
  • polymorphic 0
  • polymut 0
  • chromosome_visualization 0
  • duplicate removal 0
  • chromap 0
  • quality assurnce 0
  • mitochondrial 0
  • haplotype resolution 0
  • invariant 0
  • False duplications 0
  • Haplotype purging 0
  • assembly curation 0
  • false duplications 0
  • duplicate purging 0
  • haplotype purging 0
  • panel of normals 0
  • purging 0
  • normal database 0
  • genomic intervals 0
  • intervals coverage 0
  • gene finding 0
  • contact maps 0
  • bmp 0
  • Assembly curation 0
  • quast 0
  • pretext 0
  • read_pairs 0
  • integrity 0
  • mapping-based 0
  • sequence-based 0
  • read distribution 0
  • inner_distance 0
  • fragment_size 0
  • experiment 0
  • neighbour-joining 0
  • strandedness 0
  • bamstat 0
  • R 0
  • rhocall 0
  • long uncorrected reads 0
  • subsampling 0
  • jpg 0
  • contact 0
  • pedfilter 0
  • sortvcf 0
  • PRO-cap 0
  • GRO-cap 0
  • CoPRO 0
  • tandem duplications 0
  • insertions 0
  • deletions 0
  • picard/renamesampleinvcf 0
  • NETCAGE 0
  • pcr 0
  • liftovervcf 0
  • mate-pair 0
  • hybrid-selection 0
  • phylogenetic composition 0
  • illumina datasets 0
  • CAGE 0
  • RAMPAGE 0
  • porechop_abi 0
  • indep pairwise 0
  • pmdtools 0
  • variant genetic 0
  • scoring 0
  • identifiers 0
  • whole genome association 0
  • recode 0
  • indep 0
  • csRNA-seq 0
  • variant identifiers 0
  • exclude 0
  • genetic 0
  • GRO-seq 0
  • PRO-seq 0
  • STRIPE-seq 0
  • rtg 0
  • rocplot 0
  • prophage 0
  • sex determination 0
  • error 0
  • rare variants 0
  • relative coverage 0
  • genetic sex 0
  • induce 0
  • 256 bit 0
  • gc_wiggle 0
  • bam2seqz 0
  • freqsum 0
  • pseudodiploid 0
  • pseudohaploid 0
  • random draw 0
  • sha256 0
  • shinyngs 0
  • seq 0
  • POA 0
  • SNPs 0
  • predictions 0
  • dbnsfp 0
  • snippy 0
  • core 0
  • sniffles 0
  • SMN2 0
  • exploratory 0
  • SMN1 0
  • CRAM 0
  • sliding window 0
  • features 0
  • density 0
  • boxplot 0
  • selection 0
  • header 0
  • rtg-tools 0
  • duplicate marking 0
  • repair 0
  • insert size 0
  • faidx 0
  • calmd 0
  • ampliconclip 0
  • amplicon 0
  • sambamba 0
  • read pairs 0
  • flagstat 0
  • multimapper 0
  • Ancestor 0
  • LCA 0
  • salsa2 0
  • salsa 0
  • paired 0
  • readgroup 0
  • interleave 0
  • sertotype 0
  • sequence headers 0
  • grep 0
  • subseq 0
  • variant recalibration 0
  • assembly-binning 0
  • scramble 0
  • clusteridentifier 0
  • cluster analysis 0
  • identification 0
  • phantom peaks 0
  • gccounter 0
  • limma 0
  • peptide prediction 0
  • AMP 0
  • qualities 0
  • lofreq/filter 0
  • lofreq/call 0
  • Listeria monocytogenes 0
  • pneumophila 0
  • sgRNA 0
  • clinical 0
  • legionella 0
  • collapsing 0
  • adapter removal 0
  • train 0
  • spliced 0
  • functional genomics 0
  • CRISPR-Cas9 0
  • combining 0
  • reduced 0
  • MD5 0
  • mcr-1 0
  • mass-spectroscopy 0
  • metagenome-assembled genomes 0
  • maxbin2 0
  • representations 0
  • mash/sketch 0
  • maximum-likelihood 0
  • taxonomic assignment 0
  • estimate 0
  • damage patterns 0
  • NGS 0
  • DNA damage 0
  • rra 0
  • reorder 0
  • kofamscan 0
  • megahit 0
  • panel_of_normals 0
  • genome browser 0
  • js 0
  • igv.js 0
  • igv 0
  • IDR 0
  • haemophilus 0
  • pixel_classification 0
  • pos 0
  • annotations 0
  • hmtnote 0
  • Hidden Markov Model 0
  • HMMER 0
  • readcounter 0
  • probability_maps 0
  • kegg 0
  • kallisto/index 0
  • pneumoniae 0
  • Klebsiella 0
  • effective genome size 0
  • k-mer counting 0
  • digital normalization 0
  • quant 0
  • interproscan 0
  • jasmine 0
  • jasminesv 0
  • insertion 0
  • genomic islands 0
  • 128 bit 0
  • denovo 0
  • ChIP-Seq 0
  • graph formats 0
  • block-compressed 0
  • HLA-I 0
  • ILP 0
  • hla-typing 0
  • tumor/normal 0
  • graph viz 0
  • graph unchopping 0
  • flip 0
  • graph stats 0
  • combine graphs 0
  • odgi 0
  • squeeze 0
  • graph drawing 0
  • graph construction 0
  • PCR/optical duplicates 0
  • upper-triangular matrix 0
  • Neisseria gonorrhoeae 0
  • pbmerge 0
  • motif 0
  • pedigrees 0
  • read 0
  • pair-end 0
  • pbp 0
  • subreads 0
  • pbbam 0
  • ligation junctions 0
  • graphs 0
  • paragraph 0
  • select 0
  • restriction fragments 0
  • pairstools 0
  • pairtools 0
  • gender 0
  • ngm 0
  • debruijn 0
  • ploidy 0
  • microrna 0
  • de Bruijn 0
  • mbias 0
  • methylation bias 0
  • metaphlan 0
  • unionsum 0
  • smudgeplot 0
  • mitochondrial genome 0
  • Merqury 0
  • contour map 0
  • 3D heat map 0
  • Neisseria meningitidis 0
  • rma6 0
  • daa 0
  • target prediction 0
  • reference genome 0
  • NextGenMap 0
  • GATK UnifiedGenotyper 0
  • sequencing summary 0
  • mobile element insertions 0
  • somatic structural variations 0
  • cancer genome 0
  • contaminant 0
  • SNP table 0
  • Beautiful stand-alone HTML report 0
  • mosdepth 0
  • bioinformatics tools 0
  • mitochondrial to nuclear ratio 0
  • ratio 0
  • mtnucratio 0
  • scan 0
  • microsatellite instability 0
  • otu table 0
  • TCR 0

trims the end of reads in a SAM/BAM file, changing read ends to โ€˜Nโ€™ and quality to โ€˜!โ€™, or by soft clipping

0123

bam versions

bamutil:

Programs that perform operations on SAM/BAM files, all built into a single executable, bam.

Cut up fasta file in non-overlapping or overlapping parts of equal length.

010

fasta bed versions

concoct:

Clustering cONtigs with COverage and ComposiTion

Creates a FASTA file for each new cluster assigned by CONCOCT

012

fasta versions

concoct:

Clustering cONtigs with COverage and ComposiTion

Merge consecutive parts of the original contigs original cut up by cut_up_fasta.py

01

csv versions

concoct:

Clustering cONtigs with COverage and ComposiTion

Perform adapter/quality trimming on sequencing reads

01

reads log versions

cuatadapt:

Cutadapt finds and removes adapter sequences, primers, poly-A tails and other types of unwanted sequence from your high-throughput sequencing reads.

structural-variant calling with cutesv

01201

vcf versions

Apply a score cutoff to filter variants based on a recalibration table. AplyVQSR performs the second pass in a two-stage process called Variant Quality Score Recalibration (VQSR). Specifically, it applies filtering to the input variants based on the recalibration table produced in the first step by VariantRecalibrator and a target sensitivity value.

012345000

vcf tbi versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Split a file into consecutive or interleaved sections

01

split versions

gnu:

The GNU Core Utilities are the basic file, shell and text manipulation utilities of the GNU operating system. These are the core utilities which are expected to exist on every operating system.

Ilastik is a tool that utilizes machine learning algorithms to classify pixels, segment, track and count cells in images. Ilastik contains a graphical user interface to interactively label pixels. However, this nextflow module will implement the --headless mode, to apply pixel classification using a pre-trained .ilp file on an input image.

0120

mask versions

ilastik:

Ilastik is a user friendly tool that enables pixel classification, segmentation and analysis.

Render jupyter (or jupytext) notebooks to HTML reports. Supports parametrization through papermill.

0100

report artifacts versions

jupytext:

Jupyter notebooks as plain text scripts or markdown documents

papermill:

Parameterize, execute, and analyze notebooks

nbconvert:

Parameterize, execute, and analyze notebooks

Computes tier-based cutoffs from a sample-specific error model which is generated by muse/call and reports the finalized variants

01012

vcf tbi versions

MuSE:

Somatic point mutation caller based on Markov substitution model for molecular evolution

Calculate coverage cutoffs to determine when to purge duplicated sequence.

01

cutoff log versions

purgedups:

Purge_dups is a package used to purge haplotigs and overlaps in an assembly based on read depth

Plots the read coverage from a purge dups statistics file and cutoffs.

012

png versions

purgedups:

Purge_dups is a package used to purge haplotigs and overlaps in an assembly based on read depth

Render a Quarto notebook, including parametrization.

01000

html notebook artifacts params_yaml extensions versions

papermill:

Parameterize, execute, and analyze notebooks

Extract exon-exon junctions from an RNAseq BAM file. The output is a BED file in the BED12 format.

012

junc versions

regtools:

RegTools is a set of tools that integrate DNA-seq and RNA-seq data to help interpret mutations in a regulatory and splicing context.

Call peaks using SEACR on sequenced reads in bedgraph format

0120

bed versions

seacr:

SEACR is intended to call peaks and enriched regions from sparse CUT&RUN or chromatin profiling data in which background is dominated by "zeroes" (i.e. regions with no read coverage).

Apply a score cutoff to filter variants based on a recalibration table. Sentieon's Aplyvarcal performs the second pass in a two-stage process called Variant Quality Score Recalibration (VQSR). Specifically, it applies filtering to the input variants based on the recalibration table produced in the previous step VarCal and a target sensitivity value. https://support.sentieon.com/manual/usages/general/#applyvarcal-algorithm

0123450101

vcf tbi versions

sentieon:

Sentieonยฎ provides complete solutions for secondary DNA/RNA analysis for a variety of sequencing platforms, including short and long reads. Our software improves upon BWA, STAR, Minimap2, GATK, HaplotypeCaller, Mutect, and Mutect2 based pipelines and is deployable on any generic-CPU-based computing system.

Generates a BED file containing genomic locations of lengths of N.

01

bed versions

seqtk:

Seqtk is a fast and lightweight tool for processing sequences in the FASTA or FASTQ format. Seqtk mergepe command merges pair-end reads into one interleaved file.

The goal of the Shasta long read assembler is to rapidly produce accurate assembled sequence using DNA reads generated by Oxford Nanopore flow cells as input. Please note Assembler is design to focus on speed, so assembly may be considered somewhat non-deterministic as final assembly may vary across executions. See https://github.com/chanzuckerberg/shasta/issues/296.

01

assembly gfa results versions

Cluster sequences using a single-pass, greedy centroid-based clustering algorithm.

01

aln biom mothur otu bam out blast uc centroids clusters profile msa versions

vsearch:

VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)

Performs quality filtering and / or conversion of a FASTQ file to FASTA format.

01

fasta log versions

vsearch:

VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)

Taxonomic classification using the sintax algorithm.

010

tsv versions

vsearch:

VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)

Sort fasta entries by decreasing abundance (--sortbysize) or sequence length (--sortbylength).

010

fasta versions

vsearch:

VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)

Compare target sequences to fasta-formatted query sequences using global pairwise alignment.

010000

aln biom lca mothur otu sam tsv txt uc versions

vsearch:

VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)

Click here to trigger an update.