Available Modules

Modules are the building stones of all DSL2 nf-core blocks. You can find more info from nf-core website, if you would like to write your own module.

  • fastq 11
  • assembly 9
  • genomics 7
  • bacteria 7
  • bam 6
  • quality control 6
  • qc 5
  • genome 4
  • contamination 4
  • imputation 4
  • iCLIP 4
  • fasta 3
  • metagenomics 3
  • alignment 3
  • rnaseq 3
  • haplotype 3
  • differential 3
  • adapters 3
  • CLIP 3
  • roh 3
  • bed 2
  • cram 2
  • sort 2
  • sam 2
  • database 2
  • align 2
  • map 2
  • nanopore 2
  • k-mer 2
  • variant 2
  • sentieon 2
  • ancient DNA 2
  • demultiplex 2
  • expression 2
  • low-coverage 2
  • cooler 2
  • glimpse 2
  • hmmsearch 2
  • report 2
  • NCBI 2
  • mitochondria 2
  • antibiotic resistance 2
  • enrichment 2
  • chunk 2
  • DNA sequencing 2
  • targeted sequencing 2
  • hybrid capture sequencing 2
  • copy number alteration calling 2
  • ligate 2
  • fam 2
  • bim 2
  • fcs-gx 2
  • runs_of_homozygosity 2
  • gene set 2
  • gene set analysis 2
  • variant pruning 2
  • screening 2
  • cleaning 2
  • vcf 1
  • gatk4 1
  • annotation 1
  • variant calling 1
  • gff 1
  • classification 1
  • download 1
  • cnv 1
  • taxonomy 1
  • convert 1
  • count 1
  • proteomics 1
  • copy number 1
  • single-cell 1
  • contigs 1
  • trimming 1
  • reporting 1
  • consensus 1
  • bcftools 1
  • bisulfite 1
  • bisulphite 1
  • table 1
  • methylseq 1
  • cna 1
  • wgs 1
  • methylation 1
  • plink2 1
  • 5mC 1
  • serotype 1
  • antimicrobial resistance 1
  • neural network 1
  • example 1
  • aDNA 1
  • machine learning 1
  • validation 1
  • palaeogenomics 1
  • archaeogenomics 1
  • genotyping 1
  • peaks 1
  • hmmer 1
  • segmentation 1
  • spatial 1
  • umi 1
  • bismark 1
  • dedup 1
  • prokaryote 1
  • antimicrobial peptides 1
  • profiling 1
  • mem 1
  • detection 1
  • riboseq 1
  • summary 1
  • benchmark 1
  • merging 1
  • amps 1
  • microbiome 1
  • gsea 1
  • haplotypecaller 1
  • quantification 1
  • xeniumranger 1
  • bedgraph 1
  • happy 1
  • ganon 1
  • pypgx 1
  • resistance 1
  • BGC 1
  • biosynthetic gene cluster 1
  • malt 1
  • abundance 1
  • fungi 1
  • ampir 1
  • microarray 1
  • parsing 1
  • typing 1
  • chimeras 1
  • DRAMP 1
  • neubi 1
  • amplify 1
  • macrel 1
  • virulence 1
  • shapeit 1
  • eukaryotes 1
  • prokaryotes 1
  • cfDNA 1
  • genome mining 1
  • cool 1
  • lineage 1
  • microbes 1
  • pangolin 1
  • covid 1
  • pan-genome 1
  • gene expression 1
  • mlst 1
  • organelle 1
  • ampgram 1
  • amptransformer 1
  • SimpleAF 1
  • kma 1
  • ichorcna 1
  • Streptococcus pneumoniae 1
  • nextclade 1
  • metagenomic 1
  • pharmacogenetics 1
  • eCLIP 1
  • splice 1
  • BAM 1
  • deseq2 1
  • rna-seq 1
  • blastn 1
  • NRPS 1
  • secondary metabolites 1
  • RiPP 1
  • demultiplexed reads 1
  • antibiotics 1
  • aggregate 1
  • artic 1
  • antismash 1
  • junction 1
  • workflow_mode 1
  • snakemake 1
  • workflow 1
  • bgen 1
  • pruning 1
  • variancepartition 1
  • dream 1
  • linkage equilibrium 1
  • scRNA-Seq 1
  • rank 1
  • 10x 1
  • translation 1
  • gprofiler2 1
  • gost 1
  • morphology 1
  • resegment 1
  • run 1
  • resistance genes 1
  • resfinder 1
  • low coverage 1
  • bgc 1
  • variantrecalibrator 1
  • recalibration model 1
  • element 1
  • biallelic 1
  • homozygosity 1
  • autozygosity 1
  • cooler/balance 1
  • normal database 1
  • panel of normals 1
  • rhocall 1
  • indep pairwise 1
  • indep 1
  • SMN1 1
  • SMN2 1
  • CRAM 1
  • seacr 1
  • chromatin 1
  • cut&run 1
  • cut&tag 1
  • peak-caller 1
  • motif 1
  • limma 1
  • mass-spectroscopy 1
  • damage patterns 1
  • NGS 1
  • DNA damage 1
  • hmtnote 1
  • NextGenMap 1
  • ngm 1
  • sequencing summary 1
  • index 0
  • reference 0
  • structural variants 0
  • filter 0
  • merge 0
  • coverage 0
  • statistics 0
  • variants 0
  • gtf 0
  • classify 0
  • split 0
  • MSA 0
  • taxonomic profiling 0
  • gfa 0
  • somatic 0
  • pacbio 0
  • conversion 0
  • clustering 0
  • quality 0
  • binning 0
  • VCF 0
  • bedtools 0
  • phylogeny 0
  • long reads 0
  • build 0
  • gvcf 0
  • isoseq 0
  • sv 0
  • mags 0
  • kmer 0
  • variation graph 0
  • graph 0
  • picard 0
  • compression 0
  • indexing 0
  • long-read 0
  • visualisation 0
  • illumina 0
  • protein 0
  • bqsr 0
  • databases 0
  • QC 0
  • stats 0
  • phage 0
  • sequences 0
  • mapping 0
  • openms 0
  • imaging 0
  • taxonomic classification 0
  • metrics 0
  • depth 0
  • tsv 0
  • pangenome graph 0
  • markduplicates 0
  • histogram 0
  • structure 0
  • cluster 0
  • base quality score recalibration 0
  • scWGBS 0
  • samtools 0
  • matrix 0
  • WGBS 0
  • plot 0
  • amr 0
  • protein sequence 0
  • pairs 0
  • searching 0
  • DNA methylation 0
  • filtering 0
  • repeat 0
  • bins 0
  • mmseqs2 0
  • bcf 0
  • completeness 0
  • mappability 0
  • phasing 0
  • biscuit 0
  • transcript 0
  • annotate 0
  • checkm 0
  • virus 0
  • metagenome 0
  • gzip 0
  • aligner 0
  • sequence 0
  • LAST 0
  • transcriptome 0
  • gene 0
  • bwa 0
  • genotype 0
  • seqkit 0
  • damage 0
  • germline 0
  • bisulfite sequencing 0
  • db 0
  • complexity 0
  • evaluation 0
  • feature 0
  • gff3 0
  • kraken2 0
  • mkref 0
  • blast 0
  • decompression 0
  • ncbi 0
  • population genetics 0
  • msa 0
  • newick 0
  • ucsc 0
  • mag 0
  • sketch 0
  • vsearch 0
  • reads 0
  • demultiplexing 0
  • antimicrobial resistance genes 0
  • rna 0
  • csv 0
  • extract 0
  • bedGraph 0
  • multiple sequence alignment 0
  • short-read 0
  • tumor-only 0
  • deduplication 0
  • single 0
  • cnvkit 0
  • duplicates 0
  • mirna 0
  • snp 0
  • plasmid 0
  • pangenome 0
  • prediction 0
  • splicing 0
  • scRNA-seq 0
  • json 0
  • low frequency variant calling 0
  • kmers 0
  • profile 0
  • mpileup 0
  • idXML 0
  • concatenate 0
  • fragment 0
  • diversity 0
  • svtk 0
  • cat 0
  • kallisto 0
  • fastx 0
  • text 0
  • counts 0
  • MAF 0
  • gridss 0
  • isolates 0
  • arg 0
  • compare 0
  • indels 0
  • interval 0
  • sourmash 0
  • mutect2 0
  • call 0
  • FASTQ 0
  • visualization 0
  • ptr 0
  • ont 0
  • de novo assembly 0
  • query 0
  • distance 0
  • tabular 0
  • view 0
  • reference-free 0
  • wxs 0
  • de novo 0
  • clipping 0
  • structural 0
  • single cell 0
  • deamination 0
  • 3-letter genome 0
  • coptr 0
  • microsatellite 0
  • deep learning 0
  • snps 0
  • mtDNA 0
  • fgbio 0
  • redundancy 0
  • read depth 0
  • transcriptomics 0
  • peak-calling 0
  • diamond 0
  • circrna 0
  • miscoding lesions 0
  • palaeogenetics 0
  • ranking 0
  • interval_list 0
  • HiFi 0
  • public datasets 0
  • preprocessing 0
  • genome assembler 0
  • hic 0
  • bin 0
  • bigwig 0
  • retrotransposon 0
  • STR 0
  • archaeogenetics 0
  • cut 0
  • phylogenetic placement 0
  • containment 0
  • SV 0
  • sylph 0
  • isomir 0
  • bedpe 0
  • dna 0
  • ngscheckmate 0
  • HMM 0
  • hmmcopy 0
  • paf 0
  • telomere 0
  • compress 0
  • matching 0
  • ccs 0
  • genmod 0
  • propr 0
  • fai 0
  • image 0
  • bgzip 0
  • clean 0
  • chromosome 0
  • DNA sequence 0
  • fusion 0
  • ATAC-seq 0
  • umitools 0
  • bcl2fastq 0
  • normalization 0
  • logratio 0
  • union 0
  • ancestry 0
  • add 0
  • sample 0
  • sequencing 0
  • skani 0
  • family 0
  • untar 0
  • transposons 0
  • highly_multiplexed_imaging 0
  • unzip 0
  • fastk 0
  • mcmicro 0
  • image_analysis 0
  • duplication 0
  • fusions 0
  • UMI 0
  • uncompress 0
  • html 0
  • ataqv 0
  • krona 0
  • bacterial 0
  • bakta 0
  • benchmarking 0
  • minimap2 0
  • pileup 0
  • tabix 0
  • quality trimming 0
  • zip 0
  • archiving 0
  • polishing 0
  • remove 0
  • entrez 0
  • panel 0
  • adapter trimming 0
  • uLTRA 0
  • scaffolding 0
  • small indels 0
  • host 0
  • bamtools 0
  • checkv 0
  • khmer 0
  • informative sites 0
  • spaceranger 0
  • popscle 0
  • genotype-based deconvoltion 0
  • observations 0
  • lossless 0
  • kinship 0
  • PacBio 0
  • rna_structure 0
  • RNA 0
  • identity 0
  • dist 0
  • transcripts 0
  • genome assembly 0
  • relatedness 0
  • score 0
  • angsd 0
  • seqtk 0
  • RNA-seq 0
  • subsample 0
  • pseudoalignment 0
  • SNP 0
  • arriba 0
  • krona chart 0
  • rsem 0
  • reports 0
  • notebook 0
  • wastewater 0
  • amplicon sequencing 0
  • indel 0
  • dictionary 0
  • miRNA 0
  • spark 0
  • survivor 0
  • population genomics 0
  • hidden Markov model 0
  • mask 0
  • ambient RNA removal 0
  • complement 0
  • long_read 0
  • atac-seq 0
  • somatic variants 0
  • aln 0
  • cut up 0
  • proteome 0
  • bracken 0
  • mzml 0
  • gatk4spark 0
  • mapper 0
  • repeat expansion 0
  • CRISPR 0
  • npz 0
  • combine 0
  • comparisons 0
  • prefetch 0
  • windowmasker 0
  • prokka 0
  • bwameth 0
  • guide tree 0
  • amplicon sequences 0
  • kraken 0
  • structural_variants 0
  • chip-seq 0
  • wig 0
  • png 0
  • hi-c 0
  • pairsam 0
  • comparison 0
  • variant_calling 0
  • cellranger 0
  • replace 0
  • mkfastq 0
  • nucleotide 0
  • insert 0
  • C to T 0
  • dump 0
  • das tool 0
  • regions 0
  • intervals 0
  • fingerprint 0
  • genomes 0
  • scaffold 0
  • converter 0
  • PCA 0
  • vrhyme 0
  • deeparg 0
  • scores 0
  • das_tool 0
  • graph layout 0
  • shigella 0
  • small genome 0
  • haplogroups 0
  • genetics 0
  • duplicate 0
  • functional analysis 0
  • copyratios 0
  • k-mer frequency 0
  • tnhaplotyper2 0
  • signature 0
  • interactions 0
  • rrna 0
  • de novo assembler 0
  • ancient dna 0
  • switch 0
  • xz 0
  • hla 0
  • reformat 0
  • megan 0
  • regression 0
  • COBS 0
  • hlala 0
  • hla_typing 0
  • hlala_typing 0
  • read-group 0
  • Read depth 0
  • archive 0
  • mapcounter 0
  • rgfa 0
  • zlib 0
  • taxids 0
  • ChIP-seq 0
  • concordance 0
  • variation 0
  • mitochondrion 0
  • contig 0
  • resolve_bioscience 0
  • effect prediction 0
  • snpeff 0
  • GPU-accelerated 0
  • assembly evaluation 0
  • snpsift 0
  • cancer genomics 0
  • spatial_transcriptomics 0
  • genomad 0
  • profiles 0
  • junctions 0
  • small variants 0
  • gstama 0
  • taxon name 0
  • trancriptome 0
  • multiallelic 0
  • FracMinHash sketch 0
  • tama 0
  • image_processing 0
  • nucleotides 0
  • ped 0
  • cnvnator 0
  • registration 0
  • GC content 0
  • proportionality 0
  • differential expression 0
  • phase 0
  • checksum 0
  • leviosam2 0
  • metamaps 0
  • salmon 0
  • barcode 0
  • primer 0
  • soft-clipped clusters 0
  • pharokka 0
  • taxon tables 0
  • otu tables 0
  • varcal 0
  • instability 0
  • pair 0
  • standardisation 0
  • msi 0
  • minhash 0
  • interactive 0
  • krakenuniq 0
  • standardise 0
  • serogroup 0
  • lofreq 0
  • salmonella 0
  • homoploymer 0
  • purge duplications 0
  • library 0
  • bam2fq 0
  • preseq 0
  • collate 0
  • adapter 0
  • function 0
  • retrotransposons 0
  • MSI 0
  • long terminal repeat 0
  • dict 0
  • fixmate 0
  • long terminal retrotransposon 0
  • import 0
  • mash 0
  • taxonomic profile 0
  • tumor 0
  • maximum likelihood 0
  • polyA_tail 0
  • sequenzautils 0
  • refine 0
  • svdb 0
  • mudskipper 0
  • reformatting 0
  • iphop 0
  • orf 0
  • vg 0
  • bloom filter 0
  • rtgtools 0
  • instrain 0
  • lift 0
  • k-mer index 0
  • transformation 0
  • micro-satellite-scan 0
  • rename 0
  • krakentools 0
  • tree 0
  • screen 0
  • msisensor-pro 0
  • bustools 0
  • bfiles 0
  • transcriptomic 0
  • parallelized 0
  • standardization 0
  • orthology 0
  • subset 0
  • Duplication purging 0
  • vcflib 0
  • removal 0
  • polish 0
  • immunoprofiling 0
  • join 0
  • repeat_expansions 0
  • duplex 0
  • fetch 0
  • GEO 0
  • frame-shift correction 0
  • long-read sequencing 0
  • identifier 0
  • sequence analysis 0
  • expansionhunterdenovo 0
  • metadata 0
  • reheader 0
  • tab 0
  • intersection 0
  • windows 0
  • emboss 0
  • doublets 0
  • eigenstrat 0
  • anndata 0
  • validate 0
  • UMIs 0
  • unaligned 0
  • samplesheet 0
  • smrnaseq 0
  • xenograft 0
  • MCMICRO 0
  • graft 0
  • trim 0
  • allele-specific 0
  • mirdeep2 0
  • RNA sequencing 0
  • realignment 0
  • microbial 0
  • microscopy 0
  • Pharmacogenetics 0
  • deconvolution 0
  • bayesian 0
  • concat 0
  • tbi 0
  • intersect 0
  • normalize 0
  • norm 0
  • merge mate pairs 0
  • reads merging 0
  • short reads 0
  • region 0
  • sizes 0
  • ome-tif 0
  • nanostring 0
  • trgt 0
  • pigz 0
  • find 0
  • split_kmers 0
  • corrupted 0
  • calling 0
  • nacho 0
  • cnv calling 0
  • CNV 0
  • mRNA 0
  • vdj 0
  • cvnkit 0
  • single cells 0
  • estimation 0
  • genome bins 0
  • recombination 0
  • parse 0
  • correction 0
  • bases 0
  • heatmap 0
  • format 0
  • eido 0
  • haplotypes 0
  • awk 0
  • blastp 0
  • spatial_omics 0
  • human removal 0
  • random forest 0
  • metagenomes 0
  • gene labels 0
  • structural-variant calling 0
  • hostile 0
  • fasterq-dump 0
  • sra-tools 0
  • settings 0
  • decontamination 0
  • version 0
  • interval list 0
  • scatter 0
  • gatk 0
  • evidence 0
  • MaltExtract 0
  • HOPS 0
  • panelofnormals 0
  • baf 0
  • authentication 0
  • edit distance 0
  • dereplicate 0
  • allele 0
  • simulate 0
  • RNA-Seq 0
  • WGS 0
  • joint genotyping 0
  • cgMLST 0
  • samples 0
  • orthologs 0
  • ragtag 0
  • repeats 0
  • filtermutectcalls 0
  • qualty 0
  • gem 0
  • gwas 0
  • hmmscan 0
  • short-read sequencing 0
  • alr 0
  • blat 0
  • yahs 0
  • detecting svs 0
  • Bioinformatics Tools 0
  • confidence 0
  • phylogenies 0
  • geo 0
  • chloroplast 0
  • hmmpress 0
  • patch 0
  • hhsuite 0
  • mapad 0
  • covariance models 0
  • trna 0
  • clr 0
  • copy number variation 0
  • missingness 0
  • reference compression 0
  • baftest 0
  • svtk/baftest 0
  • regex 0
  • impute 0
  • scanner 0
  • whamg 0
  • constant 0
  • wham 0
  • reference panel 0
  • modelsegments 0
  • copy-number 0
  • copy number analysis 0
  • unmarkduplicates 0
  • gender determination 0
  • references 0
  • copy number alterations 0
  • sccmec 0
  • variantcalling 0
  • c to t 0
  • adna 0
  • dnamodelapply 0
  • groupby 0
  • createreadcountpanelofnormals 0
  • taxonomic composition 0
  • denoisereadcounts 0
  • metaspace 0
  • metabolite annotation 0
  • readwriter 0
  • mzML 0
  • data-download 0
  • Immune Deconvolution 0
  • ribosomal RNA 0
  • rRNA 0
  • prepare 0
  • hwe 0
  • catpack 0
  • Computational Immunology 0
  • tnscope 0
  • genome annotation 0
  • readproteingroups 0
  • dnascope 0
  • 16S 0
  • proteus 0
  • streptococcus 0
  • spa 0
  • spatype 0
  • mobile genetic elements 0
  • integron 0
  • patterns 0
  • signatures 0
  • doublet 0
  • countsvtypes 0
  • eigenvectors 0
  • hicPCA 0
  • fracminhash sketch 0
  • hash sketch 0
  • sliding 0
  • CRISPRi 0
  • rdtest2vcf 0
  • downsample 0
  • longest 0
  • isoform 0
  • upd 0
  • transcroder 0
  • cds 0
  • uniparental 0
  • disomy 0
  • snv 0
  • coding 0
  • sequencing adapters 0
  • downsample bam 0
  • subsample bam 0
  • vcf2db 0
  • gemini 0
  • maf 0
  • eucaryotes 0
  • lua 0
  • toml 0
  • chromosomal rearrangements 0
  • agat 0
  • drep 0
  • vcfbreakmulti 0
  • pca 0
  • refflat 0
  • genepred 0
  • bedtobigbed 0
  • ucsc/liftover 0
  • bigbed 0
  • f coefficient 0
  • homozygous genotypes 0
  • heterozygous genotypes 0
  • inbreeding 0
  • umicollapse 0
  • microbial genomics 0
  • bedgraphtobigwig 0
  • plink2_pca 0
  • bgen file 0
  • covariance model 0
  • dereplication 0
  • files 0
  • vcf file 0
  • genotype dosages 0
  • Mycobacterium tuberculosis 0
  • assembly polishing 0
  • rdtest 0
  • SNV 0
  • remove samples 0
  • extractunbinned 0
  • tandem repeats 0
  • linkbins 0
  • long read 0
  • decompress 0
  • sintax 0
  • vsearch/sort 0
  • vcf2bed 0
  • shuffleBed 0
  • Indel 0
  • trio binning 0
  • host removal 0
  • usearch 0
  • long read alignment 0
  • pangenome-scale 0
  • all versus all 0
  • mashmap 0
  • gtftogenepred 0
  • wavefront 0
  • haploype 0
  • helitron 0
  • polya tail 0
  • fast5 0
  • genome polishing 0
  • network 0
  • bedcov 0
  • uniq 0
  • deduplicate 0
  • paired reads re-pairing 0
  • comp 0
  • md 0
  • VCFtools 0
  • nm 0
  • wget 0
  • uq 0
  • verifybamid 0
  • GFF/GTF 0
  • short 0
  • intron 0
  • DNA contamination estimation 0
  • SINE 0
  • masking 0
  • low-complexity 0
  • plant 0
  • construct 0
  • melon 0
  • graph projection to vcf 0
  • boxcox 0
  • busco 0
  • fix 0
  • tag2tag 0
  • association 0
  • GWAS 0
  • svg 0
  • case/control 0
  • xml 0
  • script 0
  • java 0
  • associations 0
  • spatial_neighborhoods 0
  • hashing-based deconvolution 0
  • tags 0
  • standard 0
  • impute-info 0
  • functional 0
  • Illumina 0
  • scimap 0
  • Bayesian 0
  • uniques 0
  • invariant 0
  • structural-variants 0
  • omics 0
  • biological activity 0
  • drug categorization 0
  • prior knowledge 0
  • refresh 0
  • clahe 0
  • cell_barcodes 0
  • microRNA 0
  • telseq 0
  • stardist 0
  • variant-calling 0
  • poolseq 0
  • multi-tool 0
  • predict 0
  • search engine 0
  • mass_error 0
  • hardy-weinberg 0
  • hwe statistics 0
  • multiqc 0
  • hwe equilibrium 0
  • haplotag 0
  • reference-independent 0
  • genotype likelihood 0
  • Staging 0
  • collapse 0
  • liftover 0
  • probabilistic realignment 0
  • seqfu 0
  • n50 0
  • cell_type_identification 0
  • cell_phenotyping 0
  • machine_learning 0
  • staging 0
  • tag 0
  • mygene 0
  • vsearch/dereplicate 0
  • coreutils 0
  • transcription factors 0
  • regulatory network 0
  • ribosomal 0
  • grabix 0
  • hamming-distance 0
  • bwameme 0
  • bwamem2 0
  • guidetree 0
  • hashing-based deconvoltion 0
  • gnu 0
  • Pacbio 0
  • overlap-based merging 0
  • generic 0
  • AC/NS/AF 0
  • vcflib/vcffixup 0
  • trimfq 0
  • cellsnp 0
  • transposable element 0
  • retrieval 0
  • donor deconvolution 0
  • genotype-based demultiplexing 0
  • MMseqs2 0
  • lexogen 0
  • droplet based single cells 0
  • check 0
  • paired reads merging 0
  • Read report 0
  • orthogroup 0
  • go 0
  • Read trimming 0
  • Read filters 0
  • nanoq 0
  • redundant 0
  • pile up 0
  • extraction 0
  • featuretable 0
  • mass spectrometry 0
  • sage 0
  • nanopore sequencing 0
  • rna velocity 0
  • cobra 0
  • spot 0
  • circular 0
  • extension 0
  • realign 0
  • quality check 0
  • size 0
  • cram-size 0
  • selector 0
  • grea 0
  • paraphase 0
  • functional enrichment 0
  • homologs 0
  • vsearch/fastqfilter 0
  • malformed 0
  • rad 0
  • tnfilter 0
  • plotting 0
  • scanpy 0
  • array_cgh 0
  • cytosure 0
  • metagenome assembler 0
  • vector 0
  • relabel 0
  • regtools 0
  • cell segmentation 0
  • nuclear segmentation 0
  • structural variant 0
  • bam2fastx 0
  • import segmentation 0
  • bam2fastq 0
  • immcantation 0
  • airrseq 0
  • immunoinformatics 0
  • solo 0
  • scvi 0
  • co-orthology 0
  • derived alleles 0
  • InterProScan 0
  • sequence similarity 0
  • decompose 0
  • partitioning 0
  • Escherichia coli 0
  • chip 0
  • propd 0
  • Read coverage histogram 0
  • updatedata 0
  • reverse complement 0
  • pdb 0
  • simulation 0
  • hmmfetch 0
  • block substitutions 0
  • site frequency spectrum 0
  • transmembrane 0
  • decomposeblocksub 0
  • genome graph 0
  • tnseq 0
  • identity-by-descent 0
  • decoy 0
  • htseq 0
  • mgi 0
  • sompy 0
  • recovery 0
  • peak picking 0
  • leafcutter 0
  • homology 0
  • p-value 0
  • fastqfilter 0
  • translate 0
  • raw 0
  • mgf 0
  • tarball 0
  • parquet 0
  • parser 0
  • dbsnp 0
  • standardize 0
  • quarto 0
  • python 0
  • r 0
  • tar 0
  • jvarkit 0
  • setgt 0
  • coexpression 0
  • correlation 0
  • corpcor 0
  • ATACshift 0
  • assay 0
  • phylogenetics 0
  • shift 0
  • minimum_evolution 0
  • distance-based 0
  • ATACseq 0
  • nucleotide sequence 0
  • targz 0
  • significance statistic 0
  • gaps 0
  • logFC 0
  • spectral clustering 0
  • comparative genomics 0
  • subsetting 0
  • deep variant 0
  • mutect 0
  • idx 0
  • barcodes 0
  • doublet_detection 0
  • quality_control 0
  • transform 0
  • emoji 0
  • introns 0
  • plastid 0
  • source tracking 0
  • controlstatistics 0
  • elprep 0
  • elfasta 0
  • install 0
  • nucleotide content 0
  • joint-genotyping 0
  • genotypegvcf 0
  • AT content 0
  • nucBed 0
  • bclconvert 0
  • parallel 0
  • ancestral alleles 0
  • methylation bias 0
  • SNPs 0
  • getpileupsummaries 0
  • short variant discovery 0
  • combinegvcfs 0
  • collectsvevidence 0
  • collectreadcounts 0
  • cnnscorevariants 0
  • calibratedragstrmodel 0
  • cross-samplecontamination 0
  • dragstr 0
  • calculatecontamination 0
  • bedtointervallist 0
  • asereadcounter 0
  • vqsr 0
  • variant quality score recalibration 0
  • annotateintervals 0
  • composestrtablefile 0
  • condensedepthevidence 0
  • heattree 0
  • gatherbqsrreports 0
  • germlinecnvcaller 0
  • germline contig ploidy 0
  • panelofnormalscreation 0
  • jointgenotyping 0
  • genomicsdbimport 0
  • genomicsdb 0
  • tranche filtering 0
  • createsequencedictionary 0
  • filtervarianttranches 0
  • filterintervals 0
  • estimatelibrarycomplexity 0
  • duplication metrics 0
  • determinegermlinecontigploidy 0
  • createsomaticpanelofnormals 0
  • targets 0
  • gangstr 0
  • getpileupsumaries 0
  • antibiotic resistance genes 0
  • consensus sequence 0
  • public 0
  • ENA 0
  • SRA 0
  • ANI 0
  • ARGs 0
  • faqcs 0
  • groupreads 0
  • str 0
  • cache 0
  • percent on target 0
  • endogenous DNA 0
  • Streptococcus pyogenes 0
  • swissprot 0
  • duplexumi 0
  • unmapped 0
  • gene-calling 0
  • variant caller 0
  • gamma 0
  • UShER 0
  • bootstrapping 0
  • bacterial variant calling 0
  • germline variant calling 0
  • somatic variant calling 0
  • rust 0
  • ubam 0
  • fq 0
  • lint 0
  • random 0
  • generate 0
  • single molecule 0
  • zipperbams 0
  • germlinevariantsites 0
  • readcountssummary 0
  • embl 0
  • Imputation 0
  • gene model 0
  • tama_collapse.py 0
  • genomes on a tree 0
  • merge compare 0
  • GNU 0
  • joint-variant-calling 0
  • Haplotypes 0
  • gstama/merge 0
  • Sample 0
  • gget 0
  • genome statistics 0
  • genome manipulation 0
  • genome summary 0
  • TAMA 0
  • gstama/polyacleanup 0
  • Mykrobe 0
  • abricate 0
  • beagle 0
  • hbd 0
  • ibd 0
  • rgi 0
  • fARGene 0
  • amrfinderplus 0
  • extractvariants 0
  • GTDB taxonomy 0
  • extract_variants 0
  • gvcftools 0
  • gunzip 0
  • gunc 0
  • archaea 0
  • genome taxonomy database 0
  • gfastats 0
  • Salmonella Typhi 0
  • indexfeaturefile 0
  • preprocessintervals 0
  • shiftchain 0
  • selectvariants 0
  • revert 0
  • reblockgvcf 0
  • printsvevidence 0
  • printreads 0
  • postprocessgermlinecnvcalls 0
  • shiftintervals 0
  • snvs 0
  • mutectstats 0
  • mergebamalignment 0
  • leftalignandtrimvariants 0
  • readorientationartifacts 0
  • learnreadorientationmodel 0
  • shiftfasta 0
  • site depth 0
  • repeat content 0
  • file parsing 0
  • genome heterozygosity 0
  • genome size 0
  • models 0
  • compound 0
  • genome profile 0
  • txt 0
  • splitcram 0
  • gawk 0
  • variantfiltration 0
  • svcluster 0
  • svannotate 0
  • splitintervals 0
  • genbank 0
  • split by chromosome 0
  • Haemophilus influenzae 0
  • illumiation_correction 0
  • BCF 0
  • csi 0
  • deduping 0
  • smaller fastqs 0
  • clumping fastqs 0
  • background_correction 0
  • trimBam 0
  • bamUtil 0
  • bamtools/split 0
  • yaml 0
  • bamtools/convert 0
  • mouse 0
  • update header 0
  • virulent 0
  • chunking 0
  • subtract 0
  • slopBed 0
  • shiftBed 0
  • multinterval 0
  • overlapped bed 0
  • maskfasta 0
  • jaccard 0
  • overlap 0
  • getfasta 0
  • genomecov 0
  • closest 0
  • bamtobed 0
  • sorting 0
  • bacphlip 0
  • temperate 0
  • bioawk 0
  • amp 0
  • allele counts 0
  • nuclear contamination estimate 0
  • post Post-processing 0
  • model 0
  • AMPs 0
  • antimicrobial peptide prediction 0
  • Staphylococcus aureus 0
  • installation 0
  • affy 0
  • reference panels 0
  • admixture 0
  • adapterremoval 0
  • antimicrobial reistance 0
  • contiguate 0
  • doCounts 0
  • HLA 0
  • lifestyle 0
  • read group 0
  • autofluorescence 0
  • cycif 0
  • background 0
  • single-stranded 0
  • ancientDNA 0
  • authentict 0
  • bias 0
  • utility 0
  • ATLAS 0
  • sequencing_bias 0
  • post mortem damage 0
  • atlas 0
  • mkarv 0
  • http(s) 0
  • unionBedGraphs 0
  • file manipulation 0
  • deletion 0
  • Segmentation 0
  • cutesv 0
  • gct 0
  • cls 0
  • na 0
  • custom 0
  • Cores 0
  • TMA dearray 0
  • paired-end 0
  • UNet 0
  • mcool 0
  • genomic bins 0
  • makebins 0
  • enzyme 0
  • digest 0
  • pcr duplicates 0
  • track 0
  • escherichia coli 0
  • circos 0
  • eklipse 0
  • eigenstratdatabasetools 0
  • pep 0
  • schema 0
  • PEP 0
  • depth information 0
  • corrrelation 0
  • structural variation 0
  • duphold 0
  • segment 0
  • blastx 0
  • cumulative coverage 0
  • scatterplot 0
  • cload 0
  • subcontigs 0
  • sorted 0
  • compartments 0
  • multiomics 0
  • mkvdjref 0
  • cellpose 0
  • hifi 0
  • Assembly 0
  • domains 0
  • topology 0
  • antibody capture 0
  • calder2 0
  • cadd 0
  • postprocessing 0
  • tblastn 0
  • subtyping 0
  • Salmonella enterica 0
  • antigen capture 0
  • crispr 0
  • nucleotide composition 0
  • cmseq 0
  • concoct 0
  • partition histograms 0
  • target 0
  • export 0
  • antitarget 0
  • access 0
  • protein coding genes 0
  • qa 0
  • polymorphic sites 0
  • polymorphic 0
  • polymut 0
  • chromosome_visualization 0
  • duplicate removal 0
  • chromap 0
  • quality assurnce 0
  • mitochondrial 0
  • haplotype resolution 0
  • predictions 0
  • assembly curation 0
  • false duplications 0
  • duplicate purging 0
  • haplotype purging 0
  • cutoff 0
  • genomic intervals 0
  • False duplications 0
  • intervals coverage 0
  • gene finding 0
  • contact maps 0
  • bmp 0
  • jpg 0
  • pretext 0
  • Haplotype purging 0
  • Assembly curation 0
  • porechop_abi 0
  • strandedness 0
  • sequence-based 0
  • read distribution 0
  • inner_distance 0
  • fragment_size 0
  • read_pairs 0
  • experiment 0
  • bamstat 0
  • purging 0
  • R 0
  • long uncorrected reads 0
  • subsampling 0
  • neighbour-joining 0
  • quast 0
  • contact 0
  • pmdtools 0
  • integrity 0
  • pcr 0
  • CoPRO 0
  • tandem duplications 0
  • insertions 0
  • deletions 0
  • sortvcf 0
  • picard/renamesampleinvcf 0
  • liftovervcf 0
  • PRO-cap 0
  • mate-pair 0
  • hybrid-selection 0
  • phylogenetic composition 0
  • illumina datasets 0
  • identification 0
  • prophage 0
  • GRO-cap 0
  • CAGE 0
  • variant genetic 0
  • variant identifiers 0
  • scoring 0
  • identifiers 0
  • whole genome association 0
  • recode 0
  • exclude 0
  • NETCAGE 0
  • genetic 0
  • GRO-seq 0
  • PRO-seq 0
  • STRIPE-seq 0
  • csRNA-seq 0
  • RAMPAGE 0
  • mapping-based 0
  • rtg 0
  • ChIP-Seq 0
  • gc_wiggle 0
  • error 0
  • rare variants 0
  • relative coverage 0
  • genetic sex 0
  • sex determination 0
  • induce 0
  • bam2seqz 0
  • longread 0
  • freqsum 0
  • pseudodiploid 0
  • pseudohaploid 0
  • random draw 0
  • selection 0
  • seq 0
  • de-novo 0
  • sha256 0
  • interleave 0
  • dbnsfp 0
  • snippy 0
  • core 0
  • sniffles 0
  • POA 0
  • 256 bit 0
  • sliding window 0
  • features 0
  • density 0
  • boxplot 0
  • exploratory 0
  • shinyngs 0
  • header 0
  • sertotype 0
  • pedfilter 0
  • flagstat 0
  • faidx 0
  • calmd 0
  • ampliconclip 0
  • amplicon 0
  • duplicate marking 0
  • sambamba 0
  • multimapper 0
  • repair 0
  • Ancestor 0
  • LCA 0
  • salsa2 0
  • salsa 0
  • rtg-tools 0
  • rocplot 0
  • insert size 0
  • paired 0
  • sequence headers 0
  • grep 0
  • subseq 0
  • variant recalibration 0
  • VQSR 0
  • applyvarcal 0
  • assembly-binning 0
  • read pairs 0
  • clusteridentifier 0
  • cluster analysis 0
  • scramble 0
  • readgroup 0
  • phantom peaks 0
  • gccounter 0
  • clinical 0
  • qualities 0
  • lofreq/filter 0
  • lofreq/call 0
  • Listeria monocytogenes 0
  • pneumophila 0
  • legionella 0
  • peptide prediction 0
  • collapsing 0
  • adapter removal 0
  • train 0
  • spliced 0
  • reorder 0
  • combining 0
  • AMP 0
  • functional genomics 0
  • kegg 0
  • taxonomic assignment 0
  • metagenome-assembled genomes 0
  • maxbin2 0
  • representations 0
  • reduced 0
  • mash/sketch 0
  • estimate 0
  • sgRNA 0
  • rra 0
  • maximum-likelihood 0
  • CRISPR-Cas9 0
  • kofamscan 0
  • pneumoniae 0
  • MD5 0
  • haemophilus 0
  • genome browser 0
  • js 0
  • igv.js 0
  • igv 0
  • IDR 0
  • panel_of_normals 0
  • pos 0
  • pixel classification 0
  • annotations 0
  • Hidden Markov Model 0
  • amino acid 0
  • HMMER 0
  • readcounter 0
  • multicut 0
  • pixel_classification 0
  • Klebsiella 0
  • jupytext 0
  • effective genome size 0
  • k-mer counting 0
  • digital normalization 0
  • quant 0
  • kallisto/index 0
  • papermill 0
  • Jupyter 0
  • probability_maps 0
  • Python 0
  • jasmine 0
  • jasminesv 0
  • insertion 0
  • genomic islands 0
  • interproscan 0
  • mcr-1 0
  • 128 bit 0
  • pedigrees 0
  • graph stats 0
  • ILP 0
  • hla-typing 0
  • tumor/normal 0
  • graph viz 0
  • graph formats 0
  • graph unchopping 0
  • combine graphs 0
  • block-compressed 0
  • odgi 0
  • squeeze 0
  • graph drawing 0
  • graph construction 0
  • gender 0
  • Neisseria gonorrhoeae 0
  • HLA-I 0
  • PCR/optical duplicates 0
  • graphs 0
  • read 0
  • pair-end 0
  • pbp 0
  • subreads 0
  • pbmerge 0
  • pbbam 0
  • paragraph 0
  • flip 0
  • select 0
  • restriction fragments 0
  • pairstools 0
  • pairtools 0
  • ligation junctions 0
  • upper-triangular matrix 0
  • megahit 0
  • Merqury 0
  • assembler 0
  • mbias 0
  • metaphlan 0
  • unionsum 0
  • ploidy 0
  • smudgeplot 0
  • contour map 0
  • microrna 0
  • 3D heat map 0
  • Neisseria meningitidis 0
  • rma6 0
  • daa 0
  • debruijn 0
  • denovo 0
  • de Bruijn 0
  • target prediction 0
  • mobile element insertions 0
  • bioinformatics tools 0
  • somatic structural variations 0
  • cancer genome 0
  • contaminant 0
  • SNP table 0
  • GATK UnifiedGenotyper 0
  • Beautiful stand-alone HTML report 0
  • mitochondrial to nuclear ratio 0
  • mitochondrial genome 0
  • ratio 0
  • mtnucratio 0
  • scan 0
  • microsatellite instability 0
  • otu table 0
  • mosdepth 0
  • reference genome 0
  • long-reads 0

Screen assemblies for antimicrobial resistance against multiple databases

010

report versions

abricate:

Mass screening of contigs for antibiotic resistance genes

A NATA accredited tool for reporting the presence of antimicrobial resistance genes in bacterial genomes

01

matches partials virulence out txt versions

abritamr:

A pipeline for running AMRfinderPlus and collating results into functional classes

Trim sequencing adapters and collapse overlapping reads

010

singles_truncated discarded paired_truncated collapsed collapsed_truncated paired_interleaved settings versions

A submodule that merges all output summary tables from ampcombi/parsetables in one summary file.

0

tsv log versions

ampcombi2/complete:

This merges the per sample AMPcombi summaries generated by running 'ampcombi2/parsetables'.

Identify antimicrobial resistance in gene or protein sequences

010

report mutation_report versions tool_version db_version

amrfinderplus:

AMRFinderPlus finds antimicrobial resistance and other genes in protein or nucleotide sequences.

Generally applicable transcriptome-wide analysis of translational efficiency using anota2seq

0123012

translated_mrna total_mrna translation buffering mrna_abundance rdata fold_change_plot interaction_p_distribution_plot residual_distribution_summary_plot residual_vs_fitted_plot rvm_fit_for_all_contrasts_group_plot rvm_fit_for_interactions_plot rvm_fit_for_omnibus_group_plot simulated_vs_obt_dfbetas_without_interaction_plot session_info versions

anota2seq:

Generally applicable transcriptome-wide analysis of translational efficiency using anota2seq

antiSMASH allows the rapid genome-wide identification, annotation and analysis of secondary metabolite biosynthesis gene clusters. This module downloads the antiSMASH databases for conda and docker/singularity runs.

000

database antismash_dir versions

antismash:

antiSMASH - the antibiotics and Secondary Metabolite Analysis SHell

Query input FASTQs against Ariba formatted databases

0101

results versions

ariba:

ARIBA: Antibiotic Resistance Identification By Assembly

Run the alignment/variant-call/consensus logic of the artic pipeline

01012012

results bam bai bam_trimmed bai_trimmed bam_primertrimmed bai_primertrimmed fasta vcf tbi json versions

artic:

ARTIC pipeline - a bioinformatics pipeline for working with virus sequencing data sequenced with nanopore

Tool for converting 10x BAMs produced by Cell Ranger, Space Ranger, Cell Ranger ATAC, Cell Ranger DNA, and Long Ranger back to FASTQ files that can be used as inputs to re-run analysis

01

fastq versions

Demultiplex Element Biosciences bases files

012

sample_fastq sample_json qc_report run_stats generated_run_manifest metrics unassigned versions

A program for detecting runs of homo/autozygosity. Only bi-allelic sites are considered.

012010000

roh versions

roh:

A program for detecting runs of homo/autozygosity. Only bi-allelic sites are considered.

Uses Bismark report files of several samples in a run folder to generate a graphical summary HTML report.

00000

summary versions

bismark:

Bismark is a tool to map bisulfite treated sequencing reads and perform methylation calling in a quick and easy-to-use fashion.

Runs the Clippy CLIP peak caller

0100

peaks summits intergenic_gtf versions

Run matrix balancing on a cool file

012

cool versions

cooler:

Sparse binary format for genomic interaction matrices

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data

012301010101

vcf vcf_tbi gvcf gvcf_tbi versions

deepvariant:

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data

runs a differential expression analysis with DESeq2

01230120101

results dispersion_plot rdata size_factors normalised_counts rlog_counts vst_counts model session_info versions

deseq2:

Differential gene expression analysis based on the negative binomial distribution

Run falco on sequenced reads

01

html txt versions

fastqc:

falco is a drop-in C++ implementation of FastQC to assess the quality of sequence reads.

Run FastQC on sequenced reads

01

html zip versions

Run NCBI's FCS adaptor on assembled genomes

01

cleaned_assembly adaptor_report log pipeline_args skipped_trims versions

fcs:

The Foreign Contamination Screening (FCS) tool rapidly detects contaminants from foreign organisms in genome assemblies to prepare your data for submission. Therefore, the submission process to NCBI is faster and fewer contaminated genomes are submitted. This reduces errors in analyses and conclusions, not just for the original data submitter but for all subsequent users of the assembly.

Run FCS-GX on assembled genomes. The contigs of the assembly are searched against a reference database excluding the given taxid.

010

fcs_gx_report taxonomy_report versions

fcs:

"The Foreign Contamination Screening (FCS) tool rapidly detects contaminants from foreign organisms in genome assemblies to prepare your data for submission. Therefore, the submission process to NCBI is faster and fewer contaminated genomes are submitted. This reduces errors in analyses and conclusions, not just for the original data submitter but for all subsequent users of the assembly."

Runs FCS-GX (Foreign Contamination Screen - Genome eXtractor) to remove foreign contamination from genome assemblies

012

cleaned contaminants versions

fcsgx:

The NCBI Foreign Contamination Screen. Genomic cross-species aligner, for contamination detection.

Runs FCS-GX (Foreign Contamination Screen - Genome eXtractor) to screen and remove foreign contamination from genome assemblies

01200

fcsgx_report taxonomy_report log hits versions

fcsgx:

The NCBI Foreign Contamination Screen. Genomic cross-species aligner, for contamination detection.

Generate a multi-sample report file from the output of ganon report runs

01

txt versions

ganon:

ganon classifies short DNA sequences against large sets of genomic reference sequences efficiently

Build a recalibration model to score variant quality for filtering purposes. It is highly recommended to follow GATK best practices when using this module, the gaussian mixture model requires a large number of samples to be used for the tool to produce optimal results. For example, 30 samples for exome data. For more details see https://gatk.broadinstitute.org/hc/en-us/articles/4402736812443-Which-training-sets-arguments-should-I-use-for-running-VQSR-

012000000

recal idx tranches plots versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

GECCO is a fast and scalable method for identifying putative novel Biosynthetic Gene Clusters (BGCs) in genomic and metagenomic data using Conditional Random Fields (CRFs).

0120

genes features clusters gbk json versions

gecco:

Biosynthetic Gene Cluster prediction with Conditional Random Fields.

Downloads databases needed for running getorganelle

0

db versions

getorganelle:

Get organelle genomes from genome skimming data

Defines chunks where to run imputation

0123

chunk_chr versions

glimpse:

GLIMPSE is a phasing and imputation method for large-scale low-coverage sequencing studies.

Defines chunks where to run imputation

012340

chunk_chr versions

glimpse2:

GLIMPSE2 is a phasing and imputation method for large-scale low-coverage sequencing studies.

Ligatation of multiple phased BCF/VCF files into a single whole chromosome file. GLIMPSE2 is run in chunks that are ligated into chromosome-wide files maintaining the phasing.

012

merged_variants versions

glimpse2:

GLIMPSE2 is a phasing and imputation method for large-scale low-coverage sequencing studies.

runs a functional enrichment analysis with gprofiler2

010101

all_enrich rds plot_png plot_html sub_enrich sub_plot filtered_gmt session_info versions

gprofiler2:

An R interface corresponding to the 2019 update of g:Profiler web tool.

run the Broad Gene Set Enrichment tool in GSEA mode

01230101

rpt index_html heat_map_corr_plot report_tsvs_ref report_htmls_ref report_tsvs_target report_htmls_target ranked_gene_list gene_set_sizes histogram heatmap pvalues_vs_nes_plot ranked_list_corr butterfly_plot gene_set_tsv gene_set_html gene_set_heatmap snapshot gene_set_enplot gene_set_dist archive versions

gsea:

Gene Set Enrichment Analysis (GSEA)

Detection of Chimerism and Contamination in Prokaryotic Genomes

010

maxcss_level_tsv all_levels_tsv versions

gunc:

Python package for detection of chimerism and contamination in prokaryotic genomes.

Hap.py is a tool to compare diploid genotypes at haplotype level. Rather than comparing VCF records row by row, hap.py will generate and match alternate sequences in a superlocus. A superlocus is a small region of the genome (sized between 1 and around 1000 bp) that contains one or more variants.

012340101010101

summary_csv roc_all_csv roc_indel_locations_csv roc_indel_locations_pass_csv roc_snp_locations_csv roc_snp_locations_pass_csv extended_csv runinfo metrics_json vcf tbi versions

happy:

Haplotype VCF comparison tools

R script that scores output from multiple runs of hmmer/hmmsearch

01

hmmrank versions

hmmer:

Biosequence analysis using profile hidden Markov models

R:

A Language and Environment for Statistical Computing

Tidyverse:

Tidyverse: R packages for data science

Human mitochondrial variants annotation using HmtVar. Contains .plk file with annotation, so can be run offline

01

vcf versions

hmtnote:

Human mitochondrial variants annotation using HmtVar.

ichorCNA is an R package for calculating copy number alteration from (low-pass) whole genome sequencing, particularly for use in cell-free DNA

010000000

rdata seg cna_seg seg_txt corrected_depth ichorcna_params plots genome_plot versions

ichorcna:

Estimating tumor fraction in cell-free DNA from ultra-low-pass whole genome sequencing.

Runs iCount peaks on a BED file of crosslinks

012

peaks versions

icount:

Computational pipeline for analysis of iCLIP data

Runs iCount sigxls on a BED file of crosslinks

010

sigxls scores versions

icount:

Computational pipeline for analysis of iCLIP data

runs a differential expression analysis with Limma

0123012

results md_plot rdata model session_info normalised_counts versions

limma:

Linear Models for Microarray Data

MALT, an acronym for MEGAN alignment tool, is a sequence alignment and analysis tool designed for processing high-throughput sequencing data, especially in the context of metagenomics.

010

rma6 alignments log versions

malt:

A tool for mapping metagenomic data

Computational framework for tracking and quantifying DNA damage patterns among ancient DNA sequencing reads generated by Next-Generation Sequencing platforms.

010

runtime_log fragmisincorporation_plot length_plot misincorporation lgdistribution dnacomp stats_out_mcmc_hist stats_out_mcmc_iter stats_out_mcmc_trace stats_out_mcmc_iter_summ_stat stats_out_mcmc_post_pred stats_out_mcmc_correct_prob dnacomp_genome rescaled pctot_freq pgtoa_freq fasta folder versions

Run standard proteomics data analysis with MaxQuant, mostly dedicated to label-free. Paths to fasta and raw files needs to be marked by "PLACEHOLDER"

0120

maxquant_txt versions

maxquant:

MaxQuant is a quantitative proteomics software package designed for analyzing large mass-spectrometric data sets. License restricted.

A tool to estimate bacterial species abundance

0100

results versions

midas:

An integrated pipeline for estimating strain-level genomic variation from metagenomic data

Run Torsten Seemann's classic MLST on a genome assembly

01

tsv versions

Compare multiple runs of long read sequencing data and alignments

01

report_html lengths_violin_html log_length_violin_html n50_html number_of_reads_html overlay_histogram_html overlay_histogram_normalized_html overlay_log_histogram_html overlay_log_histogram_normalized_html total_throughput_html quals_violin_html overlay_histogram_identity_html overlay_histogram_phredscore_html percent_identity_violin_html active_pores_over_time_html cumulative_yield_plot_gigabases_html sequencing_speed_over_time_html stats_txt versions

Run NanoPlot on nanopore-sequenced reads

01

html png txt log versions

SARS-CoV-2 genome clade assignment, mutation calling, and sequence quality checks (C++ implementation)

010

csv csv_errors csv_insertions tsv json json_auspice ndjson fasta_aligned fasta_translation nwk versions

nextclade:

SARS-CoV-2 genome clade assignment, mutation calling, and sequence quality checks

Performs fastq alignment to a fasta reference using NextGenMap

010

bam versions

bwa:

NextGenMap is a flexible highly sensitive short read mapping tool that handles much higher mismatch rates than comparable algorithms while still outperforming them in terms of runtime

A fast and scalable tool for bacterial pangenome analysis

01

results aln versions

panaroo:

panaroo - an updated pipeline for pangenome investigation

Phylogenetic Assignment of Named Global Outbreak LINeages

010

report versions

pangolin:

Phylogenetic Assignment of Named Global Outbreak LINeages

Runs PEKA CLIP peak k-mer analysis

0101000

cluster distribution rtxn pdf tsites oxn clust versions

Produce a pruned subset of markers that are in approximate linkage equilibrium with each other.

0123000

prunein pruneout versions

plink:

Whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally efficient manner.

Produce a pruned subset of markers that are in approximate linkage equilibrium with each other. Pairs of variants in the current window with squared correlation greater than the threshold are noted and variants are greedily pruned from the window until no such pairs remain.

0123000

prunein pruneout versions

plink:

Whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally efficient manner.

Produce pruned set of variants in approximatelinkage equilibrium

0123000

prune_in prune_out versions

plink2:

Whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally efficient manner

Run all Portcullis steps in one go

010101

log pass_junctions_bed pass_junctions_tab intron_gff exon_gff spliced_bam spliced_bai versions

portcullis:

Portcullis is a tool that filters out invalid splice junctions from RNA-seq alignment data. It accepts BAM files from various RNA-seq mappers, analyzes splice junctions and removes likely false positives, outputting filtered results in multiple formats for downstream analysis.

Build a normal database for coverage normalization from all the (GC-normalized) normal coverage files. N.B. as reported in https://www.bioconductor.org/packages/devel/bioc/vignettes/PureCN/inst/doc/Quick.html, it is advised to provide a normal panel (VCF format) to precompute mapping bias for faster runtimes.

012300

rds png bias_rds bias_bed low_cov_bed versions

purecn:

Copy number calling and SNV classification using targeted short read sequencing

Run PureCN workflow to normalize, segment and determine purity and ploidy

01200

pdf local_optima_pdf seg genes_csv amplification_pvalues_csv vcf_gz variants_csv loh_csv chr_pdf segmentation_pdf multisample_seg versions

purecn:

Copy number calling and SNV classification using targeted short read sequencing

PyPGx pharmacogenomics genotyping pipeline for NGS data.

012345010

results cnv_calls consolidated_variants versions

pypgx:

A Python package for pharmacogenomics research

ResFinder identifies acquired antimicrobial resistance genes in total or partial sequenced isolates of bacteria

01200

json disinfinder_kma pheno_table_species pheno_table pointfinder_kma pointfinder_prediction pointfinder_results pointfinder_table resfinder_hit_in_genome_seq resfinder_blast resfinder_kma resfinder_resistance_gene_seq resfinder_results_table resfinder_results_tab resfinder_results versions

resfinder:

ResFinder identifies acquired antimicrobial resistance genes in total or partial sequenced isolates of bacteria

Markup VCF file using rho-calls.

012010

vcf versions

rhocall:

Call regions of homozygosity and make tentative UPD calls.

Call regions of homozygosity and make tentative UPD calls

0101

bed wig versions

rhocall:

Call regions of homozygosity and make tentative UPD calls.

Call peaks using SEACR on sequenced reads in bedgraph format

0120

bed versions

seacr:

SEACR is intended to call peaks and enriched regions from sparse CUT&RUN or chromatin profiling data in which background is dominated by "zeroes" (i.e. regions with no read coverage).

Runs the sentieon tool LocusCollector followed by Dedup. LocusCollector collects read information that is used by Dedup which in turn marks or removes duplicate reads.

0120101

cram crai bam bai score metrics metrics_multiqc_tsv versions

sentieon:

Sentieonยฎ provides complete solutions for secondary DNA/RNA analysis for a variety of sequencing platforms, including short and long reads. Our software improves upon BWA, STAR, Minimap2, GATK, HaplotypeCaller, Mutect, and Mutect2 based pipelines and is deployable on any generic-CPU-based computing system.

Runs Sentieon's haplotyper for germline variant calling.

012340101010100

vcf vcf_tbi gvcf gvcf_tbi versions

sentieon:

Sentieonยฎ provides complete solutions for secondary DNA/RNA analysis for a variety of sequencing platforms, including short and long reads. Our software improves upon BWA, STAR, Minimap2, GATK, HaplotypeCaller, Mutect, and Mutect2 based pipelines and is deployable on any generic-CPU-based computing system.

Determine Streptococcus pneumoniae serotype from Illumina paired-end reads

01

tsv txt versions

seroba:

SeroBA is a k-mer based pipeline to identify the Serotype from Illumina NGS reads for given references.

Ligate multiple phased BCF/VCF files into a single whole chromosome file. Typically run to ligate multiple chunks of phased common variants.

012

merged_variants versions

shapeit5:

Fast and accurate method for estimation of haplotypes (phasing)

simpleaf is a program to simplify and customize the running and configuration of single-cell processing with alevin-fry.

0120120123001

map quant versions

simpleaf:

SimpleAF is a program to simplify and customize the running and configuration of single-cell processing with alevin-fry.

tool to call the copy number of full-length SMN1, full-length SMN2, as well as SMN2ฮ”7โ€“8 (SMN2 with a deletion of Exon7-8) from a whole-genome sequencing (WGS) BAM file.

012

smncopynumber run_metrics versions

The Snakemake workflow management system is a tool to create reproducible and scalable data analyses. This module runs a simple Snakemake pipeline based on input snakefile. Expect many limitations."

0101

outputs snakemake_dir versions

Rapid haploid variant calling

010

tab csv html vcf bed gff bam bai log aligned_fa consensus_fa consensus_subs_fa raw_vcf filt_vcf vcf_gz vcf_csi txt versions

snippy:

Rapid bacterial SNP calling and core genome alignments

STITCH is an R program for reference panel free, read aware, low coverage sequencing genotype imputation. STITCH runs on a set of samples with sequencing reads in BAM format, as well as a list of positions to genotype, and outputs imputed genotypes in VCF format.

0123456789100120

input rdata plots vcf bgen versions

Run TRUST4 on RNA-seq data

01201010101

tsv airr_files airr_tsv report_tsv fasta out fq outs versions

Module to run UniverSC an open-source pipeline to demultiplex and process single-cell RNA-Seq data

010

outs versions

Runs a differential expression analysis with dream() from variancePartition R package

01234012

results model versions

dream:

Differential expression for repeated measures

The xeniumranger resegment module allows you to generate a new segmentation of the morphology image space by rerunning the Xenium Onboard Analysis (XOA) segmentation algorithms with modified parameters.

010000

outs versions

xeniumranger:

Xenium Ranger is a set of analysis pipelines that process Xenium In Situ Gene Expression data to relabel, resegment, or import new segmentation results from community-developed tools. Xenium Ranger provides flexible off-instrument reanalysis of Xenium In Situ data. Relabel transcripts, resegment cells with the latest 10x segmentation algorithms, or import your own segmentation data to assign transcripts to cells.

Click here to trigger an update.