Available Modules

Modules are the building stones of all DSL2 nf-core blocks. You can find more info from nf-core website, if you would like to write your own module.

  • vcf 29
  • variant calling 12
  • VCF 10
  • structural variants 8
  • merge 5
  • gatk4 4
  • filter 4
  • bed 3
  • conversion 3
  • gvcf 3
  • survivor 3
  • sort 2
  • gff 2
  • variants 2
  • bedtools 2
  • imputation 2
  • sv 2
  • picard 2
  • haplotype 2
  • low-coverage 2
  • glimpse 2
  • indels 2
  • snps 2
  • bedpe 2
  • ligate 2
  • complement 2
  • vcflib 2
  • genomics 1
  • annotation 1
  • statistics 1
  • variant 1
  • split 1
  • sentieon 1
  • quality 1
  • consensus 1
  • bcftools 1
  • table 1
  • stats 1
  • plink2 1
  • neural network 1
  • filtering 1
  • machine learning 1
  • bcf 1
  • phasing 1
  • annotate 1
  • genotype 1
  • population genetics 1
  • json 1
  • mpileup 1
  • benchmark 1
  • query 1
  • structural 1
  • happy 1
  • normalization 1
  • union 1
  • shapeit 1
  • structural_variants 1
  • roh 1
  • intervals 1
  • snpsift 1
  • cancer genomics 1
  • runs_of_homozygosity 1
  • small variants 1
  • multiallelic 1
  • svdb 1
  • standardization 1
  • reheader 1
  • allele-specific 1
  • concat 1
  • intersect 1
  • normalize 1
  • norm 1
  • repeats 1
  • dnamodelapply 1
  • vcfbreakmulti 1
  • bgen file 1
  • vcf file 1
  • genotype dosages 1
  • uniq 1
  • deduplicate 1
  • VCFtools 1
  • AC/NS/AF 1
  • vcflib/vcffixup 1
  • decompose 1
  • simulation 1
  • standardize 1
  • short variant discovery 1
  • combinegvcfs 1
  • joint-variant-calling 1
  • Mykrobe 1
  • extractvariants 1
  • extract_variants 1
  • gvcftools 1
  • Salmonella Typhi 1
  • selectvariants 1
  • BCF 1
  • update header 1
  • multinterval 1
  • sorting 1
  • atlas 1
  • rhocall 1
  • sortvcf 1
  • picard/renamesampleinvcf 1
  • liftovervcf 1
  • bam 0
  • fasta 0
  • fastq 0
  • metagenomics 0
  • genome 0
  • index 0
  • alignment 0
  • assembly 0
  • reference 0
  • cram 0
  • sam 0
  • database 0
  • align 0
  • bacteria 0
  • map 0
  • coverage 0
  • qc 0
  • quality control 0
  • classification 0
  • nanopore 0
  • download 0
  • gtf 0
  • classify 0
  • cnv 0
  • k-mer 0
  • MSA 0
  • taxonomy 0
  • taxonomic profiling 0
  • gfa 0
  • contamination 0
  • somatic 0
  • pacbio 0
  • convert 0
  • clustering 0
  • count 0
  • binning 0
  • proteomics 0
  • copy number 0
  • ancient DNA 0
  • single-cell 0
  • phylogeny 0
  • contigs 0
  • rnaseq 0
  • long reads 0
  • trimming 0
  • build 0
  • reporting 0
  • isoseq 0
  • mags 0
  • kmer 0
  • variation graph 0
  • bisulfite 0
  • graph 0
  • bisulphite 0
  • compression 0
  • methylseq 0
  • cna 0
  • indexing 0
  • long-read 0
  • visualisation 0
  • illumina 0
  • wgs 0
  • protein 0
  • bqsr 0
  • databases 0
  • QC 0
  • methylation 0
  • phage 0
  • sequences 0
  • demultiplex 0
  • mapping 0
  • openms 0
  • imaging 0
  • 5mC 0
  • serotype 0
  • taxonomic classification 0
  • metrics 0
  • antimicrobial resistance 0
  • depth 0
  • tsv 0
  • pangenome graph 0
  • markduplicates 0
  • histogram 0
  • structure 0
  • cluster 0
  • base quality score recalibration 0
  • scWGBS 0
  • samtools 0
  • matrix 0
  • WGBS 0
  • expression 0
  • plot 0
  • amr 0
  • protein sequence 0
  • pairs 0
  • searching 0
  • DNA methylation 0
  • example 0
  • repeat 0
  • aDNA 0
  • bins 0
  • validation 0
  • mmseqs2 0
  • completeness 0
  • mappability 0
  • biscuit 0
  • transcript 0
  • checkm 0
  • virus 0
  • metagenome 0
  • gzip 0
  • aligner 0
  • palaeogenomics 0
  • cooler 0
  • sequence 0
  • LAST 0
  • transcriptome 0
  • gene 0
  • bwa 0
  • seqkit 0
  • archaeogenomics 0
  • damage 0
  • iCLIP 0
  • germline 0
  • bisulfite sequencing 0
  • db 0
  • complexity 0
  • evaluation 0
  • genotyping 0
  • feature 0
  • peaks 0
  • gff3 0
  • kraken2 0
  • hmmer 0
  • mkref 0
  • segmentation 0
  • blast 0
  • decompression 0
  • ncbi 0
  • msa 0
  • spatial 0
  • newick 0
  • ucsc 0
  • umi 0
  • mag 0
  • bismark 0
  • hmmsearch 0
  • dedup 0
  • sketch 0
  • vsearch 0
  • prokaryote 0
  • reads 0
  • demultiplexing 0
  • antimicrobial resistance genes 0
  • rna 0
  • csv 0
  • extract 0
  • bedGraph 0
  • multiple sequence alignment 0
  • short-read 0
  • tumor-only 0
  • deduplication 0
  • report 0
  • NCBI 0
  • single 0
  • cnvkit 0
  • duplicates 0
  • mirna 0
  • snp 0
  • plasmid 0
  • pangenome 0
  • antimicrobial peptides 0
  • prediction 0
  • splicing 0
  • scRNA-seq 0
  • low frequency variant calling 0
  • mitochondria 0
  • kmers 0
  • profile 0
  • differential 0
  • idXML 0
  • concatenate 0
  • fragment 0
  • diversity 0
  • profiling 0
  • svtk 0
  • mem 0
  • cat 0
  • kallisto 0
  • detection 0
  • riboseq 0
  • fastx 0
  • summary 0
  • text 0
  • counts 0
  • MAF 0
  • gridss 0
  • adapters 0
  • merging 0
  • isolates 0
  • arg 0
  • compare 0
  • amps 0
  • antibiotic resistance 0
  • interval 0
  • sourmash 0
  • mutect2 0
  • call 0
  • FASTQ 0
  • microbiome 0
  • visualization 0
  • ptr 0
  • ont 0
  • de novo assembly 0
  • distance 0
  • tabular 0
  • view 0
  • reference-free 0
  • wxs 0
  • de novo 0
  • clipping 0
  • single cell 0
  • deamination 0
  • 3-letter genome 0
  • coptr 0
  • microsatellite 0
  • deep learning 0
  • gsea 0
  • haplotypecaller 0
  • mtDNA 0
  • enrichment 0
  • fgbio 0
  • redundancy 0
  • quantification 0
  • CLIP 0
  • read depth 0
  • transcriptomics 0
  • peak-calling 0
  • xeniumranger 0
  • diamond 0
  • circrna 0
  • miscoding lesions 0
  • palaeogenetics 0
  • bedgraph 0
  • ranking 0
  • interval_list 0
  • HiFi 0
  • public datasets 0
  • preprocessing 0
  • genome assembler 0
  • hic 0
  • bin 0
  • bigwig 0
  • retrotransposon 0
  • STR 0
  • archaeogenetics 0
  • cut 0
  • ganon 0
  • phylogenetic placement 0
  • containment 0
  • SV 0
  • sylph 0
  • isomir 0
  • dna 0
  • ngscheckmate 0
  • HMM 0
  • hmmcopy 0
  • paf 0
  • telomere 0
  • pypgx 0
  • compress 0
  • matching 0
  • ccs 0
  • genmod 0
  • resistance 0
  • BGC 0
  • chunk 0
  • propr 0
  • DNA sequencing 0
  • targeted sequencing 0
  • fai 0
  • hybrid capture sequencing 0
  • image 0
  • bgzip 0
  • biosynthetic gene cluster 0
  • malt 0
  • copy number alteration calling 0
  • clean 0
  • abundance 0
  • chromosome 0
  • fungi 0
  • DNA sequence 0
  • fusion 0
  • ATAC-seq 0
  • umitools 0
  • bcl2fastq 0
  • ampir 0
  • logratio 0
  • ancestry 0
  • add 0
  • sample 0
  • sequencing 0
  • microarray 0
  • skani 0
  • family 0
  • parsing 0
  • untar 0
  • transposons 0
  • highly_multiplexed_imaging 0
  • unzip 0
  • fastk 0
  • mcmicro 0
  • image_analysis 0
  • duplication 0
  • fusions 0
  • UMI 0
  • uncompress 0
  • html 0
  • ataqv 0
  • krona 0
  • bacterial 0
  • bakta 0
  • benchmarking 0
  • minimap2 0
  • pileup 0
  • tabix 0
  • quality trimming 0
  • zip 0
  • archiving 0
  • polishing 0
  • remove 0
  • entrez 0
  • panel 0
  • adapter trimming 0
  • uLTRA 0
  • scaffolding 0
  • small indels 0
  • host 0
  • typing 0
  • bamtools 0
  • checkv 0
  • khmer 0
  • informative sites 0
  • spaceranger 0
  • popscle 0
  • genotype-based deconvoltion 0
  • observations 0
  • chimeras 0
  • lossless 0
  • DRAMP 0
  • neubi 0
  • amplify 0
  • macrel 0
  • kinship 0
  • PacBio 0
  • rna_structure 0
  • RNA 0
  • virulence 0
  • identity 0
  • dist 0
  • transcripts 0
  • genome assembly 0
  • relatedness 0
  • score 0
  • angsd 0
  • seqtk 0
  • RNA-seq 0
  • subsample 0
  • pseudoalignment 0
  • SNP 0
  • arriba 0
  • krona chart 0
  • rsem 0
  • reports 0
  • notebook 0
  • wastewater 0
  • amplicon sequencing 0
  • indel 0
  • dictionary 0
  • miRNA 0
  • eukaryotes 0
  • prokaryotes 0
  • spark 0
  • population genomics 0
  • cfDNA 0
  • genome mining 0
  • hidden Markov model 0
  • mask 0
  • ambient RNA removal 0
  • long_read 0
  • atac-seq 0
  • fam 0
  • somatic variants 0
  • aln 0
  • cut up 0
  • proteome 0
  • bracken 0
  • bim 0
  • cool 0
  • mzml 0
  • gatk4spark 0
  • mapper 0
  • repeat expansion 0
  • CRISPR 0
  • npz 0
  • combine 0
  • comparisons 0
  • prefetch 0
  • windowmasker 0
  • fcs-gx 0
  • prokka 0
  • bwameth 0
  • guide tree 0
  • amplicon sequences 0
  • kraken 0
  • chip-seq 0
  • lineage 0
  • wig 0
  • png 0
  • microbes 0
  • pangolin 0
  • covid 0
  • pan-genome 0
  • hi-c 0
  • pairsam 0
  • comparison 0
  • gene expression 0
  • variant_calling 0
  • cellranger 0
  • replace 0
  • mkfastq 0
  • nucleotide 0
  • insert 0
  • C to T 0
  • dump 0
  • das tool 0
  • regions 0
  • mlst 0
  • fingerprint 0
  • organelle 0
  • genomes 0
  • scaffold 0
  • converter 0
  • PCA 0
  • vrhyme 0
  • deeparg 0
  • scores 0
  • das_tool 0
  • graph layout 0
  • shigella 0
  • small genome 0
  • haplogroups 0
  • genetics 0
  • duplicate 0
  • functional analysis 0
  • copyratios 0
  • k-mer frequency 0
  • tnhaplotyper2 0
  • signature 0
  • interactions 0
  • rrna 0
  • de novo assembler 0
  • ancient dna 0
  • switch 0
  • xz 0
  • hla 0
  • reformat 0
  • megan 0
  • regression 0
  • COBS 0
  • hlala 0
  • hla_typing 0
  • hlala_typing 0
  • read-group 0
  • Read depth 0
  • archive 0
  • mapcounter 0
  • rgfa 0
  • zlib 0
  • taxids 0
  • ChIP-seq 0
  • concordance 0
  • variation 0
  • mitochondrion 0
  • contig 0
  • resolve_bioscience 0
  • effect prediction 0
  • snpeff 0
  • GPU-accelerated 0
  • assembly evaluation 0
  • ampgram 0
  • amptransformer 0
  • spatial_transcriptomics 0
  • genomad 0
  • profiles 0
  • junctions 0
  • gstama 0
  • taxon name 0
  • SimpleAF 0
  • trancriptome 0
  • FracMinHash sketch 0
  • tama 0
  • image_processing 0
  • nucleotides 0
  • ped 0
  • cnvnator 0
  • gene set 0
  • registration 0
  • GC content 0
  • gene set analysis 0
  • proportionality 0
  • differential expression 0
  • phase 0
  • checksum 0
  • leviosam2 0
  • metamaps 0
  • salmon 0
  • barcode 0
  • primer 0
  • soft-clipped clusters 0
  • pharokka 0
  • taxon tables 0
  • otu tables 0
  • varcal 0
  • instability 0
  • pair 0
  • standardisation 0
  • msi 0
  • minhash 0
  • interactive 0
  • krakenuniq 0
  • standardise 0
  • serogroup 0
  • lofreq 0
  • salmonella 0
  • homoploymer 0
  • purge duplications 0
  • library 0
  • bam2fq 0
  • preseq 0
  • collate 0
  • adapter 0
  • function 0
  • retrotransposons 0
  • MSI 0
  • long terminal repeat 0
  • dict 0
  • fixmate 0
  • long terminal retrotransposon 0
  • kma 0
  • import 0
  • mash 0
  • taxonomic profile 0
  • tumor 0
  • ichorcna 0
  • maximum likelihood 0
  • polyA_tail 0
  • sequenzautils 0
  • refine 0
  • mudskipper 0
  • reformatting 0
  • iphop 0
  • orf 0
  • vg 0
  • Streptococcus pneumoniae 0
  • bloom filter 0
  • rtgtools 0
  • instrain 0
  • lift 0
  • k-mer index 0
  • nextclade 0
  • transformation 0
  • micro-satellite-scan 0
  • rename 0
  • krakentools 0
  • tree 0
  • variant pruning 0
  • screen 0
  • msisensor-pro 0
  • bustools 0
  • bfiles 0
  • transcriptomic 0
  • parallelized 0
  • orthology 0
  • subset 0
  • Duplication purging 0
  • removal 0
  • polish 0
  • immunoprofiling 0
  • join 0
  • repeat_expansions 0
  • duplex 0
  • fetch 0
  • GEO 0
  • frame-shift correction 0
  • long-read sequencing 0
  • metagenomic 0
  • identifier 0
  • sequence analysis 0
  • expansionhunterdenovo 0
  • metadata 0
  • tab 0
  • intersection 0
  • windows 0
  • pharmacogenetics 0
  • emboss 0
  • doublets 0
  • eigenstrat 0
  • anndata 0
  • validate 0
  • UMIs 0
  • unaligned 0
  • samplesheet 0
  • smrnaseq 0
  • xenograft 0
  • MCMICRO 0
  • graft 0
  • trim 0
  • mirdeep2 0
  • RNA sequencing 0
  • realignment 0
  • microbial 0
  • microscopy 0
  • Pharmacogenetics 0
  • deconvolution 0
  • bayesian 0
  • tbi 0
  • merge mate pairs 0
  • reads merging 0
  • short reads 0
  • region 0
  • sizes 0
  • ome-tif 0
  • nanostring 0
  • trgt 0
  • pigz 0
  • find 0
  • split_kmers 0
  • corrupted 0
  • calling 0
  • nacho 0
  • cnv calling 0
  • CNV 0
  • mRNA 0
  • screening 0
  • vdj 0
  • cvnkit 0
  • single cells 0
  • estimation 0
  • genome bins 0
  • recombination 0
  • eCLIP 0
  • splice 0
  • parse 0
  • cleaning 0
  • correction 0
  • bases 0
  • heatmap 0
  • format 0
  • eido 0
  • haplotypes 0
  • awk 0
  • BAM 0
  • blastp 0
  • deseq2 0
  • rna-seq 0
  • blastn 0
  • spatial_omics 0
  • human removal 0
  • random forest 0
  • metagenomes 0
  • gene labels 0
  • structural-variant calling 0
  • hostile 0
  • fasterq-dump 0
  • sra-tools 0
  • settings 0
  • decontamination 0
  • version 0
  • interval list 0
  • scatter 0
  • gatk 0
  • NRPS 0
  • evidence 0
  • MaltExtract 0
  • HOPS 0
  • panelofnormals 0
  • baf 0
  • authentication 0
  • edit distance 0
  • dereplicate 0
  • secondary metabolites 0
  • RiPP 0
  • allele 0
  • demultiplexed reads 0
  • antibiotics 0
  • aggregate 0
  • artic 0
  • simulate 0
  • antismash 0
  • RNA-Seq 0
  • WGS 0
  • joint genotyping 0
  • cgMLST 0
  • samples 0
  • orthologs 0
  • ragtag 0
  • filtermutectcalls 0
  • qualty 0
  • gem 0
  • gwas 0
  • hmmscan 0
  • short-read sequencing 0
  • alr 0
  • blat 0
  • yahs 0
  • detecting svs 0
  • Bioinformatics Tools 0
  • confidence 0
  • phylogenies 0
  • geo 0
  • chloroplast 0
  • hmmpress 0
  • patch 0
  • hhsuite 0
  • mapad 0
  • covariance models 0
  • trna 0
  • clr 0
  • copy number variation 0
  • missingness 0
  • reference compression 0
  • baftest 0
  • svtk/baftest 0
  • regex 0
  • impute 0
  • scanner 0
  • whamg 0
  • constant 0
  • wham 0
  • reference panel 0
  • modelsegments 0
  • copy-number 0
  • copy number analysis 0
  • unmarkduplicates 0
  • gender determination 0
  • junction 0
  • references 0
  • copy number alterations 0
  • sccmec 0
  • variantcalling 0
  • c to t 0
  • adna 0
  • workflow_mode 0
  • groupby 0
  • createreadcountpanelofnormals 0
  • taxonomic composition 0
  • denoisereadcounts 0
  • metaspace 0
  • metabolite annotation 0
  • readwriter 0
  • mzML 0
  • snakemake 0
  • data-download 0
  • Immune Deconvolution 0
  • ribosomal RNA 0
  • rRNA 0
  • prepare 0
  • hwe 0
  • catpack 0
  • Computational Immunology 0
  • workflow 0
  • tnscope 0
  • genome annotation 0
  • readproteingroups 0
  • dnascope 0
  • 16S 0
  • proteus 0
  • streptococcus 0
  • spa 0
  • spatype 0
  • mobile genetic elements 0
  • integron 0
  • patterns 0
  • signatures 0
  • doublet 0
  • countsvtypes 0
  • eigenvectors 0
  • hicPCA 0
  • fracminhash sketch 0
  • hash sketch 0
  • sliding 0
  • bgen 0
  • CRISPRi 0
  • pruning 0
  • rdtest2vcf 0
  • downsample 0
  • longest 0
  • isoform 0
  • upd 0
  • transcroder 0
  • cds 0
  • uniparental 0
  • disomy 0
  • snv 0
  • variancepartition 0
  • coding 0
  • sequencing adapters 0
  • downsample bam 0
  • subsample bam 0
  • vcf2db 0
  • gemini 0
  • maf 0
  • eucaryotes 0
  • lua 0
  • dream 0
  • toml 0
  • chromosomal rearrangements 0
  • agat 0
  • drep 0
  • pca 0
  • linkage equilibrium 0
  • refflat 0
  • genepred 0
  • bedtobigbed 0
  • ucsc/liftover 0
  • bigbed 0
  • f coefficient 0
  • homozygous genotypes 0
  • heterozygous genotypes 0
  • inbreeding 0
  • umicollapse 0
  • microbial genomics 0
  • bedgraphtobigwig 0
  • scRNA-Seq 0
  • plink2_pca 0
  • covariance model 0
  • dereplication 0
  • files 0
  • Mycobacterium tuberculosis 0
  • assembly polishing 0
  • rdtest 0
  • SNV 0
  • remove samples 0
  • extractunbinned 0
  • tandem repeats 0
  • linkbins 0
  • long read 0
  • decompress 0
  • sintax 0
  • vsearch/sort 0
  • vcf2bed 0
  • shuffleBed 0
  • Indel 0
  • trio binning 0
  • host removal 0
  • usearch 0
  • long read alignment 0
  • pangenome-scale 0
  • all versus all 0
  • mashmap 0
  • gtftogenepred 0
  • wavefront 0
  • haploype 0
  • helitron 0
  • polya tail 0
  • fast5 0
  • genome polishing 0
  • network 0
  • bedcov 0
  • paired reads re-pairing 0
  • comp 0
  • md 0
  • nm 0
  • wget 0
  • uq 0
  • verifybamid 0
  • GFF/GTF 0
  • short 0
  • intron 0
  • DNA contamination estimation 0
  • SINE 0
  • masking 0
  • low-complexity 0
  • plant 0
  • construct 0
  • melon 0
  • graph projection to vcf 0
  • boxcox 0
  • busco 0
  • fix 0
  • tag2tag 0
  • association 0
  • GWAS 0
  • svg 0
  • case/control 0
  • xml 0
  • script 0
  • java 0
  • associations 0
  • rank 0
  • spatial_neighborhoods 0
  • hashing-based deconvolution 0
  • tags 0
  • standard 0
  • impute-info 0
  • functional 0
  • Illumina 0
  • scimap 0
  • Bayesian 0
  • uniques 0
  • invariant 0
  • structural-variants 0
  • omics 0
  • biological activity 0
  • drug categorization 0
  • prior knowledge 0
  • refresh 0
  • clahe 0
  • cell_barcodes 0
  • microRNA 0
  • telseq 0
  • stardist 0
  • variant-calling 0
  • poolseq 0
  • multi-tool 0
  • predict 0
  • search engine 0
  • mass_error 0
  • hardy-weinberg 0
  • hwe statistics 0
  • multiqc 0
  • hwe equilibrium 0
  • haplotag 0
  • reference-independent 0
  • genotype likelihood 0
  • Staging 0
  • collapse 0
  • liftover 0
  • probabilistic realignment 0
  • seqfu 0
  • n50 0
  • cell_type_identification 0
  • cell_phenotyping 0
  • machine_learning 0
  • staging 0
  • tag 0
  • mygene 0
  • vsearch/dereplicate 0
  • coreutils 0
  • transcription factors 0
  • regulatory network 0
  • 10x 0
  • ribosomal 0
  • grabix 0
  • hamming-distance 0
  • bwameme 0
  • bwamem2 0
  • guidetree 0
  • hashing-based deconvoltion 0
  • gnu 0
  • Pacbio 0
  • overlap-based merging 0
  • generic 0
  • trimfq 0
  • cellsnp 0
  • transposable element 0
  • retrieval 0
  • donor deconvolution 0
  • genotype-based demultiplexing 0
  • MMseqs2 0
  • lexogen 0
  • droplet based single cells 0
  • check 0
  • paired reads merging 0
  • Read report 0
  • orthogroup 0
  • go 0
  • Read trimming 0
  • Read filters 0
  • nanoq 0
  • redundant 0
  • pile up 0
  • extraction 0
  • featuretable 0
  • mass spectrometry 0
  • sage 0
  • nanopore sequencing 0
  • rna velocity 0
  • translation 0
  • cobra 0
  • spot 0
  • circular 0
  • extension 0
  • realign 0
  • quality check 0
  • size 0
  • cram-size 0
  • selector 0
  • grea 0
  • paraphase 0
  • functional enrichment 0
  • homologs 0
  • vsearch/fastqfilter 0
  • malformed 0
  • rad 0
  • tnfilter 0
  • plotting 0
  • scanpy 0
  • array_cgh 0
  • cytosure 0
  • metagenome assembler 0
  • vector 0
  • gprofiler2 0
  • gost 0
  • morphology 0
  • resegment 0
  • relabel 0
  • regtools 0
  • cell segmentation 0
  • nuclear segmentation 0
  • structural variant 0
  • bam2fastx 0
  • import segmentation 0
  • bam2fastq 0
  • immcantation 0
  • airrseq 0
  • immunoinformatics 0
  • solo 0
  • scvi 0
  • co-orthology 0
  • derived alleles 0
  • InterProScan 0
  • sequence similarity 0
  • partitioning 0
  • Escherichia coli 0
  • chip 0
  • propd 0
  • Read coverage histogram 0
  • updatedata 0
  • run 0
  • reverse complement 0
  • pdb 0
  • hmmfetch 0
  • block substitutions 0
  • site frequency spectrum 0
  • transmembrane 0
  • decomposeblocksub 0
  • genome graph 0
  • tnseq 0
  • identity-by-descent 0
  • decoy 0
  • htseq 0
  • mgi 0
  • sompy 0
  • recovery 0
  • peak picking 0
  • leafcutter 0
  • homology 0
  • p-value 0
  • fastqfilter 0
  • translate 0
  • raw 0
  • mgf 0
  • tarball 0
  • parquet 0
  • parser 0
  • dbsnp 0
  • quarto 0
  • python 0
  • r 0
  • tar 0
  • jvarkit 0
  • resistance genes 0
  • setgt 0
  • coexpression 0
  • correlation 0
  • corpcor 0
  • ATACshift 0
  • assay 0
  • phylogenetics 0
  • shift 0
  • minimum_evolution 0
  • distance-based 0
  • ATACseq 0
  • nucleotide sequence 0
  • targz 0
  • resfinder 0
  • significance statistic 0
  • gaps 0
  • logFC 0
  • spectral clustering 0
  • comparative genomics 0
  • subsetting 0
  • deep variant 0
  • mutect 0
  • idx 0
  • barcodes 0
  • doublet_detection 0
  • quality_control 0
  • transform 0
  • emoji 0
  • introns 0
  • plastid 0
  • source tracking 0
  • controlstatistics 0
  • elprep 0
  • elfasta 0
  • install 0
  • nucleotide content 0
  • joint-genotyping 0
  • genotypegvcf 0
  • AT content 0
  • nucBed 0
  • bclconvert 0
  • parallel 0
  • ancestral alleles 0
  • methylation bias 0
  • SNPs 0
  • getpileupsummaries 0
  • collectsvevidence 0
  • collectreadcounts 0
  • cnnscorevariants 0
  • calibratedragstrmodel 0
  • cross-samplecontamination 0
  • dragstr 0
  • calculatecontamination 0
  • bedtointervallist 0
  • asereadcounter 0
  • vqsr 0
  • variant quality score recalibration 0
  • annotateintervals 0
  • composestrtablefile 0
  • condensedepthevidence 0
  • heattree 0
  • gatherbqsrreports 0
  • germlinecnvcaller 0
  • germline contig ploidy 0
  • panelofnormalscreation 0
  • jointgenotyping 0
  • genomicsdbimport 0
  • genomicsdb 0
  • tranche filtering 0
  • createsequencedictionary 0
  • filtervarianttranches 0
  • filterintervals 0
  • estimatelibrarycomplexity 0
  • duplication metrics 0
  • determinegermlinecontigploidy 0
  • createsomaticpanelofnormals 0
  • targets 0
  • gangstr 0
  • getpileupsumaries 0
  • antibiotic resistance genes 0
  • consensus sequence 0
  • public 0
  • ENA 0
  • SRA 0
  • ANI 0
  • ARGs 0
  • faqcs 0
  • groupreads 0
  • str 0
  • cache 0
  • percent on target 0
  • endogenous DNA 0
  • Streptococcus pyogenes 0
  • swissprot 0
  • duplexumi 0
  • unmapped 0
  • gene-calling 0
  • variant caller 0
  • gamma 0
  • UShER 0
  • bootstrapping 0
  • bacterial variant calling 0
  • germline variant calling 0
  • somatic variant calling 0
  • rust 0
  • ubam 0
  • fq 0
  • lint 0
  • random 0
  • generate 0
  • single molecule 0
  • zipperbams 0
  • germlinevariantsites 0
  • readcountssummary 0
  • embl 0
  • Imputation 0
  • gene model 0
  • tama_collapse.py 0
  • genomes on a tree 0
  • merge compare 0
  • GNU 0
  • Haplotypes 0
  • gstama/merge 0
  • Sample 0
  • low coverage 0
  • gget 0
  • genome statistics 0
  • genome manipulation 0
  • genome summary 0
  • TAMA 0
  • gstama/polyacleanup 0
  • abricate 0
  • beagle 0
  • hbd 0
  • ibd 0
  • rgi 0
  • fARGene 0
  • amrfinderplus 0
  • GTDB taxonomy 0
  • gunzip 0
  • gunc 0
  • archaea 0
  • genome taxonomy database 0
  • gfastats 0
  • indexfeaturefile 0
  • preprocessintervals 0
  • shiftchain 0
  • revert 0
  • reblockgvcf 0
  • printsvevidence 0
  • printreads 0
  • postprocessgermlinecnvcalls 0
  • shiftintervals 0
  • snvs 0
  • mutectstats 0
  • mergebamalignment 0
  • leftalignandtrimvariants 0
  • readorientationartifacts 0
  • learnreadorientationmodel 0
  • shiftfasta 0
  • site depth 0
  • repeat content 0
  • file parsing 0
  • genome heterozygosity 0
  • genome size 0
  • models 0
  • compound 0
  • genome profile 0
  • bgc 0
  • txt 0
  • splitcram 0
  • gawk 0
  • variantrecalibrator 0
  • recalibration model 0
  • variantfiltration 0
  • svcluster 0
  • svannotate 0
  • splitintervals 0
  • genbank 0
  • split by chromosome 0
  • Haemophilus influenzae 0
  • illumiation_correction 0
  • csi 0
  • deduping 0
  • smaller fastqs 0
  • clumping fastqs 0
  • background_correction 0
  • element 0
  • biallelic 0
  • trimBam 0
  • bamUtil 0
  • bamtools/split 0
  • yaml 0
  • bamtools/convert 0
  • mouse 0
  • homozygosity 0
  • virulent 0
  • chunking 0
  • subtract 0
  • slopBed 0
  • shiftBed 0
  • overlapped bed 0
  • maskfasta 0
  • jaccard 0
  • autozygosity 0
  • overlap 0
  • getfasta 0
  • genomecov 0
  • closest 0
  • bamtobed 0
  • bacphlip 0
  • temperate 0
  • bioawk 0
  • amp 0
  • allele counts 0
  • nuclear contamination estimate 0
  • post Post-processing 0
  • model 0
  • AMPs 0
  • antimicrobial peptide prediction 0
  • Staphylococcus aureus 0
  • installation 0
  • affy 0
  • reference panels 0
  • admixture 0
  • adapterremoval 0
  • antimicrobial reistance 0
  • contiguate 0
  • doCounts 0
  • HLA 0
  • lifestyle 0
  • read group 0
  • autofluorescence 0
  • cycif 0
  • background 0
  • single-stranded 0
  • ancientDNA 0
  • authentict 0
  • bias 0
  • utility 0
  • ATLAS 0
  • sequencing_bias 0
  • post mortem damage 0
  • mkarv 0
  • http(s) 0
  • unionBedGraphs 0
  • file manipulation 0
  • deletion 0
  • Segmentation 0
  • cutesv 0
  • gct 0
  • cls 0
  • na 0
  • custom 0
  • Cores 0
  • TMA dearray 0
  • paired-end 0
  • UNet 0
  • mcool 0
  • genomic bins 0
  • makebins 0
  • enzyme 0
  • digest 0
  • pcr duplicates 0
  • track 0
  • cooler/balance 0
  • escherichia coli 0
  • circos 0
  • eklipse 0
  • eigenstratdatabasetools 0
  • pep 0
  • schema 0
  • PEP 0
  • depth information 0
  • corrrelation 0
  • structural variation 0
  • duphold 0
  • segment 0
  • blastx 0
  • cumulative coverage 0
  • scatterplot 0
  • cload 0
  • subcontigs 0
  • sorted 0
  • compartments 0
  • multiomics 0
  • mkvdjref 0
  • cellpose 0
  • hifi 0
  • Assembly 0
  • domains 0
  • topology 0
  • antibody capture 0
  • calder2 0
  • cadd 0
  • postprocessing 0
  • tblastn 0
  • subtyping 0
  • Salmonella enterica 0
  • antigen capture 0
  • crispr 0
  • nucleotide composition 0
  • cmseq 0
  • concoct 0
  • partition histograms 0
  • target 0
  • export 0
  • antitarget 0
  • access 0
  • protein coding genes 0
  • qa 0
  • polymorphic sites 0
  • polymorphic 0
  • polymut 0
  • chromosome_visualization 0
  • duplicate removal 0
  • chromap 0
  • quality assurnce 0
  • mitochondrial 0
  • haplotype resolution 0
  • predictions 0
  • normal database 0
  • assembly curation 0
  • false duplications 0
  • duplicate purging 0
  • haplotype purging 0
  • cutoff 0
  • panel of normals 0
  • genomic intervals 0
  • False duplications 0
  • intervals coverage 0
  • gene finding 0
  • contact maps 0
  • bmp 0
  • jpg 0
  • pretext 0
  • Haplotype purging 0
  • Assembly curation 0
  • porechop_abi 0
  • strandedness 0
  • sequence-based 0
  • read distribution 0
  • inner_distance 0
  • fragment_size 0
  • read_pairs 0
  • experiment 0
  • bamstat 0
  • purging 0
  • R 0
  • long uncorrected reads 0
  • subsampling 0
  • neighbour-joining 0
  • quast 0
  • contact 0
  • pmdtools 0
  • integrity 0
  • pcr 0
  • CoPRO 0
  • tandem duplications 0
  • insertions 0
  • deletions 0
  • PRO-cap 0
  • mate-pair 0
  • hybrid-selection 0
  • phylogenetic composition 0
  • illumina datasets 0
  • identification 0
  • prophage 0
  • GRO-cap 0
  • CAGE 0
  • variant genetic 0
  • variant identifiers 0
  • scoring 0
  • identifiers 0
  • whole genome association 0
  • recode 0
  • indep pairwise 0
  • indep 0
  • exclude 0
  • NETCAGE 0
  • genetic 0
  • GRO-seq 0
  • PRO-seq 0
  • STRIPE-seq 0
  • csRNA-seq 0
  • RAMPAGE 0
  • mapping-based 0
  • rtg 0
  • ChIP-Seq 0
  • gc_wiggle 0
  • error 0
  • rare variants 0
  • relative coverage 0
  • genetic sex 0
  • sex determination 0
  • induce 0
  • bam2seqz 0
  • longread 0
  • freqsum 0
  • pseudodiploid 0
  • pseudohaploid 0
  • random draw 0
  • selection 0
  • seq 0
  • de-novo 0
  • sha256 0
  • interleave 0
  • SMN1 0
  • dbnsfp 0
  • snippy 0
  • core 0
  • sniffles 0
  • POA 0
  • SMN2 0
  • CRAM 0
  • 256 bit 0
  • sliding window 0
  • features 0
  • density 0
  • boxplot 0
  • exploratory 0
  • shinyngs 0
  • header 0
  • sertotype 0
  • pedfilter 0
  • flagstat 0
  • faidx 0
  • calmd 0
  • ampliconclip 0
  • amplicon 0
  • duplicate marking 0
  • sambamba 0
  • multimapper 0
  • repair 0
  • Ancestor 0
  • LCA 0
  • salsa2 0
  • salsa 0
  • rtg-tools 0
  • rocplot 0
  • insert size 0
  • paired 0
  • sequence headers 0
  • seacr 0
  • grep 0
  • subseq 0
  • variant recalibration 0
  • VQSR 0
  • applyvarcal 0
  • assembly-binning 0
  • chromatin 0
  • read pairs 0
  • cut&run 0
  • cut&tag 0
  • peak-caller 0
  • clusteridentifier 0
  • cluster analysis 0
  • scramble 0
  • readgroup 0
  • phantom peaks 0
  • motif 0
  • gccounter 0
  • clinical 0
  • qualities 0
  • lofreq/filter 0
  • lofreq/call 0
  • Listeria monocytogenes 0
  • limma 0
  • pneumophila 0
  • legionella 0
  • peptide prediction 0
  • collapsing 0
  • adapter removal 0
  • train 0
  • spliced 0
  • reorder 0
  • combining 0
  • AMP 0
  • functional genomics 0
  • kegg 0
  • taxonomic assignment 0
  • mass-spectroscopy 0
  • metagenome-assembled genomes 0
  • maxbin2 0
  • representations 0
  • reduced 0
  • mash/sketch 0
  • estimate 0
  • sgRNA 0
  • damage patterns 0
  • NGS 0
  • DNA damage 0
  • rra 0
  • maximum-likelihood 0
  • CRISPR-Cas9 0
  • kofamscan 0
  • pneumoniae 0
  • MD5 0
  • haemophilus 0
  • genome browser 0
  • js 0
  • igv.js 0
  • igv 0
  • IDR 0
  • panel_of_normals 0
  • pos 0
  • pixel classification 0
  • annotations 0
  • hmtnote 0
  • Hidden Markov Model 0
  • amino acid 0
  • HMMER 0
  • readcounter 0
  • multicut 0
  • pixel_classification 0
  • Klebsiella 0
  • jupytext 0
  • effective genome size 0
  • k-mer counting 0
  • digital normalization 0
  • quant 0
  • kallisto/index 0
  • papermill 0
  • Jupyter 0
  • probability_maps 0
  • Python 0
  • jasmine 0
  • jasminesv 0
  • insertion 0
  • genomic islands 0
  • interproscan 0
  • mcr-1 0
  • 128 bit 0
  • pedigrees 0
  • graph stats 0
  • ILP 0
  • hla-typing 0
  • tumor/normal 0
  • graph viz 0
  • graph formats 0
  • graph unchopping 0
  • combine graphs 0
  • block-compressed 0
  • odgi 0
  • squeeze 0
  • graph drawing 0
  • graph construction 0
  • gender 0
  • Neisseria gonorrhoeae 0
  • HLA-I 0
  • PCR/optical duplicates 0
  • NextGenMap 0
  • graphs 0
  • read 0
  • pair-end 0
  • pbp 0
  • subreads 0
  • pbmerge 0
  • pbbam 0
  • paragraph 0
  • flip 0
  • select 0
  • restriction fragments 0
  • pairstools 0
  • pairtools 0
  • ligation junctions 0
  • upper-triangular matrix 0
  • ngm 0
  • sequencing summary 0
  • megahit 0
  • Merqury 0
  • assembler 0
  • mbias 0
  • metaphlan 0
  • unionsum 0
  • ploidy 0
  • smudgeplot 0
  • contour map 0
  • microrna 0
  • 3D heat map 0
  • Neisseria meningitidis 0
  • rma6 0
  • daa 0
  • debruijn 0
  • denovo 0
  • de Bruijn 0
  • target prediction 0
  • mobile element insertions 0
  • bioinformatics tools 0
  • somatic structural variations 0
  • cancer genome 0
  • contaminant 0
  • SNP table 0
  • GATK UnifiedGenotyper 0
  • Beautiful stand-alone HTML report 0
  • mitochondrial to nuclear ratio 0
  • mitochondrial genome 0
  • ratio 0
  • mtnucratio 0
  • scan 0
  • microsatellite instability 0
  • otu table 0
  • mosdepth 0
  • reference genome 0
  • long-reads 0

generate VCF file from a BAM file using various calling methods

012340000

vcf versions

atlas:

ATLAS, a suite of methods to accurately genotype and estimate genetic diversity

Concatenate VCF files

012

vcf tbi csi versions

concat:

Concatenate VCF files.

Compresses VCF files

01234

fasta versions

consensus:

Create consensus sequence by applying VCF variants to a reference fasta file.

Filters VCF files

012

vcf tbi csi versions

filter:

Apply fixed-threshold filters to VCF files.

Apply set operations to VCF files

012

results versions

isec:

Computes intersections, unions and complements of VCF files.

Merge VCF files

012010101

vcf index versions

merge:

Merge VCF files.

Compresses VCF files

012010

vcf tbi stats mpileup versions

mpileup:

Generates genotype likelihoods at each genomic position with coverage.

Normalize VCF file

01201

vcf tbi csi versions

norm:

Normalize VCF files.

Reheader a VCF file

012301

vcf index versions

reheader:

Modify header of VCF/BCF files, change sample names.

Sorts VCF files

01

vcf tbi csi versions

sort:

Sort VCF files by coordinates.

Split a vcf file into files per chromosome

012

split_vcf versions

bcftools:

Sort VCF files by coordinates.

Generates stats from VCF files

0120101010101

stats versions

stats:

Parses VCF or BCF and produces text file stats which is suitable for machine processing and can be plotted using plot-vcfstats.

Convert a BED file to a VCF file according to a YAML config

01201

vcf versions

Returns all intervals in a genome that are not covered by at least one interval in the input BED/GFF/VCF file.

010

bed versions

bedtools:

A set of tools for genomic analysis tasks, specifically enabling genome arithmetic (merge, count, complement) on various file types.

Identifies common intervals among multiple (and subsets thereof) sorted BED/GFF/VCF files.

010

bed versions

bedtools:

A set of tools for genomic analysis tasks, specifically enabling genome arithmetic (merge, count, complement) on various file types.

DeepSomatic is an extension of deep learning-based variant caller DeepVariant that takes aligned reads (in BAM or CRAM format) from tumor and normal data, produces pileup image tensors from them, classifies each tensor using a convolutional neural network, and finally reports somatic variants in a standard VCF or gVCF file.

0123401010101

vcf vcf_tbi gvcf gvcf_tbi versions

Combine per-sample gVCF files produced by HaplotypeCaller into a multi-sample gVCF file

012000

combined_gvcf versions

gatk4:

Genome Analysis Toolkit (GATK4). Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Merges several vcf files

0101

vcf tbi versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Select a subset of variants from a VCF file

0123

vcf tbi versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Extract fields from a VCF file to a tab-delimited table

012345010101

table versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Genotype Salmonella Typhi from Mykrobe results

01

tsv versions

genotyphi:

Assign genotypes to Salmonella Typhi genomes based on VCF files (mapped to Typhi CT18 reference genome)

Ligatation of multiple phased BCF/VCF files into a single whole chromosome file. GLIMPSE2 is run in chunks that are ligated into chromosome-wide files maintaining the phasing.

012

merged_variants versions

glimpse2:

GLIMPSE2 is a phasing and imputation method for large-scale low-coverage sequencing studies.

Tool for imputation and phasing from vcf file or directly from bam files.

0123456789012

phased_variants stats_coverage versions

glimpse2:

GLIMPSE2 is a phasing and imputation method for large-scale low-coverage sequencing studies.

merge gVCF files and perform joint variant calling

0101

bcf versions

Removes all non-variant blocks from a gVCF file to produce a smaller variant-only VCF file.

01

vcf versions

gvcftools:

gvcftools is a package of small utilities for creating and analyzing gVCF files

Pre.py is a preprocessing tool made to preprocess VCF files for Hap.py

0120101

preprocessed_vcf versions

happy:

Haplotype VCF comparison tools

annotate VCF files for poly repeats

01010101

vcf tbi csi versions

jvarkit:

Java utilities for Bioinformatics.

bcftools:

View, subset and filter VCF or BCF files by position and filtering expression. Convert between VCF and BCF

Convert a VCF file to a JSON graph

0101

graph versions

paragraph:

Graph realignment tools for structural variants

Lifts over a VCF file from one reference build to another.

01010101

vcf_lifted vcf_unlifted versions

picard:

Move annotations from one assembly to another

changes name of sample in the vcf file

01

vcf versions

picard:

A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.

Sorts vcf files

010101

vcf versions

picard:

Java tools for working with NGS data in the BAM/CRAM/SAM and VCF format

Convert from VCF file to BGEN file version 1.2 format preserving dosages.

01234

bgen_file sample_file log_file versions

plink2:

Whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally efficient manner

Markup VCF file using rho-calls.

012010

vcf versions

rhocall:

Call regions of homozygosity and make tentative UPD calls.

modifies the input VCF file by adding the MLrejected FILTER to the variants

012010101

vcf index versions

sentieon:

Sentieonยฎ provides complete solutions for secondary DNA/RNA analysis for a variety of sequencing platforms, including short and long reads. Our software improves upon BWA, STAR, Minimap2, GATK, HaplotypeCaller, Mutect, and Mutect2 based pipelines and is deployable on any generic-CPU-based computing system.

Ligate multiple phased BCF/VCF files into a single whole chromosome file. Typically run to ligate multiple chunks of phased common variants.

012

merged_variants versions

shapeit5:

Fast and accurate method for estimation of haplotypes (phasing)

Annotate a VCF file with another VCF file

012012

vcf versions

snpsift:

SnpSift is a toolbox that allows you to filter and manipulate annotated files

Converts a bedpe file to a VCF file (beta version)

01

vcf versions

survivor:

Toolset for SV simulation, comparison and filtering

Filter a vcf file based on size and/or regions to ignore

0120000

vcf versions

survivor:

Toolset for SV simulation, comparison and filtering

Compare or merge VCF files to generate a consensus or multi sample VCF files.

01000000

vcf versions

survivor:

Toolset for SV simulation, comparison and filtering

Simulate an SV VCF file based on a reference genome

01010100

parameters vcf bed fasta insertions versions

survivor:

Toolset for SV simulation, comparison and filtering

Report multiple stats over a VCF file

01000

stats versions

survivor:

Toolset for SV simulation, comparison and filtering

The merge module merges structural variants within one or more vcf files.

0100

vcf tbi csi versions

svdb:

structural variant database software

Query a structural variant database, using a vcf file as query

01000000

vcf versions

svdb:

structural variant database software

Convert a VCF file to a BEDPE file.

01

bedpe versions

svtools:

Tools for processing and analyzing structural variants

A tool to standardize VCF files from structural variant callers

0123

vcf tbi versions

If multiple alleles are specified in a single record, break the record into several lines preserving allele-specific INFO fields

012

vcf versions

vcflib:

Command-line tools for manipulating VCF files

Command line tools for parsing and manipulating VCF files.

012

vcf versions

vcflib:

Command line tools for parsing and manipulating VCF files.

Generates a VCF stream where AC and NS have been generated for each record using sample genotypes.

012

vcf versions

vcflib:

Command-line tools for manipulating VCF files

List unique genotypes. Like GNU uniq, but for VCF records. Remove records which have the same position, ref, and alt as the previous record.

012

vcf versions

vcflib:

Command-line tools for manipulating VCF files

A set of tools written in Perl and C++ for working with VCF files

0100

vcf bcf frq frq_count idepth ldepth ldepth_mean gdepth hap_ld geno_ld geno_chisq list_hap_ld list_geno_ld interchrom_hap_ld interchrom_geno_ld tstv tstv_summary tstv_count tstv_qual filter_summary sites_pi windowed_pi weir_fst heterozygosity hwe tajima_d freq_burden lroh relatedness relatedness2 lqual missing_individual missing_site snp_density kept_sites removed_sites singeltons indel_hist hapcount mendel format info genotypes_matrix genotypes_matrix_individual genotypes_matrix_position impute_hap impute_hap_legend impute_hap_indv ldhat_sites ldhat_locs beagle_gl beagle_pl ped map_ tped tfam diff_sites_in_files diff_indv_in_files diff_sites diff_indv diff_discd_matrix diff_switch_error versions

decomposes multiallelic variants into biallelic in a VCF file.

012

vcf versions

vt:

A tool set for short variant discovery in genetic sequence data

normalizes variants in a VCF file

01230101

vcf fai versions

vt:

A tool set for short variant discovery in genetic sequence data

Click here to trigger an update.