Available Modules

Modules are the building stones of all DSL2 nf-core blocks. You can find more info from nf-core website, if you would like to write your own module.

  • bam 8
  • gatk4 7
  • fasta 5
  • counts 5
  • genomics 4
  • cram 4
  • qc 4
  • hmmcopy 4
  • genome 3
  • reference 3
  • sam 3
  • quality control 3
  • cnv 3
  • expression 3
  • vcf 2
  • fastq 2
  • metagenomics 2
  • assembly 2
  • count 2
  • stats 2
  • mapping 2
  • differential 2
  • interval 2
  • windowmasker 2
  • mapcounter 2
  • index 1
  • structural variants 1
  • align 1
  • gff 1
  • statistics 1
  • coverage 1
  • classify 1
  • k-mer 1
  • sentieon 1
  • pacbio 1
  • quality 1
  • binning 1
  • copy number 1
  • ancient DNA 1
  • rnaseq 1
  • contigs 1
  • mags 1
  • isoseq 1
  • QC 1
  • histogram 1
  • transcript 1
  • completeness 1
  • gene 1
  • db 1
  • mappability 1
  • evaluation 1
  • population genetics 1
  • mirna 1
  • kmers 1
  • scRNA-seq 1
  • de novo assembly 1
  • FASTQ 1
  • kallisto 1
  • svtk 1
  • mutect2 1
  • isomir 1
  • ccs 1
  • quantification 1
  • chromosome 1
  • microarray 1
  • transposons 1
  • rsem 1
  • angsd 1
  • pseudoalignment 1
  • miRNA 1
  • salmon 1
  • barcode 1
  • primer 1
  • UMIs 1
  • duplex 1
  • allele-specific 1
  • nacho 1
  • nanostring 1
  • mRNA 1
  • deseq2 1
  • rna-seq 1
  • WGS 1
  • panelofnormals 1
  • cgMLST 1
  • gatk 1
  • repeats 1
  • CRISPRi 1
  • denoisereadcounts 1
  • copyratios 1
  • createreadcountpanelofnormals 1
  • countsvtypes 1
  • collapse 1
  • collectreadcounts 1
  • asereadcounter 1
  • germlinecnvcaller 1
  • germline contig ploidy 1
  • determinegermlinecontigploidy 1
  • getpileupsumaries 1
  • germlinevariantsites 1
  • readcountssummary 1
  • readorientationartifacts 1
  • learnreadorientationmodel 1
  • allele counts 1
  • doCounts 1
  • post mortem damage 1
  • limma 1
  • metagenome-assembled genomes 1
  • maxbin2 1
  • readcounter 1
  • gccounter 1
  • alignment 0
  • bed 0
  • sort 0
  • annotation 0
  • variant calling 0
  • database 0
  • filter 0
  • merge 0
  • bacteria 0
  • map 0
  • variants 0
  • classification 0
  • download 0
  • gtf 0
  • nanopore 0
  • split 0
  • MSA 0
  • variant 0
  • contamination 0
  • taxonomic profiling 0
  • taxonomy 0
  • gfa 0
  • somatic 0
  • convert 0
  • conversion 0
  • clustering 0
  • proteomics 0
  • single-cell 0
  • VCF 0
  • imputation 0
  • trimming 0
  • phylogeny 0
  • long reads 0
  • bedtools 0
  • gvcf 0
  • consensus 0
  • build 0
  • bcftools 0
  • graph 0
  • variation graph 0
  • reporting 0
  • sv 0
  • kmer 0
  • bisulfite 0
  • cna 0
  • visualisation 0
  • long-read 0
  • picard 0
  • table 0
  • databases 0
  • methylation 0
  • illumina 0
  • protein 0
  • compression 0
  • indexing 0
  • bqsr 0
  • wgs 0
  • bisulphite 0
  • methylseq 0
  • taxonomic classification 0
  • metrics 0
  • serotype 0
  • antimicrobial resistance 0
  • depth 0
  • openms 0
  • imaging 0
  • 5mC 0
  • tsv 0
  • demultiplex 0
  • phage 0
  • sequences 0
  • matrix 0
  • WGBS 0
  • scWGBS 0
  • pangenome graph 0
  • repeat 0
  • pairs 0
  • samtools 0
  • DNA methylation 0
  • plot 0
  • searching 0
  • amr 0
  • protein sequence 0
  • cluster 0
  • plink2 0
  • aDNA 0
  • structure 0
  • neural network 0
  • base quality score recalibration 0
  • haplotype 0
  • example 0
  • bins 0
  • markduplicates 0
  • filtering 0
  • annotate 0
  • bisulfite sequencing 0
  • gzip 0
  • mmseqs2 0
  • archaeogenomics 0
  • biscuit 0
  • low-coverage 0
  • machine learning 0
  • phasing 0
  • virus 0
  • bcf 0
  • validation 0
  • palaeogenomics 0
  • aligner 0
  • transcriptome 0
  • seqkit 0
  • sequence 0
  • cooler 0
  • iCLIP 0
  • LAST 0
  • checkm 0
  • genotype 0
  • metagenome 0
  • damage 0
  • bwa 0
  • germline 0
  • complexity 0
  • gff3 0
  • glimpse 0
  • decompression 0
  • mag 0
  • hmmer 0
  • peaks 0
  • genotyping 0
  • dedup 0
  • kraken2 0
  • blast 0
  • segmentation 0
  • feature 0
  • newick 0
  • umi 0
  • spatial 0
  • bismark 0
  • ucsc 0
  • msa 0
  • mkref 0
  • sketch 0
  • hmmsearch 0
  • ncbi 0
  • low frequency variant calling 0
  • prediction 0
  • demultiplexing 0
  • vsearch 0
  • bedGraph 0
  • short-read 0
  • antimicrobial peptides 0
  • antimicrobial resistance genes 0
  • mitochondria 0
  • deduplication 0
  • splicing 0
  • cnvkit 0
  • extract 0
  • csv 0
  • json 0
  • reads 0
  • prokaryote 0
  • multiple sequence alignment 0
  • report 0
  • pangenome 0
  • duplicates 0
  • NCBI 0
  • tumor-only 0
  • snp 0
  • profile 0
  • single 0
  • plasmid 0
  • text 0
  • 3-letter genome 0
  • adapters 0
  • mem 0
  • idXML 0
  • merging 0
  • diversity 0
  • tabular 0
  • detection 0
  • deamination 0
  • MAF 0
  • indels 0
  • visualization 0
  • summary 0
  • fastx 0
  • riboseq 0
  • single cell 0
  • profiling 0
  • sourmash 0
  • isolates 0
  • benchmark 0
  • antibiotic resistance 0
  • microbiome 0
  • concatenate 0
  • gridss 0
  • amps 0
  • view 0
  • cat 0
  • de novo 0
  • arg 0
  • call 0
  • structural 0
  • mpileup 0
  • fragment 0
  • reference-free 0
  • query 0
  • compare 0
  • wxs 0
  • ont 0
  • distance 0
  • clipping 0
  • coptr 0
  • ptr 0
  • circrna 0
  • ngscheckmate 0
  • matching 0
  • read depth 0
  • propr 0
  • CLIP 0
  • logratio 0
  • rna 0
  • sylph 0
  • snps 0
  • cut 0
  • dna 0
  • retrotransposon 0
  • preprocessing 0
  • fgbio 0
  • pypgx 0
  • genome assembler 0
  • ganon 0
  • HMM 0
  • phylogenetic placement 0
  • HiFi 0
  • happy 0
  • haplotypecaller 0
  • enrichment 0
  • transcriptomics 0
  • peak-calling 0
  • bedgraph 0
  • STR 0
  • public datasets 0
  • hic 0
  • deep learning 0
  • bedpe 0
  • compress 0
  • microsatellite 0
  • gsea 0
  • xeniumranger 0
  • miscoding lesions 0
  • palaeogenetics 0
  • archaeogenetics 0
  • paf 0
  • containment 0
  • bin 0
  • redundancy 0
  • bigwig 0
  • diamond 0
  • mtDNA 0
  • SV 0
  • telomere 0
  • genmod 0
  • ranking 0
  • fai 0
  • image 0
  • family 0
  • umitools 0
  • interval_list 0
  • chunk 0
  • bcl2fastq 0
  • clean 0
  • fungi 0
  • ATAC-seq 0
  • DNA sequencing 0
  • bgzip 0
  • abundance 0
  • BGC 0
  • targeted sequencing 0
  • hybrid capture sequencing 0
  • biosynthetic gene cluster 0
  • copy number alteration calling 0
  • malt 0
  • DNA sequence 0
  • ampir 0
  • add 0
  • resistance 0
  • ancestry 0
  • parsing 0
  • sample 0
  • normalization 0
  • union 0
  • skani 0
  • sequencing 0
  • tabix 0
  • uLTRA 0
  • krona 0
  • UMI 0
  • html 0
  • host 0
  • image_analysis 0
  • mcmicro 0
  • fastk 0
  • highly_multiplexed_imaging 0
  • bakta 0
  • bamtools 0
  • checkv 0
  • small indels 0
  • minimap2 0
  • adapter trimming 0
  • bacterial 0
  • duplication 0
  • polishing 0
  • remove 0
  • archiving 0
  • zip 0
  • quality trimming 0
  • unzip 0
  • pileup 0
  • uncompress 0
  • untar 0
  • benchmarking 0
  • scaffolding 0
  • typing 0
  • pangolin 0
  • panel 0
  • long_read 0
  • entrez 0
  • ataqv 0
  • fusion 0
  • khmer 0
  • informative sites 0
  • spaceranger 0
  • chimeras 0
  • popscle 0
  • genotype-based deconvoltion 0
  • observations 0
  • DRAMP 0
  • neubi 0
  • amplify 0
  • macrel 0
  • lossless 0
  • rna_structure 0
  • PacBio 0
  • RNA 0
  • ligate 0
  • virulence 0
  • kinship 0
  • identity 0
  • transcripts 0
  • genome assembly 0
  • relatedness 0
  • dist 0
  • score 0
  • shapeit 0
  • subsample 0
  • seqtk 0
  • krona chart 0
  • SNP 0
  • arriba 0
  • complement 0
  • reports 0
  • notebook 0
  • amplicon sequencing 0
  • wastewater 0
  • RNA-seq 0
  • dictionary 0
  • indel 0
  • eukaryotes 0
  • prokaryotes 0
  • survivor 0
  • spark 0
  • population genomics 0
  • cfDNA 0
  • genome mining 0
  • hidden Markov model 0
  • mask 0
  • ambient RNA removal 0
  • organelle 0
  • covid 0
  • dump 0
  • mapper 0
  • variant_calling 0
  • mkfastq 0
  • cellranger 0
  • combine 0
  • prefetch 0
  • comparisons 0
  • replace 0
  • amplicon sequences 0
  • prokka 0
  • C to T 0
  • das tool 0
  • das_tool 0
  • mlst 0
  • vrhyme 0
  • nucleotide 0
  • CRISPR 0
  • intervals 0
  • bwameth 0
  • cut up 0
  • aln 0
  • bracken 0
  • cool 0
  • somatic variants 0
  • mzml 0
  • repeat expansion 0
  • hi-c 0
  • bim 0
  • fam 0
  • gatk4spark 0
  • npz 0
  • guide tree 0
  • fcs-gx 0
  • insert 0
  • deeparg 0
  • proteome 0
  • gene expression 0
  • genomes 0
  • scores 0
  • lineage 0
  • regions 0
  • png 0
  • microbes 0
  • kraken 0
  • wig 0
  • structural_variants 0
  • pairsam 0
  • fingerprint 0
  • chip-seq 0
  • pan-genome 0
  • roh 0
  • PCA 0
  • atac-seq 0
  • converter 0
  • comparison 0
  • variation 0
  • hla_typing 0
  • hlala_typing 0
  • ancient dna 0
  • sequenzautils 0
  • Streptococcus pneumoniae 0
  • snpsift 0
  • nextclade 0
  • snpeff 0
  • ampgram 0
  • reformat 0
  • effect prediction 0
  • reformatting 0
  • instrain 0
  • SimpleAF 0
  • metamaps 0
  • lift 0
  • hla 0
  • genomad 0
  • ChIP-seq 0
  • leviosam2 0
  • ichorcna 0
  • hlala 0
  • de novo assembler 0
  • rrna 0
  • nucleotides 0
  • taxids 0
  • taxon name 0
  • FracMinHash sketch 0
  • rgfa 0
  • small variants 0
  • multiallelic 0
  • proportionality 0
  • regression 0
  • mitochondrion 0
  • registration 0
  • ped 0
  • cnvnator 0
  • gene set analysis 0
  • zlib 0
  • gstama 0
  • differential expression 0
  • read-group 0
  • functional analysis 0
  • GPU-accelerated 0
  • concordance 0
  • gene set 0
  • genetics 0
  • switch 0
  • haplogroups 0
  • small genome 0
  • trancriptome 0
  • shigella 0
  • signature 0
  • image_processing 0
  • tnhaplotyper2 0
  • graph layout 0
  • phase 0
  • interactions 0
  • tama 0
  • polish 0
  • iphop 0
  • pharokka 0
  • k-mer index 0
  • vg 0
  • bloom filter 0
  • minhash 0
  • cancer genomics 0
  • mash 0
  • purge duplications 0
  • library 0
  • rtgtools 0
  • preseq 0
  • bam2fq 0
  • adapter 0
  • import 0
  • collate 0
  • retrotransposons 0
  • long terminal repeat 0
  • dict 0
  • tree 0
  • COBS 0
  • lofreq 0
  • megan 0
  • runs_of_homozygosity 0
  • scaffold 0
  • contig 0
  • assembly evaluation 0
  • vcflib 0
  • junctions 0
  • GC content 0
  • k-mer frequency 0
  • resolve_bioscience 0
  • Duplication purging 0
  • spatial_transcriptomics 0
  • xz 0
  • archive 0
  • checksum 0
  • mudskipper 0
  • duplicate 0
  • transcriptomic 0
  • Read depth 0
  • long terminal retrotransposon 0
  • fixmate 0
  • maximum likelihood 0
  • msisensor-pro 0
  • subset 0
  • screen 0
  • bustools 0
  • standardization 0
  • salmonella 0
  • parallelized 0
  • tumor 0
  • micro-satellite-scan 0
  • orthology 0
  • krakentools 0
  • profiles 0
  • polyA_tail 0
  • rename 0
  • transformation 0
  • refine 0
  • svdb 0
  • orf 0
  • removal 0
  • bfiles 0
  • homoploymer 0
  • pair 0
  • serogroup 0
  • kma 0
  • soft-clipped clusters 0
  • taxon tables 0
  • otu tables 0
  • standardisation 0
  • standardise 0
  • msi 0
  • MSI 0
  • fusions 0
  • variant pruning 0
  • interactive 0
  • krakenuniq 0
  • taxonomic profile 0
  • instability 0
  • varcal 0
  • function 0
  • immunoprofiling 0
  • amptransformer 0
  • expansionhunterdenovo 0
  • Pharmacogenetics 0
  • fetch 0
  • GEO 0
  • metagenomic 0
  • identifier 0
  • frame-shift correction 0
  • long-read sequencing 0
  • repeat_expansions 0
  • genome bins 0
  • metadata 0
  • tab 0
  • sequence analysis 0
  • intersection 0
  • windows 0
  • emboss 0
  • pharmacogenetics 0
  • haplotypes 0
  • region 0
  • unaligned 0
  • reheader 0
  • eigenstrat 0
  • deconvolution 0
  • graft 0
  • trim 0
  • ome-tif 0
  • MCMICRO 0
  • realignment 0
  • mirdeep2 0
  • microbial 0
  • RNA sequencing 0
  • microscopy 0
  • scatter 0
  • bayesian 0
  • smrnaseq 0
  • concat 0
  • tbi 0
  • intersect 0
  • merge mate pairs 0
  • normalize 0
  • reads merging 0
  • short reads 0
  • norm 0
  • sizes 0
  • bases 0
  • interval list 0
  • cnv calling 0
  • decontamination 0
  • human removal 0
  • screening 0
  • cleaning 0
  • trgt 0
  • gem 0
  • split_kmers 0
  • calling 0
  • corrupted 0
  • CNV 0
  • correction 0
  • cvnkit 0
  • estimation 0
  • vdj 0
  • recombination 0
  • eCLIP 0
  • splice 0
  • parse 0
  • hostile 0
  • version 0
  • validate 0
  • BAM 0
  • samplesheet 0
  • format 0
  • doublets 0
  • eido 0
  • anndata 0
  • awk 0
  • blastp 0
  • blastn 0
  • settings 0
  • pigz 0
  • heatmap 0
  • spatial_omics 0
  • random forest 0
  • metagenomes 0
  • structural-variant calling 0
  • gene labels 0
  • fasterq-dump 0
  • find 0
  • sra-tools 0
  • xenograft 0
  • single cells 0
  • joint genotyping 0
  • allele 0
  • artic 0
  • gwas 0
  • antibiotics 0
  • RiPP 0
  • authentication 0
  • aggregate 0
  • secondary metabolites 0
  • demultiplexed reads 0
  • simulate 0
  • join 0
  • evidence 0
  • dereplicate 0
  • MaltExtract 0
  • antismash 0
  • HOPS 0
  • baf 0
  • NRPS 0
  • edit distance 0
  • filtermutectcalls 0
  • RNA-Seq 0
  • ragtag 0
  • orthologs 0
  • scanner 0
  • geo 0
  • helitron 0
  • mapad 0
  • adna 0
  • spatype 0
  • wavefront 0
  • c to t 0
  • unmarkduplicates 0
  • junction 0
  • mashmap 0
  • covariance models 0
  • proteus 0
  • remove samples 0
  • 16S 0
  • yahs 0
  • hmmscan 0
  • hhsuite 0
  • detecting svs 0
  • copy number analysis 0
  • hmmpress 0
  • copy-number 0
  • short-read sequencing 0
  • variantcalling 0
  • gender determination 0
  • wham 0
  • phylogenies 0
  • whamg 0
  • sccmec 0
  • streptococcus 0
  • copy number alterations 0
  • copy number variation 0
  • spa 0
  • signatures 0
  • readproteingroups 0
  • groupby 0
  • data-download 0
  • dnamodelapply 0
  • constant 0
  • invariant 0
  • dnascope 0
  • doublet 0
  • patterns 0
  • regex 0
  • SNPs 0
  • paired reads re-pairing 0
  • samples 0
  • tnscope 0
  • bgen 0
  • hwe 0
  • fix 0
  • qualty 0
  • chloroplast 0
  • confidence 0
  • malformed 0
  • blat 0
  • alr 0
  • readwriter 0
  • metabolite annotation 0
  • fracminhash sketch 0
  • ribosomal RNA 0
  • taxonomic composition 0
  • hash sketch 0
  • eigenvectors 0
  • trna 0
  • hicPCA 0
  • sliding 0
  • mzML 0
  • snakemake 0
  • workflow 0
  • genome annotation 0
  • workflow_mode 0
  • prepare 0
  • catpack 0
  • mobile genetic elements 0
  • rRNA 0
  • integron 0
  • Computational Immunology 0
  • Bioinformatics Tools 0
  • metaspace 0
  • Immune Deconvolution 0
  • all versus all 0
  • inbreeding 0
  • melon 0
  • disomy 0
  • pca 0
  • dream 0
  • md 0
  • nm 0
  • plink2_pca 0
  • coding 0
  • upd 0
  • uq 0
  • uniparental 0
  • snv 0
  • files 0
  • downsample 0
  • downsample bam 0
  • subsample bam 0
  • eucaryotes 0
  • vcf2db 0
  • short 0
  • gemini 0
  • maf 0
  • lua 0
  • toml 0
  • pruning 0
  • cds 0
  • Mycobacterium tuberculosis 0
  • bigbed 0
  • heterozygous genotypes 0
  • genepred 0
  • refflat 0
  • gtftogenepred 0
  • ucsc/liftover 0
  • covariance model 0
  • dereplication 0
  • microbial genomics 0
  • umicollapse 0
  • drep 0
  • variancepartition 0
  • scRNA-Seq 0
  • homozygous genotypes 0
  • agat 0
  • longest 0
  • bedgraphtobigwig 0
  • f coefficient 0
  • isoform 0
  • sequencing adapters 0
  • transcroder 0
  • linkage equilibrium 0
  • chromosomal rearrangements 0
  • bgen file 0
  • svtk/baftest 0
  • vsearch/sort 0
  • vcf2bed 0
  • extractunbinned 0
  • Indel 0
  • host removal 0
  • rdtest 0
  • linkbins 0
  • haploype 0
  • impute 0
  • sintax 0
  • reference compression 0
  • SNV 0
  • usearch 0
  • rdtest2vcf 0
  • long read alignment 0
  • reference panel 0
  • SINE 0
  • bedtobigbed 0
  • baftest 0
  • pangenome-scale 0
  • plant 0
  • decompress 0
  • shuffleBed 0
  • vcf file 0
  • uniq 0
  • genotype dosages 0
  • assembly polishing 0
  • genome polishing 0
  • bedcov 0
  • comp 0
  • fast5 0
  • masking 0
  • vcfbreakmulti 0
  • low-complexity 0
  • GFF/GTF 0
  • deduplicate 0
  • graph projection to vcf 0
  • trio binning 0
  • VCFtools 0
  • verifybamid 0
  • wget 0
  • polya tail 0
  • DNA contamination estimation 0
  • tandem repeats 0
  • construct 0
  • long read 0
  • network 0
  • intron 0
  • peak picking 0
  • partitioning 0
  • Illumina 0
  • clahe 0
  • refresh 0
  • java 0
  • rank 0
  • hashing-based deconvolution 0
  • association 0
  • tag2tag 0
  • GWAS 0
  • tags 0
  • impute-info 0
  • functional 0
  • uniques 0
  • xml 0
  • case/control 0
  • drug categorization 0
  • Read report 0
  • Read trimming 0
  • Read filters 0
  • associations 0
  • spatial_neighborhoods 0
  • scimap 0
  • Bayesian 0
  • structural-variants 0
  • omics 0
  • biological activity 0
  • script 0
  • svg 0
  • prior knowledge 0
  • staging 0
  • search engine 0
  • mass_error 0
  • multiqc 0
  • distance-based 0
  • nucleotide sequence 0
  • homologs 0
  • multi-tool 0
  • predict 0
  • microRNA 0
  • Staging 0
  • hardy-weinberg 0
  • haplotag 0
  • machine_learning 0
  • hwe statistics 0
  • hwe equilibrium 0
  • reference-independent 0
  • genotype likelihood 0
  • liftover 0
  • probabilistic realignment 0
  • seqfu 0
  • n50 0
  • cell_type_identification 0
  • standard 0
  • cell_phenotyping 0
  • nanoq 0
  • tag 0
  • minimum_evolution 0
  • cellsnp 0
  • bwamem2 0
  • guidetree 0
  • translation 0
  • paired reads merging 0
  • Pacbio 0
  • AC/NS/AF 0
  • overlap-based merging 0
  • check 0
  • vcflib/vcffixup 0
  • trimfq 0
  • hamming-distance 0
  • donor deconvolution 0
  • grabix 0
  • genotype-based demultiplexing 0
  • lexogen 0
  • hashing-based deconvoltion 0
  • gnu 0
  • coreutils 0
  • generic 0
  • transposable element 0
  • droplet based single cells 0
  • busco 0
  • InterProScan 0
  • retrieval 0
  • MMseqs2 0
  • bwameme 0
  • ribosomal 0
  • cell_barcodes 0
  • realign 0
  • redundant 0
  • mygene 0
  • go 0
  • extraction 0
  • featuretable 0
  • mass spectrometry 0
  • pile up 0
  • sage 0
  • orthogroup 0
  • spot 0
  • circular 0
  • quality check 0
  • functional enrichment 0
  • size 0
  • cram-size 0
  • selector 0
  • paraphase 0
  • transcription factors 0
  • regulatory network 0
  • nanopore sequencing 0
  • rna velocity 0
  • cobra 0
  • extension 0
  • grea 0
  • 10x 0
  • poolseq 0
  • phylogenetics 0
  • chip 0
  • gost 0
  • tnfilter 0
  • scanpy 0
  • metagenome assembler 0
  • morphology 0
  • resegment 0
  • array_cgh 0
  • cytosure 0
  • relabel 0
  • cell segmentation 0
  • nuclear segmentation 0
  • gprofiler2 0
  • import segmentation 0
  • ancestral alleles 0
  • solo 0
  • scvi 0
  • rad 0
  • p-value 0
  • structural variant 0
  • bam2fastx 0
  • significance statistic 0
  • logFC 0
  • bam2fastq 0
  • immcantation 0
  • airrseq 0
  • subsetting 0
  • derived alleles 0
  • site frequency spectrum 0
  • immunoinformatics 0
  • reverse complement 0
  • updatedata 0
  • run 0
  • pdb 0
  • clr 0
  • boxcox 0
  • Escherichia coli 0
  • propd 0
  • Read coverage histogram 0
  • block substitutions 0
  • decomposeblocksub 0
  • identity-by-descent 0
  • simulation 0
  • plotting 0
  • hmmfetch 0
  • decompose 0
  • transmembrane 0
  • genome graph 0
  • tnseq 0
  • mgi 0
  • recovery 0
  • decoy 0
  • htseq 0
  • sompy 0
  • leafcutter 0
  • regtools 0
  • barcodes 0
  • co-orthology 0
  • variant-calling 0
  • shift 0
  • jvarkit 0
  • resfinder 0
  • resistance genes 0
  • raw 0
  • setgt 0
  • mgf 0
  • parquet 0
  • parser 0
  • dbsnp 0
  • standardize 0
  • ATACshift 0
  • ATACseq 0
  • translate 0
  • fastqfilter 0
  • quarto 0
  • python 0
  • r 0
  • coexpression 0
  • vsearch/fastqfilter 0
  • correlation 0
  • corpcor 0
  • vsearch/dereplicate 0
  • assay 0
  • telseq 0
  • stardist 0
  • plastid 0
  • tar 0
  • homology 0
  • elprep 0
  • doublet_detection 0
  • sequence similarity 0
  • spectral clustering 0
  • comparative genomics 0
  • deep variant 0
  • mutect 0
  • idx 0
  • quality_control 0
  • emoji 0
  • source tracking 0
  • controlstatistics 0
  • elfasta 0
  • parallel 0
  • transform 0
  • nucleotide content 0
  • gaps 0
  • AT content 0
  • introns 0
  • nucBed 0
  • bclconvert 0
  • install 0
  • joint-genotyping 0
  • genotypegvcf 0
  • targz 0
  • tarball 0
  • vector 0
  • metaphlan 0
  • predictions 0
  • getpileupsummaries 0
  • short variant discovery 0
  • combinegvcfs 0
  • collectsvevidence 0
  • cnnscorevariants 0
  • calibratedragstrmodel 0
  • cross-samplecontamination 0
  • dragstr 0
  • calculatecontamination 0
  • bedtointervallist 0
  • vqsr 0
  • variant quality score recalibration 0
  • annotateintervals 0
  • composestrtablefile 0
  • condensedepthevidence 0
  • heattree 0
  • gatherbqsrreports 0
  • panelofnormalscreation 0
  • jointgenotyping 0
  • genomicsdbimport 0
  • genomicsdb 0
  • tranche filtering 0
  • createsequencedictionary 0
  • filtervarianttranches 0
  • filterintervals 0
  • estimatelibrarycomplexity 0
  • duplication metrics 0
  • createsomaticpanelofnormals 0
  • targets 0
  • gangstr 0
  • antibiotic resistance genes 0
  • consensus sequence 0
  • public 0
  • ENA 0
  • SRA 0
  • ANI 0
  • ARGs 0
  • faqcs 0
  • groupreads 0
  • str 0
  • cache 0
  • percent on target 0
  • endogenous DNA 0
  • Streptococcus pyogenes 0
  • swissprot 0
  • duplexumi 0
  • unmapped 0
  • gene-calling 0
  • variant caller 0
  • gamma 0
  • UShER 0
  • bootstrapping 0
  • bacterial variant calling 0
  • germline variant calling 0
  • somatic variant calling 0
  • rust 0
  • ubam 0
  • fq 0
  • lint 0
  • random 0
  • generate 0
  • single molecule 0
  • zipperbams 0
  • embl 0
  • Haplotypes 0
  • tama_collapse.py 0
  • genomes on a tree 0
  • merge compare 0
  • GNU 0
  • joint-variant-calling 0
  • Imputation 0
  • Sample 0
  • TAMA 0
  • low coverage 0
  • gget 0
  • genome statistics 0
  • genome manipulation 0
  • genome summary 0
  • gfastats 0
  • gene model 0
  • gstama/merge 0
  • Salmonella Typhi 0
  • extractvariants 0
  • hbd 0
  • ibd 0
  • rgi 0
  • fARGene 0
  • amrfinderplus 0
  • abricate 0
  • extract_variants 0
  • gstama/polyacleanup 0
  • gvcftools 0
  • gunzip 0
  • gunc 0
  • archaea 0
  • genome taxonomy database 0
  • GTDB taxonomy 0
  • Mykrobe 0
  • repeat content 0
  • indexfeaturefile 0
  • preprocessintervals 0
  • shiftchain 0
  • selectvariants 0
  • revert 0
  • reblockgvcf 0
  • printsvevidence 0
  • printreads 0
  • postprocessgermlinecnvcalls 0
  • shiftintervals 0
  • snvs 0
  • mutectstats 0
  • mergebamalignment 0
  • leftalignandtrimvariants 0
  • shiftfasta 0
  • site depth 0
  • genome heterozygosity 0
  • txt 0
  • genome size 0
  • models 0
  • compound 0
  • genome profile 0
  • bgc 0
  • file parsing 0
  • gawk 0
  • splitcram 0
  • variantrecalibrator 0
  • recalibration model 0
  • variantfiltration 0
  • svcluster 0
  • svannotate 0
  • splitintervals 0
  • genbank 0
  • split by chromosome 0
  • mitochondrial 0
  • illumiation_correction 0
  • BCF 0
  • csi 0
  • deduping 0
  • smaller fastqs 0
  • clumping fastqs 0
  • background_correction 0
  • element 0
  • biallelic 0
  • trimBam 0
  • bamUtil 0
  • bamtools/split 0
  • yaml 0
  • bamtools/convert 0
  • mouse 0
  • update header 0
  • homozygosity 0
  • virulent 0
  • chunking 0
  • subtract 0
  • slopBed 0
  • shiftBed 0
  • multinterval 0
  • overlapped bed 0
  • maskfasta 0
  • jaccard 0
  • autozygosity 0
  • overlap 0
  • getfasta 0
  • genomecov 0
  • closest 0
  • bamtobed 0
  • sorting 0
  • bacphlip 0
  • temperate 0
  • bioawk 0
  • amp 0
  • nuclear contamination estimate 0
  • post Post-processing 0
  • model 0
  • AMPs 0
  • antimicrobial peptide prediction 0
  • Staphylococcus aureus 0
  • installation 0
  • affy 0
  • reference panels 0
  • admixture 0
  • adapterremoval 0
  • antimicrobial reistance 0
  • contiguate 0
  • HLA 0
  • lifestyle 0
  • read group 0
  • autofluorescence 0
  • cycif 0
  • background 0
  • single-stranded 0
  • ancientDNA 0
  • authentict 0
  • bias 0
  • utility 0
  • ATLAS 0
  • sequencing_bias 0
  • atlas 0
  • mkarv 0
  • http(s) 0
  • unionBedGraphs 0
  • file manipulation 0
  • deletion 0
  • Segmentation 0
  • cutesv 0
  • gct 0
  • cls 0
  • na 0
  • custom 0
  • Cores 0
  • TMA dearray 0
  • paired-end 0
  • UNet 0
  • mcool 0
  • genomic bins 0
  • makebins 0
  • enzyme 0
  • digest 0
  • pcr duplicates 0
  • track 0
  • cooler/balance 0
  • escherichia coli 0
  • circos 0
  • eklipse 0
  • eigenstratdatabasetools 0
  • pep 0
  • schema 0
  • PEP 0
  • depth information 0
  • corrrelation 0
  • structural variation 0
  • duphold 0
  • segment 0
  • blastx 0
  • cumulative coverage 0
  • scatterplot 0
  • cload 0
  • subcontigs 0
  • sorted 0
  • compartments 0
  • multiomics 0
  • mkvdjref 0
  • cellpose 0
  • hifi 0
  • Assembly 0
  • domains 0
  • topology 0
  • antibody capture 0
  • calder2 0
  • cadd 0
  • postprocessing 0
  • tblastn 0
  • subtyping 0
  • Salmonella enterica 0
  • antigen capture 0
  • crispr 0
  • nucleotide composition 0
  • cmseq 0
  • concoct 0
  • partition histograms 0
  • target 0
  • export 0
  • antitarget 0
  • access 0
  • protein coding genes 0
  • qa 0
  • polymorphic sites 0
  • polymorphic 0
  • polymut 0
  • chromosome_visualization 0
  • duplicate removal 0
  • chromap 0
  • quality assurnce 0
  • beagle 0
  • Haemophilus influenzae 0
  • dbnsfp 0
  • genomic intervals 0
  • false duplications 0
  • duplicate purging 0
  • haplotype purging 0
  • cutoff 0
  • panel of normals 0
  • normal database 0
  • intervals coverage 0
  • Haplotype purging 0
  • gene finding 0
  • contact maps 0
  • bmp 0
  • jpg 0
  • pretext 0
  • contact 0
  • assembly curation 0
  • False duplications 0
  • pmdtools 0
  • bamstat 0
  • read distribution 0
  • inner_distance 0
  • fragment_size 0
  • read_pairs 0
  • experiment 0
  • strandedness 0
  • R 0
  • Assembly curation 0
  • rhocall 0
  • long uncorrected reads 0
  • subsampling 0
  • neighbour-joining 0
  • quast 0
  • purging 0
  • porechop_abi 0
  • variant genetic 0
  • mapping-based 0
  • liftovervcf 0
  • tandem duplications 0
  • insertions 0
  • deletions 0
  • sortvcf 0
  • picard/renamesampleinvcf 0
  • pcr 0
  • mate-pair 0
  • GRO-cap 0
  • hybrid-selection 0
  • phylogenetic composition 0
  • illumina datasets 0
  • identification 0
  • prophage 0
  • phantom peaks 0
  • CoPRO 0
  • PRO-cap 0
  • scoring 0
  • exclude 0
  • identifiers 0
  • whole genome association 0
  • recode 0
  • indep pairwise 0
  • indep 0
  • variant identifiers 0
  • genetic 0
  • CAGE 0
  • GRO-seq 0
  • PRO-seq 0
  • STRIPE-seq 0
  • csRNA-seq 0
  • RAMPAGE 0
  • NETCAGE 0
  • sequence-based 0
  • integrity 0
  • motif 0
  • bam2seqz 0
  • rare variants 0
  • relative coverage 0
  • genetic sex 0
  • sex determination 0
  • induce 0
  • gc_wiggle 0
  • freqsum 0
  • de-novo 0
  • pseudodiploid 0
  • pseudohaploid 0
  • random draw 0
  • selection 0
  • seq 0
  • header 0
  • error 0
  • longread 0
  • sertotype 0
  • CRAM 0
  • snippy 0
  • core 0
  • sniffles 0
  • POA 0
  • SMN2 0
  • SMN1 0
  • sliding window 0
  • sha256 0
  • features 0
  • density 0
  • boxplot 0
  • exploratory 0
  • shinyngs 0
  • 256 bit 0
  • interleave 0
  • sequence headers 0
  • rtg 0
  • multimapper 0
  • calmd 0
  • ampliconclip 0
  • amplicon 0
  • duplicate marking 0
  • sambamba 0
  • flagstat 0
  • Ancestor 0
  • insert size 0
  • LCA 0
  • salsa2 0
  • salsa 0
  • rtg-tools 0
  • rocplot 0
  • pedfilter 0
  • faidx 0
  • repair 0
  • grep 0
  • chromatin 0
  • subseq 0
  • variant recalibration 0
  • VQSR 0
  • applyvarcal 0
  • assembly-binning 0
  • seacr 0
  • cut&run 0
  • paired 0
  • cut&tag 0
  • peak-caller 0
  • clusteridentifier 0
  • cluster analysis 0
  • scramble 0
  • readgroup 0
  • read pairs 0
  • ChIP-Seq 0
  • pedigrees 0
  • haplotype resolution 0
  • legionella 0
  • lofreq/filter 0
  • lofreq/call 0
  • Listeria monocytogenes 0
  • pneumophila 0
  • clinical 0
  • collapsing 0
  • AMP 0
  • adapter removal 0
  • train 0
  • spliced 0
  • reorder 0
  • combining 0
  • kofamscan 0
  • qualities 0
  • peptide prediction 0
  • pneumoniae 0
  • estimate 0
  • representations 0
  • reduced 0
  • mash/sketch 0
  • taxonomic assignment 0
  • damage patterns 0
  • functional genomics 0
  • NGS 0
  • DNA damage 0
  • rra 0
  • maximum-likelihood 0
  • CRISPR-Cas9 0
  • sgRNA 0
  • kegg 0
  • Klebsiella 0
  • mcr-1 0
  • pos 0
  • js 0
  • igv.js 0
  • igv 0
  • IDR 0
  • panel_of_normals 0
  • haemophilus 0
  • annotations 0
  • multicut 0
  • hmtnote 0
  • Hidden Markov Model 0
  • amino acid 0
  • HMMER 0
  • genome browser 0
  • pixel classification 0
  • effective genome size 0
  • Jupyter 0
  • k-mer counting 0
  • digital normalization 0
  • quant 0
  • kallisto/index 0
  • papermill 0
  • jupytext 0
  • Python 0
  • pixel_classification 0
  • jasmine 0
  • jasminesv 0
  • insertion 0
  • genomic islands 0
  • interproscan 0
  • probability_maps 0
  • mass-spectroscopy 0
  • MD5 0
  • read 0
  • combine graphs 0
  • hla-typing 0
  • tumor/normal 0
  • graph viz 0
  • graph formats 0
  • graph unchopping 0
  • graph stats 0
  • odgi 0
  • HLA-I 0
  • squeeze 0
  • graph drawing 0
  • graph construction 0
  • gender 0
  • Neisseria gonorrhoeae 0
  • ngm 0
  • ILP 0
  • block-compressed 0
  • sequencing summary 0
  • paragraph 0
  • pair-end 0
  • pbp 0
  • subreads 0
  • pbmerge 0
  • pbbam 0
  • graphs 0
  • select 0
  • PCR/optical duplicates 0
  • restriction fragments 0
  • pairstools 0
  • pairtools 0
  • ligation junctions 0
  • upper-triangular matrix 0
  • flip 0
  • NextGenMap 0
  • mobile element insertions 0
  • 128 bit 0
  • contour map 0
  • mbias 0
  • methylation bias 0
  • unionsum 0
  • ploidy 0
  • smudgeplot 0
  • Merqury 0
  • 3D heat map 0
  • de Bruijn 0
  • Neisseria meningitidis 0
  • rma6 0
  • daa 0
  • debruijn 0
  • denovo 0
  • megahit 0
  • assembler 0
  • microrna 0
  • somatic structural variations 0
  • mitochondrial to nuclear ratio 0
  • cancer genome 0
  • contaminant 0
  • SNP table 0
  • GATK UnifiedGenotyper 0
  • Beautiful stand-alone HTML report 0
  • bioinformatics tools 0
  • ratio 0
  • target prediction 0
  • mtnucratio 0
  • scan 0
  • microsatellite instability 0
  • otu table 0
  • mosdepth 0
  • reference genome 0
  • mitochondrial genome 0
  • patch 0

Calculates base frequency statistics across reference positions from BAM.

0123

depth_sample depth_global qs pos counts icounts versions

angsd:

ANGSD: Analysis of next generation Sequencing Data

Estimate the post-mortem damage patterns of DNA

012300

empiric exponential counts table versions

atlas:

ATLAS, a suite of methods to accurately genotype and estimate genetic diversity

Determine the allelic profiles of a genome using a pre-defined schema

0101

stats contigs_info alleles log paralogous_counts paralogous_loci cds_coordinates invalid_cds loci_summary_stats versions

chewbbaca:

A complete suite for gene-by-gene schema creation and strain identification.

runs a differential expression analysis with DESeq2

01230120101

results dispersion_plot rdata size_factors normalised_counts rlog_counts vst_counts model session_info versions

deseq2:

Differential gene expression analysis based on the negative binomial distribution

A program that counts sequence occurrences in FASTQ files.

0101

count_matrix stats distribution_plot reads_plot reads_plot_percentage versions

2FAST2Q:

2FAST2Q is ideal for CRISPRi-Seq, and for extracting and counting any kind of information from reads in the fastq format, such as barcodes in Bar-seq experiments. 2FAST2Q can work with sequence mismatches, Phred-score, and be used to find and extract unknown sequences delimited by known sequences. 2FAST2Q can extract multiple features per read using either fixed positions or delimiting search sequences.

Collapses identical sequences in a FASTQ/A file into a single sequence (while maintaining reads counts)

01

fasta versions

fastx:

A collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing

Collects a suite of metrics to QC duplex sequencing data.

010

family_sizes duplex_family_sizes duplex_yield_metrics umi_counts duplex_qc duplex_umi_counts versions

fgbio:

A set of tools for working with genomic and high throughput sequencing data, including UMIs

r-ggplot2:

ggplot2 is a system for declaratively creating graphics, based on The Grammar of Graphics.

Calculates the allele-specific read counts for allele-specific expression analysis of RNAseq data

012340101010

csv versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Collects read counts at specified intervals. The count for each interval is calculated by counting the number of read starts that lie in the interval.

0123010101

hdf5 tsv versions

gatk4:

Genome Analysis Toolkit (GATK4). Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Creates a panel of normals (PoN) for read-count denoising given the read counts for samples in the panel.

01

pon versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Denoises read counts to produce denoised copy ratios

0101

standardized denoised versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Determines the baseline contig ploidy for germline samples given counts data

0123010

calls model versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Calls copy-number variants in germline samples given their counts and the output of DetermineGermlineContigPloidy.

01234

cohortcalls cohortmodel casecalls versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Summarizes counts of reads that support reference, alternate and other alleles for given sites. Results can be used with CalculateContamination. Requires a common germline variant sites file, such as from gnomAD.

012301010100

table versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Uses f1r2 counts collected during mutect2 to Learn the prior probability of read orientation artifacts

01

artifactprior versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

gcCounter function from HMMcopy utilities, used to generate GC content in non-overlapping windows from a fasta reference

01

wig versions

hmmcopy:

C++ based programs for analyzing BAM files and preparing read counts -- used with bioconductor-hmmcopy

Perl script (generateMap.pl) generates the mappability of a genome given a certain size of reads, for input to hmmcopy mapcounter. Takes a very long time on large genomes, is not parallelised at all.

01

bigwig versions

hmmcopy:

C++ based programs for analyzing BAM files and preparing read counts -- used with bioconductor-hmmcopy

mapCounter function from HMMcopy utilities, used to generate mappability in non-overlapping windows from a bigwig file

01

wig versions

hmmcopy:

C++ based programs for analyzing BAM files and preparing read counts -- used with bioconductor-hmmcopy

readCounter function from HMMcopy utilities, used to generate read in windows

012

wig versions

hmmcopy:

C++ based programs for analyzing BAM files and preparing read counts -- used with bioconductor-hmmcopy

Quantification of transposable elements expression in scRNA-seq

0100

versions results counts log tmp

Creates a histogram of the number of distinct k-mers having a given frequency.

01

hist json png ps pdf jellyfish_hash versions

kat:

KAT is a suite of tools that analyse jellyfish hashes or sequence files (fasta or fastq) using kmer counts

Classifies metagenomic sequence data using unique k-mer counts

012000000

classified_reads unclassified_reads classified_assignment report versions

krakenuniq:

Metagenomics classifier with unique k-mer counting for more specific results

lima - The PacBio Barcode Demultiplexer and Primer Remover

010

counts report summary versions bam pbi fasta fastagz fastq fastqgz xml json clips guess

runs a differential expression analysis with Limma

0123012

results md_plot rdata model session_info normalised_counts versions

limma:

Linear Models for Microarray Data

MaxBin is a software that is capable of clustering metagenomic contigs

0123

binned_fastas summary abundance log marker_counts unbinned_fasta tooshort_fasta marker_bins marker_genes versions

Compare k-mer frequency in reads and assembly to devise the metrics K and QV

0101000

hist log_stderr versions

merfin:

Merfin (k-mer based finishing tool) is a suite of subtools to variant filtering, assembly evaluation and polishing via k-mer validation. The subtool -hist estimates the QV (quality value of Merqury) for each scaffold/contig and genome-wide averages. In addition, Merfin produces a QV* estimate, which accounts also for kmers that are seen in excess with respect to their expected multiplicity predicted from the reads.

mirtop counts generates a file with the minimal information about each sequence and the count data in columns for each samples.

0101012

tsv versions

mirtop:

Small RNA-seq annotation

NACHO (NAnostring quality Control dasHbOard) is developed for NanoString nCounter data. NanoString nCounter data is a messenger-RNA/micro-RNA (mRNA/miRNA) expression assay and works with fluorescent barcodes. Each barcode is assigned a mRNA/miRNA, which can be counted after bonding with its target. As a result each count of a specific barcode represents the presence of its target mRNA/miRNA.

0101

normalized_counts normalized_counts_wo_HK versions

NACHO:

R package that uses two main functions to summarize and visualize NanoString RCC files, namely: load_rcc() and visualise(). It also includes a function normalise(), which (re)calculates sample specific size factors and normalises the data. For more information vignette("NACHO") and vignette("NACHO-analysis")

Evaluate alignment data

010

results versions

qualimap:

Qualimap 2 is a platform-independent application written in Java and R that provides both a Graphical User Interface and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts.

Evaluate alignment data

012000

results versions

qualimap:

Qualimap 2 is a platform-independent application written in Java and R that provides both a Graphical User Interface and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts.

Evaluate alignment data

0101

results versions

qualimap:

Qualimap 2 is a platform-independent application written in Java and R that provides both a Graphical User Interface and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts.

Calculate expression with RSEM

010

counts_gene counts_transcript stat logs versions bam_star bam_genome bam_transcript

rseqc:

RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome

gene/transcript quantification with Salmon

0100000

results json_info lib_format_counts versions

salmon:

Salmon is a tool for wicked-fast transcript quantification from RNA-seq data

Counts the number of alignments in a BAM/CRAM/SAM file for each FLAG type

012

flagstat versions

samtools:

SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.

Reports alignment summary statistics for a BAM/CRAM/SAM file

012

idxstats versions

samtools:

SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.

Produces comprehensive statistics from SAM/BAM/CRAM file

01201

stats versions

samtools:

SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.

Accelerated implementation of the GATK DepthOfCoverage tool.

01201010101

per_locus sample_summary statistics coverage_counts coverage_proportions interval_summary versions

sentieon:

Sentieonยฎ provides complete solutions for secondary DNA/RNA analysis for a variety of sequencing platforms, including short and long reads. Our software improves upon BWA, STAR, Minimap2, GATK, HaplotypeCaller, Mutect, and Mutect2 based pipelines and is deployable on any generic-CPU-based computing system.

Create a counts matrix for single-cell data using STARSolo, handling cell barcodes and UMI information.

012001

counts log_final log_out log_progress summary versions

Count reads that map to genomic features

012

counts summary versions

featurecounts:

featureCounts is a highly efficient general-purpose read summarization program that counts mapped reads for genomic features such as genes, exons, promoter, gene bodies, genomic bins and chromosomal locations. It can be used to count both RNA-seq and genomic DNA-seq reads.

Count the instances of each SVTYPE observed in each sample in a VCF.

01

counts versions

svtk:

Utilities for consolidating, filtering, resolving, and annotating structural variants.

Import transcript-level abundances and estimated counts for gene-level analysis packages

01010

tpm_gene counts_gene counts_gene_length_scaled counts_gene_scaled lengths_gene tpm_transcript counts_transcript lengths_transcript versions

tximeta:

Transcript Quantification Import with Automatic Metadata

A program to generate frequency counts of repetitive units.

01

counts versions

windowmasker:

A program to mask highly repetitive and low complexity DNA sequences within a genome.

A program to take a counts file and creates a file of genomic co-ordinates to be masked.

0101

intervals versions

windowmasker:

A program to mask highly repetitive and low complexity DNA sequences within a genome.

Click here to trigger an update.