Available Modules

Modules are the building stones of all DSL2 nf-core blocks. You can find more info from nf-core website, if you would like to write your own module.

  • fastq 125
  • fasta 45
  • bam 25
  • metagenomics 23
  • reference 17
  • map 17
  • classify 17
  • genomics 14
  • sam 14
  • alignment 13
  • align 13
  • taxonomic profiling 12
  • genome 10
  • trimming 9
  • demultiplex 8
  • qc 7
  • quality control 7
  • convert 7
  • illumina 7
  • index 6
  • filter 6
  • kmers 6
  • assembly 5
  • cram 5
  • merge 5
  • serotype 5
  • sequences 5
  • bwa 5
  • vsearch 5
  • FASTQ 5
  • fastx 5
  • classification 4
  • nanopore 4
  • db 4
  • reads 4
  • mem 4
  • adapters 4
  • bcl2fastq 4
  • sort 3
  • k-mer 3
  • taxonomy 3
  • pacbio 3
  • clustering 3
  • count 3
  • single-cell 3
  • filtering 3
  • LAST 3
  • seqkit 3
  • lossless 3
  • seqtk 3
  • mkfastq 3
  • bed 2
  • gatk4 2
  • database 2
  • bacteria 2
  • coverage 2
  • statistics 2
  • download 2
  • split 2
  • consensus 2
  • compression 2
  • mapping 2
  • cluster 2
  • samtools 2
  • mappability 2
  • aligner 2
  • sequence 2
  • umi 2
  • population genetics 2
  • sketch 2
  • report 2
  • short-read 2
  • deduplication 2
  • merging 2
  • text 2
  • tabular 2
  • union 2
  • sequencing 2
  • resistance 2
  • preprocessing 2
  • subsample 2
  • amplicon sequences 2
  • cellranger 2
  • pileup 2
  • virulence 2
  • immunoprofiling 2
  • bam2fq 2
  • shigella 2
  • Streptococcus pneumoniae 2
  • simulate 2
  • unaligned 2
  • corrupted 2
  • variant 1
  • sentieon 1
  • ancient DNA 1
  • bisulfite 1
  • isoseq 1
  • build 1
  • kmer 1
  • bisulphite 1
  • methylseq 1
  • indexing 1
  • methylation 1
  • stats 1
  • antimicrobial resistance 1
  • 5mC 1
  • DNA methylation 1
  • WGBS 1
  • histogram 1
  • scWGBS 1
  • bisulfite sequencing 1
  • biscuit 1
  • damage 1
  • palaeogenomics 1
  • archaeogenomics 1
  • genotype 1
  • decompression 1
  • dedup 1
  • complexity 1
  • json 1
  • scRNA-seq 1
  • rna 1
  • plasmid 1
  • snp 1
  • mirna 1
  • duplicates 1
  • diversity 1
  • concatenate 1
  • deamination 1
  • cat 1
  • microbiome 1
  • 3-letter genome 1
  • profiling 1
  • visualization 1
  • kallisto 1
  • query 1
  • summary 1
  • redundancy 1
  • ganon 1
  • cut 1
  • clean 1
  • interval_list 1
  • bgzip 1
  • normalization 1
  • fusion 1
  • dna 1
  • ccs 1
  • sample 1
  • sylph 1
  • ngscheckmate 1
  • archaeogenetics 1
  • miscoding lesions 1
  • matching 1
  • palaeogenetics 1
  • paf 1
  • mapper 1
  • khmer 1
  • genome assembly 1
  • kraken 1
  • bamtools 1
  • aln 1
  • bwameth 1
  • prefetch 1
  • chip-seq 1
  • dump 1
  • RNA-seq 1
  • adapter trimming 1
  • remove 1
  • quality trimming 1
  • hi-c 1
  • complement 1
  • atac-seq 1
  • mash 1
  • tree 1
  • hlala_typing 1
  • hla_typing 1
  • hlala 1
  • hla 1
  • vdj 1
  • barcode 1
  • primer 1
  • pair 1
  • bustools 1
  • awk 1
  • salmonella 1
  • rename 1
  • de novo assembler 1
  • ancient dna 1
  • transformation 1
  • trim 1
  • removal 1
  • concat 1
  • artic 1
  • aggregate 1
  • demultiplexed reads 1
  • import 1
  • subset 1
  • smrnaseq 1
  • samplesheet 1
  • human removal 1
  • validate 1
  • decontamination 1
  • hostile 1
  • c to t 1
  • mapad 1
  • adna 1
  • recovery 1
  • vsearch/fastqfilter 1
  • fastqfilter 1
  • rRNA 1
  • Escherichia coli 1
  • mgi 1
  • ribosomal RNA 1
  • comp 1
  • vsearch/sort 1
  • emoji 1
  • umicollapse 1
  • quality_control 1
  • sequencing adapters 1
  • bclconvert 1
  • sintax 1
  • usearch 1
  • 10x 1
  • bwameme 1
  • host removal 1
  • collapse 1
  • seqfu 1
  • n50 1
  • bwamem2 1
  • CRISPRi 1
  • hamming-distance 1
  • paired reads merging 1
  • overlap-based merging 1
  • trimfq 1
  • lexogen 1
  • bam2fastx 1
  • bam2fastq 1
  • fix 1
  • malformed 1
  • paired reads re-pairing 1
  • partitioning 1
  • transform 1
  • gaps 1
  • faqcs 1
  • rust 1
  • fq 1
  • lint 1
  • random 1
  • generate 1
  • genome statistics 1
  • genome manipulation 1
  • genome summary 1
  • gfastats 1
  • deduping 1
  • smaller fastqs 1
  • clumping fastqs 1
  • element 1
  • yaml 1
  • bamtools/convert 1
  • bioawk 1
  • adapterremoval 1
  • HLA 1
  • file manipulation 1
  • mkvdjref 1
  • duplicate removal 1
  • chromap 1
  • rtg 1
  • seq 1
  • sliding window 1
  • selection 1
  • header 1
  • interleave 1
  • sertotype 1
  • train 1
  • insertion 1
  • ngm 1
  • NextGenMap 1
  • sequencing summary 1
  • contaminant 1
  • otu table 1
  • vcf 0
  • annotation 0
  • structural variants 0
  • variant calling 0
  • gff 0
  • variants 0
  • gtf 0
  • cnv 0
  • MSA 0
  • contamination 0
  • gfa 0
  • somatic 0
  • conversion 0
  • proteomics 0
  • quality 0
  • binning 0
  • VCF 0
  • copy number 0
  • long reads 0
  • imputation 0
  • contigs 0
  • bedtools 0
  • phylogeny 0
  • rnaseq 0
  • graph 0
  • mags 0
  • reporting 0
  • variation graph 0
  • gvcf 0
  • bcftools 0
  • protein 0
  • sv 0
  • imaging 0
  • databases 0
  • table 0
  • picard 0
  • wgs 0
  • QC 0
  • visualisation 0
  • bqsr 0
  • cna 0
  • long-read 0
  • openms 0
  • depth 0
  • metrics 0
  • phage 0
  • plink2 0
  • taxonomic classification 0
  • tsv 0
  • haplotype 0
  • searching 0
  • structure 0
  • protein sequence 0
  • plot 0
  • bins 0
  • base quality score recalibration 0
  • aDNA 0
  • pangenome graph 0
  • repeat 0
  • neural network 0
  • pairs 0
  • example 0
  • markduplicates 0
  • expression 0
  • matrix 0
  • amr 0
  • machine learning 0
  • cooler 0
  • gzip 0
  • transcriptome 0
  • mmseqs2 0
  • low-coverage 0
  • annotate 0
  • iCLIP 0
  • virus 0
  • validation 0
  • bcf 0
  • phasing 0
  • completeness 0
  • metagenome 0
  • checkm 0
  • germline 0
  • gene 0
  • transcript 0
  • peaks 0
  • msa 0
  • evaluation 0
  • bismark 0
  • kraken2 0
  • ucsc 0
  • prediction 0
  • blast 0
  • hmmsearch 0
  • hmmer 0
  • genotyping 0
  • spatial 0
  • glimpse 0
  • mag 0
  • mkref 0
  • newick 0
  • ncbi 0
  • segmentation 0
  • gff3 0
  • feature 0
  • prokaryote 0
  • bedGraph 0
  • splicing 0
  • pangenome 0
  • multiple sequence alignment 0
  • cnvkit 0
  • single 0
  • tumor-only 0
  • antimicrobial peptides 0
  • csv 0
  • NCBI 0
  • antimicrobial resistance genes 0
  • mitochondria 0
  • profile 0
  • low frequency variant calling 0
  • differential 0
  • demultiplexing 0
  • extract 0
  • clipping 0
  • ptr 0
  • wxs 0
  • arg 0
  • HMM 0
  • reference-free 0
  • benchmark 0
  • sourmash 0
  • indels 0
  • detection 0
  • coptr 0
  • compare 0
  • de novo 0
  • single cell 0
  • mpileup 0
  • antibiotic resistance 0
  • idXML 0
  • gridss 0
  • isolates 0
  • interval 0
  • mutect2 0
  • de novo assembly 0
  • structural 0
  • distance 0
  • MAF 0
  • amps 0
  • riboseq 0
  • svtk 0
  • fragment 0
  • ont 0
  • call 0
  • counts 0
  • view 0
  • fgbio 0
  • add 0
  • propr 0
  • haplotypecaller 0
  • malt 0
  • gsea 0
  • STR 0
  • compress 0
  • parsing 0
  • microarray 0
  • hic 0
  • family 0
  • phylogenetic placement 0
  • bedpe 0
  • bedgraph 0
  • ranking 0
  • logratio 0
  • genome assembler 0
  • transcriptomics 0
  • CLIP 0
  • read depth 0
  • genmod 0
  • circrna 0
  • pypgx 0
  • peak-calling 0
  • ampir 0
  • enrichment 0
  • isomir 0
  • microsatellite 0
  • umitools 0
  • DNA sequencing 0
  • abundance 0
  • DNA sequence 0
  • quantification 0
  • mtDNA 0
  • snps 0
  • ATAC-seq 0
  • targeted sequencing 0
  • hybrid capture sequencing 0
  • bin 0
  • chunk 0
  • copy number alteration calling 0
  • xeniumranger 0
  • retrotransposon 0
  • containment 0
  • chromosome 0
  • bigwig 0
  • diamond 0
  • fai 0
  • telomere 0
  • SV 0
  • ancestry 0
  • happy 0
  • deep learning 0
  • image 0
  • nucleotide 0
  • fungi 0
  • public datasets 0
  • HiFi 0
  • skani 0
  • BGC 0
  • biosynthetic gene cluster 0
  • hmmcopy 0
  • gatk4spark 0
  • somatic variants 0
  • dist 0
  • SNP 0
  • comparison 0
  • bacterial 0
  • mzml 0
  • identity 0
  • typing 0
  • pairsam 0
  • relatedness 0
  • entrez 0
  • fastk 0
  • structural_variants 0
  • pan-genome 0
  • pangolin 0
  • spaceranger 0
  • lineage 0
  • anndata 0
  • covid 0
  • UMI 0
  • observations 0
  • survivor 0
  • panel 0
  • wastewater 0
  • benchmarking 0
  • bim 0
  • duplication 0
  • PacBio 0
  • fam 0
  • rsem 0
  • mask 0
  • hidden Markov model 0
  • cfDNA 0
  • polishing 0
  • population genomics 0
  • vrhyme 0
  • scaffold 0
  • amplicon sequencing 0
  • mlst 0
  • notebook 0
  • reports 0
  • prokka 0
  • krona chart 0
  • pseudoalignment 0
  • transposons 0
  • windowmasker 0
  • npz 0
  • krona 0
  • html 0
  • small indels 0
  • popscle 0
  • genotype-based deconvoltion 0
  • indel 0
  • kinship 0
  • shapeit 0
  • spark 0
  • miRNA 0
  • tabix 0
  • dictionary 0
  • ambient RNA removal 0
  • informative sites 0
  • rna_structure 0
  • RNA 0
  • fusions 0
  • replace 0
  • score 0
  • scaffolding 0
  • transcripts 0
  • uLTRA 0
  • insert 0
  • variant_calling 0
  • ligate 0
  • minimap2 0
  • long_read 0
  • guide tree 0
  • untar 0
  • uncompress 0
  • chimeras 0
  • unzip 0
  • zip 0
  • archiving 0
  • organelle 0
  • angsd 0
  • genome mining 0
  • cool 0
  • png 0
  • proteome 0
  • repeat expansion 0
  • bracken 0
  • cut up 0
  • das tool 0
  • das_tool 0
  • wig 0
  • prokaryotes 0
  • comparisons 0
  • ataqv 0
  • image_analysis 0
  • mcmicro 0
  • highly_multiplexed_imaging 0
  • CRISPR 0
  • arriba 0
  • eukaryotes 0
  • combine 0
  • bakta 0
  • intervals 0
  • host 0
  • converter 0
  • deeparg 0
  • C to T 0
  • roh 0
  • fingerprint 0
  • macrel 0
  • amplify 0
  • neubi 0
  • fcs-gx 0
  • scores 0
  • gene expression 0
  • regions 0
  • checkv 0
  • genomes 0
  • PCA 0
  • DRAMP 0
  • microbes 0
  • minhash 0
  • windows 0
  • immunoinformatics 0
  • intersect 0
  • norm 0
  • long terminal repeat 0
  • normalize 0
  • intersection 0
  • long terminal retrotransposon 0
  • kma 0
  • retrotransposons 0
  • checksum 0
  • scatter 0
  • megan 0
  • assembly evaluation 0
  • GC content 0
  • k-mer frequency 0
  • k-mer index 0
  • archive 0
  • lofreq 0
  • bloom filter 0
  • pharokka 0
  • reheader 0
  • xz 0
  • function 0
  • profiles 0
  • COBS 0
  • resolve_bioscience 0
  • spatial_transcriptomics 0
  • salmon 0
  • BAM 0
  • rna-seq 0
  • regression 0
  • reformat 0
  • haplotypes 0
  • functional analysis 0
  • mapcounter 0
  • haplogroups 0
  • interactions 0
  • taxids 0
  • ichorcna 0
  • taxon name 0
  • zlib 0
  • pigz 0
  • find 0
  • differential expression 0
  • trancriptome 0
  • tama 0
  • translation 0
  • amino acid 0
  • genetics 0
  • orf 0
  • region 0
  • interactive 0
  • krakenuniq 0
  • sizes 0
  • bases 0
  • homologs 0
  • krakentools 0
  • screen 0
  • metamaps 0
  • tbi 0
  • polyA_tail 0
  • blastn 0
  • refine 0
  • maximum likelihood 0
  • iphop 0
  • instrain 0
  • leviosam2 0
  • lift 0
  • homoploymer 0
  • deseq2 0
  • MSI 0
  • dict 0
  • varcal 0
  • MaltExtract 0
  • HOPS 0
  • authentication 0
  • soft-clipped clusters 0
  • edit distance 0
  • ragtag 0
  • qualty 0
  • samples 0
  • fixmate 0
  • collate 0
  • taxon tables 0
  • secondary metabolites 0
  • NRPS 0
  • RiPP 0
  • antibiotics 0
  • antismash 0
  • rtgtools 0
  • vcflib 0
  • junctions 0
  • vg 0
  • allele 0
  • FracMinHash sketch 0
  • join 0
  • signature 0
  • cancer genomics 0
  • snpsift 0
  • snpeff 0
  • effect prediction 0
  • small genome 0
  • gwas 0
  • otu tables 0
  • svdb 0
  • switch 0
  • standardization 0
  • sequenzautils 0
  • taxonomic profile 0
  • standardise 0
  • standardisation 0
  • runs_of_homozygosity 0
  • polish 0
  • instability 0
  • microscopy 0
  • nucleotides 0
  • GPU-accelerated 0
  • multiallelic 0
  • small variants 0
  • rgfa 0
  • tnhaplotyper2 0
  • gstama 0
  • reformatting 0
  • graph layout 0
  • nextclade 0
  • graft 0
  • orthology 0
  • parallelized 0
  • transcriptomic 0
  • mudskipper 0
  • msisensor-pro 0
  • micro-satellite-scan 0
  • tumor 0
  • msi 0
  • cnvnator 0
  • proportionality 0
  • RNA-Seq 0
  • preseq 0
  • contig 0
  • duplicate 0
  • Read depth 0
  • Duplication purging 0
  • purge duplications 0
  • library 0
  • adapter 0
  • ped 0
  • variant pruning 0
  • bfiles 0
  • SimpleAF 0
  • copyratios 0
  • image_processing 0
  • registration 0
  • mitochondrion 0
  • read-group 0
  • xenograft 0
  • rrna 0
  • serogroup 0
  • nacho 0
  • metagenomic 0
  • cgMLST 0
  • mass spectrometry 0
  • UMIs 0
  • version 0
  • orthologs 0
  • duplex 0
  • trgt 0
  • nanostring 0
  • fetch 0
  • GEO 0
  • sra-tools 0
  • fasterq-dump 0
  • identifier 0
  • sequence analysis 0
  • baf 0
  • pharmacogenetics 0
  • estimation 0
  • expansionhunterdenovo 0
  • repeat_expansions 0
  • cleaning 0
  • structural-variant calling 0
  • metadata 0
  • screening 0
  • tab 0
  • recombination 0
  • gem 0
  • metagenomes 0
  • eCLIP 0
  • WGS 0
  • long-read sequencing 0
  • doublets 0
  • mRNA 0
  • realignment 0
  • microbial 0
  • deconvolution 0
  • allele-specific 0
  • bayesian 0
  • interval list 0
  • RNA sequencing 0
  • filtermutectcalls 0
  • mirdeep2 0
  • MCMICRO 0
  • calling 0
  • ome-tif 0
  • Pharmacogenetics 0
  • cvnkit 0
  • split_kmers 0
  • evidence 0
  • repeats 0
  • panelofnormals 0
  • cnv calling 0
  • CNV 0
  • dereplicate 0
  • joint genotyping 0
  • gatk 0
  • merge mate pairs 0
  • reads merging 0
  • short reads 0
  • correction 0
  • frame-shift correction 0
  • splice 0
  • settings 0
  • random forest 0
  • amptransformer 0
  • gene set 0
  • gene set analysis 0
  • eigenstrat 0
  • variation 0
  • format 0
  • genome bins 0
  • blastp 0
  • phase 0
  • ChIP-seq 0
  • gene labels 0
  • genomad 0
  • single cells 0
  • emboss 0
  • parse 0
  • heatmap 0
  • ampgram 0
  • eido 0
  • spatial_omics 0
  • concordance 0
  • spatialdata 0
  • melon 0
  • proteus 0
  • plant 0
  • hash sketch 0
  • signatures 0
  • setgt 0
  • readproteingroups 0
  • metabolomics 0
  • cell segmentation 0
  • SINE 0
  • copy-number 0
  • jvarkit 0
  • remove samples 0
  • gender determination 0
  • scanner 0
  • copy number alterations 0
  • helitron 0
  • tar 0
  • unmarkduplicates 0
  • covariance models 0
  • translate 0
  • leafcutter 0
  • copy number analysis 0
  • trna 0
  • wham 0
  • fracminhash sketch 0
  • genome annotation 0
  • mobile genetic elements 0
  • tarball 0
  • copy number variation 0
  • yahs 0
  • geo 0
  • relabel 0
  • bedcov 0
  • genome polishing 0
  • assembly polishing 0
  • chloroplast 0
  • confidence 0
  • blat 0
  • alr 0
  • clr 0
  • boxcox 0
  • tnscope 0
  • vsearch/dereplicate 0
  • telseq 0
  • stardist 0
  • propd 0
  • Read coverage histogram 0
  • immunology 0
  • BCR 0
  • bgen 0
  • groupby 0
  • eigenvectors 0
  • secondary structure 0
  • network 0
  • resegment 0
  • wget 0
  • wavefront 0
  • hicPCA 0
  • sliding 0
  • snakemake 0
  • workflow 0
  • morphology 0
  • ATACseq 0
  • workflow_mode 0
  • ATACshift 0
  • createreadcountpanelofnormals 0
  • shift 0
  • denoisereadcounts 0
  • readwriter 0
  • dnamodelapply 0
  • dnascope 0
  • whamg 0
  • mashmap 0
  • source tracking 0
  • decompress 0
  • vcf2bed 0
  • significance statistic 0
  • scanpy 0
  • rdtest 0
  • hwe 0
  • data-download 0
  • scRNA-Seq 0
  • gtftogenepred 0
  • controlstatistics 0
  • rdtest2vcf 0
  • countsvtypes 0
  • p-value 0
  • scvi 0
  • elprep 0
  • files 0
  • baftest 0
  • elfasta 0
  • ucsc/liftover 0
  • refflat 0
  • upd 0
  • eucaryotes 0
  • doublet_detection 0
  • subsetting 0
  • fast5 0
  • references 0
  • modelsegments 0
  • polya tail 0
  • Mycobacterium tuberculosis 0
  • metagenome assembler 0
  • chromosomal rearrangements 0
  • coding 0
  • genepred 0
  • missingness 0
  • cds 0
  • transcroder 0
  • patch 0
  • logFC 0
  • bedgraphtobigwig 0
  • bigbed 0
  • bedtobigbed 0
  • nucleotide content 0
  • uniparental 0
  • all versus all 0
  • spa 0
  • graph projection to vcf 0
  • nucBed 0
  • long-reads 0
  • plotting 0
  • variantcalling 0
  • sccmec 0
  • streptococcus 0
  • extractunbinned 0
  • linkbins 0
  • integron 0
  • targz 0
  • iterative model refinement 0
  • nuclear segmentation 0
  • spatype 0
  • barcodes 0
  • long read alignment 0
  • pangenome-scale 0
  • regtools 0
  • construct 0
  • DNA contamination estimation 0
  • disomy 0
  • metabolite annotation 0
  • snv 0
  • downsample 0
  • svtk/baftest 0
  • downsample bam 0
  • subsample bam 0
  • vcf2db 0
  • AT content 0
  • gemini 0
  • maf 0
  • lua 0
  • detecting svs 0
  • toml 0
  • solo 0
  • import segmentation 0
  • short-read sequencing 0
  • vcfbreakmulti 0
  • uniq 0
  • deduplicate 0
  • VCFtools 0
  • verifybamid 0
  • metaspace 0
  • check 0
  • decoy 0
  • genotype dosages 0
  • impute 0
  • hwe statistics 0
  • ribosomal 0
  • grabix 0
  • SNV 0
  • hwe equilibrium 0
  • reference-independent 0
  • Indel 0
  • haploype 0
  • genotype likelihood 0
  • patterns 0
  • liftover 0
  • probabilistic realignment 0
  • guidetree 0
  • Pacbio 0
  • doublet 0
  • cell_type_identification 0
  • cell_phenotyping 0
  • machine_learning 0
  • hardy-weinberg 0
  • regex 0
  • AC/NS/AF 0
  • distance-based 0
  • circular 0
  • python 0
  • r 0
  • realign 0
  • quality check 0
  • coexpression 0
  • correlation 0
  • corpcor 0
  • low-complexity 0
  • assay 0
  • phylogenetics 0
  • minimum_evolution 0
  • nucleotide sequence 0
  • shuffleBed 0
  • GFF/GTF 0
  • size 0
  • trio binning 0
  • cram-size 0
  • selector 0
  • paraphase 0
  • transcription factors 0
  • regulatory network 0
  • tandem repeats 0
  • multi-tool 0
  • long read 0
  • predict 0
  • reference compression 0
  • vcflib/vcffixup 0
  • spot 0
  • nanopore sequencing 0
  • cell_barcodes 0
  • hhsuite 0
  • 16S 0
  • mygene 0
  • go 0
  • retrieval 0
  • pile up 0
  • catpack 0
  • prepare 0
  • transposable element 0
  • generic 0
  • hmmpress 0
  • coreutils 0
  • rna velocity 0
  • cobra 0
  • gnu 0
  • extension 0
  • grea 0
  • hashing-based deconvoltion 0
  • functional enrichment 0
  • taxonomic composition 0
  • tag 0
  • Computational Immunology 0
  • omics 0
  • clahe 0
  • refresh 0
  • association 0
  • GWAS 0
  • case/control 0
  • associations 0
  • reference panel 0
  • spatial_neighborhoods 0
  • scimap 0
  • cellsnp 0
  • Bayesian 0
  • structural-variants 0
  • donor deconvolution 0
  • hmmscan 0
  • genotype-based demultiplexing 0
  • biological activity 0
  • droplet based single cells 0
  • junction 0
  • Immune Deconvolution 0
  • Bioinformatics Tools 0
  • prior knowledge 0
  • phylogenies 0
  • busco 0
  • InterProScan 0
  • MMseqs2 0
  • masking 0
  • quarto 0
  • variant-calling 0
  • staging 0
  • derived alleles 0
  • tnfilter 0
  • heterozygous genotypes 0
  • inbreeding 0
  • array_cgh 0
  • cytosure 0
  • Staging 0
  • vector 0
  • gprofiler2 0
  • gost 0
  • rad 0
  • block substitutions 0
  • covariance model 0
  • haplotag 0
  • standard 0
  • svg 0
  • structural variant 0
  • xml 0
  • run 0
  • script 0
  • dereplication 0
  • java 0
  • pdb 0
  • ancestral alleles 0
  • immcantation 0
  • mass_error 0
  • reverse complement 0
  • vcf file 0
  • poolseq 0
  • bgen file 0
  • plink2_pca 0
  • search engine 0
  • simulation 0
  • hmmfetch 0
  • decompose 0
  • identity-by-descent 0
  • decomposeblocksub 0
  • transmembrane 0
  • genome graph 0
  • site frequency spectrum 0
  • pca 0
  • tnseq 0
  • multiqc 0
  • mzML 0
  • pruning 0
  • htseq 0
  • linkage equilibrium 0
  • sompy 0
  • f coefficient 0
  • peak picking 0
  • homozygous genotypes 0
  • microRNA 0
  • rank 0
  • airrseq 0
  • orthogroup 0
  • uq 0
  • isoform 0
  • joint-genotyping 0
  • variancepartition 0
  • genotypegvcf 0
  • dream 0
  • redundant 0
  • extraction 0
  • featuretable 0
  • md 0
  • nm 0
  • parallel 0
  • plastid 0
  • short 0
  • sage 0
  • resfinder 0
  • resistance genes 0
  • raw 0
  • mgf 0
  • parquet 0
  • intron 0
  • parser 0
  • dbsnp 0
  • standardize 0
  • install 0
  • nanoq 0
  • hashing-based deconvolution 0
  • deep variant 0
  • co-orthology 0
  • updatedata 0
  • homology 0
  • microbial genomics 0
  • chip 0
  • tag2tag 0
  • sequence similarity 0
  • spectral clustering 0
  • tags 0
  • comparative genomics 0
  • functional 0
  • Illumina 0
  • Read filters 0
  • uniques 0
  • mutect 0
  • idx 0
  • drep 0
  • drug categorization 0
  • Read report 0
  • agat 0
  • Read trimming 0
  • introns 0
  • longest 0
  • impute-info 0
  • assembler 0
  • constant 0
  • getpileupsummaries 0
  • short variant discovery 0
  • combinegvcfs 0
  • collectsvevidence 0
  • collectreadcounts 0
  • cnnscorevariants 0
  • calibratedragstrmodel 0
  • cross-samplecontamination 0
  • dragstr 0
  • calculatecontamination 0
  • bedtointervallist 0
  • asereadcounter 0
  • vqsr 0
  • variant quality score recalibration 0
  • annotateintervals 0
  • composestrtablefile 0
  • condensedepthevidence 0
  • heattree 0
  • gatherbqsrreports 0
  • germlinecnvcaller 0
  • germline contig ploidy 0
  • panelofnormalscreation 0
  • jointgenotyping 0
  • genomicsdbimport 0
  • genomicsdb 0
  • tranche filtering 0
  • createsequencedictionary 0
  • filtervarianttranches 0
  • filterintervals 0
  • estimatelibrarycomplexity 0
  • duplication metrics 0
  • determinegermlinecontigploidy 0
  • createsomaticpanelofnormals 0
  • targets 0
  • gangstr 0
  • getpileupsumaries 0
  • antibiotic resistance genes 0
  • consensus sequence 0
  • public 0
  • ENA 0
  • SRA 0
  • ANI 0
  • ARGs 0
  • groupreads 0
  • str 0
  • cache 0
  • percent on target 0
  • endogenous DNA 0
  • Streptococcus pyogenes 0
  • swissprot 0
  • duplexumi 0
  • unmapped 0
  • gene-calling 0
  • variant caller 0
  • gamma 0
  • UShER 0
  • bootstrapping 0
  • bacterial variant calling 0
  • germline variant calling 0
  • somatic variant calling 0
  • ubam 0
  • single molecule 0
  • zipperbams 0
  • germlinevariantsites 0
  • readcountssummary 0
  • embl 0
  • Imputation 0
  • gene model 0
  • tama_collapse.py 0
  • genomes on a tree 0
  • merge compare 0
  • GNU 0
  • joint-variant-calling 0
  • Haplotypes 0
  • gstama/merge 0
  • Sample 0
  • low coverage 0
  • gget 0
  • TAMA 0
  • gstama/polyacleanup 0
  • Mykrobe 0
  • abricate 0
  • beagle 0
  • hbd 0
  • ibd 0
  • rgi 0
  • fARGene 0
  • amrfinderplus 0
  • extractvariants 0
  • GTDB taxonomy 0
  • extract_variants 0
  • gvcftools 0
  • gunzip 0
  • gunc 0
  • archaea 0
  • genome taxonomy database 0
  • Salmonella Typhi 0
  • indexfeaturefile 0
  • preprocessintervals 0
  • shiftchain 0
  • selectvariants 0
  • revert 0
  • reblockgvcf 0
  • printsvevidence 0
  • printreads 0
  • postprocessgermlinecnvcalls 0
  • shiftintervals 0
  • snvs 0
  • mutectstats 0
  • mergebamalignment 0
  • leftalignandtrimvariants 0
  • readorientationartifacts 0
  • learnreadorientationmodel 0
  • shiftfasta 0
  • site depth 0
  • repeat content 0
  • file parsing 0
  • genome heterozygosity 0
  • genome size 0
  • models 0
  • compound 0
  • genome profile 0
  • bgc 0
  • txt 0
  • splitcram 0
  • gawk 0
  • variantrecalibrator 0
  • recalibration model 0
  • variantfiltration 0
  • svcluster 0
  • svannotate 0
  • splitintervals 0
  • genbank 0
  • split by chromosome 0
  • Haemophilus influenzae 0
  • illumiation_correction 0
  • BCF 0
  • csi 0
  • background_correction 0
  • biallelic 0
  • trimBam 0
  • bamUtil 0
  • bamtools/split 0
  • mouse 0
  • update header 0
  • homozygosity 0
  • virulent 0
  • chunking 0
  • subtract 0
  • slopBed 0
  • shiftBed 0
  • multinterval 0
  • overlapped bed 0
  • maskfasta 0
  • jaccard 0
  • autozygosity 0
  • overlap 0
  • getfasta 0
  • genomecov 0
  • closest 0
  • bamtobed 0
  • sorting 0
  • bacphlip 0
  • temperate 0
  • amp 0
  • allele counts 0
  • nuclear contamination estimate 0
  • post Post-processing 0
  • model 0
  • AMPs 0
  • antimicrobial peptide prediction 0
  • Staphylococcus aureus 0
  • installation 0
  • affy 0
  • reference panels 0
  • admixture 0
  • antimicrobial reistance 0
  • contiguate 0
  • doCounts 0
  • lifestyle 0
  • read group 0
  • autofluorescence 0
  • cycif 0
  • background 0
  • single-stranded 0
  • ancientDNA 0
  • authentict 0
  • bias 0
  • utility 0
  • ATLAS 0
  • sequencing_bias 0
  • post mortem damage 0
  • atlas 0
  • mkarv 0
  • http(s) 0
  • unionBedGraphs 0
  • deletion 0
  • Segmentation 0
  • cutesv 0
  • gct 0
  • cls 0
  • na 0
  • custom 0
  • Cores 0
  • TMA dearray 0
  • paired-end 0
  • UNet 0
  • mcool 0
  • genomic bins 0
  • makebins 0
  • enzyme 0
  • digest 0
  • pcr duplicates 0
  • track 0
  • cooler/balance 0
  • escherichia coli 0
  • circos 0
  • eklipse 0
  • eigenstratdatabasetools 0
  • pep 0
  • schema 0
  • PEP 0
  • depth information 0
  • corrrelation 0
  • structural variation 0
  • duphold 0
  • segment 0
  • blastx 0
  • cumulative coverage 0
  • scatterplot 0
  • cload 0
  • subcontigs 0
  • sorted 0
  • compartments 0
  • multiomics 0
  • cellpose 0
  • hifi 0
  • Assembly 0
  • domains 0
  • topology 0
  • antibody capture 0
  • calder2 0
  • cadd 0
  • postprocessing 0
  • tblastn 0
  • subtyping 0
  • Salmonella enterica 0
  • antigen capture 0
  • crispr 0
  • nucleotide composition 0
  • cmseq 0
  • concoct 0
  • partition histograms 0
  • target 0
  • export 0
  • antitarget 0
  • access 0
  • protein coding genes 0
  • qa 0
  • polymorphic sites 0
  • polymorphic 0
  • polymut 0
  • chromosome_visualization 0
  • quality assurnce 0
  • mitochondrial 0
  • haplotype resolution 0
  • invariant 0
  • cutoff 0
  • False duplications 0
  • Haplotype purging 0
  • assembly curation 0
  • false duplications 0
  • duplicate purging 0
  • haplotype purging 0
  • panel of normals 0
  • purging 0
  • normal database 0
  • genomic intervals 0
  • intervals coverage 0
  • gene finding 0
  • contact maps 0
  • bmp 0
  • Assembly curation 0
  • quast 0
  • pretext 0
  • read_pairs 0
  • integrity 0
  • mapping-based 0
  • sequence-based 0
  • read distribution 0
  • inner_distance 0
  • fragment_size 0
  • experiment 0
  • neighbour-joining 0
  • strandedness 0
  • bamstat 0
  • R 0
  • rhocall 0
  • long uncorrected reads 0
  • subsampling 0
  • jpg 0
  • contact 0
  • pedfilter 0
  • sortvcf 0
  • PRO-cap 0
  • GRO-cap 0
  • CoPRO 0
  • tandem duplications 0
  • insertions 0
  • deletions 0
  • picard/renamesampleinvcf 0
  • NETCAGE 0
  • pcr 0
  • liftovervcf 0
  • mate-pair 0
  • hybrid-selection 0
  • phylogenetic composition 0
  • illumina datasets 0
  • CAGE 0
  • RAMPAGE 0
  • porechop_abi 0
  • indep pairwise 0
  • pmdtools 0
  • variant genetic 0
  • scoring 0
  • identifiers 0
  • whole genome association 0
  • recode 0
  • indep 0
  • csRNA-seq 0
  • variant identifiers 0
  • exclude 0
  • genetic 0
  • GRO-seq 0
  • PRO-seq 0
  • STRIPE-seq 0
  • rocplot 0
  • prophage 0
  • sex determination 0
  • longread 0
  • de-novo 0
  • error 0
  • rare variants 0
  • relative coverage 0
  • genetic sex 0
  • induce 0
  • 256 bit 0
  • gc_wiggle 0
  • bam2seqz 0
  • freqsum 0
  • pseudodiploid 0
  • pseudohaploid 0
  • random draw 0
  • sha256 0
  • shinyngs 0
  • POA 0
  • SNPs 0
  • predictions 0
  • dbnsfp 0
  • snippy 0
  • core 0
  • sniffles 0
  • SMN2 0
  • exploratory 0
  • SMN1 0
  • CRAM 0
  • features 0
  • density 0
  • boxplot 0
  • rtg-tools 0
  • duplicate marking 0
  • repair 0
  • insert size 0
  • faidx 0
  • calmd 0
  • ampliconclip 0
  • amplicon 0
  • sambamba 0
  • read pairs 0
  • flagstat 0
  • multimapper 0
  • Ancestor 0
  • LCA 0
  • salsa2 0
  • salsa 0
  • paired 0
  • readgroup 0
  • applyvarcal 0
  • sequence headers 0
  • grep 0
  • subseq 0
  • variant recalibration 0
  • VQSR 0
  • assembly-binning 0
  • scramble 0
  • seacr 0
  • chromatin 0
  • cut&run 0
  • cut&tag 0
  • peak-caller 0
  • clusteridentifier 0
  • cluster analysis 0
  • identification 0
  • phantom peaks 0
  • gccounter 0
  • limma 0
  • peptide prediction 0
  • AMP 0
  • qualities 0
  • lofreq/filter 0
  • lofreq/call 0
  • Listeria monocytogenes 0
  • pneumophila 0
  • sgRNA 0
  • clinical 0
  • legionella 0
  • collapsing 0
  • adapter removal 0
  • spliced 0
  • functional genomics 0
  • CRISPR-Cas9 0
  • combining 0
  • reduced 0
  • MD5 0
  • mcr-1 0
  • mass-spectroscopy 0
  • metagenome-assembled genomes 0
  • maxbin2 0
  • representations 0
  • mash/sketch 0
  • maximum-likelihood 0
  • taxonomic assignment 0
  • estimate 0
  • damage patterns 0
  • NGS 0
  • DNA damage 0
  • rra 0
  • reorder 0
  • kofamscan 0
  • megahit 0
  • panel_of_normals 0
  • multicut 0
  • genome browser 0
  • js 0
  • igv.js 0
  • igv 0
  • IDR 0
  • haemophilus 0
  • pixel_classification 0
  • pos 0
  • annotations 0
  • hmtnote 0
  • Hidden Markov Model 0
  • HMMER 0
  • readcounter 0
  • pixel classification 0
  • probability_maps 0
  • kegg 0
  • kallisto/index 0
  • pneumoniae 0
  • Klebsiella 0
  • effective genome size 0
  • k-mer counting 0
  • digital normalization 0
  • quant 0
  • papermill 0
  • interproscan 0
  • jupytext 0
  • Jupyter 0
  • Python 0
  • jasmine 0
  • jasminesv 0
  • genomic islands 0
  • 128 bit 0
  • denovo 0
  • ChIP-Seq 0
  • graph formats 0
  • block-compressed 0
  • HLA-I 0
  • ILP 0
  • hla-typing 0
  • tumor/normal 0
  • graph viz 0
  • graph unchopping 0
  • flip 0
  • graph stats 0
  • combine graphs 0
  • odgi 0
  • squeeze 0
  • graph drawing 0
  • graph construction 0
  • PCR/optical duplicates 0
  • upper-triangular matrix 0
  • Neisseria gonorrhoeae 0
  • pbmerge 0
  • motif 0
  • pedigrees 0
  • read 0
  • pair-end 0
  • pbp 0
  • subreads 0
  • pbbam 0
  • ligation junctions 0
  • graphs 0
  • paragraph 0
  • select 0
  • restriction fragments 0
  • pairstools 0
  • pairtools 0
  • gender 0
  • debruijn 0
  • ploidy 0
  • microrna 0
  • de Bruijn 0
  • mbias 0
  • methylation bias 0
  • metaphlan 0
  • unionsum 0
  • smudgeplot 0
  • mitochondrial genome 0
  • Merqury 0
  • contour map 0
  • 3D heat map 0
  • Neisseria meningitidis 0
  • rma6 0
  • daa 0
  • target prediction 0
  • reference genome 0
  • GATK UnifiedGenotyper 0
  • mobile element insertions 0
  • somatic structural variations 0
  • cancer genome 0
  • SNP table 0
  • Beautiful stand-alone HTML report 0
  • mosdepth 0
  • bioinformatics tools 0
  • mitochondrial to nuclear ratio 0
  • ratio 0
  • mtnucratio 0
  • scan 0
  • microsatellite instability 0
  • TCR 0

Trim sequencing adapters and collapse overlapping reads

010

singles_truncated discarded paired_truncated collapsed collapsed_truncated paired_interleaved settings versions

Fixes prefixes from AdapterRemoval2 output to make sure no clashing read names are in the output. For use with DeDup.

01

fixed_fastq versions

Extracts reads mapped to chromosome 6 and any HLA decoys or chromosome 6 alternates.

01

extracted_reads_fastq log intermediate_sam intermediate_bam intermediate_sorted_bam versions

arcashla:

arcasHLA performs high resolution genotyping for HLA class I and class II genes from RNA sequencing, supporting both paired and single-end samples.

Download and prepare database for Ariba analysis

01

db versions

ariba:

ARIBA: Antibiotic Resistance Identification By Assembly

Query input FASTQs against Ariba formatted databases

0101

results versions

ariba:

ARIBA: Antibiotic Resistance Identification By Assembly

Simulation tool to generate synthetic Illumina next-generation sequencing reads

01000

fastq aln sam versions

art:

ART is a set of simulation tools to generate synthetic next-generation sequencing reads. ART simulates sequencing reads by mimicking real sequencing process with empirical error models or quality profiles summarized from large recalibrated sequencing data. ART can also simulate reads using user own read error model or quality profiles.

Aggregates fastq files with demultiplexed reads

01

fastq versions

artic:

ARTIC pipeline - a bioinformatics pipeline for working with virus sequencing data sequenced with nanopore

Conversion of PacBio BAM files into gzipped fastq files, including splitting of barcoded data

012

fastq versions

bam2fastx:

Converting and demultiplexing of PacBio BAM files into gzipped fasta and fastq files

Tool for converting 10x BAMs produced by Cell Ranger, Space Ranger, Cell Ranger ATAC, Cell Ranger DNA, and Long Ranger back to FASTQ files that can be used as inputs to re-run analysis

01

fastq versions

BamTools provides both a programmer's API and an end-user's toolkit for handling BAM files.

01

data versions

bamtools:

C++ API & command-line toolkit for working with BAM data

Demultiplex Element Biosciences bases files

012

sample_fastq sample_json qc_report run_stats generated_run_manifest metrics unassigned versions

Align short or PacBio reads to a reference genome using BBMap

010

bam log versions

bbmap:

BBMap is a short read aligner, as well as various other bioinformatic tools.

Adapter and quality trimming of sequencing reads

010

reads log versions

bbmap:

BBMap is a short read aligner, as well as various other bioinformatic tools.

Merging overlapping paired reads into a single read.

010

merged unmerged ihist versions log

bbmap:

BBMap is a short read aligner, as well as various other bioinformatic tools.

BBNorm is designed to normalize coverage by down-sampling reads over high-depth areas of a genome, to result in a flat coverage distribution.

01

fastq log versions

bbmap:

BBMap is a short read aligner, as well as various other bioinformatic tools.

Split sequencing reads by mapping them to multiple references simultaneously

0100010

index primary_fastq all_fastq stats log versions

bbmap:

BBMap is a short read aligner, as well as various other bioinformatic tools.

Create 30% Smaller, Faster Gzipped Fastq Files. And remove duplicates

01

reads log versions

bbmap:

BBMap is a short read aligner, as well as various other bioinformatic tools.

Filter out sequences by sequence header name(s)

01000

reads log versions

bbmap:

BBMap is a short read aligner, as well as various other bioinformatic tools.

Re-pairs reads that became disordered or had some mates eliminated.

010

repaired singleton versions log

repair:

Repair.sh is a tool that re-pairs reads that became disordered or had some mates eliminated tools.

Compares query sketches to reference sketches hosted on a remote server via the Internet.

01

hits versions

bbmap:

BBMap is a short read aligner, as well as various other bioinformatic tools.

Demultiplex Illumina BCL files

012

fastq fastq_idx undetermined undetermined_idx reports stats interop versions

Demultiplex Illumina BCL files

012

fastq fastq_idx undetermined undetermined_idx reports logs interop versions

Bioawk is an extension to Brian Kernighan's awk, adding the support of several common biological data formats.

01

output versions

A fast, compact one-liner to produce duplicate-marked, sorted, and indexed BAM files using Biscuit

010101

bam bai versions

biscuit:

A utility for analyzing sodium bisulfite conversion-based DNA methylation/modification data

samblaster:

samblaster is a fast and flexible program for marking duplicates in read-id grouped paired-end SAM files. It can also optionally output discordant read pairs and/or split read mappings to separate SAM files, and/or unmapped/clipped reads to a separate FASTQ file. By default, samblaster reads SAM input from stdin and writes SAM to stdout.

samtools:

SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.

Align reads to a reference genome using bowtie

01010

bam log fastq versions

bowtie:

bowtie is a software package for mapping DNA sequences against a large reference genome, such as the human genome.

Align reads to a reference genome using bowtie2

01010100

sam bam cram csi crai log fastq versions

bowtie2:

Bowtie 2 is an ultrafast and memory-efficient tool for aligning sequencing reads to long reference sequences.

Find SA coordinates of the input reads for bwa short-read mapping

0101

sai versions

bwa:

BWA is a software package for mapping DNA sequences against a large reference genome, such as the human genome.

Performs fastq alignment to a fasta reference using BWA

0101010

bam cram csi crai versions

bwa:

BWA is a software package for mapping DNA sequences against a large reference genome, such as the human genome.

Performs fastq alignment to a fasta reference using BWA

0101010

sam bam cram crai csi versions

bwa:

BWA-mem2 is a software package for mapping DNA sequences against a large reference genome, such as the human genome.

Performs fastq alignment to a fasta reference using BWA-MEME

010101000

sam bam cram crai csi versions

bwameme:

Faster BWA-MEM2 using learned-index

Performs alignment of BS-Seq reads using bwameth

010101

bam versions

bwameth:

Fast and accurate alignment of BS-Seq reads using bwa-mem and a 3-letter genome.

Concatenates fastq files

01

reads versions

cat:

The cat utility reads files sequentially, writing them to the standard output.

Module to create FASTQs needed by the 10x Genomics Cell Ranger tool. Uses the cellranger mkfastq command.

012

fastq undetermined_fastq reports stats interop versions

cellranger:

Cell Ranger by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.

Module to build the VDJ reference needed by the 10x Genomics Cell Ranger tool. Uses the cellranger mkvdjref command.

0000

reference versions

cellranger:

Cell Ranger processes data from 10X Genomics Chromium kits. cellranger vdj takes FASTQ files from cellranger mkfastq or bcl2fastq for V(D)J libraries and performs sequence assembly and paired clonotype calling. It uses the Chromium cellular barcodes and UMIs to assemble V(D)J transcripts per cell. Clonotypes and CDR3 sequences are output as a .vloupe file which can be loaded into Loupe V(D)J Browser.

Module to use Cell Ranger's pipelines analyze sequencing data produced from Chromium Single Cell Immune Profiling.

010

outs versions

cellranger:

Cell Ranger processes data from 10X Genomics Chromium kits. cellranger vdj takes FASTQ files from cellranger mkfastq or bcl2fastq for V(D)J libraries and performs sequence assembly and paired clonotype calling. It uses the Chromium cellular barcodes and UMIs to assemble V(D)J transcripts per cell. Clonotypes and CDR3 sequences are output as a .vloupe file which can be loaded into Loupe V(D)J Browser.

Module to create fastqs needed by the 10x Genomics Cell Ranger Arc tool. Uses the cellranger-arc mkfastq command.

00

versions fastq

cellrangerarc:

Cell Ranger Arc by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.

Module to create fastqs needed by the 10x Genomics Cell Ranger ATAC tool. Uses the cellranger-atac mkfastq command.

00

versions fastq

cellranger-atac:

Cell Ranger ATAC by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.

Classifies metagenomic sequence data

01000

report results sam fastq_mapped fastq_unmapped versions

centrifuge:

Centrifuge is a classifier for metagenomic sequences.

Creates Kraken-style reports from centrifuge out files

010

kreport versions

centrifuge:

Centrifuge is a classifier for metagenomic sequences.

Filter and trim long read data.

010

fastq versions

zcat:

zcat uncompresses either a list of files on the command line or its standard input and writes the uncompressed data on standard output.

gzip:

Gzip reduces the size of the named files using Lempel-Ziv coding (LZ77).

Performs preprocessing and alignment of chromatin fastq files to fasta reference files using chromap.

0101010000

bed bam tagAlign pairs versions

chromap:

Fast alignment and preprocessing of chromatin profiles

Performs fastq alignment to a reference using DRAGMAP

0101010

sam bam cram crai csi log versions

dragmap:

Dragmap is the Dragen mapper/aligner Open Source Software.

Run falco on sequenced reads

01

html txt versions

fastqc:

falco is a drop-in C++ implementation of FastQC to assess the quality of sequence reads.

Perform adapter and quality trimming on sequencing reads with reporting

01

reads stats debug statspdf reads_fail reads_unpaired log versions

A program that counts sequence occurrences in FASTQ files.

0101

count_matrix stats distribution_plot reads_plot reads_plot_percentage versions

2FAST2Q:

2FAST2Q is ideal for CRISPRi-Seq, and for extracting and counting any kind of information from reads in the fastq format, such as barcodes in Bar-seq experiments. 2FAST2Q can work with sequence mismatches, Phred-score, and be used to find and extract unknown sequences delimited by known sequences. 2FAST2Q can extract multiple features per read using either fixed positions or delimiting search sequences.

Perform adapter/quality trimming on sequencing reads

010000

reads json html log reads_fail reads_merged versions

Run FastQC on sequenced reads

01

html zip versions

fastqe is a bioinformatics command line tool that uses emojis to represent and analyze genomic data.

01

tsv versions

FASTQ summary statistics in JSON format

01

json versions

Build fastq screen config file from bowtie index files

00

database versions

fastqscreen:

FastQ Screen allows you to screen a library of sequences in FastQ format against a set of sequence databases so you can see if the composition of the library matches with what you expect.

Align reads to multiple reference genomes using fastq-screen

010

txt png html fastq versions

fastqscreen:

FastQ Screen allows you to screen a library of sequences in FastQ format against a set of sequence databases so you can see if the composition of the library matches with what you expect.

Collapses identical sequences in a FASTQ/A file into a single sequence (while maintaining reads counts)

01

fasta versions

fastx:

A collection of command line tools for Short-Reads FASTA/FASTQ files preprocessing

Using the fgbio tools, converts FASTQ files sequenced into unaligned BAM or CRAM files possibly moving the UMI barcode into the RX field of the reads

01

bam cram versions

fgbio:

A set of tools for working with genomic and high throughput sequencing data, including UMIs

fq generate is a FASTQ file pair generator. It creates two reads, formatting names as described by Illumina. While generate creates "valid" FASTQ reads, the content of the files are completely random. The sequences do not align to any genome. This requires a seed (--seed) to be supplied in ext.args.

0

fastq versions

fq:

fq is a library to generate and validate FASTQ file pairs.

fq lint is a FASTQ file pair validator.

01

lint versions

fq:

fq is a library to generate and validate FASTQ file pairs.

fq subsample outputs a subset of records from single or paired FASTQ files. This requires a seed (--seed) to be set in ext.args.

01

fastq versions

fq:

fq is a library to generate and validate FASTQ file pairs.

Demultiplex fastq files

0123

sample_fastq metrics most_frequent_unmatched versions

FusionCatcher searches for novel/known somatic fusion genes, translocations, and chimeras in RNA-seq data

0101

fusions summary log versions

fusioncatcher:

FusionCatcher searches for novel/known somatic fusion genes, translocations, and chimeras in RNA-seq data

Classify FASTQ files against ganon database

010

tre report one all unc log versions

ganon:

ganon classifies short DNA sequences against large sets of genomic reference sequences efficiently

Converts FastQ file to SAM/BAM format

01

bam versions

gatk4:

Genome Analysis Toolkit (GATK4) Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Converts BAM/SAM file to FastQ format

01

fastq versions

gatk4:

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.

Create a GEM index from a FASTA file

01

index info versions

gem3:

The GEM indexer (v3).

Performs fastq alignment to a fasta reference using using gem3-mapper

01010

bam versions

gem3:

The GEM indexer (v3).

A single fast and exhaustive tool for summary statistics and simultaneous fa (fasta, fastq, gfa [.gz]) genome assembly file manipulation.

0100001010101

assembly_summary assembly versions

A versatile pairwise aligner for genomic and spliced nucleotide sequences

0100

sam versions

graphmap2:

A versatile pairwise aligner for genomic and spliced nucleotide sequences.

gzrecover is a program that will attempt to extract any readable data out of a gzip file that has been corrupted

01

recovered versions

Align RNA-Seq reads to a reference with HISAT2

010101

bam summary fastq versions

hisat2:

HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) to a population of human genomes as well as to a single reference genome.

Performs HLA typing based on a population reference graph and employs a new linear projection method to align reads to the graph.

0123

results extraction extraction_mapped extraction_unmpapped hla fastq reads_per_level remapped versions

hlala:

HLA typing from short and long reads

write your description here

0100

fastq json versions

hostile:

Hostile: accurate host decontamination

HUMID is a tool to quickly and easily remove duplicate reads from FastQ files, with or without UMIs.

0101

log dedup annotated stats versions

Demultiplex paired-end FASTQ files from QuantSeq-Pool

012

fastq undetermined stats versions

Identify insertion sites positions in bacterial genomes

0123

results versions

Taxonomic classification of metagenomic sequence data using a protein reference database

010

results versions

kaiju:

Fast and sensitive taxonomic classification for metagenomics

Merge two tab-separated output files of Kaiju and Kraken in the column format

0120

merged versions

kaiju:

Fast and sensitive taxonomic classification for metagenomics

Make Kaiju FMI-index file from a protein FASTA file

010

fmi bwt sa versions

kaiju:

Fast and sensitive taxonomic classification for metagenomics

quantifies scRNA-seq data from fastq files using kb-python.

01000000

count versions matrix

kb:

kallisto and bustools are wrapped in an easy-to-use program called kb

Creates a histogram of the number of distinct k-mers having a given frequency.

01

hist json png ps pdf jellyfish_hash versions

kat:

KAT is a suite of tools that analyse jellyfish hashes or sequence files (fasta or fastq) using kmer counts

Removes low abundance k-mers from FASTA/FASTQ files

01

trimmed versions

khmer:

khmer k-mer counting library

Generate k-mers (sketches) from FASTA/Q sequences

01

outdir info versions

kmcp:

Accurate metagenomic profiling of both prokaryotic and viral populations by pseudo-mapping

Construct KMCP database from k-mer files

01

kmcp log versions

kmcp:

Accurate metagenomic profiling of both prokaryotic and viral populations by pseudo-mapping

Merge search results from multiple databases.

01

result versions

kmcp:

Accurate metagenomic profiling of both prokaryotic and viral populations by pseudo-mapping

Generate taxonomic profile from search results

010

profile versions

kmcp:

Accurate metagenomic profiling of both prokaryotic and viral populations by pseudo-mapping

Search sequences against database

010

result versions

kmcp:

Accurate metagenomic profiling of both prokaryotic and viral populations by pseudo-mapping

Classifies metagenomic sequence data

01000

classified_reads_fastq unclassified_reads_fastq classified_reads_assignment report versions

kraken2:

Kraken2 is a taxonomic sequence classifier that assigns taxonomic labels to sequence reads

Classifies metagenomic sequence data using unique k-mer counts

012000000

classified_reads unclassified_reads classified_assignment report versions

krakenuniq:

Metagenomics classifier with unique k-mer counting for more specific results

Aligns query sequences to target sequences indexed with lastdb

0120

maf multiqc versions

last:

LAST finds & aligns related regions of sequences.

Prepare sequences for subsequent alignment with lastal.

01

index versions

last:

LAST finds & aligns related regions of sequences.

Find suitable score parameters for sequence alignment

010

param_file multiqc versions

last:

LAST finds & aligns related regions of sequences.

lima - The PacBio Barcode Demultiplexer and Primer Remover

010

counts report summary versions bam pbi fasta fastagz fastq fastqgz xml json clips guess

Map short-reads to an indexed reference genome

01010000000

bam versions

mapad:

An aDNA aware short-read mapper

Quickly create a tree using Mash distances

01

tree matrix versions

MetaPhlAn is a tool for profiling the composition of microbial communities from metagenomic shotgun sequencing data.

010

profile biom bt2out versions

metaphlan:

Identify clades (phyla to species) present in the metagenome obtained from a microbiome sample and their relative abundance

MetaPhlAn is a tool for profiling the composition of microbial communities from metagenomic shotgun sequencing data.

010

profile biom bt2out versions

metaphlan3:

Identify clades (phyla to species) present in the metagenome obtained from a microbiome sample and their relative abundance

Demultiplex MGI fastq files

012

fastq undetermined ambiguous undetermined_reports ambiguous_reports general_info_reports index_reports sample_stat_reports qc_reports versions

mgikit demultiplex:

Demultiplex MGI fastq files

A versatile pairwise aligner for genomic and spliced nucleotide sequences

01010000

paf bam index versions

minimap2:

A versatile pairwise aligner for genomic and spliced nucleotide sequences.

Download the mOTUs database

0

db versions

motus:

The mOTU profiler is a computational tool that estimates relative taxonomic abundance of known and currently unknown microbial community members using metagenomic shotgun sequencing data.

Taxonomic meta-omics profiling using universal marker genes

0100

txt biom versions

motus:

Marker gene-based OTU (mOTU) profiling

Taxonomic meta-omics profiling using universal marker genes

010

out versions

motus:

Marker gene-based operational taxonomic unit (mOTU) profiling

Taxonomic meta-omics profiling using universal marker genes

010

out bam mgc log versions

motus:

Marker gene-based OTU (mOTU) profiling

AMR predictions for supported species

010

csv json versions

mykrobe:

Antibiotic resistance prediction in minutes

Compare multiple runs of long read sequencing data and alignments

01

report_html lengths_violin_html log_length_violin_html n50_html number_of_reads_html overlay_histogram_html overlay_histogram_normalized_html overlay_log_histogram_html overlay_log_histogram_normalized_html total_throughput_html quals_violin_html overlay_histogram_identity_html overlay_histogram_phredscore_html percent_identity_violin_html active_pores_over_time_html cumulative_yield_plot_gigabases_html sequencing_speed_over_time_html stats_txt versions

DNA contaminant removal using NanoLyse

010

fastq log versions

Run NanoPlot on nanopore-sequenced reads

01

html png txt log versions

Performs fastq alignment to a reference using NARFMAP

0101010

bam log versions

narfmap:

narfmap is a fork of the Dragen mapper/aligner Open Source Software.

Performs fastq alignment to a fasta reference using NextGenMap

010

bam versions

bwa:

NextGenMap is a flexible highly sensitive short read mapping tool that handles much higher mismatch rates than comparable algorithms while still outperforming them in terms of runtime

Determining whether sequencing data comes from the same individual by using SNP matching. This module generates vaf files for individual fastq file(s), ready for the vafncm module.

0101

vaf versions

ngscheckmate:

NGSCheckMate is a software package for identifying next generation sequencing (NGS) data files from the same individual, including matching between DNA and RNA.

Calculate metagenome redundancy curve from FASTQ files

0100

npa npc npl npo versions

nonpareil:

Estimate average coverage and create curves for metagenomic datasets

converts pacbio bam files to fastq.gz using PacBioToolKit (pbtk) bam2fastq

012

fastq versions

pbtk:

pbtk - PacBio BAM toolkit

Converts a FASTQ file to an unaligned BAM or SAM file.

01

bam versions

picard:

A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.

Identify plasmids in bacterial sequences and assemblies

01

json txt tsv genome_seq plasmid_seq versions

Determine Streptococcus pneumoniae serotype from Illumina paired-end reads

01

xml txt versions

PRINSEQ++ is a C++ implementation of the prinseq-lite.pl program. It can be used to filter, reformat or trim genomic and metagenomic sequence data

01

good_reads single_reads bad_reads log versions

Identify, orient and trim nanopore cDNA reads

01

fastq versions

gzip:

Gzip reduces the size of the named files using Lempel-Ziv coding (LZ77).

Converts the contents of sequence data files (FASTA/FASTQ/SAM/BAM) into the RTG Sequence Data File (SDF) format.

0123

sdf versions

rtgtools:

RealTimeGenomics Tools -- Utilities for accurate VCF comparison and manipulation

Module to validate illuminaยฎ Sample Sheet v2 files.

010

samplesheet versions

The module uses bam2fq method from samtools to convert a SAM, BAM or CRAM file to FASTQ format

010

reads versions

samtools:

Tools for dealing with SAM, BAM and CRAM files

The module uses collate and then fastq methods from samtools to convert a SAM, BAM or CRAM file to FASTQ format

01010

fastq fastq_interleaved fastq_other fastq_singleton versions

samtools:

Tools for dealing with SAM, BAM and CRAM files

Produces a consensus FASTA/FASTQ/PILEUP

01

fasta fastq pileup versions

samtools:

SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.

Converts a SAM/BAM/CRAM file to FASTQ

010

fastq interleaved singleton other versions

samtools:

SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.

converts FASTQ files to unmapped SAM/BAM/CRAM

01

sam bam cram versions

samtools:

SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.

Performs fastq alignment to a fasta reference using Sentieon's BWA MEM

01010101

bam_and_bai versions

sentieon:

Sentieonยฎ provides complete solutions for secondary DNA/RNA analysis for a variety of sequencing platforms, including short and long reads. Our software improves upon BWA, STAR, Minimap2, GATK, HaplotypeCaller, Mutect, and Mutect2 based pipelines and is deployable on any generic-CPU-based computing system.

Seqcluster collapse reduces computational complexity by collapsing identical sequences in a FASTQ file.

01

fastq versions

seqcluster:

Small RNA analysis from NGS data. Seqcluster generates a list of clusters of small RNA sequences, their genome location, their annotation and the abundance in all the sample of the project.

Statistics for FASTA or FASTQ files

01

stats multiqc versions

seqfu:

Cross-platform compiled suite of tools to manipulate and inspect FASTA and FASTQ files

Concatenating multiple uncompressed sequence files together

01

fastx versions

seqkit:

Cross-platform and ultrafast toolkit for FASTA/Q file manipulation, written by Wei Shen.

Convert FASTQ to FASTA format

01

fasta versions

seqkit:

Cross-platform and ultrafast toolkit for FASTA/Q file manipulation, written by Wei Shen.

Convert FASTA/Q to tabular format, and provide various information, like sequence length, GC content/GC skew.

01

text versions

seqkit:

Cross-platform and ultrafast toolkit for FASTA/Q file manipulation, written by Wei Shen.

Subset FASTA/FASTQ files to some number of sequences

012

subset versions

seqkit:

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

match up paired-end reads from two fastq files

01

reads unpaired_reads versions

seqkit:

Cross-platform and ultrafast toolkit for FASTA/Q file manipulation, written by Wei Shen.

Transforms sequences (extract ID, filter by length, remove gaps, reverse complement...)

01

fastx log versions

seqkit:

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

Transforms sequences (extract ID, filter by length, remove gaps, reverse complement...)

01

fastx versions

seqkit:

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

Sorts sequences by id/name/sequence/length

01

fastx versions

seqkit:

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

Split single or paired-end fastq.gz files

01

reads versions

seqkit:

Cross-platform and ultrafast toolkit for FASTA/Q file manipulation, written by Wei Shen.

Convert tabular format (first two/three columns) to FASTA/Q format.

01

fastx versions

seqkit:

Cross-platform and ultrafast toolkit for FASTA/Q file manipulation, written by Wei Shen.

Salmonella serotype prediction from reads and assemblies

01

log tsv txt versions

Computes sequence statistics from FASTQ or FASTA files

01

seqtk_stats versions

Generates a BED file containing genomic locations of lengths of N.

01

bed versions

seqtk:

Seqtk is a fast and lightweight tool for processing sequences in the FASTA or FASTQ format. Seqtk mergepe command merges pair-end reads into one interleaved file.

Interleave pair-end reads from FastQ files

01

reads versions

seqtk:

Seqtk is a fast and lightweight tool for processing sequences in the FASTA or FASTQ format. Seqtk mergepe command merges pair-end reads into one interleaved file.

Rename sequence names in FASTQ or FASTA files.

01

sequences versions

seqtk:

Seqtk is a fast and lightweight tool for processing sequences in the FASTA or FASTQ format. The seqtk rename command renames sequence names.

Subsample reads from FASTQ files

012

reads versions

seqtk:

Seqtk is a fast and lightweight tool for processing sequences in the FASTA or FASTQ format. Seqtk sample command subsamples sequences.

Common transformation operations on FASTA or FASTQ files.

01

fastx versions

seqtk:

Seqtk is a fast and lightweight tool for processing sequences in the FASTA or FASTQ format. The seqtk seq command enables common transformation operations on FASTA or FASTQ files.

Select only sequences that match the filtering condition

010

sequences versions

seqtk:

Seqtk is a fast and lightweight tool for processing sequences in the FASTA or FASTQ format

Trim low quality bases from FastQ files

01

reads versions

seqtk:

Seqtk is a fast and lightweight tool for processing sequences in the FASTA or FASTQ format

Sequence quality metrics for FASTQ and uBAM files.

01

json html versions

Determine Streptococcus pneumoniae serotype from Illumina paired-end reads

01

tsv txt versions

seroba:

SeroBA is a k-mer based pipeline to identify the Serotype from Illumina NGS reads for given references.

Demultiplex bgzip'd fastq files

012

sample_fastq metrics most_frequent_unmatched per_project_metrics per_sample_metrics sample_barcode_hop_metrics versions

Determine Shigella serotype from Illumina or Oxford Nanopore reads

01

tsv hits versions

Determine Shigella serotype from assemblies or Illumina paired-end reads

01

tsv versions

A windowed adaptive trimming tool for FASTQ files using quality

012

single_trimmed paired_trimmed singleton_trimmed log versions

Fast, efficient, lossless compression of FASTQ files.

01

sfq versions

Performs fastq alignment to a fasta reference using SNAP

0101

bam bai versions

snapaligner:

Scalable Nucleotide Alignment Program -- a fast and accurate read aligner for high-throughput sequencing data

Rapid haploid variant calling

010

tab csv html vcf bed gff bam bai log aligned_fa consensus_fa consensus_subs_fa raw_vcf filt_vcf vcf_gz vcf_csi txt versions

snippy:

Rapid bacterial SNP calling and core genome alignments

Local sequence alignment tool for filtering, mapping and clustering.

010101

reads log index versions

SortMeRNA:

The core algorithm is based on approximate seeds and allows for sensitive analysis of NGS reads. The main application of SortMeRNA is filtering rRNA from metatranscriptomic data. SortMeRNA takes as input files of reads (fasta, fastq, fasta.gz, fastq.gz) and one or multiple rRNA database file(s), and sorts apart aligned and rejected reads into two files. Additional applications include clustering and taxonomy assignation available through QIIME v1.9.1. SortMeRNA works with Illumina, Ion Torrent and PacBio data, and can produce SAM and BLAST-like alignments.

Fast, efficient, lossless compression of FASTQ files.

012

spring versions

spring:

SPRING is a compression tool for Fastq files (containing up to 4.29 Billion reads)

Fast, efficient, lossless decompression of FASTQ files.

010

fastq versions

spring:

SPRING is a compression tool for Fastq files (containing up to 4.29 Billion reads)

Extract sequencing reads in FASTQ format from a given NCBI Sequence Read Archive (SRA).

0100

reads versions

sratools:

SRA Toolkit and SDK from NCBI

Download sequencing data from the NCBI Sequence Read Archive (SRA).

0100

sra versions

sratools:

SRA Toolkit and SDK from NCBI

Align reads to a reference genome using STAR

010101000

log_final log_out log_progress versions bam bam_sorted bam_sorted_aligned bam_transcript bam_unsorted fastq tab spl_junc_tab read_per_gene_tab junction sam wig bedgraph

star:

STAR is a software package for mapping DNA sequences against a large reference genome, such as the human genome.

Serotype STEC samples from paired-end reads or assemblies

01

tsv versions

Sketching/indexing sequencing reads

010

sketch_fastq_genome versions

sylph:

Sylph quickly enables querying of genomes against even low-coverage shotgun metagenomes to find nearest neighbour ANI.

Create fasta consensus with TOPAS toolkit with options to penalize substitutions for typical DNA damage present in ancient DNA

010101010

fasta vcf ccf log versions

topas:

This toolkit allows the efficient manipulation of sequence data in various ways. It is organized into modules: The FASTA processing modules, the FASTQ processing modules, the GFF processing modules and the VCF processing modules.

Trim FastQ files using Trim Galore!

01

reads log unpaired html zip versions

Assembles a de novo transcriptome from RNAseq reads

01

transcript_fasta log versions

Cluster contigs from multiple assemblies by similarity

012

cluster_dir versions

trycycler:

Trycycler is a tool for generating consensus long-read assemblies for bacterial genomes

Subsample a long-read sequencing fastq file for multiple assemblies

01

subreads versions

trycycler:

Trycycler is a tool for generating consensus long-read assemblies for bacterial genomes

Ultraplex is an all-in-one software package for processing and demultiplexing fastq files.

010

fastq no_match_fastq report versions

Deduplicate reads based on the mapping co-ordinate and the UMI attached to the read.

0120

bam fastq log versions

Cluster sequences using a single-pass, greedy centroid-based clustering algorithm.

01

aln biom mothur otu bam out blast uc centroids clusters profile msa versions

vsearch:

VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)

Performs quality filtering and / or conversion of a FASTQ file to FASTA format.

01

fasta log versions

vsearch:

VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)

Taxonomic classification using the sintax algorithm.

010

tsv versions

vsearch:

VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)

Sort fasta entries by decreasing abundance (--sortbysize) or sequence length (--sortbylength).

010

fasta versions

vsearch:

VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)

Compare target sequences to fasta-formatted query sequences using global pairwise alignment.

010000

aln biom lca mothur otu sam tsv txt uc versions

vsearch:

VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)

simulating sequence reads from a reference genome

01

fastq versions

A tool of the wipertools suite that merges FASTQ chunks produced by wipertools_fastqscatter

01

gathered_fastq versions

fastqgather:

A tool of the wipertools suite that merges FASTQ chunks produced by wipertools_fastqscatter.

A tool of the wipertools suite that splits FASTQ files into chunks

010

fastq_chunks versions

fastqscatter:

A tool of the wipertools suite that splits FASTQ files into chunks.

A tool of the wipertools suite that fixes or wipes out uncompliant reads from FASTQ files

01

wiped_fastq report versions

fastqwiper:

A tool of the wipertools suite that that fixes or wipes out uncompliant reads from FASTQ files.

A tool of the wipertools suite that merges wiping reports generated by wipertools_fastqwiper

01

gathered_report versions

reportgather:

A tool of the wipertools suite that merges wiping reports generated by wipertools_fastqwiper.

a tool to build k-mer hash table for fasta and fastq files

01

yak versions

yak:

Yet another k-mer analyzer

Click here to trigger an update.