Available Modules
Modules are the building stones of all DSL2 nf-core blocks. You can find more info from nf-core website, if you would like to write your own module.
A tool to parse and summarise results from antimicrobial peptides tools and present functional classification.
0
1
0
0
sample_dir
txt
csv
faa
summary_csv
summary_html
log
results_db
results_db_dmnd
results_db_fasta
results_db_tsv
versions
A submodule that clusters the merged AMP hits generated from ampcombi2/parsetables and ampcombi2/complete using MMseqs2 cluster.
0
cluster_tsv
rep_cluster_tsv
log
versions
A tool for clustering all AMP hits found across many samples and supporting many AMP prediction tools.
A submodule that merges all output summary tables from ampcombi/parsetables in one summary file.
0
tsv
log
versions
This merges the per sample AMPcombi summaries generated by running 'ampcombi2/parsetables'.
A submodule that parses and standardizes the results from various antimicrobial peptide identification tools.
0
1
0
0
0
0
0
sample_dir
contig_gbks
db_tsv
tsv
faa
sample_log
full_log
db
db_txt
db_fasta
db_mmseqs
versions
A parsing tool to convert and summarise the outputs from multiple AMP detection tools in a standardized format.
Post-processing script of the MaltExtract component of the HOPS package
0
0
0
json
summary_pdf
tsv
candidate_pdfs
versions
Run the alignment/variant-call/consensus logic of the artic pipeline
0
1
0
1
2
0
1
2
results
bam
bai
bam_trimmed
bai_trimmed
bam_primertrimmed
bai_primertrimmed
fasta
vcf
tbi
json
versions
ARTIC pipeline - a bioinformatics pipeline for working with virus sequencing data sequenced with nanopore
Alignment by Simultaneous Harmonization of Layer/Adjacency Registration
0
1
0
0
tif
versions
removes unused references from header of sorted BAM/CRAM files.
0
1
bam
versions
This module is used to clip primer sequences from your alignments.
0
1
2
3
bam
bai
versions
Adapter and quality trimming of sequencing reads
0
1
0
reads
log
versions
BBMap is a short read aligner, as well as various other bioinformatic tools.
Merging overlapping paired reads into a single read.
0
1
0
merged
unmerged
ihist
versions
log
BBMap is a short read aligner, as well as various other bioinformatic tools.
BBNorm is designed to normalize coverage by down-sampling reads over high-depth areas of a genome, to result in a flat coverage distribution.
0
1
fastq
log
versions
BBMap is a short read aligner, as well as various other bioinformatic tools.
Create 30% Smaller, Faster Gzipped Fastq Files. And remove duplicates
0
1
reads
log
versions
BBMap is a short read aligner, as well as various other bioinformatic tools.
Filter out sequences by sequence header name(s)
0
1
0
0
0
reads
log
versions
BBMap is a short read aligner, as well as various other bioinformatic tools.
Creates an index from a fasta file, ready to be used by bbmap.sh in mapping mode.
0
index
versions
BBMap is a short read aligner, as well as various other bioinformatic tools.
Calculates per-scaffold or per-base coverage information from an unsorted sam or bam file.
0
1
covstats
hist
versions
BBMap is a short read aligner, as well as various other bioinformatic tools.
Compares query sketches to reference sketches hosted on a remote server via the Internet.
0
1
hits
versions
BBMap is a short read aligner, as well as various other bioinformatic tools.
Computes histograms (default), per-base reports (-d) and BEDGRAPH (-bg) summaries of feature coverage (e.g., aligned sequences) for a given genome.
0
1
2
0
0
0
genomecov
versions
A set of tools for genomic analysis tasks, specifically enabling genome arithmetic (merge, count, complement) on various file types.
Locate and tag duplicate reads in a BAM file
0
1
bam
metrics
versions
biobambam is a set of tools for early stage alignment file processing.
Merge a list of sorted bam files
0
1
bam
bam_index
checksum
versions
biobambam is a set of tools for early stage alignment file processing.
Parallel sorting and duplicate marking
0
1
0
1
bam
bam_index
cram
metrics
versions
biobambam is a set of tools for early stage alignment file processing.
Aligns single- or paired-end reads from bisulfite-converted libraries to a reference genome using Biscuit.
0
1
0
1
0
1
bam
bai
versions
A utility for analyzing sodium bisulfite conversion-based DNA methylation/modification data
A fast, compact one-liner to produce duplicate-marked, sorted, and indexed BAM files using Biscuit
0
1
0
1
0
1
bam
bai
versions
A utility for analyzing sodium bisulfite conversion-based DNA methylation/modification data
samblaster is a fast and flexible program for marking duplicates in read-id grouped paired-end SAM files. It can also optionally output discordant read pairs and/or split read mappings to separate SAM files, and/or unmapped/clipped reads to a separate FASTQ file. By default, samblaster reads SAM input from stdin and writes SAM to stdout.
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Summarize and/or filter reads based on bisulfite conversion rate
0
1
0
1
0
1
0
1
bam
versions
A utility for analyzing sodium bisulfite conversion-based DNA methylation/modification data
Summarizes read-level methylation (and optionally SNV) information from a Biscuit BAM file in a standard-compliant BED format.
0
1
0
1
0
1
0
1
0
1
bed
versions
A utility for analyzing sodium bisulfite conversion-based DNA methylation/modification data
Merges methylation information for opposite-strand C's in a CpG context
0
1
0
1
0
1
bed
versions
A utility for analyzing sodium bisulfite conversion-based DNA methylation/modification data
Summarizes methylation or SNV information from a Biscuit VCF in a standard-compliant BED file.
0
1
bed
versions
A utility for analyzing sodium bisulfite conversion-based DNA methylation/modification data
Performs alignment of BS-Seq reads using bismark
0
1
0
1
0
1
bam
report
unmapped
versions
Bismark is a tool to map bisulfite treated sequencing reads and perform methylation calling in a quick and easy-to-use fashion.
Removes alignments to the same position in the genome from the Bismark mapping output.
0
1
bam
report
versions
Bismark is a tool to map bisulfite treated sequencing reads and perform methylation calling in a quick and easy-to-use fashion.
Converts a specified reference genome into two different bisulfite converted versions and indexes them for alignments.
0
1
index
versions
Bismark is a tool to map bisulfite treated sequencing reads and perform methylation calling in a quick and easy-to-use fashion.
Extracts methylation information for individual cytosines from alignments.
0
1
0
1
bedgraph
methylation_calls
coverage
report
mbias
versions
Bismark is a tool to map bisulfite treated sequencing reads and perform methylation calling in a quick and easy-to-use fashion.
Collects bismark alignment reports
0
1
2
3
4
report
versions
Bismark is a tool to map bisulfite treated sequencing reads and perform methylation calling in a quick and easy-to-use fashion.
BLASTP (Basic Local Alignment Search Tool- Protein) compares an amino acid (protein) query sequence against a protein database
0
1
0
1
0
xml
tsv
csv
versions
BLAST+ is a new suite of BLAST tools that utilizes the NCBI C++ Toolkit.
Construct species phylogenies using BUSCO proteins
0
1
gene_trees
supermatrix
versions
Construct species phylogenies using BUSCO proteins
Performs alignment of BS-Seq reads using bwameth
0
1
0
1
0
1
bam
versions
Fast and accurate alignment of BS-Seq reads using bwa-mem and a 3-letter genome.
Performs indexing of c2t converted reference genome
0
1
index
versions
Fast and accurate alignment of BS-Seq reads using bwa-mem and a 3-letter genome.
Taxonomic classification plus read-based abundance estimation from long DNA sequences and metagenome assembled genomes (e.g. contigs, MAGs / bins).
0
1
0
1
0
1
0
1
0
1
0
0
1
0
1
0
1
0
1
0
1
0
1
0
1
rat_log
complete_abundance
contig_abundance
read2classification
alignment_diamond
contig2classification
cat_log
orf2lca
faa
gff
unmapped_diamond
unmapped_fasta
unmapped2classification
versions
CAT/BAT: tool for taxonomic classification of contigs and metagenome-assembled genomes (MAGs)
Cluster protein sequences using sequence similarity
0
1
fasta
clusters
versions
Clusters and compares protein or nucleotide sequences
Cluster nucleotide sequences using sequence similarity
0
1
fasta
clusters
versions
Clusters and compares protein or nucleotide sequences
Module to use Cell Ranger's pipelines analyze sequencing data produced from Chromium Single Cell Gene Expression.
0
1
0
outs
versions
Cell Ranger by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.
Module to create FASTQs needed by the 10x Genomics Cell Ranger tool. Uses the cellranger mkfastq command.
0
1
2
fastq
undetermined_fastq
reports
stats
interop
versions
Cell Ranger by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.
Module to build a filtered GTF needed by the 10x Genomics Cell Ranger tool. Uses the cellranger mkgtf command.
0
gtf
versions
Cell Ranger by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.
Module to build the reference needed by the 10x Genomics Cell Ranger tool. Uses the cellranger mkref command.
0
0
0
reference
versions
Cell Ranger by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.
Module to use Cell Ranger's pipelines to analyze sequencing data produced from various Chromium technologies, including Single Cell Gene Expression, Single Cell Immune Profiling, Feature Barcoding, and Cell Multiplexing.
0
0
1
0
1
0
1
0
1
0
1
0
1
0
0
0
0
0
0
0
0
0
0
0
0
0
config
outs
versions
Cell Ranger by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.
Module to use Cell Ranger's pipelines analyze sequencing data produced from Chromium Single Cell Immune Profiling.
0
1
0
outs
versions
Cell Ranger processes data from 10X Genomics Chromium kits. cellranger vdj
takes FASTQ files from cellranger mkfastq
or bcl2fastq
for V(D)J libraries and performs sequence assembly and paired clonotype calling. It uses the Chromium cellular barcodes and UMIs to assemble V(D)J transcripts per cell. Clonotypes and CDR3 sequences are output as a .vloupe
file which can be loaded into Loupe V(D)J Browser.
Module to use Cell Ranger's ARC pipelines analyze sequencing data produced from Chromium Single Cell ARC. Uses the cellranger-arc count command.
0
1
2
3
0
outs
lib
versions
Cell Ranger ARC is a set of analysis pipelines that process Chromium Single Cell ARC data.
Module to create fastqs needed by the 10x Genomics Cell Ranger Arc tool. Uses the cellranger-arc mkfastq command.
0
0
versions
fastq
Cell Ranger Arc by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.
Module to build a filtered gtf needed by the 10x Genomics Cell Ranger Arc tool. Uses the cellranger-arc mkgtf command.
0
gtf
versions
Cell Ranger Arc by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.
Module to use Cell Ranger's ATAC pipelines analyze sequencing data produced from Chromium Single Cell ATAC.
0
1
0
outs
versions
Cell Ranger ATAC is a set of analysis pipelines that process Chromium Single Cell ATAC data.
Module to create fastqs needed by the 10x Genomics Cell Ranger ATAC tool. Uses the cellranger-atac mkfastq command.
0
0
versions
fastq
Cell Ranger ATAC by 10x Genomics is a set of analysis pipelines that process Chromium single-cell data to align reads, generate feature-barcode matrices, perform clustering and other secondary analysis, and more.
Cellsnp-lite is a C/C++ tool for efficient genotyping bi-allelic SNPs on single cells. You can use the mode A of cellsnp-lite after read alignment to obtain the snp x cell pileup UMI or read count matrices for each alleles of given or detected SNPs for droplet based single cell data.
0
1
2
3
4
base
cell
sample
allele_depth
depth_coverage
depth_other
versions
Efficient genotyping bi-allelic SNPs on single cells
Performs preprocessing and alignment of chromatin fastq files to fasta reference files using chromap.
0
1
0
1
0
1
0
0
0
0
bed
bam
tagAlign
pairs
versions
Fast alignment and preprocessing of chromatin profiles
Indexes a fasta reference genome ready for chromatin profiling.
0
1
index
versions
Fast alignment and preprocessing of chromatin profiles
CIRCexplorer2 parses fusion junction files from multiple aligners to prepare them for CIRCexplorer2 annotate.
0
1
junction
versions
Circular RNA analysis toolkit
Realign reads mapped with BWA to elongated reference genome
0
1
0
1
0
1
0
1
bam
versions
A method to improve mappings on circular genomes such as Mitochondria.
Predict recomination events in bacterial genomes
0
1
2
emsim
em
status
newick
fasta
pos_ref
versions
Align sequences using Clustal Omega
0
1
0
1
0
0
0
0
0
alignment
versions
Latest version of Clustal: a multiple sequence alignment program for DNA or proteins
Parallel implementation of the gzip algorithm.
Renders a guidetree in clustalo
0
1
tree
versions
Latest version of Clustal: a multiple sequence alignment program for DNA or proteins
Make a transcript/gene mapping from a GTF and cross-reference with transcript quantifications.
0
1
0
1
0
0
0
tx2gene
versions
"Custom module to create a transcript to gene mapping from a GTF and check it against transcript quantifications"
DeepSomatic is an extension of deep learning-based variant caller DeepVariant that takes aligned reads (in BAM or CRAM format) from tumor and normal data, produces pileup image tensors from them, classifies each tensor using a convolutional neural network, and finally reports somatic variants in a standard VCF or gVCF file.
0
1
2
3
4
0
1
0
1
0
1
0
1
vcf
vcf_tbi
gvcf
gvcf_tbi
versions
This tool filters alignments in a BAM/CRAM file according the the specified parameters.
0
1
2
bam
logs
versions
A set of user-friendly tools for normalization and visualization of deep-sequencing data
This tool takes an alignment of reads or fragments as input (BAM file) and generates a coverage track (bigWig or bedGraph) as output.
0
1
2
0
0
bigwig
bedgraph
versions
A set of user-friendly tools for normalization and visualization of deep-sequencing data
Transforms the input alignments to a format suitable for the deep neural network variant caller
0
1
2
3
0
1
0
1
0
1
0
1
examples
gvcf
small_model_calls
versions
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data
Queries a DIAMOND database using blastp mode
0
1
0
1
0
0
blast
xml
txt
daa
sam
tsv
paf
versions
Accelerated BLAST compatible local sequence aligner
Queries a DIAMOND database using blastx mode
0
1
0
1
0
0
blast
xml
txt
daa
sam
tsv
paf
log
versions
Accelerated BLAST compatible local sequence aligner
calculate clusters of highly similar sequences
0
1
tsv
versions
Accelerated BLAST compatible local sequence aligner
Builds a DIAMOND database
0
1
0
0
0
db
versions
Accelerated BLAST compatible local sequence aligner
Create DRAGEN hashtable for reference genome
0
1
hashmap
versions
Dragmap is the Dragen mapper/aligner Open Source Software.
Export assembly segment sequences in GFA 1.0 format to FASTA format
0
1
fasta
versions
Reads, features, variants, assemblies, alignments, genomic range trees, pangenome graphs, and a bunch of random command line tools for bioinformatics. LGPL version 3 or later.
Filter features in gzipped BED format
0
1
bed
versions
Reads, features, variants, assemblies, alignments, genomic range trees, pangenome graphs, and a bunch of random command line tools for bioinformatics. LGPL version 3 or later.
Filter features in gzipped GFF3 format
0
1
gff3
versions
Reads, features, variants, assemblies, alignments, genomic range trees, pangenome graphs, and a bunch of random command line tools for bioinformatics. LGPL version 3 or later.
Split features in gzipped BED format
0
1
bed
versions
Reads, features, variants, assemblies, alignments, genomic range trees, pangenome graphs, and a bunch of random command line tools for bioinformatics. LGPL version 3 or later.
Split features in gzipped GFF3 format
0
1
gff3
versions
Reads, features, variants, assemblies, alignments, genomic range trees, pangenome graphs, and a bunch of random command line tools for bioinformatics. LGPL version 3 or later.
cons calculates a consensus sequence from a multiple sequence alignment. To obtain the consensus, the sequence weights and a scoring matrix are used to calculate a score for each amino acid residue or nucleotide at each position in the alignment.
0
1
consensus
versions
The European Molecular Biology Open Software Suite
splits an alignment into reference and query parts
0
1
2
query
reference
versions
Massively parallel phylogenetic placement of genetic sequences
Aligns sequences using FAMSA
0
1
0
1
0
alignment
versions
Algorithm for large-scale multiple sequence alignments
Renders a guidetree in famsa
0
1
tree
versions
Algorithm for large-scale multiple sequence alignments
Align reads to multiple reference genomes using fastq-screen
0
1
0
txt
png
html
fastq
versions
FastQ Screen allows you to screen a library of sequences in FastQ format against a set of sequence databases so you can see if the composition of the library matches with what you expect.
Runs FCS-GX (Foreign Contamination Screen - Genome eXtractor) to remove foreign contamination from genome assemblies
0
1
2
cleaned
contaminants
versions
The NCBI Foreign Contamination Screen. Genomic cross-species aligner, for contamination detection.
Fetches the NCBI FCS-GX database using a provided manifest URL
0
database
versions
The NCBI Foreign Contamination Screen. Genomic cross-species aligner, for contamination detection.
Runs FCS-GX (Foreign Contamination Screen - Genome eXtractor) to screen and remove foreign contamination from genome assemblies
0
1
2
0
0
fcsgx_report
taxonomy_report
log
hits
versions
The NCBI Foreign Contamination Screen. Genomic cross-species aligner, for contamination detection.
Using the fgbio tools, converts FASTQ files sequenced into unaligned BAM or CRAM files possibly moving the UMI barcode into the RX field of the reads
0
1
bam
cram
versions
A set of tools for working with genomic and high throughput sequencing data, including UMIs
Creates a database for Foldmason.
0
1
db
versions
Multiple Protein Structure Alignment at Scale with FoldMason
Aligns protein structures using foldmason
0
1
0
1
0
msa_3di
msa_aa
versions
Multiple Protein Structure Alignment at Scale with FoldMason
Renders a visualization report using foldmason
0
1
0
1
0
1
0
1
html
versions
Multiple Protein Structure Alignment at Scale with FoldMason
fq generate is a FASTQ file pair generator. It creates two reads, formatting names as described by Illumina. While generate creates "valid" FASTQ reads, the content of the files are completely random. The sequences do not align to any genome. This requires a seed (--seed) to be supplied in ext.args.
0
fastq
versions
fq is a library to generate and validate FASTQ file pairs.
Performs local realignment around indels to correct for mapping errors
0
1
2
3
0
1
0
1
0
1
0
1
bam
versions
The full Genome Analysis Toolkit (GATK) framework, license restricted.
Generates a list of locations that should be considered for local realignment prior genotyping.
0
1
2
0
1
0
1
0
1
0
1
intervals
versions
The full Genome Analysis Toolkit (GATK) framework, license restricted.
Left align and trim variants using GATK4 LeftAlignAndTrimVariants.
0
1
2
3
0
0
0
vcf
tbi
versions
Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.
Merge unmapped with mapped BAM files
0
1
2
0
1
0
1
bam
versions
Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.
Performs fastq alignment to a fasta reference using using gem3-mapper
0
1
0
1
0
bam
versions
The GEM indexer (v3).
A versatile pairwise aligner for genomic and spliced nucleotide sequences
0
index
versions
A versatile pairwise aligner for genomic and spliced nucleotide sequences.
Gubbins (Genealogies Unbiased By recomBinations In Nucleotide Sequences) is an algorithm that iteratively identifies loci containing elevated densities of base substitutions while concurrently constructing a phylogeny based on the putative point mutations outside of these regions.
0
fasta
gff
vcf
stats
phylip
embl_predicted
embl_branch
tree
tree_labelled
versions
Reformat a Multiple Sequence Alignment (MSA) file
0
1
0
0
msa
versions
HH-suite3 for fast remote homology detection and deep protein annotation
Align RNA-Seq reads to a reference with HISAT2
0
1
0
1
0
1
bam
summary
fastq
versions
HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) to a population of human genomes as well as to a single reference genome.
Builds HISAT2 index for reference genome
0
1
0
1
0
1
index
versions
HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) to a population of human genomes as well as to a single reference genome.
Extracts splicing sites from a gtf files
0
1
txt
versions
HISAT2 is a fast and sensitive alignment program for mapping next-generation sequencing reads (both DNA and RNA) to a population of human genomes as well as to a single reference genome.
Performs HLA typing based on a population reference graph and employs a new linear projection method to align reads to the graph.
0
1
2
3
results
extraction
extraction_mapped
extraction_unmpapped
hla
fastq
reads_per_level
remapped
versions
HLA typing from short and long reads
Mask multiple sequence alignments
0
1
2
3
4
5
6
7
0
maskedaln
fmask_rf
fmask_all
gmask_rf
gmask_all
pmask_rf
pmask_all
versions
Biosequence analysis using profile hidden Markov models
hmmalign from the HMMER suite aligns a number of sequences to an HMM profile
0
1
0
sto
versions
Biosequence analysis using profile hidden Markov models
create an hmm profile from a multiple sequence alignment
0
1
0
hmm
hmmbuildout
versions
Biosequence analysis using profile hidden Markov models
search profile(s) against a sequence database
0
1
2
3
4
5
output
alignments
target_summary
domain_summary
versions
Biosequence analysis using profile hidden Markov models
Create a tag directory with the HOMER suite
0
1
0
tagdir
taginfo
versions
HOMER (Hypergeometric Optimization of Motif EnRichment) is a suite of tools for Motif Discovery and next-gen sequencing analysis.
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Differential gene expression analysis based on the negative binomial distribution
Empirical Analysis of Digital Gene Expression Data in R
igv.js is an embeddable interactive genome visualization component
0
1
2
browser
align_files
index_files
versions
Create an embeddable interactive genome browser component. Output files are expected to be present in the same directory as the genome browser html file. To visualise it, files have to be served. Check the documentation at: https://github.com/igvteam/igv-webapp for an example and https://github.com/igvteam/igv.js/wiki/Data-Server-Requirements for server requirements
Search covariance models against a sequence database
0
1
2
0
0
output
alignments
target_summary
versions
Infernal is for searching DNA sequence databases for RNA structure and sequence similarities.
Produces a Newick format phylogeny from a multiple sequence alignment using the maximum likelihood algorithm. Capable of bacterial genome size alignments.
0
1
2
0
0
0
0
0
0
0
0
0
0
0
0
phylogeny
report
mldist
lmap_svg
lmap_eps
lmap_quartetlh
sitefreq_out
bootstrap
state
contree
nex
splits
suptree
alninfo
partlh
siteprob
sitelh
treels
rate
mlrate
exch_matrix
log
versions
Aligns sequences using kalign
0
1
0
alignment
versions
Kalign is a fast and accurate multiple sequence alignment algorithm.
Computes equivalence classes for reads and quantifies abundances
0
1
0
1
0
0
0
0
results
json_info
log
versions
Quantifying abundances of transcripts from RNA-Seq data, or more generally of target sequences using high-throughput sequencing reads.
Aligns query sequences to target sequences indexed with lastdb
0
1
2
0
maf
multiqc
versions
LAST finds & aligns related regions of sequences.
Prepare sequences for subsequent alignment with lastal.
0
1
index
versions
LAST finds & aligns related regions of sequences.
Converts MAF alignments in another format.
0
1
2
0
1
0
1
0
1
axt_gz
bam
blast_gz
blasttab_gz
chain_gz
cram
gff_gz
html_gz
psl_gz
sam_gz
tab_gz
versions
LAST finds & aligns related regions of sequences.
Reorder alignments in a MAF file
0
1
maf
versions
LAST finds & aligns related regions of sequences.
Post-alignment masking
0
1
maf
versions
LAST finds & aligns related regions of sequences.
Find suitable score parameters for sequence alignment
0
1
0
param_file
multiqc
versions
LAST finds & aligns related regions of sequences.
Align sequences using learnMSA
0
1
alignment
versions
learnMSA: Learning and Aligning large Protein Families
Converting aligned short and long reads records from one reference to another
0
1
0
1
bam
versions
Fast and accurate coordinate conversion between assemblies
Lofreq subcommand to for insert base and indel alignment qualities
0
1
0
bam
versions
A fast and sensitive variant-caller for inferring SNVs and indels from next-generation sequencing data
Lofreq subcommand to call low frequency variants from alignments
0
1
2
0
vcf
versions
A fast and sensitive variant-caller for inferring SNVs and indels from next-generation sequencing data
Lofreq subcommand to call low frequency variants from alignments when tumor-normal paired samples are available
0
1
2
3
4
5
0
1
0
1
vcf
versions
A fast and sensitive variant-caller for inferring SNVs and indels from next-generation sequencing data
Lofreq subcommand to call low frequency variants from alignments when tumor-normal paired samples are available
0
1
0
1
bam
versions
A fast and sensitive variant-caller for inferring SNVs and indels from next-generation sequencing data
Peak calling of enriched genomic regions of ChIP-seq and ATAC-seq experiments
0
1
2
0
peak
xls
versions
gapped
bed
bdg
Model Based Analysis for ChIP-Seq data
Peak calling of enriched genomic regions of ChIP-seq and ATAC-seq experiments
0
1
2
0
peak
xls
versions
gapped
bed
bdg
Model Based Analysis for ChIP-Seq data
Multiple sequence alignment using MAFFT
0
1
0
1
0
1
0
1
0
1
0
1
0
fas
versions
Parallel implementation of the gzip algorithm.
Multiple sequence alignment using MAFFT
0
1
0
1
0
1
0
1
0
1
0
1
0
fas
versions
Multiple alignment program for amino acid or nucleotide sequences based on fast Fourier transform
Parallel implementation of the gzip algorithm.
Guide tree rendering using MAFFT
0
1
tree
versions
Multiple alignment program for amino acid or nucleotide sequences based on fast Fourier transform
Multiple Sequence Alignment using Graph Clustering
0
1
0
1
0
alignment
versions
Multiple Sequence Alignment using Graph Clustering
Multiple Sequence Alignment using Graph Clustering
0
1
tree
versions
Multiple Sequence Alignment using Graph Clustering
MALT, an acronym for MEGAN alignment tool, is a sequence alignment and analysis tool designed for processing high-throughput sequencing data, especially in the context of metagenomics.
0
0
0
0
index
versions
log
A tool for mapping metagenomic data
MALT, an acronym for MEGAN alignment tool, is a sequence alignment and analysis tool designed for processing high-throughput sequencing data, especially in the context of metagenomics.
0
1
0
rma6
alignments
log
versions
A tool for mapping metagenomic data
Tool for evaluation of MALT results for true positives of ancient metagenomic taxonomic screening
0
1
0
0
results
versions
Map short-reads to an indexed reference genome
0
1
0
1
0
0
0
0
0
0
0
bam
versions
An aDNA aware short-read mapper
Extracts per-base methylation metrics from alignments
0
1
2
0
0
bedgraph
methylkit
versions
Methylation caller from MethylDackel, a (mostly) universal methylation extractor for methyl-seq experiments.
Generates methylation bias plots from alignments
0
1
2
0
0
txt
versions
Read position methylation bias tools from MethylDackel, a (mostly) universal extractor for methyl-seq experiments.
Provides fasta index required by minimap2 alignment.
0
1
index
versions
A versatile pairwise aligner for genomic and spliced nucleotide sequences.
Provides fasta index required by miniprot alignment.
0
1
index
versions
A versatile pairwise aligner for genomic and protein sequences.
Aligns protein structures using mTM-align
0
1
0
alignment
structure
versions
Algorithm for structural multiple sequence alignments
Parallel implementation of the gzip algorithm.
SNP table generator from GATK UnifiedGenotyper with functionality geared for aDNA
0
1
0
1
0
1
0
1
0
0
0
0
0
0
1
full_alignment
info_txt
snp_alignment
snp_genome_alignment
snpstatistics
snptable
snptable_snpeff
snptable_uncertainty
structure_genotypes
structure_genotypes_nomissing
json
versions
MUSCLE is a program for creating multiple alignments of amino acid or nucleotide sequences. A range of options are provided that give you the choice of optimizing accuracy, speed, or some compromise between the two
0
1
aligned_fasta
phyi
phys
clustalw
html
msf
tree
log
versions
Muscle is a program for creating multiple alignments of amino acid or nucleotide sequences. This particular module uses the super5 algorithm for very big alignments. It can permutate the guide tree according to a set of flags.
0
1
0
alignment
versions
Muscle v5 is a major re-write of MUSCLE based on new algorithms.
Parallel implementation of the gzip algorithm.
Compare multiple runs of long read sequencing data and alignments
0
1
report_html
lengths_violin_html
log_length_violin_html
n50_html
number_of_reads_html
overlay_histogram_html
overlay_histogram_normalized_html
overlay_log_histogram_html
overlay_log_histogram_normalized_html
total_throughput_html
quals_violin_html
overlay_histogram_identity_html
overlay_histogram_phredscore_html
percent_identity_violin_html
active_pores_over_time_html
cumulative_yield_plot_gigabases_html
sequencing_speed_over_time_html
stats_txt
versions
Create DRAGEN hashtable for reference genome
0
1
hashmap
versions
narfmap is a fork of the Dragen mapper/aligner Open Source Software.
SARS-CoV-2 genome clade assignment, mutation calling, and sequence quality checks (C++ implementation)
0
1
0
csv
csv_errors
csv_insertions
tsv
json
json_auspice
ndjson
fasta_aligned
fasta_translation
nwk
versions
SARS-CoV-2 genome clade assignment, mutation calling, and sequence quality checks
Performs fastq alignment to a fasta reference using NextGenMap
0
1
0
bam
versions
NextGenMap is a flexible highly sensitive short read mapping tool that handles much higher mismatch rates than comparable algorithms while still outperforming them in terms of runtime
NUCmer is a pipeline for the alignment of multiple closely related nucleotide sequences.
0
1
2
delta
coords
versions
A fast and scalable tool for bacterial pangenome analysis
0
1
results
aln
versions
panaroo - an updated pipeline for pangenome investigation
NVIDIA Clara Parabricks GPU-accelerated alignment, sorting, BQSR calculation, and duplicate marking. Note this nf-core module requires files to be copied into the working directory and not symlinked.
0
1
0
1
0
1
0
1
0
1
0
bam
bai
cram
crai
bqsr_table
qc_metrics
duplicate_metrics
versions
NVIDIA Clara Parabricks GPU-accelerated genomics tools
VIDIA Clara Parabricks GPU-accelerated fast, accurate algorithm for mapping methylated DNA sequence reads to a reference genome, performing local alignment, and producing alignment for different parts of the query sequence
0
1
0
1
0
1
0
bam
bai
qc_metrics
bqsr_table
duplicate_metrics
versions
NVIDIA Clara Parabricks GPU-accelerated genomics tools
Determines the depth in a BAM/CRAM file
0
1
2
0
1
0
1
depth
binned_depth
versions
Graph realignment tools for structural variants
Genotype structural variants using paragraph and grmpy
0
1
2
3
4
5
0
1
0
1
vcf
json
versions
Graph realignment tools for structural variants
Convert a VCF file to a JSON graph
0
1
0
1
graph
versions
Graph realignment tools for structural variants
Alignment with PacBio's minimap2 frontend
0
1
0
1
bam
versions
A minimap2 frontend for PacBio native data formats
Cleans the provided BAM, soft-clipping beyond-end-of-reference alignments and setting MAPQ to 0 for unmapped reads
0
1
bam
versions
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Collects hybrid-selection (HS) metrics for a SAM or BAM file.
0
1
2
3
4
0
1
0
1
0
1
metrics
versions
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Collect metrics about the insert size distribution of a paired-end library.
0
1
metrics
histogram
versions
Java tools for working with NGS data in the BAM format
Collect multiple metrics from a BAM file
0
1
2
0
1
0
1
metrics
pdf
versions
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Collect metrics from a RNAseq BAM file
0
1
0
0
0
metrics
pdf
versions
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Collect metrics about coverage and performance of whole genome sequencing (WGS) experiments.
0
1
2
0
1
0
1
0
metrics
versions
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Checks that all data in the set of input files appear to come from the same individual
0
1
2
3
4
5
0
1
crosscheck_metrics
versions
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Converts a FASTQ file to an unaligned BAM or SAM file.
0
1
bam
versions
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Filters SAM/BAM files to include/exclude either aligned/unaligned reads or based on a read list
0
1
2
0
bam
versions
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Merges multiple BAM files into a single file
0
1
bam
versions
A set of command line tools (in Java) for manipulating high-throughput sequencing (HTS) data and formats such as SAM/BAM/CRAM and VCF.
Run all Portcullis steps in one go
0
1
0
1
0
1
log
bed
tab
versions
Portcullis is a tool that filters out invalid splice junctions from RNA-seq alignment data. It accepts BAM files from various RNA-seq mappers, analyzes splice junctions and removes likely false positives, outputting filtered results in multiple formats for downstream analysis.
Split fasta file by 'N's to aid in self alignment for duplicate purging
0
1
split_fasta
versions
Purge_dups is a package used to purge haplotigs and overlaps in an assembly based on read depth
Evaluate alignment data
0
1
0
results
versions
Qualimap 2 is a platform-independent application written in Java and R that provides both a Graphical User Interface and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts.
Evaluate alignment data
0
1
2
0
0
0
results
versions
Qualimap 2 is a platform-independent application written in Java and R that provides both a Graphical User Interface and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts.
Evaluate alignment data
0
1
0
1
results
versions
Qualimap 2 is a platform-independent application written in Java and R that provides both a Graphical User Interface and a command-line interface to facilitate the quality control of alignment sequencing data and its derivatives like feature counts.
Produces a Newick format phylogeny from a multiple sequence alignment using a Neighbour-Joining algorithm. Capable of bacterial genome size alignments.
0
stockholm_alignment
phylogeny
versions
Calculate pan-genome from annotated bacterial assemblies in GFF3 format
0
1
results
aln
versions
Calling lowest common ancestors from multi-mapped reads in SAM/BAM/CRAM files
0
1
2
0
csv
json
bam
versions
Lowest Common Ancestor on SAM/BAM/CRAM alignment files
Clips read alignments where they match BED file defined regions
0
1
0
0
0
bam
stats
rejects_bam
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
calculates MD and NM tags
0
1
0
1
bam
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Concatenate BAM or CRAM file
0
1
bam
cram
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Produces a consensus FASTA/FASTQ/PILEUP
0
1
fasta
fastq
pileup
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
convert and then index CRAM -> BAM or BAM -> CRAM file
0
1
2
0
1
0
1
bam
cram
bai
crai
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
produces a histogram or table of coverage per chromosome
0
1
2
0
1
0
1
coverage
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
List CRAM Content-ID and Data-Series sizes
0
1
size
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Create a sequence dictionary file from a FASTA file
0
1
dict
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Index FASTA file, and optionally generate a file of chromosome sizes
0
1
0
1
0
fa
fai
sizes
gzi
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Converts a SAM/BAM/CRAM file to FASTQ
0
1
0
fastq
interleaved
singleton
other
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Samtools fixmate is a tool that can fill in information (insert size, cigar, mapq) about paired end reads onto the corresponding other read. Also has options to remove secondary/unmapped alignments and recalculate whether reads are proper pairs.
0
1
bam
cram
sam
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Counts the number of alignments in a BAM/CRAM/SAM file for each FLAG type
0
1
2
flagstat
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
filter/convert SAM/BAM/CRAM file
0
1
readgroup
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Reports alignment summary statistics for a BAM/CRAM/SAM file
0
1
2
idxstats
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
converts FASTQ files to unmapped SAM/BAM/CRAM
0
1
sam
bam
cram
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Index SAM/BAM/CRAM file
0
1
bai
csi
crai
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
mark duplicate alignments in a coordinate sorted file
0
1
0
1
bam
cram
sam
versions
Tools for dealing with SAM, BAM and CRAM files
Merge BAM or CRAM file
0
1
0
1
0
1
bam
cram
csi
crai
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
BAM
0
1
2
0
mpileup
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Replace the header in the bam file with the header generated by the command. This command is much faster than replacing the header with a BAMโSAMโBAM conversion.
0
1
bam
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Collate/Fixmate/Sort/Markdup SAM/BAM/CRAM file
0
1
0
1
bam
cram
csi
crai
metrics
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Sort SAM/BAM/CRAM file
0
1
0
1
bam
cram
crai
csi
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
Produces comprehensive statistics from SAM/BAM/CRAM file
0
1
2
0
1
stats
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
filter/convert SAM/BAM/CRAM file
0
1
2
0
1
0
0
bam
cram
sam
bai
csi
crai
unselected
unselected_index
versions
SAMtools is a set of utilities for interacting with and post-processing short DNA sequence read alignments in the SAM, BAM and CRAM formats, written by Heng Li. These files are generated as output by short read aligners like BWA.
The Cluster Analysis tool of Scramble analyses and interprets the soft-clipped clusters found by cluster_identifier
0
1
0
0
meis_tab
dels_tab
vcf
versions
Soft Clipped Read Alignment Mapper
The cluster_identifier tool of Scramble identifies soft clipped clusters
0
1
2
0
clusters
versions
Soft Clipped Read Alignment Mapper
A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection
0
1
0
0
alignment
trans_alignments
multi_bed
single_bed
versions
A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection
Generate genome indices for segemehl align
0
index
versions
A multi-split mapping algorithm for circular RNA, splicing, trans-splicing and fusion detection
Performs fastq alignment to a fasta reference using Sentieon's BWA MEM
0
1
0
1
0
1
0
1
bam_and_bai
versions
Sentieonยฎ provides complete solutions for secondary DNA/RNA analysis for a variety of sequencing platforms, including short and long reads. Our software improves upon BWA, STAR, Minimap2, GATK, HaplotypeCaller, Mutect, and Mutect2 based pipelines and is deployable on any generic-CPU-based computing system.
Generate recalibration table and optionally perform base quality recalibration
0
1
2
0
1
0
1
0
1
0
1
0
1
0
table
table_post
recal_alignment
csv
pdf
versions
Sentieonยฎ provides complete solutions for secondary DNA/RNA analysis for a variety of sequencing platforms, including short and long reads. Our software improves upon BWA, STAR, Minimap2, GATK, HaplotypeCaller, Mutect, and Mutect2 based pipelines and is deployable on any generic-CPU-based computing system.
Induce a variation graph in GFA format from alignments in PAF format
0
1
2
gfa
versions
seqwish implements a lossless conversion from pairwise alignments between sequences to a variation graph encoding the sequences and their alignments.
Severus is a somatic structural variation (SV) caller for long reads (both PacBio and ONT)
0
1
2
3
4
5
0
1
log
read_qual
breakpoints_double
read_alignments
read_ids
collapsed_dup
loh
all_vcf
all_breakpoints_clusters_list
all_breakpoints_clusters
all_plots
somatic_vcf
somatic_breakpoints_clusters_list
somatic_breakpoints_clusters
somatic_plots
versions
Simple ANI calculation between reference and query genomes.
0
1
0
1
dist
versions
skani is a fast and robust tool for calculating ANI between metagenome assembled genomes and contigs.
Memory-efficient ANI database queries with skani.
0
1
0
1
search
versions
skani is a fast and robust tool for calculating ANI between metagenome assembled genomes and contigs.
Storing skani sketches/indices on disk.
0
1
sketch_dir
sketch
markers
versions
skani is a fast and robust tool for calculating ANI between metagenome assembled genomes and contigs.
All-to-all ANI computation.
0
1
triangle
versions
skani is a fast and robust tool for calculating ANI between metagenome assembled genomes and contigs.
smoove simplifies and speeds calling and genotyping SVs for short reads. It also improves specificity by removing many spurious alignment signals that are indicative of low-level noise and often contribute to spurious calls. Developed by Brent Pedersen.
0
1
2
3
0
1
0
1
vcf
versions
structural variant calling and genotyping with existing tools, but, smoothly
Create a SNAP index for reference genome
0
1
2
3
4
index
versions
Scalable Nucleotide Alignment Program -- a fast and accurate read aligner for high-throughput sequencing data
Core-SNP alignment from Snippy outputs
0
1
2
0
aln
full_aln
tab
vcf
txt
versions
Rapid bacterial SNP calling and core genome alignments
Rapid haploid variant calling
0
1
0
tab
csv
html
vcf
bed
gff
bam
bai
log
aligned_fa
consensus_fa
consensus_subs_fa
raw_vcf
filt_vcf
vcf_gz
vcf_csi
txt
versions
Rapid bacterial SNP calling and core genome alignments
Local sequence alignment tool for filtering, mapping and clustering.
0
1
0
1
0
1
reads
log
index
versions
The core algorithm is based on approximate seeds and allows for sensitive analysis of NGS reads. The main application of SortMeRNA is filtering rRNA from metatranscriptomic data. SortMeRNA takes as input files of reads (fasta, fastq, fasta.gz, fastq.gz) and one or multiple rRNA database file(s), and sorts apart aligned and rejected reads into two files. Additional applications include clustering and taxonomy assignation available through QIIME v1.9.1. SortMeRNA works with Illumina, Ion Torrent and PacBio data, and can produce SAM and BLAST-like alignments.
Module to use the 10x Space Ranger pipeline to process 10x spatial transcriptomics data
0
1
2
3
4
5
6
7
0
0
outs
versions
Visium Spatial Gene Expression is a next-generation molecular profiling solution for classifying tissue based on total mRNA. Space Ranger is a set of analysis pipelines that process Visium Spatial Gene Expression data with brightfield and fluorescence microscope images. Space Ranger allows users to map the whole transcriptome in formalin fixed paraffin embedded (FFPE) and fresh frozen tissues to discover novel insights into normal development, disease pathology, and clinical translational research. Space Ranger provides pipelines for end to end analysis of Visium Spatial Gene Expression experiments.
Align reads to a reference genome using STAR
0
1
0
1
0
1
0
0
0
log_final
log_out
log_progress
versions
bam
bam_sorted
bam_sorted_aligned
bam_transcript
bam_unsorted
fastq
tab
spl_junc_tab
read_per_gene_tab
junction
sam
wig
bedgraph
STAR is a software package for mapping DNA sequences against a large reference genome, such as the human genome.
Aligns sequences using T_COFFEE
0
1
0
1
0
1
2
0
alignment
lib
versions
A collection of tools for Computing, Evaluating and Manipulating Multiple Alignments of DNA, RNA, Protein Sequences and Structures.
Parallel implementation of the gzip algorithm.
Compares 2 alternative MSAs to evaluate them.
0
1
2
scores
versions
A collection of tools for Multiple Alignments of DNA, RNA, Protein Sequence
Parallel implementation of the gzip algorithm.
Computes a consensus alignment using T_COFFEE
0
1
0
1
0
alignment
eval
versions
A collection of tools for Computing, Evaluating and Manipulating Multiple Alignments of DNA, RNA, Protein Sequences and Structures.
Parallel implementation of the gzip algorithm.
Reformats the header of PDB files with t-coffee
0
1
formatted_pdb
versions
A collection of tools for Computing, Evaluating and Manipulating Multiple Alignments of DNA, RNA, Protein Sequences and Structures.
Computes the irmsd score for a given alignment and the structures.
0
1
0
1
2
irmsd
versions
A collection of tools for Multiple Alignments of DNA, RNA, Protein Sequence
Parallel implementation of the gzip algorithm.
Aligns sequences using the regressive algorithm as implemented in the T_COFFEE package
0
1
0
1
0
1
2
0
alignment
versions
A collection of tools for Computing, Evaluating and Manipulating Multiple Alignments of DNA, RNA, Protein Sequences and Structures.
Parallel implementation of the gzip algorithm.
Reformats files with t-coffee
0
1
formatted_file
versions
A collection of tools for Computing, Evaluating and Manipulating Multiple Alignments of DNA, RNA, Protein Sequences and Structures.
Compute the TCS score for a MSA or for a MSA plus a library file. Outputs the tcs as it is and a csv with just the total TCS score.
0
1
0
1
tcs
scores
versions
A collection of tools for Multiple Alignments of DNA, RNA, Protein Sequence
Parallel implementation of the gzip algorithm.
TransDecoder identifies candidate coding regions within transcript sequences. it is used to build gff file.
0
1
pep
gff3
cds
dat
folder
versions
TransDecoder identifies candidate coding regions within transcript sequences, such as those generated by de novo RNA-Seq transcript assembly using Trinity, or constructed based on RNA-Seq alignments to the genome using Tophat and Cufflinks.
TransDecoder identifies candidate coding regions within transcript sequences. It is used to build gff file. You can use this module after transdecoder_longorf
0
1
0
pep
gff3
cds
bed
versions
TransDecoder identifies candidate coding regions within transcript sequences, such as those generated by de novo RNA-Seq transcript assembly using Trinity, or constructed based on RNA-Seq alignments to the genome using Tophat and Cufflinks.
Import transcript-level abundances and estimated counts for gene-level analysis packages
0
1
0
1
0
tpm_gene
counts_gene
counts_gene_length_scaled
counts_gene_scaled
lengths_gene
tpm_transcript
counts_transcript
lengths_transcript
versions
Module to run UniverSC an open-source pipeline to demultiplex and process single-cell RNA-Seq data
0
1
0
outs
versions
Filtering, downsampling and profiling alignments in BAM/CRAM formats
0
1
bam
versions
In order to judge about candidate indel and structural variants, Varlociraptor needs to know about certain properties of the underlying sequencing experiment in combination with the used read aligner.
0
1
0
1
0
1
alignment_properties_json
versions
Flexible, uncertainty-aware variant calling with parameter free filtration via FDR control.
Constructs a graph from a reference and variant calls or a multiple sequence alignment file
0
1
2
3
0
1
0
1
graph
versions
Variation graph data structures, interchange formats, alignment, genotyping, and variant calling methods.
Deconstruct snarls present in a variation graph in GFA format to variants in VCF format
0
1
0
0
vcf
versions
Variation graph data structures, interchange formats, alignment, genotyping, and variant calling methods.
write your description here
0
1
xg
vg_index
versions
Variation graph data structures, interchange formats, alignment, genotyping, and variant calling methods.
Cluster sequences using a single-pass, greedy centroid-based clustering algorithm.
0
1
aln
biom
mothur
otu
bam
out
blast
uc
centroids
clusters
profile
msa
versions
VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)
Performs quality filtering and / or conversion of a FASTQ file to FASTA format.
0
1
fasta
log
versions
VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)
Taxonomic classification using the sintax algorithm.
0
1
0
tsv
versions
VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)
Sort fasta entries by decreasing abundance (--sortbysize) or sequence length (--sortbylength).
0
1
0
fasta
versions
VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)
Compare target sequences to fasta-formatted query sequences using global pairwise alignment.
0
1
0
0
0
0
aln
biom
lca
mothur
otu
sam
tsv
txt
uc
versions
VSEARCH is a versatile open-source tool for microbiome analysis, including chimera detection, clustering, dereplication and rereplication, extraction, FASTA/FASTQ/SFF file processing, masking, orienting, pair-wise alignment, restriction site cutting, searching, shuffling, sorting, subsampling, and taxonomic classification of amplicon sequences for metagenomics, genomics, and population genetics. (USEARCH alternative)
a pangenome-scale aligner
0
1
2
3
4
0
0
paf
versions
Convert and filter aligned reads to .npz
0
1
2
0
1
0
1
npz
versions
WIthin-SamplE COpy Number aberration DetectOR, including sex chromosomes
Align reads to a reference genome using YARA
0
1
0
1
bam
bai
versions
Yara is an exact tool for aligning DNA sequencing reads to reference genomes.
Click here to trigger an update.