Available Modules

Modules are the building stones of all DSL2 nf-core blocks. You can find more info from nf-core website, if you would like to write your own module.

  • fasta 9
  • vcf 6
  • database 5
  • variants 5
  • annotation 4
  • qc 4
  • cnv 4
  • sort 3
  • binning 3
  • phage 3
  • ncbi 3
  • bin 3
  • public datasets 3
  • hmmcopy 3
  • wastewater 3
  • amplicon sequencing 3
  • entrez 3
  • vrhyme 3
  • ataqv 3
  • variant calling 2
  • variant 2
  • bcftools 2
  • gzip 2
  • enrichment 2
  • ATAC-seq 2
  • miRNA 2
  • bakta 2
  • rna_structure 2
  • RNA 2
  • CRISPR 2
  • prokka 2
  • functional analysis 2
  • pharokka 2
  • function 2
  • pigz 2
  • find 2
  • nacho 2
  • snpeff 2
  • effect prediction 2
  • nanostring 2
  • mRNA 2
  • deconvolution 2
  • genomics 1
  • gatk4 1
  • structural variants 1
  • align 1
  • gff 1
  • bacteria 1
  • ancient DNA 1
  • VCF 1
  • trimming 1
  • reporting 1
  • consensus 1
  • protein 1
  • aDNA 1
  • annotate 1
  • bcf 1
  • virus 1
  • hmmsearch 1
  • gff3 1
  • antimicrobial peptides 1
  • concatenate 1
  • antibiotic resistance 1
  • cat 1
  • amps 1
  • view 1
  • query 1
  • logratio 1
  • propr 1
  • ampir 1
  • parsing 1
  • amplify 1
  • macrel 1
  • SNP 1
  • DRAMP 1
  • neubi 1
  • gene set analysis 1
  • gene set 1
  • differential expression 1
  • orthology 1
  • mapcounter 1
  • metadata 1
  • calling 1
  • CNV 1
  • cancer genomics 1
  • snpsift 1
  • correction 1
  • linkbins 1
  • extractunbinned 1
  • predictions 1
  • dbnsfp 1
  • functional 1
  • install 1
  • gprofiler2 1
  • gost 1
  • grea 1
  • functional enrichment 1
  • prior knowledge 1
  • biological activity 1
  • omics 1
  • UShER 1
  • bootstrapping 1
  • svannotate 1
  • mkarv 1
  • functional genomics 1
  • rra 1
  • CRISPR-Cas9 1
  • sgRNA 1
  • readcounter 1
  • gccounter 1
  • SNP table 1
  • GATK UnifiedGenotyper 1
  • bam 0
  • fastq 0
  • metagenomics 0
  • genome 0
  • index 0
  • alignment 0
  • assembly 0
  • reference 0
  • bed 0
  • cram 0
  • sam 0
  • filter 0
  • merge 0
  • map 0
  • statistics 0
  • coverage 0
  • quality control 0
  • classification 0
  • download 0
  • nanopore 0
  • gtf 0
  • classify 0
  • split 0
  • k-mer 0
  • MSA 0
  • contamination 0
  • taxonomic profiling 0
  • gfa 0
  • taxonomy 0
  • somatic 0
  • sentieon 0
  • pacbio 0
  • conversion 0
  • convert 0
  • proteomics 0
  • clustering 0
  • count 0
  • quality 0
  • copy number 0
  • single-cell 0
  • rnaseq 0
  • bedtools 0
  • contigs 0
  • long reads 0
  • phylogeny 0
  • imputation 0
  • gvcf 0
  • kmer 0
  • graph 0
  • isoseq 0
  • bisulfite 0
  • build 0
  • sv 0
  • mags 0
  • variation graph 0
  • visualisation 0
  • indexing 0
  • methylation 0
  • methylseq 0
  • bisulphite 0
  • bqsr 0
  • picard 0
  • compression 0
  • table 0
  • long-read 0
  • illumina 0
  • QC 0
  • databases 0
  • wgs 0
  • cna 0
  • serotype 0
  • antimicrobial resistance 0
  • 5mC 0
  • openms 0
  • metrics 0
  • depth 0
  • stats 0
  • imaging 0
  • tsv 0
  • taxonomic classification 0
  • mapping 0
  • sequences 0
  • demultiplex 0
  • example 0
  • histogram 0
  • markduplicates 0
  • DNA methylation 0
  • amr 0
  • samtools 0
  • WGBS 0
  • scWGBS 0
  • haplotype 0
  • structure 0
  • bins 0
  • searching 0
  • pairs 0
  • protein sequence 0
  • matrix 0
  • base quality score recalibration 0
  • cluster 0
  • filtering 0
  • neural network 0
  • pangenome graph 0
  • expression 0
  • plot 0
  • repeat 0
  • validation 0
  • mappability 0
  • iCLIP 0
  • phasing 0
  • completeness 0
  • metagenome 0
  • checkm 0
  • db 0
  • cooler 0
  • germline 0
  • transcript 0
  • seqkit 0
  • bwa 0
  • transcriptome 0
  • aligner 0
  • bisulfite sequencing 0
  • biscuit 0
  • machine learning 0
  • low-coverage 0
  • sequence 0
  • genotype 0
  • damage 0
  • palaeogenomics 0
  • archaeogenomics 0
  • mmseqs2 0
  • plink2 0
  • LAST 0
  • gene 0
  • decompression 0
  • mkref 0
  • umi 0
  • population genetics 0
  • blast 0
  • evaluation 0
  • newick 0
  • hmmer 0
  • dedup 0
  • complexity 0
  • sketch 0
  • segmentation 0
  • mag 0
  • peaks 0
  • feature 0
  • ucsc 0
  • msa 0
  • genotyping 0
  • spatial 0
  • glimpse 0
  • bismark 0
  • kraken2 0
  • short-read 0
  • antimicrobial resistance genes 0
  • scRNA-seq 0
  • json 0
  • mitochondria 0
  • report 0
  • snp 0
  • kmers 0
  • bedGraph 0
  • differential 0
  • low frequency variant calling 0
  • profile 0
  • prokaryote 0
  • deduplication 0
  • vsearch 0
  • extract 0
  • NCBI 0
  • demultiplexing 0
  • multiple sequence alignment 0
  • cnvkit 0
  • reads 0
  • pangenome 0
  • plasmid 0
  • prediction 0
  • mirna 0
  • tumor-only 0
  • duplicates 0
  • single 0
  • splicing 0
  • detection 0
  • isolates 0
  • reference-free 0
  • svtk 0
  • de novo assembly 0
  • FASTQ 0
  • tabular 0
  • call 0
  • 3-letter genome 0
  • mem 0
  • idXML 0
  • fragment 0
  • diversity 0
  • distance 0
  • adapters 0
  • merging 0
  • sourmash 0
  • fastx 0
  • de novo 0
  • arg 0
  • wxs 0
  • coptr 0
  • ptr 0
  • text 0
  • csv 0
  • summary 0
  • single cell 0
  • visualization 0
  • counts 0
  • mpileup 0
  • interval 0
  • microbiome 0
  • benchmark 0
  • indels 0
  • ont 0
  • profiling 0
  • riboseq 0
  • mutect2 0
  • clipping 0
  • MAF 0
  • deamination 0
  • kallisto 0
  • structural 0
  • gridss 0
  • compare 0
  • xeniumranger 0
  • CLIP 0
  • hic 0
  • paf 0
  • umitools 0
  • matching 0
  • sample 0
  • cut 0
  • copy number alteration calling 0
  • bedgraph 0
  • hybrid capture sequencing 0
  • sequencing 0
  • ngscheckmate 0
  • DNA sequencing 0
  • targeted sequencing 0
  • haplotypecaller 0
  • genmod 0
  • ranking 0
  • bgzip 0
  • peak-calling 0
  • microsatellite 0
  • circrna 0
  • read depth 0
  • retrotransposon 0
  • compress 0
  • phylogenetic placement 0
  • dna 0
  • pypgx 0
  • STR 0
  • ccs 0
  • containment 0
  • fgbio 0
  • SV 0
  • mtDNA 0
  • snps 0
  • telomere 0
  • diamond 0
  • bigwig 0
  • deep learning 0
  • genome assembler 0
  • transcriptomics 0
  • quantification 0
  • interval_list 0
  • gsea 0
  • preprocessing 0
  • happy 0
  • redundancy 0
  • HiFi 0
  • ganon 0
  • HMM 0
  • miscoding lesions 0
  • palaeogenetics 0
  • archaeogenetics 0
  • family 0
  • isomir 0
  • bedpe 0
  • rna 0
  • microarray 0
  • ancestry 0
  • union 0
  • skani 0
  • fai 0
  • resistance 0
  • chunk 0
  • image 0
  • BGC 0
  • clean 0
  • biosynthetic gene cluster 0
  • add 0
  • normalization 0
  • bcl2fastq 0
  • malt 0
  • DNA sequence 0
  • chromosome 0
  • fungi 0
  • abundance 0
  • krona 0
  • spark 0
  • pairsam 0
  • benchmarking 0
  • popscle 0
  • survivor 0
  • amplicon sequences 0
  • pan-genome 0
  • covid 0
  • ambient RNA removal 0
  • combine 0
  • genotype-based deconvoltion 0
  • long_read 0
  • regions 0
  • scores 0
  • bracken 0
  • fingerprint 0
  • PCA 0
  • duplication 0
  • minimap2 0
  • html 0
  • ligate 0
  • uLTRA 0
  • bacterial 0
  • transposons 0
  • lineage 0
  • comparisons 0
  • seqtk 0
  • pangolin 0
  • image_analysis 0
  • khmer 0
  • chimeras 0
  • hidden Markov model 0
  • intervals 0
  • mask 0
  • fam 0
  • UMI 0
  • bim 0
  • angsd 0
  • insert 0
  • pileup 0
  • converter 0
  • host 0
  • PacBio 0
  • npz 0
  • bamtools 0
  • windowmasker 0
  • mapper 0
  • cfDNA 0
  • variant_calling 0
  • pseudoalignment 0
  • typing 0
  • structural_variants 0
  • krona chart 0
  • reports 0
  • notebook 0
  • guide tree 0
  • indel 0
  • virulence 0
  • mcmicro 0
  • roh 0
  • highly_multiplexed_imaging 0
  • population genomics 0
  • tabix 0
  • png 0
  • replace 0
  • fastk 0
  • small indels 0
  • panel 0
  • comparison 0
  • fcs-gx 0
  • kraken 0
  • score 0
  • fusion 0
  • cut up 0
  • observations 0
  • arriba 0
  • gene expression 0
  • cool 0
  • cellranger 0
  • dump 0
  • genomes 0
  • zip 0
  • relatedness 0
  • identity 0
  • kinship 0
  • informative sites 0
  • microbes 0
  • eukaryotes 0
  • wig 0
  • somatic variants 0
  • chip-seq 0
  • mzml 0
  • remove 0
  • RNA-seq 0
  • mlst 0
  • repeat expansion 0
  • lossless 0
  • atac-seq 0
  • prokaryotes 0
  • checkv 0
  • sylph 0
  • gatk4spark 0
  • dist 0
  • organelle 0
  • archiving 0
  • mkfastq 0
  • dictionary 0
  • genome assembly 0
  • quality trimming 0
  • hi-c 0
  • C to T 0
  • das tool 0
  • das_tool 0
  • uncompress 0
  • transcripts 0
  • deeparg 0
  • bwameth 0
  • aln 0
  • proteome 0
  • spaceranger 0
  • untar 0
  • genome mining 0
  • subsample 0
  • polishing 0
  • complement 0
  • shapeit 0
  • adapter trimming 0
  • rsem 0
  • prefetch 0
  • nucleotide 0
  • unzip 0
  • genetics 0
  • zlib 0
  • microscopy 0
  • ampgram 0
  • trim 0
  • import 0
  • amptransformer 0
  • concordance 0
  • vcflib 0
  • gem 0
  • variation 0
  • taxon name 0
  • library 0
  • variant pruning 0
  • gstama 0
  • regression 0
  • edit distance 0
  • taxids 0
  • phase 0
  • bfiles 0
  • subset 0
  • MaltExtract 0
  • preseq 0
  • genomad 0
  • interactions 0
  • adapter 0
  • tama 0
  • HOPS 0
  • vg 0
  • ChIP-seq 0
  • trancriptome 0
  • authentication 0
  • bustools 0
  • metamaps 0
  • RiPP 0
  • simulate 0
  • megan 0
  • small variants 0
  • nextclade 0
  • multiallelic 0
  • checksum 0
  • artic 0
  • aggregate 0
  • demultiplexed reads 0
  • RNA-Seq 0
  • NRPS 0
  • tree 0
  • minhash 0
  • nucleotides 0
  • mash 0
  • xz 0
  • archive 0
  • graph layout 0
  • cnvnator 0
  • mudskipper 0
  • k-mer frequency 0
  • k-mer index 0
  • antibiotics 0
  • tumor 0
  • reformatting 0
  • msi 0
  • instability 0
  • MSI 0
  • homoploymer 0
  • antismash 0
  • rrna 0
  • spatial_transcriptomics 0
  • GC content 0
  • resolve_bioscience 0
  • parallelized 0
  • profiles 0
  • tnhaplotyper2 0
  • removal 0
  • assembly evaluation 0
  • rgfa 0
  • transcriptomic 0
  • COBS 0
  • bloom filter 0
  • lift 0
  • read-group 0
  • refine 0
  • ped 0
  • maximum likelihood 0
  • iphop 0
  • instrain 0
  • orf 0
  • SimpleAF 0
  • ichorcna 0
  • reformat 0
  • salmon 0
  • hlala_typing 0
  • hla_typing 0
  • hlala 0
  • hla 0
  • xenograft 0
  • graft 0
  • haplogroups 0
  • leviosam2 0
  • polyA_tail 0
  • purge duplications 0
  • proportionality 0
  • long terminal repeat 0
  • lofreq 0
  • serogroup 0
  • barcode 0
  • primer 0
  • secondary metabolites 0
  • retrotransposons 0
  • mitochondrion 0
  • GPU-accelerated 0
  • long terminal retrotransposon 0
  • pair 0
  • interactive 0
  • krakenuniq 0
  • registration 0
  • kma 0
  • krakentools 0
  • image_processing 0
  • screen 0
  • baf 0
  • concat 0
  • micro-satellite-scan 0
  • samplesheet 0
  • frame-shift correction 0
  • sizes 0
  • long-read sequencing 0
  • sequence analysis 0
  • bases 0
  • gwas 0
  • svdb 0
  • eigenstrat 0
  • validate 0
  • format 0
  • region 0
  • salmonella 0
  • eido 0
  • pharmacogenetics 0
  • rename 0
  • blastp 0
  • deseq2 0
  • rna-seq 0
  • transformation 0
  • awk 0
  • de novo assembler 0
  • emboss 0
  • varcal 0
  • blastn 0
  • soft-clipped clusters 0
  • fixmate 0
  • otu tables 0
  • standardisation 0
  • unaligned 0
  • standardise 0
  • UMIs 0
  • duplex 0
  • taxonomic profile 0
  • fetch 0
  • GEO 0
  • metagenomic 0
  • fusions 0
  • identifier 0
  • intersection 0
  • expansionhunterdenovo 0
  • repeat_expansions 0
  • standardization 0
  • tab 0
  • Pharmacogenetics 0
  • BAM 0
  • heatmap 0
  • dict 0
  • trgt 0
  • human removal 0
  • single cells 0
  • genome bins 0
  • cnv calling 0
  • screening 0
  • cleaning 0
  • cvnkit 0
  • decontamination 0
  • estimation 0
  • split_kmers 0
  • recombination 0
  • corrupted 0
  • eCLIP 0
  • splice 0
  • parse 0
  • hostile 0
  • small genome 0
  • ancient dna 0
  • doublets 0
  • spatial_omics 0
  • random forest 0
  • metagenomes 0
  • sequenzautils 0
  • Streptococcus pneumoniae 0
  • signature 0
  • FracMinHash sketch 0
  • structural-variant calling 0
  • anndata 0
  • fasterq-dump 0
  • sra-tools 0
  • settings 0
  • version 0
  • switch 0
  • shigella 0
  • immunoprofiling 0
  • gene labels 0
  • join 0
  • vdj 0
  • taxon tables 0
  • windows 0
  • collate 0
  • panelofnormals 0
  • msisensor-pro 0
  • WGS 0
  • evidence 0
  • realignment 0
  • cgMLST 0
  • tbi 0
  • intersect 0
  • contig 0
  • normalize 0
  • mirdeep2 0
  • scaffold 0
  • repeats 0
  • polish 0
  • microbial 0
  • MCMICRO 0
  • gatk 0
  • joint genotyping 0
  • ome-tif 0
  • orthologs 0
  • runs_of_homozygosity 0
  • scaffolding 0
  • duplicate 0
  • bayesian 0
  • short reads 0
  • reads merging 0
  • merge mate pairs 0
  • allele-specific 0
  • scatter 0
  • interval list 0
  • haplotypes 0
  • smrnaseq 0
  • rtgtools 0
  • Duplication purging 0
  • allele 0
  • bam2fq 0
  • filtermutectcalls 0
  • Read depth 0
  • junctions 0
  • dereplicate 0
  • reheader 0
  • RNA sequencing 0
  • norm 0
  • simulation 0
  • files 0
  • hmmfetch 0
  • propd 0
  • covariance models 0
  • hash sketch 0
  • scRNA-Seq 0
  • reverse complement 0
  • signatures 0
  • ucsc/liftover 0
  • fracminhash sketch 0
  • Read coverage histogram 0
  • unmarkduplicates 0
  • umicollapse 0
  • transmembrane 0
  • decompose 0
  • network 0
  • htseq 0
  • trna 0
  • downsample 0
  • deduplicate 0
  • wget 0
  • sompy 0
  • SINE 0
  • decoy 0
  • downsample bam 0
  • uniq 0
  • vcfbreakmulti 0
  • subsample bam 0
  • vcf2db 0
  • toml 0
  • gemini 0
  • VCFtools 0
  • snv 0
  • ribosomal RNA 0
  • rRNA 0
  • upd 0
  • constant 0
  • Escherichia coli 0
  • invariant 0
  • uniparental 0
  • SNPs 0
  • genome graph 0
  • plant 0
  • graph projection to vcf 0
  • tnseq 0
  • construct 0
  • disomy 0
  • DNA contamination estimation 0
  • verifybamid 0
  • gtftogenepred 0
  • streptococcus 0
  • spatype 0
  • eucaryotes 0
  • snakemake 0
  • workflow 0
  • decompress 0
  • vcf2bed 0
  • wham 0
  • assembly polishing 0
  • chromosomal rearrangements 0
  • lua 0
  • sliding 0
  • workflow_mode 0
  • whamg 0
  • rdtest 0
  • wavefront 0
  • rdtest2vcf 0
  • mashmap 0
  • createreadcountpanelofnormals 0
  • genome polishing 0
  • hicPCA 0
  • denoisereadcounts 0
  • c to t 0
  • copy number variation 0
  • yahs 0
  • copy number alterations 0
  • geo 0
  • Mycobacterium tuberculosis 0
  • mapad 0
  • adna 0
  • proteus 0
  • eigenvectors 0
  • readproteingroups 0
  • melon 0
  • bedcov 0
  • gender determination 0
  • fast5 0
  • copy number analysis 0
  • polya tail 0
  • copy-number 0
  • copyratios 0
  • readwriter 0
  • boxcox 0
  • variantcalling 0
  • bedtobigbed 0
  • tnscope 0
  • bgen 0
  • scanner 0
  • chloroplast 0
  • genepred 0
  • confidence 0
  • sccmec 0
  • bigbed 0
  • vsearch/sort 0
  • blat 0
  • alr 0
  • sintax 0
  • clr 0
  • refflat 0
  • spa 0
  • groupby 0
  • usearch 0
  • dnamodelapply 0
  • genotype dosages 0
  • all versus all 0
  • coding 0
  • countsvtypes 0
  • baftest 0
  • dnascope 0
  • remove samples 0
  • svtk/baftest 0
  • pangenome-scale 0
  • detecting svs 0
  • long read alignment 0
  • cds 0
  • transcroder 0
  • sequencing adapters 0
  • comp 0
  • helitron 0
  • short-read sequencing 0
  • bedgraphtobigwig 0
  • maf 0
  • predict 0
  • peak picking 0
  • haplotag 0
  • Illumina 0
  • impute-info 0
  • tags 0
  • tag2tag 0
  • hashing-based deconvolution 0
  • host removal 0
  • rank 0
  • java 0
  • script 0
  • haploype 0
  • xml 0
  • svg 0
  • standard 0
  • staging 0
  • drug categorization 0
  • impute 0
  • Staging 0
  • reference compression 0
  • reference panel 0
  • microRNA 0
  • junction 0
  • multiqc 0
  • mass_error 0
  • search engine 0
  • poolseq 0
  • variant-calling 0
  • stardist 0
  • phylogenies 0
  • telseq 0
  • uniques 0
  • Read report 0
  • vsearch/fastqfilter 0
  • paraphase 0
  • cellsnp 0
  • trimfq 0
  • vcflib/vcffixup 0
  • AC/NS/AF 0
  • Pacbio 0
  • guidetree 0
  • bwamem2 0
  • bwameme 0
  • grabix 0
  • ribosomal 0
  • 10x 0
  • SNV 0
  • regulatory network 0
  • transcription factors 0
  • selector 0
  • Read trimming 0
  • sage 0
  • Read filters 0
  • nanoq 0
  • redundant 0
  • extraction 0
  • snippy 0
  • mass spectrometry 0
  • Indel 0
  • cram-size 0
  • orthogroup 0
  • spot 0
  • circular 0
  • realign 0
  • quality check 0
  • size 0
  • vsearch/dereplicate 0
  • hmmscan 0
  • genotype-based demultiplexing 0
  • import segmentation 0
  • paired reads re-pairing 0
  • fix 0
  • malformed 0
  • partitioning 0
  • chip 0
  • updatedata 0
  • subsetting 0
  • run 0
  • logFC 0
  • significance statistic 0
  • pdb 0
  • p-value 0
  • scvi 0
  • solo 0
  • nuclear segmentation 0
  • doublet_detection 0
  • cell segmentation 0
  • block substitutions 0
  • relabel 0
  • resegment 0
  • morphology 0
  • decomposeblocksub 0
  • identity-by-descent 0
  • metagenome assembler 0
  • mgi 0
  • scanpy 0
  • plotting 0
  • regtools 0
  • leafcutter 0
  • recovery 0
  • barcodes 0
  • regex 0
  • hmmpress 0
  • bclconvert 0
  • fastqfilter 0
  • ATACseq 0
  • shift 0
  • ATACshift 0
  • hhsuite 0
  • 16S 0
  • CRISPRi 0
  • setgt 0
  • jvarkit 0
  • translate 0
  • tar 0
  • tarball 0
  • targz 0
  • taxonomic composition 0
  • mzML 0
  • patterns 0
  • emoji 0
  • doublet 0
  • Immune Deconvolution 0
  • Bioinformatics Tools 0
  • Computational Immunology 0
  • catpack 0
  • quality_control 0
  • source tracking 0
  • nucBed 0
  • controlstatistics 0
  • elprep 0
  • prepare 0
  • elfasta 0
  • nucleotide content 0
  • AT content 0
  • donor deconvolution 0
  • lexogen 0
  • vcf file 0
  • parquet 0
  • genotypegvcf 0
  • homozygous genotypes 0
  • heterozygous genotypes 0
  • inbreeding 0
  • parallel 0
  • plastid 0
  • covariance model 0
  • dereplication 0
  • resfinder 0
  • resistance genes 0
  • microbial genomics 0
  • raw 0
  • mgf 0
  • parser 0
  • f coefficient 0
  • dbsnp 0
  • drep 0
  • standardize 0
  • agat 0
  • quarto 0
  • python 0
  • r 0
  • coexpression 0
  • correlation 0
  • corpcor 0
  • assay 0
  • phylogenetics 0
  • minimum_evolution 0
  • distance-based 0
  • joint-genotyping 0
  • homologs 0
  • bam2fastq 0
  • bgen file 0
  • site frequency spectrum 0
  • ancestral alleles 0
  • derived alleles 0
  • plink2_pca 0
  • tnfilter 0
  • array_cgh 0
  • cytosure 0
  • vector 0
  • rad 0
  • structural variant 0
  • bam2fastx 0
  • immcantation 0
  • introns 0
  • mutect 0
  • gaps 0
  • linkage equilibrium 0
  • transform 0
  • pruning 0
  • pca 0
  • idx 0
  • deep variant 0
  • airrseq 0
  • comparative genomics 0
  • spectral clustering 0
  • sequence similarity 0
  • homology 0
  • co-orthology 0
  • immunoinformatics 0
  • nucleotide sequence 0
  • longest 0
  • droplet based single cells 0
  • translation 0
  • mygene 0
  • intron 0
  • masking 0
  • go 0
  • pile up 0
  • low-complexity 0
  • GFF/GTF 0
  • trio binning 0
  • nanopore sequencing 0
  • rna velocity 0
  • cobra 0
  • extension 0
  • paired reads merging 0
  • cell_barcodes 0
  • coreutils 0
  • busco 0
  • InterProScan 0
  • MMseqs2 0
  • retrieval 0
  • transposable element 0
  • generic 0
  • gnu 0
  • tandem repeats 0
  • hashing-based deconvoltion 0
  • hamming-distance 0
  • shuffleBed 0
  • long read 0
  • check 0
  • overlap-based merging 0
  • short 0
  • tag 0
  • multi-tool 0
  • cell_phenotyping 0
  • isoform 0
  • hardy-weinberg 0
  • hwe statistics 0
  • hwe equilibrium 0
  • reference-independent 0
  • variancepartition 0
  • genotype likelihood 0
  • collapse 0
  • liftover 0
  • dream 0
  • probabilistic realignment 0
  • seqfu 0
  • n50 0
  • cell_type_identification 0
  • machine_learning 0
  • nm 0
  • uq 0
  • structural-variants 0
  • Bayesian 0
  • scimap 0
  • spatial_neighborhoods 0
  • md 0
  • associations 0
  • case/control 0
  • GWAS 0
  • association 0
  • refresh 0
  • clahe 0
  • featuretable 0
  • cumulative coverage 0
  • core 0
  • getpileupsummaries 0
  • short variant discovery 0
  • combinegvcfs 0
  • collectsvevidence 0
  • collectreadcounts 0
  • cnnscorevariants 0
  • calibratedragstrmodel 0
  • cross-samplecontamination 0
  • dragstr 0
  • calculatecontamination 0
  • bedtointervallist 0
  • asereadcounter 0
  • vqsr 0
  • variant quality score recalibration 0
  • annotateintervals 0
  • composestrtablefile 0
  • condensedepthevidence 0
  • heattree 0
  • gatherbqsrreports 0
  • germlinecnvcaller 0
  • germline contig ploidy 0
  • panelofnormalscreation 0
  • jointgenotyping 0
  • genomicsdbimport 0
  • genomicsdb 0
  • tranche filtering 0
  • createsequencedictionary 0
  • filtervarianttranches 0
  • filterintervals 0
  • estimatelibrarycomplexity 0
  • duplication metrics 0
  • determinegermlinecontigploidy 0
  • createsomaticpanelofnormals 0
  • targets 0
  • gangstr 0
  • getpileupsumaries 0
  • antibiotic resistance genes 0
  • consensus sequence 0
  • public 0
  • ENA 0
  • SRA 0
  • ANI 0
  • ARGs 0
  • faqcs 0
  • groupreads 0
  • str 0
  • cache 0
  • percent on target 0
  • endogenous DNA 0
  • Streptococcus pyogenes 0
  • swissprot 0
  • duplexumi 0
  • unmapped 0
  • gene-calling 0
  • variant caller 0
  • gamma 0
  • bacterial variant calling 0
  • germline variant calling 0
  • somatic variant calling 0
  • rust 0
  • ubam 0
  • fq 0
  • lint 0
  • random 0
  • generate 0
  • single molecule 0
  • zipperbams 0
  • germlinevariantsites 0
  • readcountssummary 0
  • embl 0
  • Haplotypes 0
  • tama_collapse.py 0
  • genomes on a tree 0
  • merge compare 0
  • GNU 0
  • joint-variant-calling 0
  • Imputation 0
  • Sample 0
  • TAMA 0
  • low coverage 0
  • gget 0
  • genome statistics 0
  • genome manipulation 0
  • genome summary 0
  • gfastats 0
  • gene model 0
  • gstama/merge 0
  • Salmonella Typhi 0
  • extractvariants 0
  • hbd 0
  • ibd 0
  • rgi 0
  • fARGene 0
  • amrfinderplus 0
  • abricate 0
  • extract_variants 0
  • gstama/polyacleanup 0
  • gvcftools 0
  • gunzip 0
  • gunc 0
  • archaea 0
  • genome taxonomy database 0
  • GTDB taxonomy 0
  • Mykrobe 0
  • repeat content 0
  • indexfeaturefile 0
  • preprocessintervals 0
  • shiftchain 0
  • selectvariants 0
  • revert 0
  • reblockgvcf 0
  • printsvevidence 0
  • printreads 0
  • postprocessgermlinecnvcalls 0
  • shiftintervals 0
  • snvs 0
  • mutectstats 0
  • mergebamalignment 0
  • leftalignandtrimvariants 0
  • readorientationartifacts 0
  • learnreadorientationmodel 0
  • shiftfasta 0
  • site depth 0
  • genome heterozygosity 0
  • txt 0
  • genome size 0
  • models 0
  • compound 0
  • genome profile 0
  • bgc 0
  • file parsing 0
  • gawk 0
  • splitcram 0
  • variantrecalibrator 0
  • recalibration model 0
  • variantfiltration 0
  • svcluster 0
  • splitintervals 0
  • genbank 0
  • split by chromosome 0
  • mitochondrial 0
  • illumiation_correction 0
  • BCF 0
  • csi 0
  • deduping 0
  • smaller fastqs 0
  • clumping fastqs 0
  • background_correction 0
  • element 0
  • biallelic 0
  • trimBam 0
  • bamUtil 0
  • bamtools/split 0
  • yaml 0
  • bamtools/convert 0
  • mouse 0
  • update header 0
  • homozygosity 0
  • virulent 0
  • chunking 0
  • subtract 0
  • slopBed 0
  • shiftBed 0
  • multinterval 0
  • overlapped bed 0
  • maskfasta 0
  • jaccard 0
  • autozygosity 0
  • overlap 0
  • getfasta 0
  • genomecov 0
  • closest 0
  • bamtobed 0
  • sorting 0
  • bacphlip 0
  • temperate 0
  • bioawk 0
  • amp 0
  • allele counts 0
  • nuclear contamination estimate 0
  • post Post-processing 0
  • model 0
  • AMPs 0
  • antimicrobial peptide prediction 0
  • Staphylococcus aureus 0
  • installation 0
  • affy 0
  • reference panels 0
  • admixture 0
  • adapterremoval 0
  • antimicrobial reistance 0
  • contiguate 0
  • doCounts 0
  • HLA 0
  • lifestyle 0
  • read group 0
  • autofluorescence 0
  • cycif 0
  • background 0
  • single-stranded 0
  • ancientDNA 0
  • authentict 0
  • bias 0
  • utility 0
  • ATLAS 0
  • sequencing_bias 0
  • post mortem damage 0
  • atlas 0
  • http(s) 0
  • unionBedGraphs 0
  • file manipulation 0
  • deletion 0
  • TMA dearray 0
  • gct 0
  • cls 0
  • na 0
  • custom 0
  • Cores 0
  • Segmentation 0
  • UNet 0
  • pcr duplicates 0
  • mcool 0
  • genomic bins 0
  • makebins 0
  • enzyme 0
  • digest 0
  • cload 0
  • cutesv 0
  • paired-end 0
  • subcontigs 0
  • escherichia coli 0
  • circos 0
  • eklipse 0
  • eigenstratdatabasetools 0
  • pep 0
  • schema 0
  • PEP 0
  • depth information 0
  • track 0
  • structural variation 0
  • duphold 0
  • segment 0
  • blastx 0
  • scatterplot 0
  • corrrelation 0
  • cooler/balance 0
  • nucleotide composition 0
  • sorted 0
  • compartments 0
  • multiomics 0
  • mkvdjref 0
  • cellpose 0
  • hifi 0
  • Assembly 0
  • domains 0
  • topology 0
  • antibody capture 0
  • calder2 0
  • cadd 0
  • postprocessing 0
  • tblastn 0
  • subtyping 0
  • Salmonella enterica 0
  • antigen capture 0
  • crispr 0
  • concoct 0
  • protein coding genes 0
  • partition histograms 0
  • target 0
  • export 0
  • antitarget 0
  • access 0
  • cmseq 0
  • polymorphic sites 0
  • qa 0
  • polymorphic 0
  • polymut 0
  • chromosome_visualization 0
  • duplicate removal 0
  • chromap 0
  • quality assurnce 0
  • beagle 0
  • Haemophilus influenzae 0
  • sniffles 0
  • gene finding 0
  • haplotype purging 0
  • cutoff 0
  • panel of normals 0
  • normal database 0
  • genomic intervals 0
  • intervals coverage 0
  • contact maps 0
  • false duplications 0
  • bmp 0
  • jpg 0
  • pretext 0
  • contact 0
  • porechop_abi 0
  • pmdtools 0
  • duplicate purging 0
  • assembly curation 0
  • scoring 0
  • rhocall 0
  • fragment_size 0
  • read_pairs 0
  • experiment 0
  • strandedness 0
  • bamstat 0
  • R 0
  • long uncorrected reads 0
  • Haplotype purging 0
  • subsampling 0
  • neighbour-joining 0
  • quast 0
  • purging 0
  • Assembly curation 0
  • False duplications 0
  • variant genetic 0
  • identifiers 0
  • read distribution 0
  • hybrid-selection 0
  • deletions 0
  • sortvcf 0
  • picard/renamesampleinvcf 0
  • pcr 0
  • liftovervcf 0
  • mate-pair 0
  • phylogenetic composition 0
  • tandem duplications 0
  • illumina datasets 0
  • identification 0
  • prophage 0
  • phantom peaks 0
  • ChIP-Seq 0
  • motif 0
  • insertions 0
  • CoPRO 0
  • whole genome association 0
  • GRO-seq 0
  • recode 0
  • indep pairwise 0
  • indep 0
  • variant identifiers 0
  • exclude 0
  • genetic 0
  • PRO-seq 0
  • GRO-cap 0
  • STRIPE-seq 0
  • csRNA-seq 0
  • RAMPAGE 0
  • NETCAGE 0
  • CAGE 0
  • PRO-cap 0
  • inner_distance 0
  • sequence-based 0
  • read 0
  • pseudohaploid 0
  • sex determination 0
  • induce 0
  • gc_wiggle 0
  • bam2seqz 0
  • freqsum 0
  • pseudodiploid 0
  • random draw 0
  • relative coverage 0
  • selection 0
  • seq 0
  • header 0
  • interleave 0
  • sertotype 0
  • sequence headers 0
  • genetic sex 0
  • rare variants 0
  • subseq 0
  • density 0
  • POA 0
  • SMN2 0
  • SMN1 0
  • CRAM 0
  • sliding window 0
  • features 0
  • boxplot 0
  • error 0
  • exploratory 0
  • shinyngs 0
  • 256 bit 0
  • sha256 0
  • longread 0
  • de-novo 0
  • grep 0
  • variant recalibration 0
  • mapping-based 0
  • LCA 0
  • amplicon 0
  • duplicate marking 0
  • sambamba 0
  • flagstat 0
  • multimapper 0
  • Ancestor 0
  • salsa2 0
  • calmd 0
  • salsa 0
  • rtg-tools 0
  • rocplot 0
  • pedfilter 0
  • rtg 0
  • integrity 0
  • ampliconclip 0
  • faidx 0
  • VQSR 0
  • peak-caller 0
  • applyvarcal 0
  • assembly-binning 0
  • seacr 0
  • chromatin 0
  • cut&run 0
  • cut&tag 0
  • clusteridentifier 0
  • insert size 0
  • cluster analysis 0
  • scramble 0
  • readgroup 0
  • read pairs 0
  • paired 0
  • repair 0
  • pedigrees 0
  • pair-end 0
  • haplotype resolution 0
  • legionella 0
  • lofreq/filter 0
  • lofreq/call 0
  • Listeria monocytogenes 0
  • limma 0
  • pneumophila 0
  • clinical 0
  • collapsing 0
  • AMP 0
  • adapter removal 0
  • train 0
  • spliced 0
  • reorder 0
  • combining 0
  • kofamscan 0
  • qualities 0
  • peptide prediction 0
  • pneumoniae 0
  • estimate 0
  • metagenome-assembled genomes 0
  • maxbin2 0
  • representations 0
  • reduced 0
  • mash/sketch 0
  • taxonomic assignment 0
  • damage patterns 0
  • NGS 0
  • DNA damage 0
  • maximum-likelihood 0
  • kegg 0
  • Klebsiella 0
  • mcr-1 0
  • pos 0
  • js 0
  • igv.js 0
  • igv 0
  • IDR 0
  • panel_of_normals 0
  • haemophilus 0
  • annotations 0
  • multicut 0
  • hmtnote 0
  • Hidden Markov Model 0
  • amino acid 0
  • HMMER 0
  • genome browser 0
  • pixel classification 0
  • effective genome size 0
  • Jupyter 0
  • k-mer counting 0
  • digital normalization 0
  • quant 0
  • kallisto/index 0
  • papermill 0
  • jupytext 0
  • Python 0
  • pixel_classification 0
  • jasmine 0
  • jasminesv 0
  • insertion 0
  • genomic islands 0
  • interproscan 0
  • probability_maps 0
  • mass-spectroscopy 0
  • MD5 0
  • pbp 0
  • squeeze 0
  • graph viz 0
  • graph formats 0
  • graph unchopping 0
  • graph stats 0
  • combine graphs 0
  • odgi 0
  • graph drawing 0
  • hla-typing 0
  • graph construction 0
  • gender 0
  • Neisseria gonorrhoeae 0
  • ngm 0
  • NextGenMap 0
  • sequencing summary 0
  • tumor/normal 0
  • ILP 0
  • somatic structural variations 0
  • restriction fragments 0
  • subreads 0
  • pbmerge 0
  • pbbam 0
  • graphs 0
  • paragraph 0
  • select 0
  • pairstools 0
  • HLA-I 0
  • pairtools 0
  • ligation junctions 0
  • upper-triangular matrix 0
  • flip 0
  • PCR/optical duplicates 0
  • block-compressed 0
  • mobile element insertions 0
  • cancer genome 0
  • 128 bit 0
  • contour map 0
  • methylation bias 0
  • metaphlan 0
  • unionsum 0
  • ploidy 0
  • smudgeplot 0
  • Merqury 0
  • 3D heat map 0
  • assembler 0
  • Neisseria meningitidis 0
  • rma6 0
  • daa 0
  • debruijn 0
  • denovo 0
  • megahit 0
  • mbias 0
  • de Bruijn 0
  • contaminant 0
  • mtnucratio 0
  • Beautiful stand-alone HTML report 0
  • bioinformatics tools 0
  • mitochondrial to nuclear ratio 0
  • ratio 0
  • scan 0
  • microrna 0
  • microsatellite instability 0
  • otu table 0
  • mosdepth 0
  • reference genome 0
  • mitochondrial genome 0
  • target prediction 0
  • genome annotation 0

A NATA accredited tool for reporting the presence of antimicrobial resistance genes in bacterial genomes

01

matches partials virulence out txt versions

abritamr:

A pipeline for running AMRfinderPlus and collating results into functional classes

A tool to parse and summarise results from antimicrobial peptides tools and present functional classification.

0100

sample_dir txt csv faa summary_csv summary_html log results_db results_db_dmnd results_db_fasta results_db_tsv versions

ataqv function of a corresponding ataqv tool

012300000

json problems versions

ataqv:

ataqv is a toolkit for measuring and comparing ATAC-seq results. It was written to help understand how well ATAC-seq assays have worked, and to make it easier to spot differences that might be caused by library prep or sequencing.

mkarv function of a corresponding ataqv tool

0

html versions

ataqv:

ataqv is a toolkit for measuring and comparing ATAC-seq results. It was written to help understand how well ATAC-seq assays have worked, and to make it easier to spot differences that might be caused by library prep or sequencing.

This command replaces the former bcftools view caller. Some of the original functionality has been temporarily lost in the process of transition under htslib, but will be added back on popular demand. The original calling model can be invoked with the -c option.

012000

vcf tbi csi versions

view:

View, subset and filter VCF or BCF files by position and filtering expression. Convert between VCF and BCF

bcftools Haplotype-aware consequence caller

01010101

vcf tbi csi versions

reheader:

Haplotype-aware consequence caller

view function to generate vcfs

0100

vcf tsv xls versions

cnvpytor:

calling CNVs using read depth

remove false positives of functional crispr genomics due to CNVs

01200

norm_count_file versions

crisprcleanr:

Analysis of CRISPR functional genomics, remove false positive due to CNVs.

decoupler is a package containing different statistical methods to extract biological activities from omics data within a unified framework. It allows to flexibly test any enrichment method with any prior knowledge resource and incorporates methods that take into account the sign and weight. It can be used with any omic, as long as its features can be linked to a biological process based on prior knowledge. For example, in transcriptomics gene sets regulated by a transcription factor, or in phospho-proteomics phosphosites that are targeted by a kinase.

0100

dc_estimate dc_pvals versions

Fast genome-wide functional annotation through orthology assignment.

010001

annotations orthologs hits versions

Searches a term in a public NCBI database

010

xml versions

entrezdirect:

Entrez Direct (EDirect) is a method for accessing the NCBI's set of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a UNIX terminal window. Functions take search terms from command line arguments. Individual operations are combined to build multi-step queries. Record retrieval and formatting normally complete the process.

Queries an NCBI database using Unique Identifier(s)

0120

xml versions

entrezdirect:

Entrez Direct (EDirect) is a method for accessing the NCBI's set of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a UNIX terminal window. Functions take search terms from command line arguments. Individual operations are combined to build multi-step queries. Record retrieval and formatting normally complete the process.

Queries an NCBI database using an UID

01000

txt versions

entrezdirect:

Entrez Direct (EDirect) is a method for accessing the NCBI's set of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a UNIX terminal window. Functions take search terms from command line arguments. Individual operations are combined to build multi-step queries. Record retrieval and formatting normally complete the process.

A module for concatenation of gzipped or uncompressed files getting around UNIX terminal argument size

01

file_out versions

find:

GNU find searches the directory tree rooted at each given starting-point by evaluating the given expression

pigz:

pigz, which stands for Parallel Implementation of GZip, is a fully functional replacement for gzip that exploits multiple processors and multiple cores to the hilt when compressing data.

A module for decompressing a large number of gzipped files, getting around the UNIX terminal argument limit

01

file_out versions

find:

GNU find searches the directory tree rooted at each given starting-point by evaluating the given expression

pigz:

pigz, which stands for Parallel Implementation of GZip, is a fully functional replacement for gzip that exploits multiple processors and multiple cores to the hilt when compressing data.

Bootstrap sample demixing by resampling each site based on a multinomial distribution of read depth across all sites, where the event probabilities were determined by the fraction of the total sample reads found at each site, followed by a secondary resampling at each site according to a multinomial distribution (that is, binomial when there was only one SNV at a site), where event probabilities were determined by the frequencies of each base at the site, and the number of trials is given by the sequencing depth.

012000

lineages summarized versions

freyja:

Freyja recovers relative lineage abundances from mixed SARS-CoV-2 samples and provides functionality to analyze lineage dynamics.

specify the relative abundance of each known haplotype

01200

demix versions

freyja:

Freyja recovers relative lineage abundances from mixed SARS-CoV-2 samples and provides functionality to analyze lineage dynamics.

downloads new versions of the curated SARS-CoV-2 lineage file and barcodes

0

barcodes lineages_topology lineages_meta versions

freyja:

Freyja recovers relative lineage abundances from mixed SARS-CoV-2 samples and provides functionality to analyze lineage dynamics.

call variant and sequencing depth information of the variant

010

variants versions

freyja:

Freyja recovers relative lineage abundances from mixed SARS-CoV-2 samples and provides functionality to analyze lineage dynamics.

Adds predicted functional consequence, gene overlap, and noncoding element overlap annotations to SV VCF from GATK-SV pipeline. Input files are an SV VCF, a GTF file containing primary or canonical transcripts, and a BED file containing noncoding elements. Output file is an annotated SV VCF.

0123000

annotated_vcf index versions

gatk4:

Genome Analysis Toolkit (GATK4)

runs a functional enrichment analysis with gprofiler2

010101

all_enrich rds plot_png plot_html sub_enrich sub_plot filtered_gmt session_info versions

gprofiler2:

An R interface corresponding to the 2019 update of g:Profiler web tool.

gcCounter function from HMMcopy utilities, used to generate GC content in non-overlapping windows from a fasta reference

01

wig versions

hmmcopy:

C++ based programs for analyzing BAM files and preparing read counts -- used with bioconductor-hmmcopy

mapCounter function from HMMcopy utilities, used to generate mappability in non-overlapping windows from a bigwig file

01

wig versions

hmmcopy:

C++ based programs for analyzing BAM files and preparing read counts -- used with bioconductor-hmmcopy

readCounter function from HMMcopy utilities, used to generate read in windows

012

wig versions

hmmcopy:

C++ based programs for analyzing BAM files and preparing read counts -- used with bioconductor-hmmcopy

Generate a consensus sequence from a BAM file using iVar

0100

fasta qual mpileup versions

ivar:

iVar - a computational package that contains functions broadly useful for viral amplicon-based sequencing.

Trim primer sequences rom a BAM file with iVar

0120

bam log versions

ivar:

iVar - a computational package that contains functions broadly useful for viral amplicon-based sequencing.

Call variants from a BAM file using iVar

010000

tsv mpileup versions

ivar:

iVar - a computational package that contains functions broadly useful for viral amplicon-based sequencing.

mageck count for functional genomics, reads are usually mapped to a specific sgRNA

010

count norm versions

mageck:

MAGeCK (Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout), an algorithm to process, QC, analyze and visualize CRISPR screening data.

Mageck test performs a robust ranking aggregation (RRA) to identify positively or negatively selected genes in functional genomics screens.

01

gene_summary sgrna_summary r_script versions

mageck:

MAGeCK (Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout), an algorithm to process, QC, analyze and visualize CRISPR screening data.

SNP table generator from GATK UnifiedGenotyper with functionality geared for aDNA

010101010000001

full_alignment info_txt snp_alignment snp_genome_alignment snpstatistics snptable snptable_snpeff snptable_uncertainty structure_genotypes structure_genotypes_nomissing json versions

NACHO (NAnostring quality Control dasHbOard) is developed for NanoString nCounter data. NanoString nCounter data is a messenger-RNA/micro-RNA (mRNA/miRNA) expression assay and works with fluorescent barcodes. Each barcode is assigned a mRNA/miRNA, which can be counted after bonding with its target. As a result each count of a specific barcode represents the presence of its target mRNA/miRNA.

0101

normalized_counts normalized_counts_wo_HK versions

NACHO:

R package that uses two main functions to summarize and visualize NanoString RCC files, namely: load_rcc() and visualise(). It also includes a function normalise(), which (re)calculates sample specific size factors and normalises the data. For more information vignette("NACHO") and vignette("NACHO-analysis")

NACHO (NAnostring quality Control dasHbOard) is developed for NanoString nCounter data. NanoString nCounter data is a messenger-RNA/micro-RNA (mRNA/miRNA) expression assay and works with fluorescent barcodes. Each barcode is assigned a mRNA/miRNA, which can be counted after bonding with its target. As a result each count of a specific barcode represents the presence of its target mRNA/miRNA.

0101

nacho_qc_reports nacho_qc_png nacho_qc_txt versions

NACHO:

R package that uses two main functions to summarize and visualize NanoString RCC files, namely: load_rcc() and visualise(). It also includes a function normalise(), which (re)calculates sample specific size factors and normalises the data. For more information vignette("NACHO") and vignette("NACHO-analysis")

Install databases necessary for Pharokka's functional analysis

NO input

pharokka_db versions

pharokka:

Fast Phage Annotation Program

Functional annotation of phages

010

cds_final_merged_output cds_functions length_gc_cds_density card vfdb mash reoriented versions

pharokka:

Fast Phage Annotation Program

Perform Gene Ratio Enrichment Analysis

0101

results versions session_info

grea:

Gene Ratio Enrichment Analysis

Genetic variant annotation and functional effect prediction toolbox

012

cache versions

snpeff:

SnpEff is a variant annotation and effect prediction tool. It annotates and predicts the effects of genetic variants on genes and proteins (such as amino acid changes).

Genetic variant annotation and functional effect prediction toolbox

01001

vcf report summary_html genes_txt versions

snpeff:

SnpEff is a variant annotation and effect prediction tool. It annotates and predicts the effects of genetic variants on genes and proteins (such as amino acid changes).

The dbNSFP is an integrated database of functional predictions from multiple algorithms

012012

vcf versions

snpsift:

SnpSift is a toolbox that allows you to filter and manipulate annotated files

calculate secondary structures of two RNAs with dimerization

01

rnacofold_csv rnacofold_ps versions

viennarna:

calculate secondary structures of two RNAs with dimerization

The program works much like RNAfold, but allows one to specify two RNA sequences which are then allowed to form a dimer structure. RNA sequences are read from stdin in the usual format, i.e. each line of input corresponds to one sequence, except for lines starting with > which contain the name of the next sequence. To compute the hybrid structure of two molecules, the two sequences must be concatenated using the & character as separator. RNAcofold can compute minimum free energy (mfe) structures, as well as partition function (pf) and base pairing probability matrix (using the -p switch) Since dimer formation is concentration dependent, RNAcofold can be used to compute equilibrium concentrations for all five monomer and (homo/hetero)-dimer species, given input concentrations for the monomers. Output consists of the mfe structure in bracket notation as well as PostScript structure plots and โ€œdot plotโ€ files containing the pair probabilities, see the RNAfold man page for details. In the dot plots a cross marks the chain break between the two concatenated sequences. The program will continue to read new sequences until a line consisting of the single character @ or an end of file condition is encountered.

Predict RNA secondary structure using the ViennaRNA RNAfold tools. Calculate minimum free energy secondary structures and partition function of RNAs.

01

rnafold_txt rnafold_ps versions

viennarna:

Calculate minimum free energy secondary structures and partition function of RNAs

The program reads RNA sequences, calculates their minimum free energy (mfe) structure and prints the mfe structure in bracket notation and its free energy. If not specified differently using commandline arguments, input is accepted from stdin or read from an input file, and output printed to stdout. If the -p option was given it also computes the partition function (pf) and base pairing probability matrix, and prints the free energy of the thermodynamic ensemble, the frequency of the mfe structure in the ensemble, and the ensemble diversity to stdout.

Extracting sequences that were unbinnned by vRhyme into a FASTA file

0101

unbinned_sequences versions

vrhyme:

vRhyme functions by utilizing coverage variance comparisons and supervised machine learning classification of sequence features to construct viral metagenome-assembled genomes (vMAGs).

Linking bins output by vRhyme to create one sequences per bin

01

linked_bins versions

vrhyme:

vRhyme functions by utilizing coverage variance comparisons and supervised machine learning classification of sequence features to construct viral metagenome-assembled genomes (vMAGs).

Binning virus genomes from metagenomes

0101

bins membership summary versions

vrhyme:

vRhyme functions by utilizing coverage variance comparisons and supervised machine learning classification of sequence features to construct viral metagenome-assembled genomes (vMAGs).

Click here to trigger an update.