Available Modules

Modules are the building stones of all DSL2 nf-core blocks. You can find more info from nf-core website, if you would like to write your own module.

  • fasta 9
  • vcf 6
  • database 5
  • variants 5
  • annotation 4
  • qc 4
  • cnv 4
  • sort 3
  • binning 3
  • phage 3
  • ncbi 3
  • public datasets 3
  • bin 3
  • hmmcopy 3
  • ataqv 3
  • entrez 3
  • wastewater 3
  • amplicon sequencing 3
  • vrhyme 3
  • variant calling 2
  • variant 2
  • bcftools 2
  • gzip 2
  • enrichment 2
  • ATAC-seq 2
  • bakta 2
  • rna_structure 2
  • RNA 2
  • miRNA 2
  • CRISPR 2
  • prokka 2
  • functional analysis 2
  • effect prediction 2
  • snpeff 2
  • pharokka 2
  • function 2
  • deconvolution 2
  • nanostring 2
  • pigz 2
  • find 2
  • nacho 2
  • mRNA 2
  • genomics 1
  • gatk4 1
  • structural variants 1
  • align 1
  • gff 1
  • bacteria 1
  • VCF 1
  • ancient DNA 1
  • trimming 1
  • reporting 1
  • consensus 1
  • protein 1
  • aDNA 1
  • bcf 1
  • annotate 1
  • virus 1
  • gff3 1
  • hmmsearch 1
  • antimicrobial peptides 1
  • concatenate 1
  • cat 1
  • amps 1
  • antibiotic resistance 1
  • query 1
  • view 1
  • propr 1
  • ampir 1
  • logratio 1
  • parsing 1
  • DRAMP 1
  • neubi 1
  • amplify 1
  • macrel 1
  • SNP 1
  • mapcounter 1
  • snpsift 1
  • cancer genomics 1
  • gene set 1
  • gene set analysis 1
  • differential expression 1
  • orthology 1
  • metadata 1
  • calling 1
  • CNV 1
  • correction 1
  • extractunbinned 1
  • linkbins 1
  • functional 1
  • omics 1
  • biological activity 1
  • prior knowledge 1
  • grea 1
  • functional enrichment 1
  • gprofiler2 1
  • gost 1
  • install 1
  • UShER 1
  • bootstrapping 1
  • svannotate 1
  • mkarv 1
  • predictions 1
  • dbnsfp 1
  • gccounter 1
  • functional genomics 1
  • sgRNA 1
  • rra 1
  • CRISPR-Cas9 1
  • readcounter 1
  • SNP table 1
  • GATK UnifiedGenotyper 1
  • bam 0
  • fastq 0
  • metagenomics 0
  • genome 0
  • index 0
  • alignment 0
  • assembly 0
  • reference 0
  • bed 0
  • cram 0
  • sam 0
  • filter 0
  • merge 0
  • map 0
  • coverage 0
  • statistics 0
  • quality control 0
  • classification 0
  • nanopore 0
  • download 0
  • gtf 0
  • classify 0
  • k-mer 0
  • split 0
  • MSA 0
  • taxonomy 0
  • taxonomic profiling 0
  • gfa 0
  • contamination 0
  • somatic 0
  • sentieon 0
  • pacbio 0
  • convert 0
  • conversion 0
  • clustering 0
  • quality 0
  • count 0
  • proteomics 0
  • copy number 0
  • single-cell 0
  • bedtools 0
  • phylogeny 0
  • contigs 0
  • rnaseq 0
  • long reads 0
  • imputation 0
  • build 0
  • gvcf 0
  • isoseq 0
  • sv 0
  • mags 0
  • kmer 0
  • variation graph 0
  • bisulfite 0
  • graph 0
  • picard 0
  • bisulphite 0
  • compression 0
  • table 0
  • methylseq 0
  • cna 0
  • indexing 0
  • long-read 0
  • visualisation 0
  • illumina 0
  • wgs 0
  • bqsr 0
  • databases 0
  • QC 0
  • methylation 0
  • stats 0
  • sequences 0
  • demultiplex 0
  • mapping 0
  • openms 0
  • imaging 0
  • plink2 0
  • 5mC 0
  • serotype 0
  • taxonomic classification 0
  • metrics 0
  • antimicrobial resistance 0
  • depth 0
  • tsv 0
  • haplotype 0
  • pangenome graph 0
  • markduplicates 0
  • histogram 0
  • neural network 0
  • structure 0
  • cluster 0
  • base quality score recalibration 0
  • scWGBS 0
  • samtools 0
  • matrix 0
  • WGBS 0
  • expression 0
  • plot 0
  • amr 0
  • protein sequence 0
  • pairs 0
  • searching 0
  • DNA methylation 0
  • filtering 0
  • example 0
  • repeat 0
  • bins 0
  • machine learning 0
  • validation 0
  • mmseqs2 0
  • completeness 0
  • low-coverage 0
  • mappability 0
  • phasing 0
  • biscuit 0
  • transcript 0
  • checkm 0
  • metagenome 0
  • aligner 0
  • palaeogenomics 0
  • cooler 0
  • sequence 0
  • LAST 0
  • transcriptome 0
  • gene 0
  • bwa 0
  • genotype 0
  • seqkit 0
  • archaeogenomics 0
  • damage 0
  • iCLIP 0
  • germline 0
  • bisulfite sequencing 0
  • db 0
  • complexity 0
  • evaluation 0
  • genotyping 0
  • feature 0
  • peaks 0
  • kraken2 0
  • hmmer 0
  • mkref 0
  • segmentation 0
  • blast 0
  • decompression 0
  • glimpse 0
  • population genetics 0
  • msa 0
  • spatial 0
  • newick 0
  • ucsc 0
  • umi 0
  • mag 0
  • bismark 0
  • dedup 0
  • sketch 0
  • vsearch 0
  • prokaryote 0
  • reads 0
  • demultiplexing 0
  • antimicrobial resistance genes 0
  • rna 0
  • csv 0
  • extract 0
  • bedGraph 0
  • multiple sequence alignment 0
  • short-read 0
  • tumor-only 0
  • deduplication 0
  • report 0
  • NCBI 0
  • single 0
  • cnvkit 0
  • duplicates 0
  • mirna 0
  • snp 0
  • plasmid 0
  • pangenome 0
  • prediction 0
  • splicing 0
  • scRNA-seq 0
  • json 0
  • low frequency variant calling 0
  • mitochondria 0
  • kmers 0
  • profile 0
  • differential 0
  • mpileup 0
  • idXML 0
  • fragment 0
  • diversity 0
  • profiling 0
  • svtk 0
  • mem 0
  • kallisto 0
  • detection 0
  • riboseq 0
  • fastx 0
  • summary 0
  • text 0
  • counts 0
  • benchmark 0
  • MAF 0
  • gridss 0
  • adapters 0
  • merging 0
  • isolates 0
  • arg 0
  • compare 0
  • indels 0
  • interval 0
  • sourmash 0
  • mutect2 0
  • call 0
  • FASTQ 0
  • microbiome 0
  • visualization 0
  • ptr 0
  • ont 0
  • de novo assembly 0
  • distance 0
  • tabular 0
  • reference-free 0
  • wxs 0
  • de novo 0
  • clipping 0
  • structural 0
  • single cell 0
  • deamination 0
  • 3-letter genome 0
  • coptr 0
  • microsatellite 0
  • deep learning 0
  • gsea 0
  • snps 0
  • haplotypecaller 0
  • mtDNA 0
  • fgbio 0
  • redundancy 0
  • quantification 0
  • CLIP 0
  • read depth 0
  • transcriptomics 0
  • peak-calling 0
  • xeniumranger 0
  • diamond 0
  • circrna 0
  • miscoding lesions 0
  • palaeogenetics 0
  • bedgraph 0
  • ranking 0
  • interval_list 0
  • HiFi 0
  • happy 0
  • preprocessing 0
  • genome assembler 0
  • hic 0
  • bigwig 0
  • retrotransposon 0
  • STR 0
  • archaeogenetics 0
  • cut 0
  • ganon 0
  • phylogenetic placement 0
  • containment 0
  • SV 0
  • sylph 0
  • isomir 0
  • bedpe 0
  • dna 0
  • ngscheckmate 0
  • HMM 0
  • paf 0
  • telomere 0
  • pypgx 0
  • compress 0
  • matching 0
  • ccs 0
  • genmod 0
  • resistance 0
  • BGC 0
  • chunk 0
  • DNA sequencing 0
  • targeted sequencing 0
  • fai 0
  • hybrid capture sequencing 0
  • image 0
  • bgzip 0
  • biosynthetic gene cluster 0
  • malt 0
  • copy number alteration calling 0
  • clean 0
  • abundance 0
  • chromosome 0
  • fungi 0
  • DNA sequence 0
  • fusion 0
  • umitools 0
  • bcl2fastq 0
  • normalization 0
  • union 0
  • ancestry 0
  • add 0
  • sample 0
  • sequencing 0
  • microarray 0
  • skani 0
  • family 0
  • untar 0
  • transposons 0
  • highly_multiplexed_imaging 0
  • unzip 0
  • fastk 0
  • mcmicro 0
  • image_analysis 0
  • duplication 0
  • fusions 0
  • UMI 0
  • uncompress 0
  • html 0
  • krona 0
  • bacterial 0
  • benchmarking 0
  • minimap2 0
  • pileup 0
  • tabix 0
  • quality trimming 0
  • zip 0
  • archiving 0
  • polishing 0
  • remove 0
  • panel 0
  • adapter trimming 0
  • uLTRA 0
  • scaffolding 0
  • small indels 0
  • host 0
  • typing 0
  • bamtools 0
  • checkv 0
  • khmer 0
  • informative sites 0
  • spaceranger 0
  • popscle 0
  • genotype-based deconvoltion 0
  • observations 0
  • chimeras 0
  • lossless 0
  • kinship 0
  • PacBio 0
  • ligate 0
  • virulence 0
  • identity 0
  • dist 0
  • transcripts 0
  • genome assembly 0
  • relatedness 0
  • score 0
  • angsd 0
  • shapeit 0
  • seqtk 0
  • RNA-seq 0
  • subsample 0
  • pseudoalignment 0
  • arriba 0
  • krona chart 0
  • rsem 0
  • reports 0
  • notebook 0
  • indel 0
  • dictionary 0
  • eukaryotes 0
  • prokaryotes 0
  • spark 0
  • survivor 0
  • population genomics 0
  • cfDNA 0
  • genome mining 0
  • hidden Markov model 0
  • mask 0
  • ambient RNA removal 0
  • complement 0
  • long_read 0
  • atac-seq 0
  • fam 0
  • somatic variants 0
  • aln 0
  • cut up 0
  • proteome 0
  • bracken 0
  • bim 0
  • cool 0
  • mzml 0
  • gatk4spark 0
  • mapper 0
  • repeat expansion 0
  • npz 0
  • combine 0
  • comparisons 0
  • prefetch 0
  • windowmasker 0
  • fcs-gx 0
  • bwameth 0
  • guide tree 0
  • amplicon sequences 0
  • kraken 0
  • structural_variants 0
  • chip-seq 0
  • lineage 0
  • wig 0
  • png 0
  • microbes 0
  • pangolin 0
  • covid 0
  • pan-genome 0
  • hi-c 0
  • pairsam 0
  • comparison 0
  • gene expression 0
  • variant_calling 0
  • cellranger 0
  • replace 0
  • mkfastq 0
  • nucleotide 0
  • insert 0
  • C to T 0
  • dump 0
  • das tool 0
  • regions 0
  • roh 0
  • intervals 0
  • mlst 0
  • fingerprint 0
  • organelle 0
  • genomes 0
  • scaffold 0
  • converter 0
  • PCA 0
  • deeparg 0
  • scores 0
  • das_tool 0
  • graph layout 0
  • shigella 0
  • small genome 0
  • haplogroups 0
  • genetics 0
  • duplicate 0
  • copyratios 0
  • k-mer frequency 0
  • tnhaplotyper2 0
  • signature 0
  • interactions 0
  • rrna 0
  • de novo assembler 0
  • ancient dna 0
  • switch 0
  • xz 0
  • hla 0
  • reformat 0
  • megan 0
  • regression 0
  • COBS 0
  • hlala 0
  • hla_typing 0
  • hlala_typing 0
  • read-group 0
  • Read depth 0
  • archive 0
  • rgfa 0
  • zlib 0
  • taxids 0
  • ChIP-seq 0
  • concordance 0
  • variation 0
  • mitochondrion 0
  • contig 0
  • resolve_bioscience 0
  • GPU-accelerated 0
  • assembly evaluation 0
  • ampgram 0
  • amptransformer 0
  • spatial_transcriptomics 0
  • genomad 0
  • runs_of_homozygosity 0
  • profiles 0
  • junctions 0
  • small variants 0
  • gstama 0
  • taxon name 0
  • SimpleAF 0
  • trancriptome 0
  • multiallelic 0
  • FracMinHash sketch 0
  • tama 0
  • image_processing 0
  • nucleotides 0
  • ped 0
  • cnvnator 0
  • registration 0
  • GC content 0
  • proportionality 0
  • phase 0
  • checksum 0
  • leviosam2 0
  • metamaps 0
  • salmon 0
  • barcode 0
  • primer 0
  • soft-clipped clusters 0
  • taxon tables 0
  • otu tables 0
  • varcal 0
  • instability 0
  • pair 0
  • standardisation 0
  • msi 0
  • minhash 0
  • interactive 0
  • krakenuniq 0
  • standardise 0
  • serogroup 0
  • lofreq 0
  • salmonella 0
  • homoploymer 0
  • purge duplications 0
  • library 0
  • bam2fq 0
  • preseq 0
  • collate 0
  • adapter 0
  • retrotransposons 0
  • MSI 0
  • long terminal repeat 0
  • dict 0
  • fixmate 0
  • long terminal retrotransposon 0
  • kma 0
  • import 0
  • mash 0
  • taxonomic profile 0
  • tumor 0
  • ichorcna 0
  • maximum likelihood 0
  • polyA_tail 0
  • sequenzautils 0
  • refine 0
  • svdb 0
  • mudskipper 0
  • reformatting 0
  • iphop 0
  • orf 0
  • vg 0
  • Streptococcus pneumoniae 0
  • bloom filter 0
  • rtgtools 0
  • instrain 0
  • lift 0
  • k-mer index 0
  • nextclade 0
  • transformation 0
  • micro-satellite-scan 0
  • rename 0
  • krakentools 0
  • tree 0
  • variant pruning 0
  • screen 0
  • msisensor-pro 0
  • bustools 0
  • bfiles 0
  • transcriptomic 0
  • parallelized 0
  • standardization 0
  • subset 0
  • Duplication purging 0
  • vcflib 0
  • removal 0
  • polish 0
  • immunoprofiling 0
  • join 0
  • repeat_expansions 0
  • duplex 0
  • fetch 0
  • GEO 0
  • frame-shift correction 0
  • long-read sequencing 0
  • metagenomic 0
  • identifier 0
  • sequence analysis 0
  • expansionhunterdenovo 0
  • reheader 0
  • tab 0
  • intersection 0
  • windows 0
  • pharmacogenetics 0
  • emboss 0
  • doublets 0
  • eigenstrat 0
  • anndata 0
  • validate 0
  • UMIs 0
  • unaligned 0
  • samplesheet 0
  • smrnaseq 0
  • xenograft 0
  • MCMICRO 0
  • graft 0
  • trim 0
  • allele-specific 0
  • mirdeep2 0
  • RNA sequencing 0
  • realignment 0
  • microbial 0
  • microscopy 0
  • Pharmacogenetics 0
  • bayesian 0
  • concat 0
  • tbi 0
  • intersect 0
  • normalize 0
  • norm 0
  • merge mate pairs 0
  • reads merging 0
  • short reads 0
  • region 0
  • sizes 0
  • ome-tif 0
  • trgt 0
  • split_kmers 0
  • corrupted 0
  • cnv calling 0
  • screening 0
  • vdj 0
  • cvnkit 0
  • single cells 0
  • estimation 0
  • genome bins 0
  • recombination 0
  • eCLIP 0
  • splice 0
  • parse 0
  • cleaning 0
  • bases 0
  • heatmap 0
  • format 0
  • eido 0
  • haplotypes 0
  • awk 0
  • BAM 0
  • blastp 0
  • deseq2 0
  • rna-seq 0
  • blastn 0
  • spatial_omics 0
  • human removal 0
  • random forest 0
  • metagenomes 0
  • gene labels 0
  • structural-variant calling 0
  • hostile 0
  • fasterq-dump 0
  • sra-tools 0
  • settings 0
  • decontamination 0
  • version 0
  • interval list 0
  • scatter 0
  • gatk 0
  • NRPS 0
  • evidence 0
  • MaltExtract 0
  • HOPS 0
  • panelofnormals 0
  • baf 0
  • authentication 0
  • edit distance 0
  • dereplicate 0
  • secondary metabolites 0
  • RiPP 0
  • allele 0
  • demultiplexed reads 0
  • antibiotics 0
  • aggregate 0
  • artic 0
  • simulate 0
  • antismash 0
  • RNA-Seq 0
  • WGS 0
  • joint genotyping 0
  • cgMLST 0
  • samples 0
  • orthologs 0
  • ragtag 0
  • repeats 0
  • filtermutectcalls 0
  • qualty 0
  • gem 0
  • gwas 0
  • hmmscan 0
  • short-read sequencing 0
  • alr 0
  • blat 0
  • yahs 0
  • detecting svs 0
  • Bioinformatics Tools 0
  • confidence 0
  • phylogenies 0
  • geo 0
  • chloroplast 0
  • hmmpress 0
  • patch 0
  • hhsuite 0
  • mapad 0
  • covariance models 0
  • trna 0
  • clr 0
  • copy number variation 0
  • missingness 0
  • reference compression 0
  • baftest 0
  • svtk/baftest 0
  • regex 0
  • impute 0
  • scanner 0
  • whamg 0
  • constant 0
  • wham 0
  • reference panel 0
  • modelsegments 0
  • copy-number 0
  • copy number analysis 0
  • unmarkduplicates 0
  • gender determination 0
  • junction 0
  • references 0
  • copy number alterations 0
  • sccmec 0
  • variantcalling 0
  • c to t 0
  • adna 0
  • dnamodelapply 0
  • workflow_mode 0
  • groupby 0
  • createreadcountpanelofnormals 0
  • taxonomic composition 0
  • denoisereadcounts 0
  • metaspace 0
  • metabolite annotation 0
  • readwriter 0
  • mzML 0
  • snakemake 0
  • data-download 0
  • Immune Deconvolution 0
  • ribosomal RNA 0
  • rRNA 0
  • prepare 0
  • hwe 0
  • catpack 0
  • Computational Immunology 0
  • workflow 0
  • tnscope 0
  • genome annotation 0
  • readproteingroups 0
  • dnascope 0
  • 16S 0
  • proteus 0
  • streptococcus 0
  • spa 0
  • spatype 0
  • mobile genetic elements 0
  • integron 0
  • patterns 0
  • signatures 0
  • doublet 0
  • countsvtypes 0
  • eigenvectors 0
  • hicPCA 0
  • fracminhash sketch 0
  • hash sketch 0
  • sliding 0
  • bgen 0
  • CRISPRi 0
  • pruning 0
  • rdtest2vcf 0
  • downsample 0
  • longest 0
  • isoform 0
  • upd 0
  • transcroder 0
  • cds 0
  • uniparental 0
  • disomy 0
  • snv 0
  • variancepartition 0
  • coding 0
  • sequencing adapters 0
  • downsample bam 0
  • subsample bam 0
  • vcf2db 0
  • gemini 0
  • maf 0
  • eucaryotes 0
  • lua 0
  • dream 0
  • toml 0
  • chromosomal rearrangements 0
  • agat 0
  • drep 0
  • vcfbreakmulti 0
  • pca 0
  • linkage equilibrium 0
  • refflat 0
  • genepred 0
  • bedtobigbed 0
  • ucsc/liftover 0
  • bigbed 0
  • f coefficient 0
  • homozygous genotypes 0
  • heterozygous genotypes 0
  • inbreeding 0
  • umicollapse 0
  • microbial genomics 0
  • bedgraphtobigwig 0
  • scRNA-Seq 0
  • plink2_pca 0
  • bgen file 0
  • covariance model 0
  • dereplication 0
  • files 0
  • vcf file 0
  • genotype dosages 0
  • Mycobacterium tuberculosis 0
  • assembly polishing 0
  • rdtest 0
  • SNV 0
  • remove samples 0
  • tandem repeats 0
  • long read 0
  • decompress 0
  • sintax 0
  • vsearch/sort 0
  • vcf2bed 0
  • shuffleBed 0
  • Indel 0
  • trio binning 0
  • host removal 0
  • usearch 0
  • long read alignment 0
  • pangenome-scale 0
  • all versus all 0
  • mashmap 0
  • gtftogenepred 0
  • wavefront 0
  • haploype 0
  • helitron 0
  • polya tail 0
  • fast5 0
  • genome polishing 0
  • network 0
  • bedcov 0
  • uniq 0
  • deduplicate 0
  • paired reads re-pairing 0
  • comp 0
  • md 0
  • VCFtools 0
  • nm 0
  • wget 0
  • uq 0
  • verifybamid 0
  • GFF/GTF 0
  • short 0
  • intron 0
  • DNA contamination estimation 0
  • SINE 0
  • masking 0
  • low-complexity 0
  • plant 0
  • construct 0
  • melon 0
  • graph projection to vcf 0
  • boxcox 0
  • busco 0
  • fix 0
  • tag2tag 0
  • association 0
  • GWAS 0
  • svg 0
  • case/control 0
  • xml 0
  • script 0
  • java 0
  • associations 0
  • rank 0
  • spatial_neighborhoods 0
  • hashing-based deconvolution 0
  • tags 0
  • standard 0
  • impute-info 0
  • Illumina 0
  • scimap 0
  • Bayesian 0
  • uniques 0
  • invariant 0
  • structural-variants 0
  • drug categorization 0
  • refresh 0
  • clahe 0
  • cell_barcodes 0
  • microRNA 0
  • telseq 0
  • stardist 0
  • variant-calling 0
  • poolseq 0
  • multi-tool 0
  • predict 0
  • search engine 0
  • mass_error 0
  • hardy-weinberg 0
  • hwe statistics 0
  • multiqc 0
  • hwe equilibrium 0
  • haplotag 0
  • reference-independent 0
  • genotype likelihood 0
  • Staging 0
  • collapse 0
  • liftover 0
  • probabilistic realignment 0
  • seqfu 0
  • n50 0
  • cell_type_identification 0
  • cell_phenotyping 0
  • machine_learning 0
  • staging 0
  • tag 0
  • mygene 0
  • vsearch/dereplicate 0
  • coreutils 0
  • transcription factors 0
  • regulatory network 0
  • 10x 0
  • ribosomal 0
  • grabix 0
  • hamming-distance 0
  • bwameme 0
  • bwamem2 0
  • guidetree 0
  • hashing-based deconvoltion 0
  • gnu 0
  • Pacbio 0
  • overlap-based merging 0
  • generic 0
  • AC/NS/AF 0
  • vcflib/vcffixup 0
  • trimfq 0
  • cellsnp 0
  • transposable element 0
  • retrieval 0
  • donor deconvolution 0
  • genotype-based demultiplexing 0
  • MMseqs2 0
  • lexogen 0
  • droplet based single cells 0
  • check 0
  • paired reads merging 0
  • Read report 0
  • orthogroup 0
  • go 0
  • Read trimming 0
  • Read filters 0
  • nanoq 0
  • redundant 0
  • pile up 0
  • extraction 0
  • featuretable 0
  • mass spectrometry 0
  • sage 0
  • nanopore sequencing 0
  • rna velocity 0
  • translation 0
  • cobra 0
  • spot 0
  • circular 0
  • extension 0
  • realign 0
  • quality check 0
  • size 0
  • cram-size 0
  • selector 0
  • paraphase 0
  • homologs 0
  • vsearch/fastqfilter 0
  • malformed 0
  • rad 0
  • tnfilter 0
  • plotting 0
  • scanpy 0
  • array_cgh 0
  • cytosure 0
  • metagenome assembler 0
  • vector 0
  • morphology 0
  • resegment 0
  • relabel 0
  • regtools 0
  • cell segmentation 0
  • nuclear segmentation 0
  • structural variant 0
  • bam2fastx 0
  • import segmentation 0
  • bam2fastq 0
  • immcantation 0
  • airrseq 0
  • immunoinformatics 0
  • solo 0
  • scvi 0
  • co-orthology 0
  • derived alleles 0
  • InterProScan 0
  • sequence similarity 0
  • decompose 0
  • partitioning 0
  • Escherichia coli 0
  • chip 0
  • propd 0
  • Read coverage histogram 0
  • updatedata 0
  • run 0
  • reverse complement 0
  • pdb 0
  • simulation 0
  • hmmfetch 0
  • block substitutions 0
  • site frequency spectrum 0
  • transmembrane 0
  • decomposeblocksub 0
  • genome graph 0
  • tnseq 0
  • identity-by-descent 0
  • decoy 0
  • htseq 0
  • mgi 0
  • sompy 0
  • recovery 0
  • peak picking 0
  • leafcutter 0
  • homology 0
  • p-value 0
  • fastqfilter 0
  • translate 0
  • raw 0
  • mgf 0
  • tarball 0
  • parquet 0
  • parser 0
  • dbsnp 0
  • standardize 0
  • quarto 0
  • python 0
  • r 0
  • tar 0
  • jvarkit 0
  • resistance genes 0
  • setgt 0
  • coexpression 0
  • correlation 0
  • corpcor 0
  • ATACshift 0
  • assay 0
  • phylogenetics 0
  • shift 0
  • minimum_evolution 0
  • distance-based 0
  • ATACseq 0
  • nucleotide sequence 0
  • targz 0
  • resfinder 0
  • significance statistic 0
  • gaps 0
  • logFC 0
  • spectral clustering 0
  • comparative genomics 0
  • subsetting 0
  • deep variant 0
  • mutect 0
  • idx 0
  • barcodes 0
  • doublet_detection 0
  • quality_control 0
  • transform 0
  • emoji 0
  • introns 0
  • plastid 0
  • source tracking 0
  • controlstatistics 0
  • elprep 0
  • elfasta 0
  • nucleotide content 0
  • joint-genotyping 0
  • genotypegvcf 0
  • AT content 0
  • nucBed 0
  • bclconvert 0
  • parallel 0
  • ancestral alleles 0
  • methylation bias 0
  • SNPs 0
  • getpileupsummaries 0
  • short variant discovery 0
  • combinegvcfs 0
  • collectsvevidence 0
  • collectreadcounts 0
  • cnnscorevariants 0
  • calibratedragstrmodel 0
  • cross-samplecontamination 0
  • dragstr 0
  • calculatecontamination 0
  • bedtointervallist 0
  • asereadcounter 0
  • vqsr 0
  • variant quality score recalibration 0
  • annotateintervals 0
  • composestrtablefile 0
  • condensedepthevidence 0
  • heattree 0
  • gatherbqsrreports 0
  • germlinecnvcaller 0
  • germline contig ploidy 0
  • panelofnormalscreation 0
  • jointgenotyping 0
  • genomicsdbimport 0
  • genomicsdb 0
  • tranche filtering 0
  • createsequencedictionary 0
  • filtervarianttranches 0
  • filterintervals 0
  • estimatelibrarycomplexity 0
  • duplication metrics 0
  • determinegermlinecontigploidy 0
  • createsomaticpanelofnormals 0
  • targets 0
  • gangstr 0
  • getpileupsumaries 0
  • antibiotic resistance genes 0
  • consensus sequence 0
  • public 0
  • ENA 0
  • SRA 0
  • ANI 0
  • ARGs 0
  • faqcs 0
  • groupreads 0
  • str 0
  • cache 0
  • percent on target 0
  • endogenous DNA 0
  • Streptococcus pyogenes 0
  • swissprot 0
  • duplexumi 0
  • unmapped 0
  • gene-calling 0
  • variant caller 0
  • gamma 0
  • bacterial variant calling 0
  • germline variant calling 0
  • somatic variant calling 0
  • rust 0
  • ubam 0
  • fq 0
  • lint 0
  • random 0
  • generate 0
  • single molecule 0
  • zipperbams 0
  • germlinevariantsites 0
  • readcountssummary 0
  • embl 0
  • Imputation 0
  • gene model 0
  • tama_collapse.py 0
  • genomes on a tree 0
  • merge compare 0
  • GNU 0
  • joint-variant-calling 0
  • Haplotypes 0
  • gstama/merge 0
  • Sample 0
  • low coverage 0
  • gget 0
  • genome statistics 0
  • genome manipulation 0
  • genome summary 0
  • TAMA 0
  • gstama/polyacleanup 0
  • Mykrobe 0
  • abricate 0
  • beagle 0
  • hbd 0
  • ibd 0
  • rgi 0
  • fARGene 0
  • amrfinderplus 0
  • extractvariants 0
  • GTDB taxonomy 0
  • extract_variants 0
  • gvcftools 0
  • gunzip 0
  • gunc 0
  • archaea 0
  • genome taxonomy database 0
  • gfastats 0
  • Salmonella Typhi 0
  • indexfeaturefile 0
  • preprocessintervals 0
  • shiftchain 0
  • selectvariants 0
  • revert 0
  • reblockgvcf 0
  • printsvevidence 0
  • printreads 0
  • postprocessgermlinecnvcalls 0
  • shiftintervals 0
  • snvs 0
  • mutectstats 0
  • mergebamalignment 0
  • leftalignandtrimvariants 0
  • readorientationartifacts 0
  • learnreadorientationmodel 0
  • shiftfasta 0
  • site depth 0
  • repeat content 0
  • file parsing 0
  • genome heterozygosity 0
  • genome size 0
  • models 0
  • compound 0
  • genome profile 0
  • bgc 0
  • txt 0
  • splitcram 0
  • gawk 0
  • variantrecalibrator 0
  • recalibration model 0
  • variantfiltration 0
  • svcluster 0
  • splitintervals 0
  • genbank 0
  • split by chromosome 0
  • Haemophilus influenzae 0
  • illumiation_correction 0
  • BCF 0
  • csi 0
  • deduping 0
  • smaller fastqs 0
  • clumping fastqs 0
  • background_correction 0
  • element 0
  • biallelic 0
  • trimBam 0
  • bamUtil 0
  • bamtools/split 0
  • yaml 0
  • bamtools/convert 0
  • mouse 0
  • update header 0
  • homozygosity 0
  • virulent 0
  • chunking 0
  • subtract 0
  • slopBed 0
  • shiftBed 0
  • multinterval 0
  • overlapped bed 0
  • maskfasta 0
  • jaccard 0
  • autozygosity 0
  • overlap 0
  • getfasta 0
  • genomecov 0
  • closest 0
  • bamtobed 0
  • sorting 0
  • bacphlip 0
  • temperate 0
  • bioawk 0
  • amp 0
  • allele counts 0
  • nuclear contamination estimate 0
  • post Post-processing 0
  • model 0
  • AMPs 0
  • antimicrobial peptide prediction 0
  • Staphylococcus aureus 0
  • installation 0
  • affy 0
  • reference panels 0
  • admixture 0
  • adapterremoval 0
  • antimicrobial reistance 0
  • contiguate 0
  • doCounts 0
  • HLA 0
  • lifestyle 0
  • read group 0
  • autofluorescence 0
  • cycif 0
  • background 0
  • single-stranded 0
  • ancientDNA 0
  • authentict 0
  • bias 0
  • utility 0
  • ATLAS 0
  • sequencing_bias 0
  • post mortem damage 0
  • atlas 0
  • http(s) 0
  • unionBedGraphs 0
  • file manipulation 0
  • deletion 0
  • Segmentation 0
  • cutesv 0
  • gct 0
  • cls 0
  • na 0
  • custom 0
  • Cores 0
  • TMA dearray 0
  • paired-end 0
  • UNet 0
  • mcool 0
  • genomic bins 0
  • makebins 0
  • enzyme 0
  • digest 0
  • pcr duplicates 0
  • track 0
  • cooler/balance 0
  • escherichia coli 0
  • circos 0
  • eklipse 0
  • eigenstratdatabasetools 0
  • pep 0
  • schema 0
  • PEP 0
  • depth information 0
  • corrrelation 0
  • structural variation 0
  • duphold 0
  • segment 0
  • blastx 0
  • cumulative coverage 0
  • scatterplot 0
  • cload 0
  • subcontigs 0
  • sorted 0
  • compartments 0
  • multiomics 0
  • mkvdjref 0
  • cellpose 0
  • hifi 0
  • Assembly 0
  • domains 0
  • topology 0
  • antibody capture 0
  • calder2 0
  • cadd 0
  • postprocessing 0
  • tblastn 0
  • subtyping 0
  • Salmonella enterica 0
  • antigen capture 0
  • crispr 0
  • nucleotide composition 0
  • cmseq 0
  • concoct 0
  • partition histograms 0
  • target 0
  • export 0
  • antitarget 0
  • access 0
  • protein coding genes 0
  • qa 0
  • polymorphic sites 0
  • polymorphic 0
  • polymut 0
  • chromosome_visualization 0
  • duplicate removal 0
  • chromap 0
  • quality assurnce 0
  • mitochondrial 0
  • haplotype resolution 0
  • normal database 0
  • assembly curation 0
  • false duplications 0
  • duplicate purging 0
  • haplotype purging 0
  • cutoff 0
  • panel of normals 0
  • genomic intervals 0
  • False duplications 0
  • intervals coverage 0
  • gene finding 0
  • contact maps 0
  • bmp 0
  • jpg 0
  • pretext 0
  • Haplotype purging 0
  • Assembly curation 0
  • porechop_abi 0
  • strandedness 0
  • sequence-based 0
  • read distribution 0
  • inner_distance 0
  • fragment_size 0
  • read_pairs 0
  • experiment 0
  • bamstat 0
  • purging 0
  • R 0
  • rhocall 0
  • long uncorrected reads 0
  • subsampling 0
  • neighbour-joining 0
  • quast 0
  • contact 0
  • pmdtools 0
  • integrity 0
  • pcr 0
  • CoPRO 0
  • tandem duplications 0
  • insertions 0
  • deletions 0
  • sortvcf 0
  • picard/renamesampleinvcf 0
  • liftovervcf 0
  • PRO-cap 0
  • mate-pair 0
  • hybrid-selection 0
  • phylogenetic composition 0
  • illumina datasets 0
  • identification 0
  • prophage 0
  • GRO-cap 0
  • CAGE 0
  • variant genetic 0
  • variant identifiers 0
  • scoring 0
  • identifiers 0
  • whole genome association 0
  • recode 0
  • indep pairwise 0
  • indep 0
  • exclude 0
  • NETCAGE 0
  • genetic 0
  • GRO-seq 0
  • PRO-seq 0
  • STRIPE-seq 0
  • csRNA-seq 0
  • RAMPAGE 0
  • mapping-based 0
  • rtg 0
  • ChIP-Seq 0
  • gc_wiggle 0
  • error 0
  • rare variants 0
  • relative coverage 0
  • genetic sex 0
  • sex determination 0
  • induce 0
  • bam2seqz 0
  • longread 0
  • freqsum 0
  • pseudodiploid 0
  • pseudohaploid 0
  • random draw 0
  • selection 0
  • seq 0
  • de-novo 0
  • sha256 0
  • interleave 0
  • SMN1 0
  • snippy 0
  • core 0
  • sniffles 0
  • POA 0
  • SMN2 0
  • CRAM 0
  • 256 bit 0
  • sliding window 0
  • features 0
  • density 0
  • boxplot 0
  • exploratory 0
  • shinyngs 0
  • header 0
  • sertotype 0
  • pedfilter 0
  • flagstat 0
  • faidx 0
  • calmd 0
  • ampliconclip 0
  • amplicon 0
  • duplicate marking 0
  • sambamba 0
  • multimapper 0
  • repair 0
  • Ancestor 0
  • LCA 0
  • salsa2 0
  • salsa 0
  • rtg-tools 0
  • rocplot 0
  • insert size 0
  • paired 0
  • sequence headers 0
  • seacr 0
  • grep 0
  • subseq 0
  • variant recalibration 0
  • VQSR 0
  • applyvarcal 0
  • assembly-binning 0
  • chromatin 0
  • read pairs 0
  • cut&run 0
  • cut&tag 0
  • peak-caller 0
  • clusteridentifier 0
  • cluster analysis 0
  • scramble 0
  • readgroup 0
  • phantom peaks 0
  • motif 0
  • clinical 0
  • qualities 0
  • lofreq/filter 0
  • lofreq/call 0
  • Listeria monocytogenes 0
  • limma 0
  • pneumophila 0
  • legionella 0
  • peptide prediction 0
  • collapsing 0
  • adapter removal 0
  • train 0
  • spliced 0
  • reorder 0
  • combining 0
  • AMP 0
  • kegg 0
  • taxonomic assignment 0
  • mass-spectroscopy 0
  • metagenome-assembled genomes 0
  • maxbin2 0
  • representations 0
  • reduced 0
  • mash/sketch 0
  • estimate 0
  • damage patterns 0
  • NGS 0
  • DNA damage 0
  • maximum-likelihood 0
  • kofamscan 0
  • pneumoniae 0
  • MD5 0
  • haemophilus 0
  • genome browser 0
  • js 0
  • igv.js 0
  • igv 0
  • IDR 0
  • panel_of_normals 0
  • pos 0
  • pixel classification 0
  • annotations 0
  • hmtnote 0
  • Hidden Markov Model 0
  • amino acid 0
  • HMMER 0
  • multicut 0
  • pixel_classification 0
  • Klebsiella 0
  • jupytext 0
  • effective genome size 0
  • k-mer counting 0
  • digital normalization 0
  • quant 0
  • kallisto/index 0
  • papermill 0
  • Jupyter 0
  • probability_maps 0
  • Python 0
  • jasmine 0
  • jasminesv 0
  • insertion 0
  • genomic islands 0
  • interproscan 0
  • mcr-1 0
  • 128 bit 0
  • pedigrees 0
  • graph stats 0
  • ILP 0
  • hla-typing 0
  • tumor/normal 0
  • graph viz 0
  • graph formats 0
  • graph unchopping 0
  • combine graphs 0
  • block-compressed 0
  • odgi 0
  • squeeze 0
  • graph drawing 0
  • graph construction 0
  • gender 0
  • Neisseria gonorrhoeae 0
  • HLA-I 0
  • PCR/optical duplicates 0
  • NextGenMap 0
  • graphs 0
  • read 0
  • pair-end 0
  • pbp 0
  • subreads 0
  • pbmerge 0
  • pbbam 0
  • paragraph 0
  • flip 0
  • select 0
  • restriction fragments 0
  • pairstools 0
  • pairtools 0
  • ligation junctions 0
  • upper-triangular matrix 0
  • ngm 0
  • sequencing summary 0
  • megahit 0
  • Merqury 0
  • assembler 0
  • mbias 0
  • metaphlan 0
  • unionsum 0
  • ploidy 0
  • smudgeplot 0
  • contour map 0
  • microrna 0
  • 3D heat map 0
  • Neisseria meningitidis 0
  • rma6 0
  • daa 0
  • debruijn 0
  • denovo 0
  • de Bruijn 0
  • target prediction 0
  • mobile element insertions 0
  • bioinformatics tools 0
  • somatic structural variations 0
  • cancer genome 0
  • contaminant 0
  • Beautiful stand-alone HTML report 0
  • mitochondrial to nuclear ratio 0
  • mitochondrial genome 0
  • ratio 0
  • mtnucratio 0
  • scan 0
  • microsatellite instability 0
  • otu table 0
  • mosdepth 0
  • reference genome 0
  • long-reads 0

A NATA accredited tool for reporting the presence of antimicrobial resistance genes in bacterial genomes

01

matches partials virulence out txt versions

abritamr:

A pipeline for running AMRfinderPlus and collating results into functional classes

A tool to parse and summarise results from antimicrobial peptides tools and present functional classification.

0100

sample_dir txt csv faa summary_csv summary_html log results_db results_db_dmnd results_db_fasta results_db_tsv versions

ataqv function of a corresponding ataqv tool

012300000

json problems versions

ataqv:

ataqv is a toolkit for measuring and comparing ATAC-seq results. It was written to help understand how well ATAC-seq assays have worked, and to make it easier to spot differences that might be caused by library prep or sequencing.

mkarv function of a corresponding ataqv tool

0

html versions

ataqv:

ataqv is a toolkit for measuring and comparing ATAC-seq results. It was written to help understand how well ATAC-seq assays have worked, and to make it easier to spot differences that might be caused by library prep or sequencing.

This command replaces the former bcftools view caller. Some of the original functionality has been temporarily lost in the process of transition under htslib, but will be added back on popular demand. The original calling model can be invoked with the -c option.

012000

vcf tbi csi versions

view:

View, subset and filter VCF or BCF files by position and filtering expression. Convert between VCF and BCF

bcftools Haplotype-aware consequence caller

01010101

vcf tbi csi versions

reheader:

Haplotype-aware consequence caller

view function to generate vcfs

0100

vcf tsv xls versions

cnvpytor:

calling CNVs using read depth

remove false positives of functional crispr genomics due to CNVs

01200

norm_count_file versions

crisprcleanr:

Analysis of CRISPR functional genomics, remove false positive due to CNVs.

decoupler is a package containing different statistical methods to extract biological activities from omics data within a unified framework. It allows to flexibly test any enrichment method with any prior knowledge resource and incorporates methods that take into account the sign and weight. It can be used with any omic, as long as its features can be linked to a biological process based on prior knowledge. For example, in transcriptomics gene sets regulated by a transcription factor, or in phospho-proteomics phosphosites that are targeted by a kinase.

0100

dc_estimate dc_pvals versions

Fast genome-wide functional annotation through orthology assignment.

010001

annotations orthologs hits versions

Searches a term in a public NCBI database

010

xml versions

entrezdirect:

Entrez Direct (EDirect) is a method for accessing the NCBI's set of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a UNIX terminal window. Functions take search terms from command line arguments. Individual operations are combined to build multi-step queries. Record retrieval and formatting normally complete the process.

Queries an NCBI database using Unique Identifier(s)

0120

xml versions

entrezdirect:

Entrez Direct (EDirect) is a method for accessing the NCBI's set of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a UNIX terminal window. Functions take search terms from command line arguments. Individual operations are combined to build multi-step queries. Record retrieval and formatting normally complete the process.

Queries an NCBI database using an UID

01000

txt versions

entrezdirect:

Entrez Direct (EDirect) is a method for accessing the NCBI's set of interconnected databases (publication, sequence, structure, gene, variation, expression, etc.) from a UNIX terminal window. Functions take search terms from command line arguments. Individual operations are combined to build multi-step queries. Record retrieval and formatting normally complete the process.

A module for concatenation of gzipped or uncompressed files getting around UNIX terminal argument size

01

file_out versions

find:

GNU find searches the directory tree rooted at each given starting-point by evaluating the given expression

pigz:

pigz, which stands for Parallel Implementation of GZip, is a fully functional replacement for gzip that exploits multiple processors and multiple cores to the hilt when compressing data.

A module for decompressing a large number of gzipped files, getting around the UNIX terminal argument limit

01

file_out versions

find:

GNU find searches the directory tree rooted at each given starting-point by evaluating the given expression

pigz:

pigz, which stands for Parallel Implementation of GZip, is a fully functional replacement for gzip that exploits multiple processors and multiple cores to the hilt when compressing data.

Bootstrap sample demixing by resampling each site based on a multinomial distribution of read depth across all sites, where the event probabilities were determined by the fraction of the total sample reads found at each site, followed by a secondary resampling at each site according to a multinomial distribution (that is, binomial when there was only one SNV at a site), where event probabilities were determined by the frequencies of each base at the site, and the number of trials is given by the sequencing depth.

012000

lineages summarized versions

freyja:

Freyja recovers relative lineage abundances from mixed SARS-CoV-2 samples and provides functionality to analyze lineage dynamics.

specify the relative abundance of each known haplotype

01200

demix versions

freyja:

Freyja recovers relative lineage abundances from mixed SARS-CoV-2 samples and provides functionality to analyze lineage dynamics.

downloads new versions of the curated SARS-CoV-2 lineage file and barcodes

0

barcodes lineages_topology lineages_meta versions

freyja:

Freyja recovers relative lineage abundances from mixed SARS-CoV-2 samples and provides functionality to analyze lineage dynamics.

call variant and sequencing depth information of the variant

010

variants versions

freyja:

Freyja recovers relative lineage abundances from mixed SARS-CoV-2 samples and provides functionality to analyze lineage dynamics.

Adds predicted functional consequence, gene overlap, and noncoding element overlap annotations to SV VCF from GATK-SV pipeline. Input files are an SV VCF, a GTF file containing primary or canonical transcripts, and a BED file containing noncoding elements. Output file is an annotated SV VCF.

0123000

annotated_vcf index versions

gatk4:

Genome Analysis Toolkit (GATK4)

runs a functional enrichment analysis with gprofiler2

010101

all_enrich rds plot_png plot_html sub_enrich sub_plot filtered_gmt session_info versions

gprofiler2:

An R interface corresponding to the 2019 update of g:Profiler web tool.

gcCounter function from HMMcopy utilities, used to generate GC content in non-overlapping windows from a fasta reference

01

wig versions

hmmcopy:

C++ based programs for analyzing BAM files and preparing read counts -- used with bioconductor-hmmcopy

mapCounter function from HMMcopy utilities, used to generate mappability in non-overlapping windows from a bigwig file

01

wig versions

hmmcopy:

C++ based programs for analyzing BAM files and preparing read counts -- used with bioconductor-hmmcopy

readCounter function from HMMcopy utilities, used to generate read in windows

012

wig versions

hmmcopy:

C++ based programs for analyzing BAM files and preparing read counts -- used with bioconductor-hmmcopy

Generate a consensus sequence from a BAM file using iVar

0100

fasta qual mpileup versions

ivar:

iVar - a computational package that contains functions broadly useful for viral amplicon-based sequencing.

Trim primer sequences rom a BAM file with iVar

0120

bam log versions

ivar:

iVar - a computational package that contains functions broadly useful for viral amplicon-based sequencing.

Call variants from a BAM file using iVar

010000

tsv mpileup versions

ivar:

iVar - a computational package that contains functions broadly useful for viral amplicon-based sequencing.

mageck count for functional genomics, reads are usually mapped to a specific sgRNA

010

count norm versions

mageck:

MAGeCK (Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout), an algorithm to process, QC, analyze and visualize CRISPR screening data.

Mageck test performs a robust ranking aggregation (RRA) to identify positively or negatively selected genes in functional genomics screens.

01

gene_summary sgrna_summary r_script versions

mageck:

MAGeCK (Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout), an algorithm to process, QC, analyze and visualize CRISPR screening data.

SNP table generator from GATK UnifiedGenotyper with functionality geared for aDNA

010101010000001

full_alignment info_txt snp_alignment snp_genome_alignment snpstatistics snptable snptable_snpeff snptable_uncertainty structure_genotypes structure_genotypes_nomissing json versions

NACHO (NAnostring quality Control dasHbOard) is developed for NanoString nCounter data. NanoString nCounter data is a messenger-RNA/micro-RNA (mRNA/miRNA) expression assay and works with fluorescent barcodes. Each barcode is assigned a mRNA/miRNA, which can be counted after bonding with its target. As a result each count of a specific barcode represents the presence of its target mRNA/miRNA.

0101

normalized_counts normalized_counts_wo_HK versions

NACHO:

R package that uses two main functions to summarize and visualize NanoString RCC files, namely: load_rcc() and visualise(). It also includes a function normalise(), which (re)calculates sample specific size factors and normalises the data. For more information vignette("NACHO") and vignette("NACHO-analysis")

NACHO (NAnostring quality Control dasHbOard) is developed for NanoString nCounter data. NanoString nCounter data is a messenger-RNA/micro-RNA (mRNA/miRNA) expression assay and works with fluorescent barcodes. Each barcode is assigned a mRNA/miRNA, which can be counted after bonding with its target. As a result each count of a specific barcode represents the presence of its target mRNA/miRNA.

0101

nacho_qc_reports nacho_qc_png nacho_qc_txt versions

NACHO:

R package that uses two main functions to summarize and visualize NanoString RCC files, namely: load_rcc() and visualise(). It also includes a function normalise(), which (re)calculates sample specific size factors and normalises the data. For more information vignette("NACHO") and vignette("NACHO-analysis")

Install databases necessary for Pharokka's functional analysis

NO input

pharokka_db versions

pharokka:

Fast Phage Annotation Program

Functional annotation of phages

010

cds_final_merged_output cds_functions length_gc_cds_density card vfdb mash reoriented versions

pharokka:

Fast Phage Annotation Program

Perform Gene Ratio Enrichment Analysis

0101

results versions session_info

grea:

Gene Ratio Enrichment Analysis

Genetic variant annotation and functional effect prediction toolbox

012

cache versions

snpeff:

SnpEff is a variant annotation and effect prediction tool. It annotates and predicts the effects of genetic variants on genes and proteins (such as amino acid changes).

Genetic variant annotation and functional effect prediction toolbox

01001

vcf report summary_html genes_txt versions

snpeff:

SnpEff is a variant annotation and effect prediction tool. It annotates and predicts the effects of genetic variants on genes and proteins (such as amino acid changes).

The dbNSFP is an integrated database of functional predictions from multiple algorithms

012012

vcf versions

snpsift:

SnpSift is a toolbox that allows you to filter and manipulate annotated files

calculate secondary structures of two RNAs with dimerization

01

rnacofold_csv rnacofold_ps versions

viennarna:

calculate secondary structures of two RNAs with dimerization

The program works much like RNAfold, but allows one to specify two RNA sequences which are then allowed to form a dimer structure. RNA sequences are read from stdin in the usual format, i.e. each line of input corresponds to one sequence, except for lines starting with > which contain the name of the next sequence. To compute the hybrid structure of two molecules, the two sequences must be concatenated using the & character as separator. RNAcofold can compute minimum free energy (mfe) structures, as well as partition function (pf) and base pairing probability matrix (using the -p switch) Since dimer formation is concentration dependent, RNAcofold can be used to compute equilibrium concentrations for all five monomer and (homo/hetero)-dimer species, given input concentrations for the monomers. Output consists of the mfe structure in bracket notation as well as PostScript structure plots and โ€œdot plotโ€ files containing the pair probabilities, see the RNAfold man page for details. In the dot plots a cross marks the chain break between the two concatenated sequences. The program will continue to read new sequences until a line consisting of the single character @ or an end of file condition is encountered.

Predict RNA secondary structure using the ViennaRNA RNAfold tools. Calculate minimum free energy secondary structures and partition function of RNAs.

01

rnafold_txt rnafold_ps versions

viennarna:

Calculate minimum free energy secondary structures and partition function of RNAs

The program reads RNA sequences, calculates their minimum free energy (mfe) structure and prints the mfe structure in bracket notation and its free energy. If not specified differently using commandline arguments, input is accepted from stdin or read from an input file, and output printed to stdout. If the -p option was given it also computes the partition function (pf) and base pairing probability matrix, and prints the free energy of the thermodynamic ensemble, the frequency of the mfe structure in the ensemble, and the ensemble diversity to stdout.

Extracting sequences that were unbinnned by vRhyme into a FASTA file

0101

unbinned_sequences versions

vrhyme:

vRhyme functions by utilizing coverage variance comparisons and supervised machine learning classification of sequence features to construct viral metagenome-assembled genomes (vMAGs).

Linking bins output by vRhyme to create one sequences per bin

01

linked_bins versions

vrhyme:

vRhyme functions by utilizing coverage variance comparisons and supervised machine learning classification of sequence features to construct viral metagenome-assembled genomes (vMAGs).

Binning virus genomes from metagenomes

0101

bins membership summary versions

vrhyme:

vRhyme functions by utilizing coverage variance comparisons and supervised machine learning classification of sequence features to construct viral metagenome-assembled genomes (vMAGs).

Click here to trigger an update.