Description
The DRAGEN DNA Germline Pipeline accelerates the secondary analysis of NGS data by harnessing the tremendous power available on the DRAGEN Platform. The pipeline includes highly optimized algorithms for mapping, aligning, sorting, duplicate marking, and haplotype variant calling. In addition to haplotype variant calling, the pipeline supports calling of copy number and structural variants as well as detection of repeat expansions and targeted calls.
Input
Input expected genotypes (VCF) for checkfingerprint comparison
*.vcfPath to the directory that contains the compression reference and index file
Variant file (.vcf/.vcf.gz) with population allele frequencies to estimate sample contamination
*.vcf{,.gz}BED file containing the set of regions to call (optionally gzip or bgzip compressed)
*.bed{,.gz}BED file containing the set of exclusion regions for SV calling (optionally gzip or bzip compressed)
*.bedSpecify a VCF of structural variants for forced genotyping, meaning these variants will be scored and emitted in the output VCF even if not found in the sample data. These variants will be merged with any additional variants discovered directly from the sample data.
*.vcfSystematic noise BEDPE file containing the set of noisy paired regions for SV calling (optionally gzip or bzip compressed).
*.bed{,.gz}Files of adapter sequences to trim from the 3' end of read 1 and 2
*.{fasta,fa}FASTA files that contains adapter sequences to trim from the 5' end of Read 1 and 2; The sequences should be in reverse order (with respect to their appearance in the FASTQ) but not complemented
*.{fasta,fa}Excluded regions bed specifying where variants will be hard filtered
*.bedList of small variants to force genotype. Can be .vcf or .vcf.gz file
*.vcf{,.gz}BED file containing regions from which to estimate nucleotide substitution biases
*.bedSite specific noise file. This file enables the systematic-noise filter and improves specificity during somatic variant calling
Tools
dragen Documentation
The Illumina DRAGEN™ Bio-IT Platform is based on the highly reconfigurable DRAGEN Bio-IT Processor, which is integrated on a Field Programmable Gate Array (FPGA) card and is available in a preconfigured server that can be seamlessly integrated into bioinformatics workflows. The platform can be loaded with highly optimized algorithms for many different NGS secondary analysis pipelines.
Command copied!