The sanger-tol project is the informatics branch of the Tree of Life programme at the Wellcome Sanger Institute, UK. Here you can read how we organise ourselves, how we are funded and how the sanger-tol project was started.

We welcome external contributions and collaboration on our projects. If you'd like to be involved, drop us a message or comment directly on an existing GitHub issue.

Please note that all community members are expected to adhere to our code of conduct, which is adopted from nf-core.

Informatics Infrastructure team

The Informatics Infrastructure team provides support for the production of reference genome assemblies and large-scale genome analyses in the Tree of Life programme, and helps with the management and use of informatics resources and digital solutions. Current members are listed below:

Safety team

The Safety team is responsible for ensuring the community is a safe place and responding to instances of misconduct. It is made up of community members who have displayed integrity, strong communication, and a genuine concern for community welfare.

The current safety team in 2023 are:

Financial Support

Wellcome Sanger

Tree of Life is a core funded programme within the Wellcome Sanger Institute and is allocated a slice of the Institute’s core Wellcome grant ref. 220540/Z/20/A.

History of sanger-tol

Established at the Wellcome Sanger Institute in 2019, the Tree of Life Programme is part of a global push to sequence the genomes of all eukaryotic life on Earth. That is, to produce reference-quality genomes for every animal, plant, fungi and single-celled protist living on our planet.

The initial focus of the programme was to provide whole genome sequencing capabilities for the Darwin Tree of Life project, a partnership of ten research institutions, with the aim of generating genomes for the estimated 70,000 species living in Britain and Ireland. Darwin Tree of Life makes a significant contribution to the Earth BioGenome Project, launched a year previously, an umbrella organisation of biodiversity genomics projects seeking to sequence all life. It also complements other like-minded projects, for example the Vertebrate Genomes Project which aims to sequence all vertebrates globally, and on which the Sanger Institute was already a partner prior to Tree of Life’s conception.

Almost half a decade and one pandemic later, Tree of Life has grown solid roots and many branches. The trunk of the programme is its genomics production pipeline, which supports collections of species in the field, their ethical and legal transportation to Sanger, extraction of high molecular weight DNA, and then the sequencing, assembly and curation of genomes organised at a chromosomal level to the highest quality. All this data is published freely and openly for researchers to use worldwide. With a demand to tackle so many different lifeforms, the innovation and expertise built by teams within the programme is vast.

Other projects have popped up alongside Darwin Tree of Life. For example, the Aquatic Symbiosis Genomics project which explores the genomics of symbiotic organisms in the ocean. Or BIOSCAN in the UK, which will study one million flying insects over five years to help better understand and monitor the health of their ecosystems. Tree of Life now has four full-time faculty, plus associate faculty members, exploring a multitude of genomic questions from rapid speciation to strange reproduction.

Key to being able to produce top-quality genomes at scale is building top-quality informatics systems. To support genome production, Tree of Life teams have either adopted existing softwares and tailored them to the programme’s needs, or developed bespoke platforms from scratch such as the Samples Tracking System. Within the bioinformatics teams, who piece together the DNA data into whole genome assemblies, new tools are being developed all the time to automate and increase efficiency, and to tackle specific issues, for example mitochondrial genome assemblies or contamination by DNA from other species. The breadth of talent in these teams ranges from early-career researchers to experienced hands whose time at Sanger can be traced back to the original Human Genome Project.

Together, these scientific endeavours, using the latest DNA sequencing technology, will enable us to tackle the extinction crisis, discover new biomedicines and biotechnologies, and better understand 3.5 billion years of life on Earth.