---
res:
  bibo_abstract:
  - Multiple sequence alignments (MSAs) are used for structural1,2 and evolutionary
    predictions1,2, but the complexity of aligning large datasets requires the use
    of approximate solutions3, including the progressive algorithm4. Progressive MSA
    methods start by aligning the most similar sequences and subsequently incorporate
    the remaining sequences, from leaf-to-root, based on a guide-tree. Their accuracy
    declines substantially as the number of sequences is scaled up5. We introduce
    a regressive algorithm that enables MSA of up to 1.4 million sequences on a standard
    workstation and substantially improves accuracy on datasets larger than 10,000
    sequences. Our regressive algorithm works the other way around to the progressive
    algorithm and begins by aligning the most dissimilar sequences. It uses an efficient
    divide-and-conquer strategy to run third-party alignment methods in linear time,
    regardless of their original complexity. Our approach will enable analyses of
    extremely large genomic datasets such as the recently announced Earth BioGenome
    Project, which comprises 1.5 million eukaryotic genomes6.@eng
  bibo_authorlist:
  - foaf_Person:
      foaf_givenName: Edgar
      foaf_name: Garriga, Edgar
      foaf_surname: Garriga
  - foaf_Person:
      foaf_givenName: Paolo
      foaf_name: Di Tommaso, Paolo
      foaf_surname: Di Tommaso
  - foaf_Person:
      foaf_givenName: Cedrik
      foaf_name: Magis, Cedrik
      foaf_surname: Magis
  - foaf_Person:
      foaf_givenName: Ionas
      foaf_name: Erb, Ionas
      foaf_surname: Erb
  - foaf_Person:
      foaf_givenName: Leila
      foaf_name: Mansouri, Leila
      foaf_surname: Mansouri
  - foaf_Person:
      foaf_givenName: Athanasios
      foaf_name: Baltzis, Athanasios
      foaf_surname: Baltzis
  - foaf_Person:
      foaf_givenName: Hafid
      foaf_name: Laayouni, Hafid
      foaf_surname: Laayouni
  - foaf_Person:
      foaf_givenName: Fyodor
      foaf_name: Kondrashov, Fyodor
      foaf_surname: Kondrashov
      foaf_workInfoHomepage: http://www.librecat.org/personId=44FDEF62-F248-11E8-B48F-1D18A9856A87
    orcid: 0000-0001-8243-4694
  - foaf_Person:
      foaf_givenName: Evan
      foaf_name: Floden, Evan
      foaf_surname: Floden
  - foaf_Person:
      foaf_givenName: Cedric
      foaf_name: Notredame, Cedric
      foaf_surname: Notredame
  bibo_doi: 10.1038/s41587-019-0333-6
  bibo_issue: '12'
  bibo_volume: 37
  dct_date: 2019^xs_gYear
  dct_identifier:
  - UT:000500748900021
  dct_isPartOf:
  - http://id.crossref.org/issn/1087-0156
  - http://id.crossref.org/issn/1546-1696
  dct_language: eng
  dct_publisher: Springer Nature@
  dct_title: Large multiple sequence alignments with a root-to-leaf regressive method@
  fabio_hasPubmedId: '31792410'
...
