nerc.ac.uk

Hi‐C scaffolded short‐ and long‐read genome assemblies of the California sea lion are broadly consistent for syntenic inference across 45 million years of evolution

Peart, Claire R.; Williams, Christina; Pophaly, Saurabh D.; Neely, Benjamin A.; Gulland, Frances M.D.; Adams, David J.; Ng, Bee Ling; Cheng, William; Goebel, Michael E.; Fedrigo, Olivier; Haase, Bettina; Mountcastle, Jacquelyn; Fungtammasan, Arkarachai; Formenti, Giulio; Collins, Joanna; Wood, Jonathan; Sims, Ying; Torrance, James; Tracey, Alan; Howe, Kerstin; Rhie, Arang; Hoffman, Joseph I. ORCID: https://orcid.org/0000-0001-5895-8949; Johnson, Jeremy; Jarvis, Erich D.; Breen, Matthew; Wolf, Jochen B.W.. 2021 Hi‐C scaffolded short‐ and long‐read genome assemblies of the California sea lion are broadly consistent for syntenic inference across 45 million years of evolution. Molecular Ecology Resources, 21 (7). 2455-2470. 10.1111/1755-0998.13443

Before downloading, please read NORA policies.
[thumbnail of Open Access]
Preview
Text (Open Access)
© 2021 The Authors. Molecular Ecology Resources published by John Wiley & Sons Ltd.
1755-0998.13443.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial 4.0.

Download (1MB) | Preview

Abstract/Summary

With the advent of chromatin-interaction maps, chromosome-level genome assemblies have become a reality for a wide range of organisms. Scaffolding quality is, however, difficult to judge. To explore this gap, we generated multiple chromosome-scale genome assemblies of an emerging wild animal model for carcinogenesis, the California sea lion (Zalophus californianus). Short-read assemblies were scaffolded with two independent chromatin interaction mapping data (Hi-C and Chicago), and long-read assemblies with three data types (Hi-C, optical maps, and 10X linked reads) following the ‘Vertebrate Genomes Project (VGP)’ pipeline. In both approaches, 18 major scaffolds recovered the karyotype (2n=36), with scaffold N50s of 138 Mb and 147 Mb, respectively. Synteny relationships at the chromosome-level with other pinniped genomes (2n=32-36), ferret (2n=34), red panda (2n=36) and domestic dog (2n=78) were consistent across approaches and recovered known fissions and fusions. Comparative chromosome painting and multicolor chromosome tiling with a panel of 264 genome-integrated single-locus canine bacterial artificial chromosome (BAC) probes provided independent evaluation of genome organization. Broad-scale discrepancies between the approaches were observed within chromosomes, most commonly in translocations centered around centromeres and telomeres, which were better resolved in the VGP assembly. Genomic and cytological approaches agreed on near-perfect synteny of the X chromosome, and in combination allowed detailed investigation of autosomal rearrangements between dog and sea lion. This study presents high-quality genomes of an emerging cancer model and highlights that even highly fragmented short-read assemblies scaffolded with Hi-C can yield reliable chromosome level scaffolds suitable for comparative genomic analyses.

Item Type: Publication - Article
Digital Object Identifier (DOI): 10.1111/1755-0998.13443
ISSN: 1755-098X
Additional Keywords: California sea lion (Zalophus californianus), cancer, Carnivora, chromatin interaction mapping, genome assembly, genome evolution
Date made live: 12 Jun 2021 06:22 +0 (UTC)
URI: https://nora.nerc.ac.uk/id/eprint/530500

Actions (login required)

View Item View Item

Document Downloads

Downloads for past 30 days

Downloads per month over past year

More statistics for this item...