loading page

Haplotype-resolved, chromosome-scale genome assembly of Quercus rubra L.
  • +14
  • Beant Kapoor,
  • Jerry Jenkins,
  • Jeremy Schmutz ,
  • Tatyana Zhebentyayeva,
  • Carsten Kuelheim,
  • Mark Coggeshall,
  • Chris Heim,
  • Jesse Lasky,
  • Laura Leites,
  • Nurul Islam-Faridi,
  • Jeanne Romero-Severson,
  • Victoria DeLeo,
  • Sarah Lucas,
  • Desanka Lazic,
  • Oliver Gailing,
  • John Carlson,
  • Margaret Staton
Beant Kapoor
University of Tennessee
Author Profile
Jerry Jenkins
HudsonAlpha Institute for Biotechnology
Author Profile
Jeremy Schmutz
US Department of Energy
Author Profile
Tatyana Zhebentyayeva
University of Kentucky
Author Profile
Carsten Kuelheim
Michigan Technological University
Author Profile
Mark Coggeshall
University of Missouri
Author Profile
Chris Heim
North Carolina State University at Raleigh
Author Profile
Jesse Lasky
Pennsylvania State University
Author Profile
Laura Leites
Pennsylvania State University
Author Profile
Nurul Islam-Faridi
USDA Forest Service Southern Research Station
Author Profile
Jeanne Romero-Severson
University of Notre Dame Department of Biological Sciences
Author Profile
Victoria DeLeo
Pennsylvania State University
Author Profile
Sarah Lucas
Pennsylvania State University
Author Profile
Desanka Lazic
University of Göttingen
Author Profile
Oliver Gailing
University of Göttingen
Author Profile
John Carlson
Pennsylvania State University
Author Profile
Margaret Staton
University of Tennessee

Corresponding Author:[email protected]

Author Profile

Abstract

Northern red oak (Quercus rubra L.) is an ecologically and economically important forest tree native to the northeastern United States. We present a chromosome-scale, haplotype-resolved genome of Q. rubra, a representative red oak species, generated by the combination of PacBio sequences and chromatin conformation capture (Hi-C) scaffolding. This is the first reference genome from the red oak clade (section Lobatae). The Q. rubra assembly spans 739 Megabases (Mb) with 95.27% of the genome sequences scaffolded into 12 chromosomes and 33,333 protein-coding genes. Comparisons to the genomes of Q. lobata and Q. mongolica reveal high collinearity, with intrachromosomal structural variants present. Orthologous gene family analysis with other oak and rosid tree species revealed that gene families associated with defense response were expanding and contracting simultaneously across the Q. rubra genome. Quercus rubra had the most CC-NBS-LRR and TIR-NBS-LRR resistance genes out of the nine species analyzed. Terpene synthase gene family comparisons further reveal tandem gene duplications in TPS-b subfamily, similar to Q. robur. Single major QTL regions were identified for vegetative bud break and marcescence which contain candidate genes for further research, including a putative ortholog of the circadian clock constituent cryptochrome (CRY2) and a family of eight tandemly duplicated genes for serine protease inhibitors, respectively. Genome-environment associations across natural populations identified candidate abiotic stress tolerance genes and predicted performance in a common garden. This high-quality red oak genome represents an essential resource to the oak genomics community which will further supplement the knowledge of Quercus genomics.