Loading [Contrib]/a11y/accessibility-menu.js
Skip to main content
biogenomes
  • Menu
  • Articles
    • Genome Sequencing
    • All
  • For Authors
  • Editorial Board
  • About
  • Open Access
  • Peer Review
  • search

RSS Feed

Enter the URL below into your favorite RSS reader.

http://localhost:47870/feed
Genome Sequencing
October 29, 2022 EDT

The complete genome sequences of Erythroxylum coca and Erythroxylum novogranatense

Dawson White, Lyndel Meinhardt, Bryan Bailey, Stacy Pirro,
erythroxylumgenome
Copyright Logoccby-sa-4.0 • https://doi.org/10.56179/001c.39776
biogenomes
White, Dawson, Lyndel Meinhardt, Bryan Bailey, and Stacy Pirro. 2022. “The Complete Genome Sequences of Erythroxylum Coca and Erythroxylum Novogranatense.” Biodiversity Genomes, October. https:/​/​doi.org/​10.56179/​001c.39776.
Save article as...▾

View more stats

Abstract

The flowering plant genus Erythroxylum contains approximately 300 species, including the economically and socially consequential crops called coca. We present the genome sequences of Erythroxylum coca and E. novogranatense, two cultigens produced for medicinal and quotidian use in the Andes and Amazon regions of South America, as well as the international cocaine industry. Sequencing was performed on an Illumina X-Ten platform, and reads were assembled by a de novo method followed by finishing via comparison with several species from the same genus. The BioProject, raw and assembled data can be accessed in GenBank for E. coca (PRJNA676123; JAJMLV000000000) and E. novogranatense (PRJNA675212; JAJKBF000000000).

Introduction

The leaves of the coca plant have been used as a medicine and mild stimulant in South America for over 8,000 years (Plowman 1984; Dillehay et al. 2010). In more recent history, few plants have had such far reaching effects on human health and international relations (Restrepo et al. 2019). Coca crops produce the alkaloid cocaine: a natural insecticide (Nathanson et al. 1993), Western medicine’s first local anesthetic, and a controlled narcotic whose supply chains and illicit international markets have caused decades of social disaster.

Coca is classified into two species, Erythroxylum coca and E. novogranatense (Erythroxylaceae, Malpighiales), each with two taxonomic varieties. These two species are found only in cultivation, having resulted from independent origins of domestication from the wild E. gracilipes (White et al. 2020).

The two varieties used in this study, E. coca var. ipadu Plowman, known as Amazonian coca, and E. novogranatense var. truxillense (Rusby) Plowman, known as Trujillo coca, are regionally distinct crops. Erythroxylum coca var. ipadu is a cultivated by indigenous groups in the lowland Amazon basin of Colombia, Brazil, and Perú. Erythroxylum novogranatense var. truxillense is grown primarily in the dry valleys of northwestern Perú and is exported as a flavoring agent of Coca Cola®. These taxa have been crossed to produce improved hybrid varieties for the cocaine market, which are currently grown in southern Colombia and possibly southern Mexico (Casale, Mallette, and Jones 2014; Rodríguez Zapata 2015).

Complete genome sequences for E. coca var. ipadu and E. novogranatense var. truxillense will provide insight into the origins, evolution, and modern breeding patterns of coca crops, as well as the of the cocaine biosynthesis pathway.

Methods

DNA from each species was provided by USDA/ARS Sustainable Perennial Crops Laboratory for use in this study.

Sequencing libraries were constructed with the Illumina TruSeq kit using standard protocols for the 2x150 bp format. Sequencing was performed on an Illumina X-Ten platform.

Raw, paired-end sequence data was trimmed of adapter sequence and low-quality regions using Trimmomatic (Bolger, Lohse, and Usadel 2014). Genome preassemblies were constructed using SPAdes (Bankevich et al. 2012), and finished with Zanfona (Kieras et al. 2021).

Results

The results of genome assemblies are as follows:

specimen accession genome size N50
E. coca var. ipadu JAJMLV000000000 584,053,830 71.4 MB
E. novogranatense var. truxillense JAJKBF000000000 573,249,677 50.4 MB

Acknowledgements

Dawson White is supported by an NSF Postdoctoral Fellowship in Biology, award number 2010821.

Funding

Funding was provided by Iridian Genomes, grant # IRGEN_RG_2021-1345 Genomic Studies of Eukaryotic Taxa.

Submitted: October 29, 2022 EDT

Accepted: October 29, 2022 EDT

References

Bankevich, Anton, Sergey Nurk, Dmitry Antipov, Alexey A. Gurevich, Mikhail Dvorkin, Alexander S. Kulikov, Valery M. Lesin, et al. 2012. “SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing.” Journal of Computational Biology 19 (5): 455–77. https:/​/​doi.org/​10.1089/​cmb.2012.0021.
Google ScholarPubMed CentralPubMed
Bolger, Anthony M., Marc Lohse, and Bjoern Usadel. 2014. “Trimmomatic: A Flexible Trimmer for Illumina Sequence Data.” Bioinformatics 30 (15): 2114–20. https:/​/​doi.org/​10.1093/​bioinformatics/​btu170.
Google ScholarPubMed CentralPubMed
Casale, John F., Jennifer R. Mallette, and Laura M. Jones. 2014. “Chemosystematic Identification of Fifteen New Cocaine-Bearing Erythroxylum Cultigens Grown in Colombia for Illicit Cocaine Production.” Forensic Science International 237:30–39. https:/​/​doi.org/​10.1016/​j.forsciint.2014.01.012.
Google Scholar
Dillehay, Tom D., Jack Rossen, Donald Ugent, Anathasios Karathanasis, Víctor Vásquez, and Patricia J. Netherly. 2010. “Early Holocene Coca Chewing In Northern Peru.” Antiquity 84 (326): 939–53. https:/​/​doi.org/​10.1017/​s0003598x00067004.
Google Scholar
Kieras, M., R. Peterson, K. O’Neill, and S. Pirro. 2021. ZANFONA, a Genome Finishing Process for Short Read Assemblies. https:/​/​github.com/​zanfona734/​zanfona.
Google Scholar
Nathanson, J. A., E. J. Hunnicutt, L. Kantham, and C. Scavone. 1993. “Cocaine as a Naturally Occurring Insecticide.” Proceedings of the National Academy of Sciences of the United States of America 90 (20): 9645–48. https:/​/​doi.org/​10.1073/​pnas.90.20.9645.
Google ScholarPubMed CentralPubMed
Plowman, T. 1984. “The Ethnobotany of Coca (Erythroxylum Spp., Erythroxylaceae).” Repos. Inst. CEDRO.
Google Scholar
Restrepo, David A., Ernesto Saenz, Orlando Adolfo Jara-Muñoz, Iván F. Calixto-Botía, Sioly Rodríguez-Suárez, Pablo Zuleta, Benjamin G. Chavez, Juan a. Sanchez, and John C. D’Auria. 2019. “Erythroxylum in Focus: An Interdisciplinary Review of an Overlooked Genus.” Molecules (Basel, Switzerland) 24 (20): 3788. https:/​/​doi.org/​10.3390/​molecules24203788.
Google ScholarPubMed CentralPubMed
Rodríguez Zapata, F.V. 2015. “Genome Size and Descriptors of Leaf Morphology as Indicators of Hybridization in Colombian Cultigens of Coca Erythroxylum Spp.” Thesis, Universidad de los Andes.
White, Dawson M., Jen-Pan Huang, Orlando Adolfo Jara-Muñoz, Santiago Madriñán, Richard H. Ree, and Roberta J. Mason-Gamer. 2020. “The Origins of Coca: Museum Genomics Reveals Multiple Independent Domestications from Progenitor Erythroxylum Gracilipes.” Systematic Biology 70 (1): 1–13. https:/​/​doi.org/​10.1093/​sysbio/​syaa074.
Google ScholarPubMed CentralPubMed

This website uses cookies

We use cookies to enhance your experience and support COUNTER Metrics for transparent reporting of readership statistics. Cookie data is not sold to third parties or used for marketing purposes.

Powered by Scholastica, the modern academic journal management system