De novo assembly of the Tamarindus indica genome as part of the Kingdom of Saudi Arabia Native Genome Project
Name:
MSc_Thesis___Mnr.pdf
Size:
41.97Mb
Format:
PDF
Description:
MS Thesis
Embargo End Date:
2023-08-11
Type
ThesisAdvisors
Wing, Rod Anthony
Committee members
Blilou, Ikram
Merzaban, Jasmeen

Program
Plant ScienceKAUST Department
Biological and Environmental Science and Engineering (BESE) DivisionDate
2022-08-10Embargo End Date
2023-08-11Permanent link to this record
http://hdl.handle.net/10754/680228
Metadata
Show full item recordAccess Restrictions
At the time of archiving, the student author of this thesis opted to temporarily restrict access to it. The full text of this thesis will become available to the public after the expiration of the embargo on 2023-08-11.Abstract
The Kingdom of Saudi Arabia Native Genome project aims to generate genomic resources for all the plants, animals, and associated microbiome species in the Kingdom. Tamarindus indica was pointed out by the MEWA as an endangered native species in the KSA and forms part of the first 15 plant species to be studied in the NGP. A voucher tree was identified in the Rijal Almaa region, from which leaf samples were collected. HMW DNA was extracted from this tissue and sequenced using CCS with the Pac-Bio Sequel II platform. The raw data obtained from the sequencing was assembled using HIFIASM, contaminant contigs were removed, and the 15 largest contigs were selected as the primary T. indica assembly. The genome sequence of Sindora glabra was used as reference guide for primary scaffolding, and T. indica optical maps were used for super-scaffolding. Secondary scaffolding utilized Hi-C data to produce a chromosome level assembly of the T. indica genome. Transposable element analysis and a preliminary annotation were performed on the final assembly. This project represents the first step in studying T. indica for the NGP. The final assembly can be used as a foundation for more genetic studies on this species, as a possible reference for other legume species from the Detarioideae family, and for Neo-domestication and reforestation. The pipeline developed for this project can also be used as a template for sequencing and assembling the remaining species in the NGP.Citation
Navarrete Rodriguez, M. E. (2022). De novo assembly of the Tamarindus indica genome as part of the Kingdom of Saudi Arabia Native Genome Project [KAUST Research Repository]. https://doi.org/10.25781/KAUST-X9IF9ae974a485f413a2113503eed53cd6c53
10.25781/KAUST-X9IF9