We present an EST library, chloroplast genome sequence, and nuclear microsatellite markers that were developed for the semi-domesticated oilseed crop noug (Guizotia abyssinica) from Ethiopia. The EST library consists of 25-711 Sanger reads, assembled into 17-538 contigs and singletons, of which 4781 were functionally annotated using the Arabidopsis Information Resource (TAIR). The age distribution of duplicated genes in the EST library shows evidence of two paleopolyploidizations-a pattern that noug shares with several other species in the Heliantheae tribe (Compositae family). From the EST library, we selected 43 microsatellites and then designed and tested primers for their amplification. The number of microsatellite alleles varied between 2 and 10 (average 4.67), and the average observed and expected heterozygosities were 0.49 and 0.54, respectively. The chloroplast genome was sequenced de novo using Illumina's sequencing technology and completed with traditional Sanger sequencing. No large re-arrangements were found between the noug and sunflower chloroplast genomes, but 1.4% of sites have indels and 1.8% show sequence divergence between the two species. We identified 34 tRNAs, 4 rRNA sequences, and 80 coding sequences, including one region (trnH-psbA) with 15% sequence divergence between noug and sunflower that may be particularly useful for phylogeographic studies in noug and its wild relatives. © 2010 Blackwell Publishing Ltd.
PMID: 18728074;PMCID: PMC2727391;Abstract:
Of the approximately 250,000 species of flowering plants, nearly one in ten are members of the Compositae (Asteraceae), a diverse family found in almost every habitat on all continents except Antarctica. With an origin in the mid Eocene, the Compositae is also a relatively young family with remarkable diversifications during the last 40 My. Previous cytologic and systematic investigations suggested that paleopolyploidy may have occurred in at least one Compositae lineage, but a recent analysis of genomic data was equivocal. We tested for evidence of paleopolyploidy in the evolutionary history of the family using recently available expressed sequence tag (EST) data from the Compositae Genome Project. Combined with data available on GenBank, we analyzed nearly 1 million ESTs from 18 species representing seven genera and four tribes. Our analyses revealed at least three ancient whole-genome duplications in the Compositae - a paleopolyploidization shared by all analyzed taxa and placed near the origin of the family just prior to the rapid radiation of its tribes and independent genome duplications near the base of the tribes Mutisieae and Heliantheae. These results are consistent with previous research implicating paleopolyploidy in the evolution and diversification of the Heliantheae. Further, we observed parallel retention of duplicate genes from the basal Compositae genome duplication across all tribes, despite divergence times of 33-38 My among these lineages. This pattern of retention was also repeated for the paleologs from the Heliantheae duplication. Intriguingly, the categories of genes retained in duplicate were substantially different from those in Arabidopsis. In particular, we found that genes annotated to structural components or cellular organization Gene Ontology categories were significantly enriched among paleologs, whereas genes associated with transcription and other regulatory functions were significantly underrepresented. Our results suggest that paleopolyploidy can yield strikingly consistent signatures of gene retention in plant genomes despite extensive lineage radiations and recurrent genome duplications but that these patterns vary substantially among higher taxonomic categories. © The Author 2008. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved.
Unexpectedly, in plant taxa that reproduce by self-pollination or cloning, more mitochondrial genes have shifted to the nucleus than in taxa that reproduce sexually.