TY - JOUR
T1 - Characterization of the heart transcriptome of the white shark (Carcharodoncarcharias)
AU - Richards, Vincent P.
AU - Suzuki, Haruo
AU - Stanhope, Michael J.
AU - Shivji, Mahmood S.
N1 - Funding Information:
We are very grateful to Dr. L. Natanson (NOAA NEFSC) for providing the white shark heart and the NOAA Office for Law Enforcement for making the confiscated shark carcass available for scientific study. We thank the Cornell Life Sciences Core Laboratory for performing the 454 and Illumina sequencing, and we are grateful to Qi Sun and Jaroslaw Pillardy for their assistance with the bioinformatics analyses. This study was supported by grants from the Save Our Seas Foundation, Guy Harvey Ocean Foundation, and Hai Stiftung/Shark Foundation to MSS.
PY - 2013/10/11
Y1 - 2013/10/11
N2 - Background: The white shark (Carcharodon carcharias) is a globally distributed,apex predator possessing physical, physiological, and behavioral traits thathave garnered it significant public attention. In addition to interest inthe genetic basis of its form and function, as a representative of theoldest extant jawed vertebrate lineage, white sharks are also ofconservation concern due to their small population size and threat fromoverfishing. Despite this, surprisingly little is known about the biology ofwhite sharks, and genomic resources are unavailable. To address thisdeficit, we combined Roche-454 and Illumina sequencing technologies tocharacterize the first transciptome of any tissue for this species. Results: From white shark heart cDNA we generated 665,399 Roche 454 reads (medianlength 387-bp) that were assembled into 141,626 contigs (mean length503-bp). We also generated 78,566,588 Illumina reads, which we aligned tothe 454 contigs producing 105,014 454/Illumina consensus sequences. Tothese, we added 3,432 non-singleton 454 contigs. By comparing thesesequences to the UniProtKB/Swiss-Prot database we were able to annotate21,019 translated open reading frames (ORFs) of ≥ 20 amino acids. Ofthese, 19,277 were additionally assigned Gene Ontology (GO) functionalannotations. While acknowledging the limitations of our single tissuetranscriptome, Fisher tests showed the white shark transcriptome to besignificantly enriched for numerous metabolic GO terms compared to the zebrafish and human transcriptomes, with white shark showing more similarity tohuman than to zebra fish (i.e. fewer terms were significantly different). Wealso compared the transcriptome to other available elasmobranch sequences,for signatures of positive selection and identified several genes ofputative adaptive significance on the white shark lineage. The white sharktranscriptome also contained 8,404 microsatellites (dinucleotide,trinucleotide, or tetranucleotide motifs ≥ five perfect repeats).Detailed characterization of these microsatellites showed that ORFs withtrinucleotide repeats, were significantly enriched for transcriptionregulatory roles and that trinucleotide frequency within ORFs was lower thanfor a wide range of taxonomic groups including other vertebrates.Conclusion: The white shark heart transcriptome represents a valuable resource for futureelasmobranch functional and comparative genomic studies, as well as forpopulation and other biological studies vital for effective conservation ofthis globally vulnerable species.
AB - Background: The white shark (Carcharodon carcharias) is a globally distributed,apex predator possessing physical, physiological, and behavioral traits thathave garnered it significant public attention. In addition to interest inthe genetic basis of its form and function, as a representative of theoldest extant jawed vertebrate lineage, white sharks are also ofconservation concern due to their small population size and threat fromoverfishing. Despite this, surprisingly little is known about the biology ofwhite sharks, and genomic resources are unavailable. To address thisdeficit, we combined Roche-454 and Illumina sequencing technologies tocharacterize the first transciptome of any tissue for this species. Results: From white shark heart cDNA we generated 665,399 Roche 454 reads (medianlength 387-bp) that were assembled into 141,626 contigs (mean length503-bp). We also generated 78,566,588 Illumina reads, which we aligned tothe 454 contigs producing 105,014 454/Illumina consensus sequences. Tothese, we added 3,432 non-singleton 454 contigs. By comparing thesesequences to the UniProtKB/Swiss-Prot database we were able to annotate21,019 translated open reading frames (ORFs) of ≥ 20 amino acids. Ofthese, 19,277 were additionally assigned Gene Ontology (GO) functionalannotations. While acknowledging the limitations of our single tissuetranscriptome, Fisher tests showed the white shark transcriptome to besignificantly enriched for numerous metabolic GO terms compared to the zebrafish and human transcriptomes, with white shark showing more similarity tohuman than to zebra fish (i.e. fewer terms were significantly different). Wealso compared the transcriptome to other available elasmobranch sequences,for signatures of positive selection and identified several genes ofputative adaptive significance on the white shark lineage. The white sharktranscriptome also contained 8,404 microsatellites (dinucleotide,trinucleotide, or tetranucleotide motifs ≥ five perfect repeats).Detailed characterization of these microsatellites showed that ORFs withtrinucleotide repeats, were significantly enriched for transcriptionregulatory roles and that trinucleotide frequency within ORFs was lower thanfor a wide range of taxonomic groups including other vertebrates.Conclusion: The white shark heart transcriptome represents a valuable resource for futureelasmobranch functional and comparative genomic studies, as well as forpopulation and other biological studies vital for effective conservation ofthis globally vulnerable species.
KW - Carcharodon carcharias
KW - Enrichment
KW - Heart transcriptome
KW - Microsatellites
KW - Positive selection
KW - White shark
UR - http://www.scopus.com/inward/record.url?scp=84885343927&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84885343927&partnerID=8YFLogxK
U2 - 10.1186/1471-2164-14-697
DO - 10.1186/1471-2164-14-697
M3 - Article
C2 - 24112713
AN - SCOPUS:84885343927
SN - 1471-2164
VL - 14
JO - BMC Genomics
JF - BMC Genomics
IS - 1
M1 - 697
ER -