Show simple item record

dc.contributor.authorDhavala, Soma S.
dc.contributor.authorDatta, Sujay
dc.contributor.authorMallick, Bani K.
dc.contributor.authorCarroll, Raymond J.
dc.contributor.authorKhare, Sangeeta
dc.contributor.authorLawhon, Sara D.
dc.contributor.authorAdams, L. Garry
dc.date.accessioned2016-02-25T12:43:45Z
dc.date.available2016-02-25T12:43:45Z
dc.date.issued2010-09
dc.identifier.citationDhavala SS, Datta S, Mallick BK, Carroll RJ, Khare S, et al. (2010) Bayesian Modeling of MPSS Data: Gene Expression Analysis of Bovine Salmonella Infection . Journal of the American Statistical Association 105: 956–967. Available: http://dx.doi.org/10.1198/jasa.2010.ap08327.
dc.identifier.issn0162-1459
dc.identifier.issn1537-274X
dc.identifier.doi10.1198/jasa.2010.ap08327
dc.identifier.urihttp://hdl.handle.net/10754/597652
dc.description.abstractMassively Parallel Signature Sequencing (MPSS) is a high-throughput, counting-based technology available for gene expression profiling. It produces output that is similar to Serial Analysis of Gene Expression and is ideal for building complex relational databases for gene expression. Our goal is to compare the in vivo global gene expression profiles of tissues infected with different strains of Salmonella obtained using the MPSS technology. In this article, we develop an exact ANOVA type model for this count data using a zero-inflatedPoisson distribution, different from existing methods that assume continuous densities. We adopt two Bayesian hierarchical models-one parametric and the other semiparametric with a Dirichlet process prior that has the ability to "borrow strength" across related signatures, where a signature is a specific arrangement of the nucleotides, usually 16-21 base pairs long. We utilize the discreteness of Dirichlet process prior to cluster signatures that exhibit similar differential expression profiles. Tests for differential expression are carried out using nonparametric approaches, while controlling the false discovery rate. We identify several differentially expressed genes that have important biological significance and conclude with a summary of the biological discoveries. This article has supplementary materials online. © 2010 American Statistical Association.
dc.description.sponsorshipSoma S. Dhavala is a Doctoral Candiate, Department of Statistics, Texas A&M University, 3143 TAMU, College Station, TX 77843 (E-mail: soma@stat.tamu.edu). Sujay Datta is Senior Scientist and Faculty Member, Statistical Center for HIV/AIDS Research and Prevention, Fred Hutchinson Cancer Research Center, M2-C125,1100 Fairview Avenue N., Seattle, WA 98109 (E-mail: sdatta@fhcrc.org). Bani K. Mal lick is Professor, Department of Statistics, Texas A&M University, 3143 TAMU, College Station, TX 77843 (E-mail: bmallick@stat.tamu.edu). Raymond J. Carroll is Distinguished Professor, Department of Statistics, Texas A&M University, 3143 TAMU, College Station, TX 77843 (E-mail: carroll@stat.tamu.edu). Sangeeta Khare is Research Assistant Professor, Department of Veterinary Pathobiology, Texas A&M University. 4467 TAMU, College Station, TX 77843 (E-mail: skhare@cvm.tamu.edu). Sara D. Lawhon is Assistant Professor, Department of Veterinary Pathobiology, Texas A&M University, 4467 TAMU, College Station, TX 77843 (E-mail: slawhon@cvm.tamu.edu). L. Garry Adams is Professor. Department of Veterinary Pathobiology, Texas A&M University, 4467 TAMU, College Station, TX 77843 (E-mail: gadams@cvm.tamu.edu). The research of Bani K. Mal lick and Raymond J. Carroll was supported by from the National Cancer Institute grants (CA 104620 and CA57030, respectively), National Science Foundation grant DMS 0914951. and by award KUS-CI-016-04, made by King Abdullah University of Science and Technology (KAUST). The research of Sujay Datta was supported by a postdoctoral training grant from the National Cancer Institute (CA90301). The research of L. Garry Adams was supported by the grants NIAID 1 RO1 A144170-01A1, USDA 2002-35204-12247, and NSF DMS 0914951. Public Health Service grant AI060933 supported the research of Sara D. Lawhon. The authors are greatful to Dr. David Dahl for discussions, and to the editors and the two anonymous referees for their suggestions and constructive comments.
dc.publisherInforma UK Limited
dc.subjectBayesian semiparametric modeling
dc.subjectDirichlet process mixture
dc.subjectMarkov chain Monte Carlo
dc.subjectZero-inflated Poisson
dc.titleBayesian Modeling of MPSS Data: Gene Expression Analysis of Bovine Salmonella Infection
dc.typeArticle
dc.identifier.journalJournal of the American Statistical Association
dc.contributor.institutionTexas A and M University, College Station, United States
dc.contributor.institutionFred Hutchinson Cancer Research Center, Seattle, United States
kaust.grant.numberKUS-CI-016-04


This item appears in the following Collection(s)

Show simple item record