Investigation of the effects of MAS5, RMA and GCRMA preprocessing methods on an affymetrix zebrafish genechip dataset using statistical and network parameters

buir.advisorKonu, Özlen
dc.contributor.authorÖztürk, Ahmet Raşit
dc.date.accessioned2016-01-08T18:18:21Z
dc.date.available2016-01-08T18:18:21Z
dc.date.issued2010
dc.departmentDepartment of Molecular Biology and Geneticsen_US
dc.descriptionAnkara : Department of Molecular Biology and Genetics and the Institute of Engineering and Sciences of Bilkent University, 2010.en_US
dc.descriptionThesis (Master's) -- Bilkent University, 2010.en_US
dc.descriptionIncludes bibliographical references leaves 77-84.en_US
dc.description.abstractMicroarray data preprocessing is an important determinant of the accuracy and repeatability of expression profiling studies. Recent studies have focused on comparison of preprocessing methodologies using differential expression analysis of spike-in datasets and qRT-PCR confirmations. Other approaches include comparison of array-wise and probe-wise correlation and of selected gene network parameters. However, zebrafish GeneChip datasets have not been used in such comparisons; furthermore, detailed analysis of upregulated and downregulated gene sets with respect to known network parameters are not well characterized across different preprocessing methodologies. In this study we re-analyzed a public zebrafish hypoxia microarray dataset (GSE4989; Marques et al. 2008) using MAS5, RMA, and gcRMA methods. Comparisons were made in terms of differentially expressed gene sets and defined network parameters, namely, clustering coefficient, degree distribution, and betwenness centrality. Our findings indicated that gcRMA and RMA exhibited greater similarity to each other in terms of differentially expressed genes, and network parameters. In addition, the network analysis demonstrated that upregulated and downregulated gene sets had distinct network structures; downregulated probesets had greater clustering coefficients and degree distributions for positively correlated probesets in all three preprocessing methods. However, gcRMA and RMA methods accentuated this difference further than MAS5 did, suggesting that preprocessing methods differ in their modulation of gene expression network structure. A selected group of probesets that showed invariant network structure parameters across RMA, gcRMA and MAS5 was determined and analyzed functionally for the zebrafish hypoxia dataset. The results of this thesis suggest that preprocessing methods may alter network structure of the datasets differentially with respect to upregulated and downregulated gene sets. Accordingly, it might be beneficial to filter differentially expressed genes that are robust to such network topology modulation to increase the repeatability of gene sets.en_US
dc.description.degreeM.S.en_US
dc.description.statementofresponsibilityÖztürk, Ahmet Raşiten_US
dc.format.extentxvi, 109 leaves, illustrationsen_US
dc.identifier.urihttp://hdl.handle.net/11693/15426
dc.language.isoEnglishen_US
dc.publisherBilkent Universityen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subject.lccQH430 .O98 2010en_US
dc.subject.lcshGenetics.en_US
dc.titleInvestigation of the effects of MAS5, RMA and GCRMA preprocessing methods on an affymetrix zebrafish genechip dataset using statistical and network parametersen_US
dc.typeThesisen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
0006166.pdf
Size:
7.74 MB
Format:
Adobe Portable Document Format