Preview

Data Mining

Good Essays
Open Document
Open Document
19578 Words
Grammar
Grammar
Plagiarism
Plagiarism
Writing
Writing
Score
Score
Data Mining
Data Mining And Statistical Approaches In Identifying Contrasting Trends In Reactome And Biocarta

By
Sumayya Iqbal
SP09-BSB-036

Zainab Khan
SP09-BSB-045
BS Thesis (Feb 2009-Jan 2013)
COMSATS Institute of Information Technology
Islamabad- Pakistan
January, 2013
COMSATS Institute of Information Technology

Data Mining And Statistical Approaches In Identifying Contrasting Trends In Reactome And Biocarta
A Thesis Presented to COMSATS Institute of Information Technology, Islamabad
In Partial Fulfilment
Of the requirement for the Degree of

B.S. (Bioinformatics)

By

Sumayya Iqbal
CIIT/ SP09-BSB-036/ISB
Zainab Khan
CIIT/ SP09-BSB-45/ISB
January, 2013

Data Mining And Statistical Approaches In Identifying Contrasting Trends In Reactome And Biocarta

An Undergraduate Thesis submitted to the Department of Bioscience as partial fulfillment of the requirement for the award of the Degree of B.S. (Bioinformatics).

Name | Registration Number | Sumayya IqbalZainab Khan | CIIT/SP09-BSB-036/ISB CIIT/SP09 -BSB-045/ISB |

Supervisors
Dr. Rani Faryal
Mr. Syed Shujaat Ali Zaidi
Department of Biosciences,
CIIT, Islamabad Campus.
January, 2013

Final Approval
This thesis titled
Data Mining And Statistical Approaches In Identifying Contrasting Trends In Reactome And Biocarta
Submitted for the Degree of BS Bioinformatics by Sumayya Iqbal Zaianb Khan
Has been approved for the COMSATS Institute of information Technology Islamabad
External Examiner: __________________________________________

Supervisor: _____________________________________________
Dr. Rani Faryal

Co-Supervisor: ________________________________________________
Mr. Syed Shujaat Ali Zaidi

Head of Department/ Chairman: ________________________________________________
Dr. Syed Habib Bokhari
Associate Professor

Declaration
We Sumayya Iqbal and



References: 2. Anna Bauer-Mehren, L. I. (2009). Pathway databases and tools for their exploitation: benefits, current limitations and challenges. Molecular systems biology, 5(1). 3. Berg JM, T. J. (2002). Glycolysis and gluconeogenesis. New York: W H Freeman. 4. Blenis, P. P. (2004). ERK and p38 MAPK-Activated Protein Kinases: a Family of Protein Kinases with Diverse Biological Functions. 68, 320-344. 5. Bonetta, L. (2010, December 8). Protein–protein interactions: Interactome under construction. Nature, 851-854. 6. Cates, S. (2012, 12 12). GENE. Retrieved 11 22, 2012, from NCBI: National Center for Biotechnology Information: http://www.ncbi.nlm.nih.gov/gene/7316 7 8. Cooper, G. M. (2000). Cell Sigalling. Sunderland (MA): Sinauer Associates, Inc. 9. Cork, J. M. (2004). The evolution of molecular genetic pathways and networks. Bioessays, 26(5, 479-484. 10. Corné H Verhees, S. W. (2003). The unique features of glycolytic pathways in Archaea. The unique features of glycolytic pathways in Archaea. Biochemical Journal,, 375(Pt 2), 231. 11. Daniel A Beard, S.-d. L. (2002). Energy balance for analysis of complex metabolic networks. Biophysical journal, 83(1), 79. 12. Domazet-Lošo, D. T. (2011). The evolutionary origin of orphan genes. Nature Reviews Genetics, 12(10), 692-702. 13. Ganesh A. Viswanathan, J. S. (2008). Getting Started in Biological Pathway Construction and Analysis. PLoS computational biology, 4(2), e16. 14. Glynn Dennis Jr, B. T. (2003, August 14). DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biology . 15. Gomez, L. (2008). G72/G30 (DAOA) and juvenile-onset mood disorders. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics, 150(7),, 1007-1012. 16. Graveley, B. R. (2001). Alternative splicing: increasingdiversity in the proteomic world. Science direct, 17(2), 100-107. 17. Guang-Zhong Wang, W.-H. C. (2011). Coexpression of Linked Gene Pairs Persists Long after Their Separation. 18. Harrtmink, A. J. (2005, May). Reverse Engineering gene regulatory networks. Nature, 23(5). 19. Holland SK, B. C. (1987). Proteins, exons and molecular evolution. Biosystems,, 20(2), 181-206. 20. Hongwu Ma1, A. S. (2007). The Edinburgh human metabolic network reconstruction and its functional analysis. 21. Isabelle Wolowczuk, C. V. (2008). Feeding Our Immune System: Impact on Metabolism. Clinical and Developmental Immunology. 22. Jeffrey D Orth, T. M. (2011). A comprehensive genome-scale reconstruction of Escherichia coli metabolism—2011. Molecular systems biology, 7(1). 23. Jiaqi Shi, Y. F. (2003). The p34cdc2-related Cyclin-dependent kinase 11 Interacts with the p47 Subunit of Eukaryotic Initiation Factor 3 during Apoptosis. Journal of Biological Chemistry, 278.7(2003), 5062-5071. 24. Jing He, K. W. (2011). Gene-based interaction analysis by incorporating external linkage disequilibrium information. European Journal of Human Genetics, 19(2), 164-172. 25. Joshi-Tope G, G. M. (2005). Reactome: a knowledgebase of biological pathways. 26. Karin, M. (1994). Signal transduction from the cell surface to the nucleus through the phosphorylation of transcription factors. 415-424. 27. Lal, A. (1999). A Public Database for Gene Expression in Human Cancers. Cancer Research, 59(21), 5403-5407. 28. Leonard Guarente & Cynthia Kenyon. (2000, November). Genetic pathways that regulate ageing in model organisms. 408. 29. Lisa Matthews, G. G. (2008). Reactome knowledgebase of human biological pathways and processes. Nucleic acids research, 37(suppl 1), D619-D622. 30. Lissette Gomez, K. W. (2008). G72/G30 (DAOA) and juvenile-onset mood disorders. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics,, 150(7), 1007-1012. 31. Lobo, I. (2008). Environmental Influences on Gene Expression. Nature Education, 1(1). 32. M. P. Kurhekar, S. A. (2001). Genome wide pathway analysis and visualization using gene expression data. Proc PSB '02, 462-473. 33. Manyuan Long, E. B. (2003). The Origin Of New Genes:Glimpses From The Young. Nature Reviews Genetics,, 4(11, 865-875. 34. Mengel-From J, B. C. (2010). Human eye colour and HERC2, OCA2 and MATP. Forensic Sci. Int. Genet,, 4, 323-328. 35. Ming Gua. (2010). Biological Pathways A pathway to explore diseases mechanisms. 36. Moss, V. S. (2006, april 1). Regulation of rRNA Synthesis in Human and Mouse Cells is Not Determined by Changes in Active Gene Count. cell cycle, 5(7), 735 - 739. 37. Murcray, C. E. (2008). Gene-Environment Interaction in Genome-Wide Association Studies. American journal of epidemiology,, 169(2), 219-226. 38. Nishimura, D. (2001, June). BioCarta. Biotech Software & Internet Report. The Computer Software Journal for Scient,, 2(3), 117-120. 39. Q. Ma, P. L.-K. (2005). Identification of Significant Association and Gene-Gene Interaction of GABA Receptor Subunit Genes in Autism. American journal of human genetics, 77(3), 377. 40. Robin Haw and Lincoln Stein. (2012, June). Using the Reactome Database. Current Protocols in Bioinformatics, 8-7. 41. Robin Haw, H. H. (2011, 6 sep). Reactome pathway analysis to enrich biological discovery in proteomics data sets. proteomics, 11(18), 3598–3613. 42. SABiosciences A QIAGEN COMPANY. (n.d.). Retrieved 12 22, 2012, from +http://www.sabiosciences.com/rt_pcr_product/HTML/PAHS-012Z.html 43 44. Saraiya, P. (2005). Visualizing Biological Pathways: Requirements Analysis, Systems Evaluation and Research Agenda.  Information Visualization, 4(3), 191-205. 45. Schilling CH. (2000). Theory for the systemic definition of metabolic pathways and their use in interpreting metabolic function from a pathway-oriented perspective.  Journal of theoretical biology, 203(3), 229 46 47. Sook S. Ha, I. K. (2011). Applications of Different Weighting Schemes to Improve Pathway-Based Analysis. Hindaw, Volume 2011 (2011), 15 . 48. Stefan M. Pulst, M. (1999). Genetic Linkage Analysis. Archieves Neurology, 56, 667-672. 49. Stein, L. D. (2003). Integrating biological databases. Nature Reviews Genetics, 4(5), 337-345. 50. Sternberg, W. F. (1995). Genetic Networks. Science (New York, N.Y.), 269(5224). 51. Tong Ihn Lee, N. J.-J. (2002). Transcriptional Regulatory Networks in Saccharomyces cerevisiae. Science Signalling, 298(5594), 799. 52. Victor Stefanovsky and Tom Moss . (2006, april 1). Regulation of rRNA Synthesis in Human and Mouse Cells is Not Determined by Changes in Active Gene Count. Landes Biosience, 5(7). 53. Yang, Y. (1988). The human genes for GM-CSF and IL 3 are closely linked in tandem on chromosome 5. Blood, 71(4), 958-961. 54. ZHENG LI and CHRISTINA CHAN. (2004, Februar 6). Inferring pathways and networks with a Bayesian framework. The FASEB Journal, 18(6), 746-748.

You May Also Find These Documents Helpful

  • Good Essays

    o The number of nucleotide substitutions in orthologous genes is proportional to the time since the genes became duplicated…

    • 4658 Words
    • 19 Pages
    Good Essays
  • Better Essays

    Sordoria Lab

    • 1569 Words
    • 7 Pages

    References: 1.) Campbell, Neil A., Jane B. Reece, et al. Biology. Eighth ed. San Francisc: Pearson Benjamin Cummings, 2008. Print.…

    • 1569 Words
    • 7 Pages
    Better Essays
  • Good Essays

    Living Primates Summary

    • 286 Words
    • 2 Pages

    Varki and Gagneux studied humans and chimpanzees to learn more about the gene’s lineage. Their…

    • 286 Words
    • 2 Pages
    Good Essays
  • Good Essays

    Biology Case Study

    • 732 Words
    • 3 Pages

    References: Molecular Cell Biology, 7th Edition, 2013, Lodish, Berk, Kaiser, Krieger, Bretscher. Ploegh, Amon, and Scott. W.H. Freeman and Company (ISBN-13: 978-1-4292-3413-9)…

    • 732 Words
    • 3 Pages
    Good Essays
  • Powerful Essays

    Early in the 20th century, the identification of the molecules of inheritance loomed as a major challenge to biologists.…

    • 1877 Words
    • 7 Pages
    Powerful Essays
  • Good Essays

    Osmosis Lab Report

    • 839 Words
    • 4 Pages

    Cited: Miller, Kenneth Raymond, and Joseph S. Levine. Prentice Hall Biology. Boston: Pearson/Prentice Hall, 2008. Print.…

    • 839 Words
    • 4 Pages
    Good Essays
  • Good Essays

    Project Plan

    • 9315 Words
    • 38 Pages

    Loewe, L (2002). Global computing for bioinformatics. Briefings in Bioinformatics; Vol. 3 Issue 4, p377. Retrieved June 3, 2006 from EBSCOhost database, University of Phoenix Online Library Collection.…

    • 9315 Words
    • 38 Pages
    Good Essays
  • Satisfactory Essays

    Science has come a long way since Mendel’s important work on identifying the transmission of inherited factors across generations. The code for transmitting this genetic information has been identified and models have been developed to explain gene functioning. Transcription of the information into functional proteins is now well understood and models are being developed to test how genes direct the structure, function and development of an organism.…

    • 680 Words
    • 3 Pages
    Satisfactory Essays
  • Powerful Essays

    Genetic Research Paper

    • 1033 Words
    • 5 Pages

    The gene ids from the above file with its go terms assigned to Gene Ontology (GO). GO used to describe gene functions. It classifies genes into three functions: cellular component, biological process, and molecular function. After GO, we created a python program for finding the correlation between genes with DMR in gene region, upstream or downstream to those differentially expressed. According to this result, we created a pie chart for analyzing the number of genes included in each…

    • 1033 Words
    • 5 Pages
    Powerful Essays
  • Good Essays

    Human Genome Project Information. 27 Oct. 2004. U.S. Department of Energy Office of Science. 23 Sep. 2005…

    • 1697 Words
    • 7 Pages
    Good Essays
  • Powerful Essays

    Guided Reading

    • 1915 Words
    • 8 Pages

    References: 1. Campbell et al. (2008). AP* Edition Biology. 8th Ed. San Francisco: Pearson Benjamin Cummings. 2. Adapted from Fred and Theresa Holtzclaw 3. Adapted from L. Miriello…

    • 1915 Words
    • 8 Pages
    Powerful Essays
  • Good Essays

    Hegde, S. (2014, March 24). Abstract. National Center for Biotechnology Information. Retrieved October 3, 2014, from http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3970008/…

    • 829 Words
    • 4 Pages
    Good Essays
  • Powerful Essays

    Bipolar Disorder

    • 3445 Words
    • 14 Pages

    Burmeister, M., Mclnnis, M. G., & Zollner, S. (2008). Psychiatric genetics: progress amid controversy. Nature Reviews Genetics 9 , 527-540.…

    • 3445 Words
    • 14 Pages
    Powerful Essays
  • Powerful Essays

    36. Woodcock CL (2006) Chromatin architecture. Curr Opin Struct Biol 16: 213–220. 37. Edayathumangalam RS, Weyermann P, Gottesfeld JM, Dervan PB, Luger K (2004) Molecular recognition of the nucleosomal ‘‘supergroove’’. Proc Natl Acad Sci U S A 101: 6864–6869. 38. Cavazza B, Brizzolara G, Lazzarini G, Patrone E, Piccardo M, et al. (1991) Thermodynamics of condensation of nuclear chromatin. A differential scanning calorimetry study of the salt-dependent structural transitions. Biochemistry 30: 9060–9072.…

    • 11158 Words
    • 45 Pages
    Powerful Essays
  • Best Essays

    2. Badea, L.: Extracting gene expression profiles common to colon and pancreatic adenocarcinoma using simultaneous nonnegative matrix factorization. In: Altman, R.B., Dunker, A.K., Hunter, L., Murray, T., Klein, T.E. (eds.) Pacific Symposium on Biocomputing, pp. 267–278. World Scientific (2008) 3. Brunet, J.P., Tamayo, P., Golub, T.R., Mesirov, J.P.: Metagenes and molecular pattern discovery using matrix factorization. PNAS 101(12), 4164–4169 (2004) 4. Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by latent semantic analysis. Journal of the American Society for Information Science 41(6), 391–407 (1990) 5. Furui, S.: Speech and speaker recognition evaluation. In: Dybkjœr, L., Hemsen, H., Minker, W., Ide, N. (eds.) Evaluation of Text and Speech Systems, Text, Speech and Language Technology, vol. 37, pp. 1–27. Springer, Netherlands (2007) 6. Gaujoux, R., Seoighe, C.: A flexible R package for nonnegative matrix factorization. BMC bioinformatics 11(1), 367+ (2010) 7. Goldenberg, J., Libai, B., Muller, E.: Talk of the Network: A Complex Systems Look at the Underlying Process of Word-of-Mouth. In: Marketing Letters, pp. 211–223 (August 2001) 8. Goyal, A., Bonchi, F., Lakshmanan, L.V.: Learning influence probabilities in social networks. In: Proceedings of the Third ACM International Conference on Web Search and Data Mining, WSDM 2010, pp. 241–250. ACM, New York (2010) 9. Gruhl, D., Guha, R., Liben-Nowell, D., Tomkins, A.: Information diffusion through blogspace. In: Proceedings of the 13th International Conference on World Wide Web, WWW 2004, pp. 491–501. ACM, New York (2004) 10. Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 1999, pp.…

    • 3900 Words
    • 16 Pages
    Best Essays

Related Topics