The Similarity Analysis of DNA Sequence Model Based on Graph Theory and Blast Program

Authors

  • Y.A. Lesnussa Mathematic Department, Faculty of Mathematics and Natural Sciences, University of Pattimura Jl. Ir M. Putuhena, Kampus Unpatti Poka-Ambon, Indonesia
  • S. Kappuw Mathematic Department, Faculty of Mathematics and Natural Sciences, University of Pattimura Jl. Ir M. Putuhena, Kampus Unpatti Poka-Ambon, Indonesia
  • B.P. Tomasouw Mathematic Department, Faculty of Mathematics and Natural Sciences, University of Pattimura Jl. Ir M. Putuhena, Kampus Unpatti Poka-Ambon, Indonesia
  • E.R. Persulessy Mathematic Department, Faculty of Mathematics and Natural Sciences, University of Pattimura Jl. Ir M. Putuhena, Kampus Unpatti Poka-Ambon, Indonesia

DOI:

https://doi.org/10.37134/ejsmt.vol4.1.6.2017

Keywords:

DNA Sequence, Graph, Cosine, Correlation, Euclid

Abstract

DNA is a nucleotide acid in form of double helix which contains genetic instruction to determine biology development of all forms of cell's life also it relates with genetic characteristic inheritance. In this research, we will see the similarity of two DNA sequences. DNA sequences that we used are human, orangutan, and gorilla. The method that we used to analyze the similarity of DNA sequences is Graph Theory. This method started by modeling each DNA sequence into a graph, making its adjacency matrix and builds a matrix vector for each graph. From these vectors we will determine similarity of two DNA sequences. The similarity of DNA sequences is determined by the similarity level using Cosine, Correlation, and Euclid. Where, the results are shown by the smaller distance, and then showing the similarity of two DNA sequences. And then compare the result from Graph Theory with the results of Basic Local Alignment Search Tools (BLAST) program. Finally, the result of research shows that Human and Gorila have close similarity of their DNA sequences. 2010 Mathematics Subject Classification: 00A69.

Downloads

Download data is not yet available.

References

Chartrand G., and Lesniak L. (1986). Graph and Digraph 2nd Edition. California: Wadsworth. Inc.

Hasan I. (2004). Analisis Data Penelitian dengan Statistik. Jakarta: Penerbit Bumi Aksara.

Howard A. (2004). Aljabar Linier Elementer. Jakarta: Penerbit Erlangga.

Xingqin Q., Qin W., Yusen Z., Eddie F., & Cun Q. Z. (2011). A Novel Model for DNA Sequence Similarity Analysis Based on Graph Theory, Evolutionary Bioinformatics, Libertas Academica.

Tooze J. and Watson J. D. (1988). DNA Rekombinan. Jakarta: Penerbit Erlangga.

Wibisono, S. (2008). Matematika Diskrit. Yogyakarta: Penerbit Graha Ilmu.

Wilson R. J., and Watkins J. J. (1990). Graph An Introductory Approach: A First Course in Discrete Mathematic. New York: John Wiley & Sons, Inc.

Zhang Y, Liao B, Ding K. (2006). On 3D D-curves of DNA sequences. Mol Simul.32:29-34.

https://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE_TYPE=BlastHome

Downloads

Published

2017-06-22

How to Cite

Lesnussa, Y., Kappuw, S., Tomasouw, B., & Persulessy, E. (2017). The Similarity Analysis of DNA Sequence Model Based on Graph Theory and Blast Program. EDUCATUM Journal of Science, Mathematics and Technology, 4(1), 41–51. https://doi.org/10.37134/ejsmt.vol4.1.6.2017