Numerical encoding of DNA sequences by chaos game representation with application in similarity comparison

Tung Hoang University of Illinois at Chicago Changchuan Yin University of Illinois at Chicago Stephen S.-T.. Yau Tsinghua University

General Mathematics mathscidoc:1611.13005

Numerical encoding plays an important role in DNA sequence analysis via computational methods, in which numerical values are associated with corresponding symbolic characters. After numerical representation, digital signal processing methods can be exploited to analyze DNA sequences. To reflect the biological properties of the original sequence, it is vital that the representation is one-to-one. Chaos Game Representation (CGR) is an iterative mapping technique that assigns each nucleotide in a DNA sequence to a respective position on the plane that allows the depiction of the DNA sequence in the form of image. Using CGR, a biological sequence can be transformed one-to-one to a numerical sequence that preserves the main features of the original sequence. In this research, we propose to encode DNA sequences by considering 2D CGR coordinates as complex numbers, and apply digital signal processing methods to analyze their evolutionary relationship. Computational experiments indicate that this approach gives comparable results to the state-of-the-art multiple sequence alignment method, Clustal Omega, and is significantly faster. The MATLAB code for our method can be accessed from: www.mathworks.com/matlabcentral/fileexchange/57152
DNA sequence, Chaos game representation
[ Download ] [ 2016-11-26 11:54:37 uploaded by cyinbox ] [ 1972 downloads ] [ 0 comments ]
@inproceedings{tungnumerical,
  title={Numerical encoding of DNA sequences by chaos game representation with application in similarity comparison},
  author={Tung Hoang, Changchuan Yin, and Stephen S.-T.. Yau},
  url={http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20161126115437269863659},
}
Tung Hoang, Changchuan Yin, and Stephen S.-T.. Yau. Numerical encoding of DNA sequences by chaos game representation with application in similarity comparison. http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20161126115437269863659.
Please log in for comment!
 
 
Contact us: office-iccm@tsinghua.edu.cn | Copyright Reserved