Two Dimensional Yau-Hausdorff Distance with Applications on Comparison of DNA and Protein Sequences

Kun Tian Tsinghua University Xiaoqian Yang Tsinghua University Qin Kong Tsinghua University Changchuan Yin The University of Illinois at Chicago Rong L. He Chicago State University Stephen S.-T. Yau Tsinghua University

Statistics Theory and Methods mathscidoc:1702.33001

Distinguished Paper Award in 2019

PLOS ONE, 1-19, 2015.9
Comparing DNA or protein sequences plays an important role in the functional analysis of genomes. Despite many methods available for sequences comparison, few methods retain the information content of sequences. We propose a new approach, the Yau-Hausdorff method, which considers all translations and rotations when seeking the best match of graphical curves of DNA or protein sequences. The complexity of this method is lower than that of any other two dimensional minimum Hausdorff algorithm. The Yau-Hausdorff method can be used for measuring the similarity of DNA sequences based on two important tools: the Yau-Hausdorff distance and graphical representation of DNA sequences. The graphical representations of DNA sequences conserve all sequence information and the Yau-Hausdorff distance is mathematically proved as a true metric. Therefore, the proposed distance can preciously measure the similarity of DNA sequences. The phylogenetic analyses of DNA sequences by the Yau-Hausdorff distance show the accuracy and stability of our approach in similarity comparison of DNA or protein sequences. This study demonstrates that Yau-Hausdorff distance is a natural metric for DNA and protein sequences with high level of stability. The approach can be also applied to similarity analysis of protein sequences by graphic representations, as well as general two dimensional shape matching.
No keywords uploaded!
[ Download ] [ 2017-02-03 16:45:15 uploaded by Stephenyau ] [ 1256 downloads ] [ 0 comments ]
  • DOI:10.1371/journal.pone.0136577
@inproceedings{kun2015two,
  title={Two Dimensional Yau-Hausdorff Distance with Applications on Comparison of DNA and Protein Sequences},
  author={Kun Tian, Xiaoqian Yang, Qin Kong, Changchuan Yin, Rong L. He, and Stephen S.-T. Yau},
  url={http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20170203164515581995156},
  booktitle={PLOS ONE},
  pages={1-19},
  year={2015},
}
Kun Tian, Xiaoqian Yang, Qin Kong, Changchuan Yin, Rong L. He, and Stephen S.-T. Yau. Two Dimensional Yau-Hausdorff Distance with Applications on Comparison of DNA and Protein Sequences. 2015. In PLOS ONE. pp.1-19. http://archive.ymsc.tsinghua.edu.cn/pacm_paperurl/20170203164515581995156.
Please log in for comment!
 
 
Contact us: office-iccm@tsinghua.edu.cn | Copyright Reserved