

Publisher: Bentham Science Publishers
E-ISSN: 1875-6697|2|3|275-285
ISSN: 1573-4099
Source: Current Computer - Aided Drug Design, Vol.2, Iss.3, 2006-09, pp. : 275-285
Disclaimer: Any content in publications that violate the sovereignty, the constitution or regulations of the PRC is not accepted or approved by CNPIEC.
Abstract
The primary sequence of DNA is a sequence of nucleotides over the four-letters alphabet {A, C, G, T}. Characteristic sequences of a DNA sequence are given in term of classification of bases of nucleotides. Using the characteristic sequences, we construct a set of 3 x 8 matrices and a set of 2 x 2 matrices to represent DNA primary sequences and define the information entropy, which is based on counting all triplets of characteristic sequences. Similarity and dissimilarity analysis based on the condensed matrices and the information entropies are given for the first exon of beta-globin genes sequences belonging to eleven different species.
Related content








Correlating Low-Similarity Peptide Sequences and Allergenic Epitopes
Current Pharmaceutical Design, Vol. 14, Iss. 3, 2008-01 ,pp. :


Metal Nanoparticle-Based Detection for DNA Analysis
Current Pharmaceutical Biotechnology, Vol. 8, Iss. 5, 2007-10 ,pp. :