e-journal
Fast Entropic Profiler: An Information Theoretic Approach for the Discovery of Patterns in Genomes
Information theory has been used for quite some time in the area of computational biology. In this paper we present a pattern discovery method, named Fast Entropic Profiler, that is based on a local entropy function that captures the importance of a region with respect to the whole genome. The local entropy function has been introduced by Vinga and Almeida in [29], here we discuss and improve the original formulation. We provide a linear time and linear space algorithm called Fast Entropic Profiler (FastEP), as opposed to the original quadratic implementation. Moreover we propose an alternative normalization that can be also efficiently implemented. We show that FastEP is suitable for large genomes and for the discovery of patterns with unbounded length. FastEP is available at http://www.dei.unipd.it/~ciompin/main/FastEP.html.
Index Terms—Pattern discovery, information theory, local entropy, computational biology
Tidak ada salinan data
Tidak tersedia versi lain