ISCID Forums


Post New Topic  Post A Reply
my profile | search | faq | forum home
  next oldest topic   next newest topic
» ISCID Forums   » General   » Literature Review   » Shannon information in complete genomes

   
Author Topic: Shannon information in complete genomes
Itzpapalotl
Member
Member # 623

Icon 1 posted 05. May 2005 16:37      Profile for Itzpapalotl   Email Itzpapalotl   Send New Private Message       Edit/Delete Post 
Chang-Heng Chang1, Li-Ching Hsieh1, Ta-Yuan Chen1, Hong-Da Chen1
Liaofu Luo3 and Hoong-Chien Lee

Abstract
Shannon information in the genomes of all completely
sequenced prokaryotes and eukaryotes are
measured in word lengths of two to ten letters. It is
found that in a scale-dependent way, the Shannon
information in complete genomes are much greater
than that in matching random sequences - thousands
of times greater in the case of short words.
Furthermore, with the exception of the 14 chromosomes
of Plasmodium falciparum, the Shannon information
in all available complete genomes belong
to a universality class given by an extremely simple
formula. The data are consistent with a model
for genome growth composed of two main ingredients:
random segmental duplications that increase
the Shannon information in a scale-independent way,
and random point mutations that preferentially reduces
the larger-scale Shannon information. The
inference drawn from the present study is that the
large-scale and coarse-grained growth of genomes
was selectively neutral and this suggests an independent
corroboration of Kimura’s neutral theory
of evolution.

http://sansan.phy.ncu.edu.tw/~hclee/rpr/Lee_H_Shannon.pdf

see also:

Divergence and Shannon information in genomes http://sansan.phy.ncu.edu.tw/~hclee/rpr/Shan_proofs0504.pdf

"Our analysis of the data combined with computer simulation of genome
growth models suggest a simple coarse-grain representation of genome growth: the ancestors of the
genomes began to grow when they were no greater than 300 b in length via a mechanism whose
main components were neutral stochastic segmental replicative translocations and random small
mutations."

Universality in large-scale structure of complete genomes http://sansan.phy.ncu.edu.tw/~hclee/rpr/Hsieh_gb-2004-5-3-p7.pdf

Complexity,Universality and Growth of Genomes http://www.nchc.org.tw/KING/KING_zh/data1/01.pdf

homepage of H. C. Lee: http://sansan.phy.ncu.edu.tw/~hclee/pub/allpub.html

Is information theory an enemy of ID?

IP: Logged
eswrite
Member
Member # 1672

Icon 1 posted 07. June 2005 15:43      Profile for eswrite   Email eswrite   Send New Private Message       Edit/Delete Post 
I get a little lost in the math, but the overall message (see last paragraph) is that information can be added through duplication of genome segments. Of course, there is a large assumption here: that those segments, which presumably are initially neutral are:
a) preserved from generation to generation, so that
b) they can mutate into meaningful coding segments for evolved structures and/or functions.

IP: Logged


All times are East Coast  
Post New Topic  Post A Reply Close Topic    Move Topic    Delete Topic    Top Topic next oldest topic   next newest topic
 - Printer-friendly view of this topic
Hop To:

Contact Us | ISCID

All content © ISCID and content contributor 2001-2003

The ISCID Forums are aimed at generating insight into the nature of complex systems (e.g. biological complexity, organizational complexity, etc.) and the ontological status of purpose, especially from the vantage point of various information- and design-theoretic models.

Indexed by UBB Spider Hack  |  Powered by Infopop Corporation UBB.classicTM 6.3.1.1

PCID | Encyclopedia | Brainstorms | The Archive | News | Essay Contests | Chat Events | Membership