|
Author
|
Topic: Convergence or Divergence?
|
Jay
Member
Member # 268
|
posted 11. July 2002 11:59
Hi Yersinia,
I love that paper that you cited! I hope to get around to it more later, as I think that the DNA replication systems are another excellent example of possible common design.
Now, I'm aware that some feel that the sequence similarity that they find between these rings justifies calling them homologous. But it needs to be further explained exactly what kind of similarity we're looking at here.
Once again, between the respective ring types, there is a very clear pattern. The alignments look good, and clearly show distinct types. In other words, there are two 1 billion year+ patterns that are nicely kept.
But again, when you then compare the two types (say, in ClustalW) you can see that any similarity between them is quite strained, to say the least. BLAST2, for instance, cannot even pick up on any similarity at all, and ClustalW jumps all over the place desperately trying to come up with something.
But, if you just assume that they *must* be related (since they are obviously too similar to not be related, right?), you can doubtless find a few areas of 'homology'. I found, for instance, a three amino acid stretch that happened to line up between the two when I forced them to align with each other.
You can go fishing for sequence similarity between almost anything if you really want to, but you need to look closer at the overall historical pattern before suggesting that they really fit a homology pattern. I do not believe that was done here.
But I am interested in what similarities they actually found. I read that paper and didn't find any actual values for the degree of similarity. Likewise, I haven't seen any alignments between the two done by those who advocate their homology.
To make things even more mysterious, they cite references 6 and 7 when they speak of this supposed similarity. But look at what one of these very references has to say:
Krishna, TS, et al. Crystal structure of the eukaryotic DNA polymerase processivity factor PCNA. Cell. 1994 Dec 30;79(7):1233-43.
"The trimeric PCNA ring is strikingly similar to the dimeric ring formed by the beta subunit (processivity factor) of E. coli DNA polymerase III holoenzyme, with which it shares no significant sequence identity."
The guys who crystalized the thing also found, as I did, no significant sequence identity! But they further go on to make an interesting closing stament in their abstract:
"This structural correspondence further substantiates the mechanistic connection between eukaryotic and prokaryotic DNA replication that has been suggested on biochemical grounds."
'Mechanistic connection' is very much common design friendly, if you ask me. Again, the homology interpretation is rather strained here, yet the data openly suggests that a common idea was shared.
And just for good measure, the other paper that they cited says in their abstract: "A potential structural relationship is suggested between the beta subunit and proliferating cell nuclear antigen (PCNA, the eukaryotic polymerase delta [and epsilon] processivity factor), and the gene 45 protein of the bacteriophage T4 DNA polymerase."
Notice again the structural, but not sequence relationship. I haven't read this last paper yet, but, I have to wonder how these authors came up with sequence similarity from these two references, especially when one of them outright contradicts that idea.
One more, then I gotta run:
Bruck, I., O'Donnell M. The ring-type polymerase sliding clamp family. Genome Biol. 2001;2(1). Review.
"Both PCNA and beta form a ring around DNA, which is made up of two subunits of three domains each in beta but three subunits of two domains each in PCNA. Despite this difference and a lack of detectable sequence homology, the structures of the two rings are very similar."
And this paper, IMO, is much newer than the one cited claiming supposed homology. If you ask me, this idea of significant sequence homology between these two types is sort of an urban legend that managed to get passed around.
Thanks, jay
IP: Logged
|
|
yersinia
Member
Member # 324
|
posted 11. July 2002 12:11
Hey James,
quote:
What I would like to focus on here, instead, is the question of archetypes as an explanation for analogies, as opposed to "selection pressure" as the main mode of explanation of these striking phenomena. You only allude to archetypes' being in the mind of God, but what if they are somehow immanent in the dynamics of the cell itself? More specifically, I would be very interested to know what you think of recent (and not so recent) "structuralist" trends in biological thought?
I think I may not have communicated one of my distinctions adequately. In my mind, archetypes were (are?) invoked as an explanation for homologies rather than analogies.
Analogies, on Owen's definition, are similarities that are functionaly necessary -- i.e., any high resolution eye will need a lense, etc. These kinds of functionally-necessary similarities can, at my current crude level of analysis, be explained by selection for the same function, or by design (by the same designer, or different designers) for the same function (I think there are reasons to favor selection but that is a different topic). A high-resolution eye simply has to have certain things to be a high resolution eye (although of course a low-resolution eye, or a light-sensing device, will not need all those things...)
So I guess I would find the "structuralist" ideas unnecessary for explaining analogy, either from an evolutionary or design perspective. Some homology, on the other hand, might I suppose be explained by structuralism, in which case we might have an additional explanation for instances homology, one that could produce functionally unnecessary similarity in a nonhierarchical pattern.
If you accept common descent then IMO we would have the same issue, namely, determining if there was any significant "homology signal" that could be explained by structuralism. There might be some, my gut feeling is not much.
In general I would expect that self-organization, constained morphospace, & similar concepts could only produce rather vague similarities between unrelated "parts" (or specific similarities, but in cases where there are only a few possibilities).
quote:
I am thinking, for example, of D'Arcy Thompson's "On Growth and Form" (orig. publ. 1917, numerous editions), of Gerry Webster and Brian Goodwin's "Form and Transformation" (Cambridge UP, 1996), of the numerous Santa Fe Institue and A-Life studies on the abstract patterns underlying biological order, and the fewer but very interesting efforts to quantify an abstract "morphospace" for a particular type of organic form quantitatively (e.g., R.D.K. Thomas and W.E. Reif, "The Skeleton Space: A Finite Set of Organic Designs," Evolution, 1993, 47: 341--360).
I know that most Darwinian thinkers are fairly comfortable with this work, feeling that it will simply be incorporated into an expanded Darwinian "paradigm." I personally feel it is far more subversive than that. But, at any rate, I was just wondering what you thought about the naturalistic, structuralist alternative to understanding analogies as archetypes.
Like I said, it seems extraneous to me for explaining analogies which are by defintion functionally necessary similarities, and archetypes were invoked to explain extra similarity. Homologies therefore, maybe, but most likely in the form of shared developmental mechanisms, which of course could be inherited, which as you say allows for all this being absorbed into evolution/development studies.
All of this is rather far from protein sequence/structural similarity however, I'm unaware of similar "structuralist" work in your sense there.
nic
[PS in edit; err, Yersinia] [ 11 July 2002, 12:33: Message edited by: yersinia ]
IP: Logged
|
|
yersinia
Member
Member # 324
|
posted 11. July 2002 13:09
Hey Jay,
The paper says that the sequence similarity was detected with the PSI-BLAST method, which they say is more sensitive than the methods you are citing.
In any case, even if they're right I will concede that the sequence similarity is very small at best (at least between beta and PCNA). My initial assumption was that this was the case, thus I attempted to lay out an argument for homology from protein structure. I'd like you to address this when you can.
quote:
Once again, between the respective ring types, there is a very clear pattern. The alignments look good, and clearly show distinct types. In other words, there are two 1 billion year+ patterns that are nicely kept.
Please be specific about what you're talking about here. The 2 vs. 3 domain distinction?
quote:
But, if you just assume that they *must* be related (since they are obviously too similar to not be related, right?), you can doubtless find a few areas of 'homology'. I found, for instance, a three amino acid stretch that happened to line up between the two when I forced them to align with each other.
You can go fishing for sequence similarity between almost anything if you really want to, but you need to look closer at the overall historical pattern before suggesting that they really fit a homology pattern. I do not believe that was done here.
Dude, the overall historical pattern for informational genes is this:
archaebacteria + eukaryotes are one group, eubacteria are another group
The DNA sliding clamps appear to fit this pattern quite well.
Anyway, I would be quite happy to leave aside the sequence similarity question (although I very much doubt that it's an "urban legend") and focus on whether or not structural similarity is a reasonable basis for homology. The basic argument is that many different secondary structures can perform the same function. Therefore, detailed structural similarities like those we see here are not explained by common function, but rather need an additional explanation.
nicsinia
IP: Logged
|
|
Jay
Member
Member # 268
|
posted 11. July 2002 13:45
Hello Nic!
I was so tempted to 'accidentally' slip and call you that last reply, just for fun. But I remember you sharing with us how you didn't like your identity getting out when you didn't want it to, so I didn't want to ruin it for you. Anyway...
I'm not trying to bring this up again to beat a dead horse, but I wanted to just quickly expound on one of those papers I cited to put this sequence homology myth to rest:
__________________________ The ring-type polymerase sliding clamp family. Bruck I, O'Donnell M. Genome Biology 2001, 2:reviews3001.1-3001.3 (9 January 2001)
"The ring-type polymerases are found in all organisms, both prokaryote and eukaryote. The existing body of genome sequence information indicates that the B sliding clamp proteins are highly conserved in prokaryotes, and PCNA is highly conserved among eukaryotes. Interestingly, B and PCNA show no sequence homology, even though they have very similar three-dimensional structure [6]."
They do, in fact, mention below that idea of the 6-1 hypothesis. But.. I'll have more to say on that one later, as I do not think it to be a robust explanation.
"The structure of eukaryotic PCNA is practically superimposable on that of the B clamp [4,9]. The monomeric unit is only about two-thirds the size of B, however; it consists of two globular domains instead of three and trimerizes to form a six-domain ring the size of the B dimer. Although the PCNA domain structure is essentially the same as that of the domain structure in B, no sequence homology is detected between the two families. Perhaps the multidomain structure evolved from a common ancestral gene encoding one domain that later underwent duplications and fusion events to form the three-domain monomer."
Finally, in the references section, they have this, including the commentary below it:
TSR Krishna, X-P Kong, S Gary, P Burgers, J Kuriyan: Crystal structure of the eukaryotic DNA polymerase processivity factor PCNA. Cell 1994, 79: 1233-1243 The yeast PCNA was the second sliding clamp structure to be solved. This report describes the amazing similarity in structure to B, despite the completely different amino acid sequence.
'Amazing similarity' indeed! But note the insistence on the part of these guys (as well as others), who actually did alignments and looked into this stuff - even after PSI BLAST came out, IMO - that there really is no significant sequence similarity.
So... we have two types that have been around for at least a billion years each, and yet are pretty darn conserved within their types, but completely different between themselves. I stress this over and over because it will frame the way that we approach any thoughts of sequence divergence and structural conservation. It should also be noted that researchers in this field really aren't sure what to make of these two types. They are not really sure whether to call them homologous or not, and as we see from O'Donnell's 6-1 suggestion, the idea that they were related is recieved as tentative at best.
So, the question then becomes whether they somehow held onto their 3D structures, but underwent incredibly rapid evolutionary divergence.
Also, I see that you remember my hypothesis that all three domains are independent designs. And you are right that these rings would support (or at least not contradict) a 2 domain ancestry. I can accept that. However, I base my 3 domain hypothesis on a much wider array of genes, and have found that some genes, for instance, cluster eubacteria and archs, some archs and euks (like the rings) and some even eubacteria and euks. If we had this discussion over another protein, you might be accusing me of claiming a 2 domain hypothesis with eubacteria and eukaryotes as one group!
In truth, I believe that during common design, some objects were directly shared (meaning down to the sequence level) between some of the domains (and in some cases, all of the domains), while in other cases, only the ideas were shared, while in others, an idea only pops up in one of them... but that is another discussion. Suffice to say that whenever I find any sort of break like this between any of the domains (whatever combination), I see it as supporting the three commonly designed cell types.
Anyway, I have to get back to work. I'll try to get back to the structural evolution later. But are you on record, then, as feeling that the most likely explanation for these two ring types is large divergence with structural conservation? What data leads you to think this?
Thanks, jay [ 11 July 2002, 13:48: Message edited by: Jay ]
IP: Logged
|
|
James A. Barham
Member
Member # 50
|
posted 11. July 2002 16:14
Yersinia:
Sorry if I misread what you wrote about Owen (which I should have known anyway---I'll have to go back and look at his writings again).
But it seems to me, at any rate, that analogies do pose a problem for Darwinism. On the one hand, Darwinians want to stress the contingency and "tinkering" of natural selection. Everything is supposed to be a "glorious accident," in the words of Stephen Jay Gould. Everything would be different "if the tape were played twice."
On the other hand, the massive convergences (the placentals and marsupials, the vertebrate and cephalopod camera eyes, etc.) are considered no problem---just similar "selection pressures" doing their thing. But Darwinians can't have it both ways! It seems to me that evolution is far more constrained than is generally admitted, and that analogies (or convergences or "homoplasies") are a good example of this fact.
But I understand that you are bound to disagree with me here. Also, I realize this discussion is tangential to the main point of this thread, so perhpas if there is interest in pursuing structuralism and convergences further, it would be better to start a new thread.
At any rate, thanks for your reply. [ 11 July 2002, 16:17: Message edited by: James A. Barham ]
IP: Logged
|
|
Moderator
Administrator
Member # 1
|
posted 11. July 2002 16:28
I concur with James.
Why doesn't someone go ahead and start a thread on structuralism and convergences.
IP: Logged
|
|
Frances
Member
Member # 169
|
posted 11. July 2002 22:08
James suggestst that "Darwinists can't have it both ways" refering to the analogies found in nature. I would be interested in seeing him explain why this is the case. I personally find this puzzling but my admittedly limited familiarity with biology may explain why this is just me. Are analogies problematic for Darwinism? Is James suggesting that analogies are intelligently designed even at the marsupials level for instance? This is a fascinating topic and I would like to delve into some of the particulars as they are very relevant to this thread.
IP: Logged
|
|
yersinia
Member
Member # 324
|
posted 13. July 2002 02:21
quote:
Hello Nic!
I was so tempted to 'accidentally' slip and call you that last reply, just for fun. But I remember you sharing with us how you didn't like your identity getting out when you didn't want it to, so I didn't want to ruin it for you. Anyway...
Well, I knew you knew as soon as I saw the smiley. I expect that it's a rather small pool of people who know, let alone care, about the similarities in structure of DNA sliding clamps.
quote:
I'm not trying to bring this up again to beat a dead horse, but I wanted to just quickly expound on one of those papers I cited to put this sequence homology myth to rest:
__________________________ The ring-type polymerase sliding clamp family. Bruck I, O'Donnell M. Genome Biology 2001, 2:reviews3001.1-3001.3 (9 January 2001)
Here is the link if you have access (one can get a 1-month free trial).
quote:
"The ring-type polymerases are found in all organisms, both prokaryote and eukaryote. The existing body of genome sequence information indicates that the B sliding clamp proteins are highly conserved in prokaryotes, and PCNA is highly conserved among eukaryotes. Interestingly, B and PCNA show no sequence homology, even though they have very similar three-dimensional structure [6]."
They do, in fact, mention below that idea of the 6-1 hypothesis. But.. I'll have more to say on that one later, as I do not think it to be a robust explanation.
Cross that bridge when we get there...
quote:
"The structure of eukaryotic PCNA is practically superimposable on that of the B clamp [4,9]. The monomeric unit is only about two-thirds the size of B, however; it consists of two globular domains instead of three and trimerizes to form a six-domain ring the size of the B dimer. Although the PCNA domain structure is essentially the same as that of the domain structure in B, no sequence homology is detected between the two families. Perhaps the multidomain structure evolved from a common ancestral gene encoding one domain that later underwent duplications and fusion events to form the three-domain monomer."
Hey! They stole my hypothesis!
quote:
[snip] 'Amazing similarity' indeed! But note the insistence on the part of these guys (as well as others), who actually did alignments and looked into this stuff - even after PSI BLAST came out, IMO - that there really is no significant sequence similarity.
So... we have two types that have been around for at least a billion years each, and yet are pretty darn conserved within their types, but completely different between themselves. I stress this over and over because it will frame the way that we approach any thoughts of sequence divergence and structural conservation.
Regarding PSI-BLAST, here is a free-online paper I came across in Nucleic Acids Research:
quote: Nucleic Acids Res 2000 Sep 15;28(18):3570-80
PSI-BLAST searches using hidden markov models of structural repeats: prediction of an unusual sliding DNA clamp and of beta-propellers in UV-damaged DNA-binding protein.
da link
[the clamp-relevant portion of the abstract]
We have designed hidden Markov models (HMMs) of structurally conserved repeats that, based on pairwise comparisons, are unconserved at the sequence level. To model secondary structure features these HMMs assign higher probabilities of transition to insert or delete states within sequence regions predicted to form loops. HMMs were optimized using a sampling procedure based on the degree of statistical uncertainty associated with parameter estimates. A PSI-BLAST search initialized using a checkpoint-recovered profile derived from simulated sequences emitted by such a HMM can reveal distant structural relationships with, in certain instances, substantially greater sensitivity than a normal PSI-BLAST search. This is illustrated using two examples involving DNA- and RNA-associated proteins with structurally conserved repeats. In the first example a putative sliding DNA clamp protein was detected in the thermophilic bacterium Thermotoga maritima. This protein appears to have arisen by way of a duplicated beta-clamp gene that then acquired features of a PCNA-like clamp, perhaps to perform a PCNA-related function in association with one or more of the many archaeal-like proteins present in this organism.
To me, it looks like scientists are using ever-more sophisticated statistical techinques to find related proteins even in the "midnight zone" where percent amino acid similarity is no higher than random. Here, they've predicted the existence of a previously unknown DNA-sliding clamp, (1) without having the crystallographic-determined structure and (2) despite the lack of any obvious sequence similarity. When the structure of this protein is resolved empirically (or the function of a DNA sliding clamp is obtained by other means), their claim will be directly tested, so they are sticking their necks out for their technique here.
quote: It should also be noted that researchers in this field really aren't sure what to make of these two types. They are not really sure whether to call them homologous or not, and as we see from O'Donnell's 6-1 suggestion, the idea that they were related is recieved as tentative at best.
It appears that all of the later papers call them homologous (at least the beta-PCNA pair; the viral proteins are structurally distinct in addition to sequence-distinct. IMO the general resemblance is still there, but the strong criterion of "superimposability" for very strong structural similarity is not met. The viral clamps do give us a (minimum) estimate of the other possible protein structures that can form a DNA-clamping ring; IMO this strengthens the structural homology argument for PCNA/beta clamps, as the viral clamps prove there is no functional necessity for the superimposable-structure similarity of beta and PCNA clamps)
Certainly, the Nucleic Acids Research paper that I am citing argues that, while it is true that at the gross level there is no detectable pairwise sequence similarity, more statistically sensitive analyses of sequences can still find similarities that are statistically significantly better than chance:
quote:
Sliding DNA clamps are ring-shaped proteins that allow DNA polymerase to achieve high processivity during chromosome replication by tethering the polymerase catalytic subunit to DNA. From the crystal structures of these proteins it appears that they can encircle duplex DNA without steric hindrance, a property that presumably allows them to non-specifically attach to and move rapidly along DNA without dissociating (for reviews see 9-12). The structures of three distinct families of sliding clamps are available and include the Escherichia coli ß-clamp (13), the human and yeast proliferating cell nuclear antigen (PCNA) (14,15) and the bacteriophage RB69 and T4 sliding clamp proteins (16,17). All of these structures share a 12-fold symmetry around the ring consisting of a simple ß-[alpha]-ß-ß-ß structural repeat (13), though there is structural divergence in some of the repeats. Bacterial ß-clamps contain six ß-[alpha]-ß-ß-ß repeats per subunit with two subunits per ring while the eukaryotic and bacteriophage clamps contain four repeats per subunit with three subunits per ring. Pairs of these repeats form a domain, which has been termed the ‘processivity fold’ (18); thus the ring of the sliding clamp contains six domains and therefore is often described as having 6-fold symmetry (19). A structural representative of a fourth family of processivity fold proteins, namely the herpes simplex virus UL42 protein, is also available (18). UL42 does not form a ring-shaped clamp, however, but rather functions as a monomer and interacts with DNA quite differently than do sliding clamps; it has been suggested that UL42 resembles a primitive ancestor of sliding clamps (see 18 and references therein). Despite their structural similarity, proteins in each of these four families lack significant pairwise sequence similarity to proteins in the other families, suggesting that additional, unrelated processivity fold proteins remain to be found. Sensitive sequence analysis methods offer an opportunity to identify additional sliding clamps. For example, the fission yeast DNA repair proteins Rad1p, Rad9p and Hus1p were recently predicted to be distant relatives of PCNA using PSI-BLAST and similar procedures (20-22).
So, can we agree that:
1) "proteins in each of these four families lack significant pairwise sequence similarity to proteins in the other families" (which is your main point)
but,
2) "Sensitive sequence analysis methods" can still detect statistically significant similarities (keep in mind all the various kinds of "similarity", in addition to raw sequence similarity, that are possible; e.g., gross sequence similarity might be random, but widely spaced positions might be shared across all the clamps -- I'm not an expert, but this is kind of thing they look for). This is serious, up-to-date, peer-reviewed, well-cited science, not "fishing" as you put it.
quote:
So, the question then becomes whether they somehow held onto their 3D structures, but underwent incredibly rapid evolutionary divergence.
I think this reveals several fundamental misconceptions you are operating under:
1) You think sequence divergence while maintaining the same structure is impossible, or very unlikely. On the contrary, it is a well-established pattern that structure decays very little even as sequence similarity dwindles to randomness. E.g., this kind of pattern is common: consider a pair of proteins, e.g. one from humans, and one chosen from increasingly distant relatives. Cytochrome C would be an archetypal example:
Sequence similarity | Structural similarity 100% | Superimposable 90% | Superimposable 80% | Superimposable 70% | Superimposable 60% | Superimposable 50% | Superimposable 40% | Superimposable 30% | Superimposable
I believe cytochrome C sequence differences bottom out at around 30%. The same pattern could be cited for numerous other proteins. See the pattern? Since you apparently accept common descent for the eukaryotes (your arguments based on the peculiarities of prok/euk relations force you to, as these pecularities do not really exist within eukaryotes)
Notably, this massive sequence redundancy for structure (zillions of potential amino acid sequences get you cytochrome C, Hubert Yockey estimated there were 10^93 of them IIRC) means that practically any sequence similarity (except perhaps small active sites of a few amino acids) is functionally unnecessary similarity (homology on my/Owen's defintion) that needs a function-independent explanation.
2) "(I)ncredibly rapid evolutionary divergence" is required to explain the twin facts of (a) high sequence conservation within e.g. the eukaryotes and eubacteria, respectively (How high, BTW?; you haven't given us numbers...50% similarity among all eukaryotes, or what?) and (b) low sequence conservation between the euks/eubacteria. You are exagerating to an extreme degree here (I am not exagerating ).
Recall how many billions of years separate eubacteria from archebacteria/euks (it is very unconstrained, but estimates range from 3 billion -- ~1 billion AFAIK). On evolutionary accounts, this should be the most extreme divide in known extant life. You are arguing that the degree of divergence is too large for this hypothesis, because within e.g. eukaryotes the clamp proteins have diverged only (say) 50% within the last 750-1500 million years (say, for a yeast-human or plant-human comparison).
But what do we know about rates of molecular divergence:
It will be slowest if all of these things are constant:
- function - interaction with other proteins - environment, particularly temperature
If these stay constant, then the rate of change will be slow (and approximately constant, the protein being subject only to stochastic substitutions). But if any of these things change, then all bets are off, primarily because natural selection can drive substitution to adapt to new function, new/addition interactions, or changed environments (and yes, we have numerous examples of rather rapid, although geologically moderate, sequence change driven by selection, as evidenced by e.g. rates of synonymous and non-synonymous substitution between closely related organisms, e.g. various Drosophila species).
In the case of DNA sliding clamps, the basic function has remained the same across all the groups, but eukaryotes attach a number of extra proteins to the clamps in comparison to prokaryotes, and archaebacteria are well-known extremophiles. The common ancestor of archaebacteria and eukaryotes was probably hyperthermophilic (adapted to ultrahigh temperatures that few-to-no eubacteria have ever reached). Adaptation to extreme temperature change is well-known to necessitate both changes in DNA/RNA G+C ratios (to maintain stability) and amino acid changes (to avoid denaturing the protein -- think taq polymerase, used in high-temperature PCR and discovered in an organism in a hot spring). Moving from high temperature back to low temperature would necessitate similar changes (although perhaps not as extreme).
These kinds of considerations make it perfectly plausible to propose accelerated substitution rates for (a) a eubacterial-->archaebacterial transition, and (b) an archaebacterial --> eukaryote transition. Even an orders-of-magnitude acceleration in substitution rate for a normally conserved protein would still be a substition rate much lower than that known for other proteins.
In the case of the FtsZ/tubulin relationship, we have a functional change in addition to the above considerations applying.
quote:
Also, I see that you remember my hypothesis that all three domains are independent designs. And you are right that these rings would support (or at least not contradict) a 2 domain ancestry. I can accept that. However, I base my 3 domain hypothesis on a much wider array of genes, and have found that some genes, for instance, cluster eubacteria and archs, some archs and euks (like the rings) and some even eubacteria and euks. If we had this discussion over another protein, you might be accusing me of claiming a 2 domain hypothesis with eubacteria and eukaryotes as one group!
You are ignoring important known facts, like the descent of the eukaryote mitochondrium from a eubacterium (alpha-proteobacteria IIRC), and the discovery of alpha-proteobacteria genes (maintaining even the eubacterial organization IIRC) in the eukaryote nuclear genome, which even maintain their function of servicing the mitochondrion IIRC, adding up to an open-and-shut case for transfer of genes from the symbiont to the host genome.
The crucial thing is the *patterns* of these similarities in interdomain shared genes. E.g., eukaryotes & archaebacteria tend to share more closely related informational genes, whereas the genes shared by eukaryotes and eubacteria tend to be metabolic. The former are the most difficult kinds of genes to laterally transfer, whereas the latter are the simplest to laterally transfer, especially from a symbiont.
The similarities which are closest between eubacterial and archaebacterial genes are IIRC rarer, and IIRC tend to have patterns like correlating with physical proximity -- e.g., the thermophilic eubacterium Thermotoga is the most extreme "chimera" that I've heard about (24% archaeal genes IIRC) -- but the archaeal genes it does have are most similar to those of the archaeabacteria it is living with, a clear indicator of LGT mediated by physical proximity.
quote:
In truth, I believe that during common design, some objects were directly shared (meaning down to the sequence level) between some of the domains (and in some cases, all of the domains), while in other cases, only the ideas were shared, while in others, an idea only pops up in one of them... but that is another discussion. Suffice to say that whenever I find any sort of break like this between any of the domains (whatever combination), I see it as supporting the three commonly designed cell types.
I expect you do. But "the designer(s) did whatever (s)he did, for no particularly apparent reason" does not appear to have any legs as a scientific hypothesis. It boils down to "find an apparent problem for simplistic versions of evolutionary biology" IMO.
quote:
Anyway, I have to get back to work. I'll try to get back to the structural evolution later. But are you on record, then, as feeling that the most likely explanation for these two ring types is large divergence with structural conservation? What data leads you to think this?
Adequately discussed above I trust.
If you do get around to addressing my argument purely based on structural homology, let's save time and stick to the basics:
1) Premise: Numerous potential secondary structures can perform the same function (examples are cited in my original posts, e.g. the Genome Biology article I cited originally, and their supplementary data; and the equivalent function of the non-superimposable viral clamps with the superimposable euk/prok clamps.
2) Therefore superimposable structural similarity is similarity that is not explained by common function (and indeed, for a pair like eubacterial-cytokinesis FtsZ and ciliary tubulin, there is not common function available to do any explaining). It is "similarity in excess of that required for the similarity in function", i.e. Owen's homology.
Step 3 is perhaps for another thread, but it is: Owen's homology is well-explained by common descent, but not really explained at all by hypothesizing an archetype in the designer's mind. To quote Darwin
(OoS, chapter 13)
(Darwin is speaking of homologies and their explantion; readers will have to convert morphology --> protein structure and species --> domain in order to see the relevance to Jay's hypothesis)
quote:
Naturalists try to arrange the species, genera, and families in each class, on what is called the Natural System. But what is meant by this system? ... But many naturalists think that something more is meant by the Natural System; they believe that it reveals the plan of the Creator; but unless it be specified whether order in time or space, or what else is meant by the plan of the Creator, it seems to me that nothing is thus added to our knowledge.
[...]
Nothing can be more hopeless than to attempt to explain this similarity of pattern in members of the same class, by utility or by the doctrine of final causes. The hopelessness of the attempt has been expressly admitted by Owen in his most interesting work on the `Nature of Limbs.' On the ordinary view of the independent creation of each being, we can only say that so it is; that it has so pleased the Creator to construct each animal and plant.
[...]
How inexplicable are these facts on the ordinary view of creation! ... Why should similar bones have been created in the formation of the wing and leg of a bat, used as they are for such totally different purposes? Why should one crustacean, which has an extremely complex mouth formed of many parts, consequently always have fewer legs; or conversely, those with many legs have simpler mouths? Why should the sepals, petals, stamens, and pistils in any individual flower, though fitted for such widely different purposes, be all constructed on the same pattern ?
On the theory of natural selection, we can satisfactorily answer these questions. In the vertebrata, we see a series of internal vertebrae bearing certain processes and appendages; in the articulata, we see the body divided into a series of segments, bearing external appendages; and in flowering plants, we see a series of successive spiral whorls of leaves. An indefinite repetition of the same part or organ is the common characteristic (as Owen has observed) of all low or little-modified forms; therefore we may readily believe that the unknown progenitor of the vertebrata possessed many vertebrae; the unknown progenitor of the articulata, many segments; and the unknown progenitor of flowering plants, many spiral whorls of leaves. We have formerly seen that parts many times repeated are eminently liable to vary in number and structure; consequently it is quite probable that natural selection, during a long-continued course of modification, should have seized on a certain number of the primordially similar elements, many times repeated, and have adapted them to the most diverse purposes. And as the whole amount of modification will have been effected by slight successive steps, we need not wonder at discovering in such parts or organs, a certain degree of fundamental resemblance, retained by the strong principle of inheritance.
The similarity of Darwin's argument from the common "duplicate structure and modify" pattern in biology, and my argument from a similar pattern in proteins (go back to the post documenting same-fold, different function, and same function, different fold), is, I am sure, accidental.
PS:
Here is another model of a DNA sliding clamp, with a cross-section of the DNA double helix inside.

Consider how many other potential ways there probably are to form a ring of these approximate dimensions, with a loose affliation for DNA on the inside (recall that it is a *sliding* clamp). Is the 6-domain pattern functionally necessary? Couldn't 5 or 7 or 8 domains work if designing from scratch? Couldn't you have the ring specified by just one big domain, with a gap on one side to clamp open or shut (which is how the current sliding clamps work IIRC)?
Superimposable structure is not functionally necessary; you can attempt to explain it by designer archetype, but then are you really explaining much at all, asks Darwin. Common descent, OTOH...
I am going in circles now. Too many rings in my head I guess. I've said my bit for now...
nicsinia [ 13 July 2002, 02:29: Message edited by: yersinia ]
IP: Logged
|
|
Mike Gene
Member
Member # 149
|
posted 13. July 2002 15:40
Concerning my 50 Rule (and others), I noted, "Furthermore, it is my intention to hold to the spirit of these rules, rather than the letter of the rules, as sometimes a very good discussion may be cause for me to bend them." Since I usually enjoy Nic's arguments, and he has taken us back to the original topic, I can't help but bend my rules and join back in.
I like Nic's argument about Owen's homology, probably because it so closely reflects my thinking. One of Nic's basic points is as follows:
quote: 2) Therefore superimposable structural similarity is similarity that is not explained by common function (and indeed, for a pair like eubacterial-cytokinesis FtsZ and ciliary tubulin, there is not common function available to do any explaining). It is "similarity in excess of that required for the similarity in function", i.e. Owen's homology.
This is essentially the same argument I was making when Frances asked me, "How do we limit the explanatory power of design without understanding the designer or the causal history or the pathway?" I replied:
For starters, engineers usually design things for a reason. Thus, similarities between ftsZ and tubulin, for example, that cannot be explained in terms of function are unlikely to have arisen from design.
Nic is starting to think like a design theorist, as he is attempting to eliminate teleological mechanisms by imposing the constraints that come with design (i.e., "These similarities, being homologies, are not explainable by reference to a common function, either via a common designer designing for the same function, or natural selection designing for the same function.) I encourage Frances to contemplate Nic's arguments, as he is showing one way to employ teleological thinking without independent evidence of the designers and their mechanisms.
So the question then is whether ftsZ and tubulin show "similarity in excess of that required for the similarity in function." And things are squishy here. What if we disrupt the tertiary structure or GTP motif in either tubulin or FtsZ? I'd predict some loss of function. So far, we do have a functional basis to explain the similarity.
Crucial to Nic's argument is the notion that numerous potential secondary structures can perform the same function. To support this contention, Nic cites work that focusing on enzymes. Whether one can extrapolate the catalysis of moving a set of atoms from one chemical species to another to the function of ftsZ/tubulin is questionable. Nevertheless, I had already considered the basic argument Nic makes and in fact my whole web essay on tubulin/ftsZ emerges from such considerations:
quote: If we entertain teleological explanations, there is nothing that compels us to move beyond a form/function similarity. But an ID theorist can take it further than this and ask why such similar structures would be employed differently? There should be good design reasons for employing essentially the same structure differently. First, what is the functional similarity? Apparently, to bind and hydrolyzes GTP such that assembly into protofilaments occur that tap into the design principle of dynamic instability (where GTP binding and hydrolysis functions as a switching mechanism). Let's call this the GTPase-dependent Protofilament Design (GPD) and we can think of it as a framework. But this framework/function, by itself, is biologically useless. In fact, unless it is tied to a biological function, it works merely as an wasteful energy sink (a futile GTPase cycle). Of course, for the biological function we need a biological context. This context then works to specify how one decorates the framework to tap into GPD's functional potential. In this case, we have the distinct bacterial and eukaryal contexts that differ in many significant ways. In a eukaryote, GPD is built upon to become tubulin, whereby it will play an important scaffold function in accord with the tensigrity model. What's more, it will serve as tracks for eukayal motor proteins, kinesin and dynein. It will also be called upon to form motility structures (flagella) and to play an essential role in segregating newly formed genetic material. In bacteria, GPD is built upon differently to serve the bacterial context. Here, it will play an important role in cytokinesis and thus interact with a different set of players. It is probably also associated with a bacterial cytoskeleton which may play more of a organizational role than a true skeletal role (the latter role is largely fulfilled by the cell wall). [1] The point is that a similar framework may have simply been exploited by a designer in different contexts (as a crude analogy, I look at my fan blowing air on me this hot night and am reminded that in another context, a very similar structure is used to move boats about). This ID hypothesis predicts several things...
Thus, the task of the design theorist is to try to reverse engineer this "GPD." Similarities between ftsZ and tubulin are interpreted through a functional template (something homology cannot provide if we are to maintain the inference to homology). Then it's not simply a question of whether the ftsZ/tubulin architectures are the only way to reflect the GPD, but whether both GPD and these concrete expressions represent very good designs. In a sense, we can predict they do, as both tubulin and ftsZ "got it right from the beginning," as billions of years of evolutionary tinkering have built on them rather than redesigned them.
The key here is that teleological thinking provides very fertile ground for research. The homology inference depends on "similarity in excess of that required for the similarity in function" as a function of the constraints involved in common design. If we uncover such a thing, the homology inference approach mandates that we stop right there. To explain the excess similarities such that they are no longer excess similarities to take away from the homology inference and move into the realm of common design. Thus, the design approach provides the impetus to look deeper, to see if excess similarities are explained in some functional way.
Perhaps another example might make this clearer. Nic asserts "the equivalent function of the non-superimposable viral clamps with the superimposable euk/prok clamps." Not so fast. I haven't looked into the sliding clamps very much, but we have to be on the lookout for hidden reductionist assumptions. That is, compared to viruses, cells have to be more concerned about such things as processivity and fidelity. Thus, an ID hypothesis might look to explain the difference between the clamps in such functional terms. And then comes to whole arena of cell biology, where things like checkpoints and assembly dynamics might come into play.
Like I said, squishy squashy.
IP: Logged
|
|
yersinia
Member
Member # 324
|
posted 13. July 2002 16:28
Hi Mike,
Ah, but regarding FtsZ & tubulin, even just considering the GTP-binding motif, do we:
(a) have any reason to suspect that the function of FtsZ and/or tubulin would have to make use of GTP, rather than, say, ATP?
(admittedly here there may only be a few options for energy-providing molecules in the context of life-as-we-know-it, so perhaps this one could be reasonably explained by chance)
(b) Perhaps more importantly, is there any particular reason to suspect that this particular GTP-binding motif is the only way to have a GTP-binding motif? It is my impression that one of things that lead researchers to suspect FtsZ-tubulin homology was that (1) tubulins all share a particular GTP-binding motif that other proteins, including those that bind GTP, do not share; and, (2) FtsZ shares this particular GTP-binding motif, rather than the many others it could have had.
IMO if this is correct then a few ticks are added to the "similarity in excess of that being necessitated by similar function" side of the scale.
Thanks, nic
IP: Logged
|
|
Frances
Member
Member # 169
|
posted 13. July 2002 18:19
Mike suggests that looking at Nic's argument about homology will help us infer common design without knowing the designers or the causal history.
I fail to see how this follows from Nic's argument. In fact analogy can be explained very well by common design as well as natural selection and other mechanisms so we are facing the same problems. Withou a mechanism, how are we going to test for common design?
Mike suggests that teleological thinking can provide us with a very fertile ground for investigation. In fact I have no problem with that but teleology and ID seem to be quite different in nature and I would argue that teleology is already used in some form in sciences. Homology stops but science does not when we "discover such a thing".
The design approach does not seem to take us places homology/analogy would take us. [ 13 July 2002, 18:24: Message edited by: Frances ]
IP: Logged
|
|
Jay
Member
Member # 268
|
posted 14. July 2002 14:32
Hi all,
I was working on a longer reply, but I don't have the patience/time right now, so I'll just kinda break it up!
Mike: "Nic asserts "the equivalent function of the non-superimposable viral clamps with the superimposable euk/prok clamps." Not so fast. I haven't looked into the sliding clamps very much, but we have to be on the lookout for hidden reductionist assumptions. That is, compared to viruses, cells have to be more concerned about such things as processivity and fidelity. Thus, an ID hypothesis might look to explain the difference between the clamps in such functional terms."
Before I start, I'd like to say that I also appreciate Nic/Owen's way of looking for homology and even implicitly distinguishing it from common design. I feel like the discussion is really getting somewhere!
Regarding the rings - Nic, it seems to me that you make the homology inference largely because these rings are so similar that you deem this unlikely to be for merely functional reasons. You point out that clamps likely need not look exactly like each other, as we do see several other different types. However, I think that there are some circumstantial reasons to think that this PCNA/Beta Clamp ring shape is something special for engineering reasons, and has good reason to be so similar.
First, note the simple fact that over the couple of billion years of evolution, the ring stays exactly the same shape. We have organisms that reduce sulfur, live in thermal vents, live inside of other cells, live in frozen wastelands, are unicellular with high replication needs, to being much slower and very mutlicellular, etc... in other words, we have any non-viral organism that you can imagine with all of their extremes of selection pressures all keeping this very same shape.
Now, if this ring shape is just some arbitrary shape among millions, then I'd expect to see lots of variation, and perhaps even new ring shapes springing up. And if we are attempting to argue that these two very different ring types (from a sequence/subunit perspective) that share the same shape really are related, then we are saying that there were *lots* of sequence intermediates between them, meaning that there was lots of evolutionary experimenting going on!
But nevertheless, even with this enormous alleged divergence between the two types, as well as the wide variation of selection pressures within said types the shape/function is kept superimposable. It seems to me that this probably is a pretty good shape after all!
So really, from a functional perspecitve, the rings are good - probably much better than we experimentally realize as they are so conserved. From a design perspecitve, it would make sense to share a really good idea like this. And likewise, I suspect that we'll find a functional reason why the prokaryotic archaea kept a PCNA type, while the eubacteria have the Beta Clamp. We have a functional, design perspective to lead us to ask why two superimposable structures would be built differently. I'm sure that it has to do with the fact that the archaea/eukaryotes share similar DNA replication machinery in contrast to the eubacteria. It would be interesting to see the functional link between a 2 subunit Beta Clamp and a eubacterial replication system, and a PCNA clamp and the archaea/eukaryotic system. There's some common design guided research for you! So now I think that we have reason to suspect that these rings are a very good design, with possibly a good design reason to keep them the same shape (in other words, we have considered Owen's test). Now I think that a real key to divining whether these truly are common design is to look and see what the data really say about their historical relationship. It is a good start to see if there are functional reasons that things are as they are, but it is the finishing touch to then see if the historical pattern suggests how they got there. I'll try to delve some more into that later.
But one other point. You pointed out that there are other ring types. But notice where these are always found - viruses. There appears to be a functional link between the type of ring clamp that we get, and the kind of organism that we are dealing with. Notice that the ring type changes when the replication apparatus/needs change.
To sum it up we have these apparent functional links between the ring type and the replication type:
Eubacteria - Beta Clamps Archaea/eukaryotes - PCNAs Viruses - Gp45, UL42, RB69
In each of these groups, there are different replication needs and abilities, and these appear to be tied into the clamp type. Again, eubacteria have their own (apparently also non-homologous to archaea/euks by even mainstream interpretations) replication system. Archaea and euks have their own type. Viruses are, well, just weird and have their own special needs, and often need to defeat the hosts's replication system. And I would further bet that the different viral ring clamps fall into specific viral categories, each with different needs.
So, the bottom line is that these different ring types do seem to have functional connections, possibly a la common design? And because of 3 billion years of history, we have good reason to think that the Beta Clamp/PCNA shape is a darn good one, and so have reason to ask why such a shape would be created twice from different components, and also why, from a functional perspective, the viruses were endowed with different ring shapes.
Thanks, jay [ 14 July 2002, 14:34: Message edited by: Jay ]
IP: Logged
|
|
yersinia
Member
Member # 324
|
posted 14. July 2002 15:44
Having no time, I will just make a few brief points:
- Jay has not identified any particular functional reason why arch/euks have one kind of clamp, viruses another, and eubacteria another. He also hasn't proposed why, for example, an 5- or 8-domain clamp wouldn't work.
(now, it is likely that the other components of DNA replication are adapted for the 6-domain clamp, but you then have to ask if those components couldn't be redesigned to function with a different clamp)
- Conservatism in structure is easy to explain -- if it ain't broke, don't fix it. I expect that if the ancestral clamp had, by chance, had five domains instead of six, then *that* pattern would have been conserved instead.
- According to the paper I cited, it appears that some replication components are homologous (Owen's def'n) across all domains, and some are not (apparently; structural data may change some of these evaluations). So it is innaccurate to characterize the entirety of the replication system as "non-homologous" in any case.
- "so have reason to ask why such a shape would be created twice from different components"
Arrgh! They're not created from different components, merely the same component, duplicated and fused in two different ways! Do you deny that either (a) gene duplications (creating adjacent genes) or (b) sequence deletions (just deleting the spacers between adjecent genes) are common, normal mutational processes?
Appear to be going in circles again. (This was fast writing, take the abruptness with a )
See ya, Nick [ 14 July 2002, 15:46: Message edited by: yersinia ]
IP: Logged
|
|
Jay
Member
Member # 268
|
posted 14. July 2002 18:47
Hi Nic,
Nic: "Jay has not identified any particular functional reason why arch/euks have one kind of clamp, viruses another, and eubacteria another. He also hasn't proposed why, for example, an 5- or 8-domain clamp wouldn't work."
Of course I didn't, hence the call to research!! You appear to be of the mindset that somehow, IDsts have to know everything about a system and prove beyond doubt that it is the product of design before they can suspect it as a good tentative interpretation. Well, those who wish for this will likely be eternally disappointed, as things are never that solid in this field. The 'hyper-skeptic' as Mike calls them, will likely never be convinced. However, that is not to say that there is not some good circumstantial evidence to suspect things. Here, I'm merely pointing out some general strcuture/function correlations that I find quite friendly to my interpretation, and that I believe help address Owen's criteria. Those correlations are between the very different modes of DNA replication vs. the type of clamp. We do see a correlation here, where new replication systems tend to come with new ring types. Why this is, I have no idea at this point, but I made the point that we have a reason to suspect that it *is* functional and rational, should we suspect common design here. So in other words, we've found a good tentative pattern that we can now pursue further. Such is the nature of a useful hypothesis. You may not find this at all convincing or useful, and that's fine. We all have different levels of proof needed to suspect something. After all, I didn't find that PSI-II remote sequence similarity to be anything near a convincing case for homology!
Nic: "Conservatism in structure is easy to explain -- if it ain't broke, don't fix it. I expect that if the ancestral clamp had, by chance, had five domains instead of six, then *that* pattern would have been conserved instead."
Well, feel free to suspect this all you want, but on the other hand, I find my suspicion that 3 billion years of evolution not changing the structure at all across the millions of different organisms (and especially if there really was this huge divergence between the two types) to be a rather nice reason to suspect that this is a very solid, good design for functional reasons. And should this be an ideal way to assist replication, a smart designer would do well to use a similar construction... In other words, I have a feeling that this is not simply some arbitrary shape that just happened to be stumbled upon near the OOL, and that it just happened to be good enough to happen to stick for 3 billion years exactly as is, and I base this off of the strong data patterns that we do see. But you never know.
But perhaps a better way to get a more whole flavor of this idea of common design is to look at some more systems that appear to borrow similar good design ideas. Mike's FtsZ/tubulin, and the different DNA polymerases are a good start, in my opinion. One system, by itself, is not likely to be a solid case for common design, just as finding one instance of homology would not be evidence for universal common ancestry. But when you start seeing patterns that recur like this, i.e. when you start seeing systems that appear to borrow good design ideas but not be physically related, it starts to add up, at least in my mind. It find this way of looking at the domains and their components rather exciting, and fruitful to further investigative research!
Nic: "According to the paper I cited, it appears that some replication components are homologous (Owen's def'n) across all domains, and some are not (apparently; structural data may change some of these evaluations). So it is innaccurate to characterize the entirety of the replication system as "non-homologous" in any case."
Well, I'm not sure if the entire system is non-homologous or not, but look at the title of the very paper you cited - "Did DNA replication evolve twice independently?" We'll just have to analyze the components part by part like this and see what we get. And it should be noted that this interpretation of homology, at least in this paper, looked like more sequence-fishing .i.e find some sequence similarity (even remote PSI-II) or any other remarkable instance of similarity and then call it homology - they're not even following Owen. I doubt that they went through and rigorously determined that the similarities between the systems were unnecessary and only explained by common descent. As with >90% of all of these comparison papers, homology assignment = sequence and/or really good structural conservation. It's rather shallow.
Nic: "Arrgh! They're not created from different components, merely the same component, duplicated and fused in two different ways! Do you deny that either (a) gene duplications (creating adjacent genes) or (b) sequence deletions (just deleting the spacers between adjecent genes) are common, normal mutational processes?"
You are rather confident of that 6-1 hypothesis, I see. Rather odd when dealing with such a fuzzy topic as this, don't you think? And I have already repeated that I do not believe these different subunits, *by themselves*, to be good reason to suspect uncommon descent and call them different components. Instead, it is the pattern of sequence found in these respective types *as well as* the fact that the subunits are different sizes, which fold and look somewhat different, that all add up *together* to suggest to me that they are truly independent. You can't take these subunits in isolation if their sequences to say whether they are due to the 6-1 or not. And once you introduce the sequences, that hypothesis starts to look rather unfavorable, in my opinion. There is every reason to *suspect* (not prove!!) that these parts are not related, but yet share clear common design principles. And also good reason to suspect that this is for functional reasons.
Thanks, jay
P.S. No prob on the abruptness - I need to start being a bit more abrupt lest my real work begin to suffer! [ 14 July 2002, 19:11: Message edited by: Jay ]
IP: Logged
|
|
yersinia
Member
Member # 324
|
posted 14. July 2002 21:58
Thanks for the dialog Jay, I think I've said my bit a coupla times already.
It appears that all agree that making the homology/analogy distinction, with Owen's definitions (basically; IMO we've been even a bit more explicit), makes things somewhat clearer.
E.g. it has focused us on the question of whether or not the differences in the sliding clamps are necessitated by whatever differences in function may exist. I agree that we lack the data to make a definitive decision at this point. Too me it seems like the function is basically "sliding clamp" (surprise surprise), with the purpose of serving as an attachment to hold other components near to the DNA strand, and it seems that a variety of ring structures could serve this purpose.
Sure, there are correlations, in that our basic clamp groups (euks/arch, eubacteria, viruses -- although are these really one group?) also have basic differences in numerous other respects. The trick will be to see if any of these correlated differences make the structural differences necessary, or if the clamp differences and group differences are simply both products of billions of years of separated evolution.
I too need to get back to other things, Thanks, yersinia
IP: Logged
|
|
|