[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Subject Index][Author Index]

Consistency index was Re: Clarification of scope of paleoart->uses

To: DML <dinosaur@usc.edu>
Subject: Consistency index was Re: Clarification of scope of paleoart->uses
From: David Marjanovic <david.marjanovic@gmx.at>
Date: Thu, 17 Mar 2011 13:45:34 +0100
Authentication-results: msg-ironport1.usc.edu; dkim=neutral (message not signed) header.i=none
In-reply-to: <AANLkTi=D44Yy_4=NLiTpiT5shwsH=bhei49-D8yDi4Pk@mail.gmail.com>
References: <314568.96436.qm@web39320.mail.mud.yahoo.com> <4D813B95.8000508@gmx.at> <AANLkTiky9kRB9RAmx4ZpogK2xWAB=2GTPoG1Lxz3raAd@mail.gmail.com> <4D814BF8.5090008@gmx.at> <AANLkTi=D44Yy_4=NLiTpiT5shwsH=bhei49-D8yDi4Pk@mail.gmail.com>
Reply-to: david.marjanovic@gmx.at
Sender: owner-DINOSAUR@usc.edu

 Sure it is. If they're in the matrix, they're already scored for
 the N taxa, and the person adding taxon N+1 only has to score it
 once, not N+1 times. You've already done the work, so why throw it
 away?

I'm not saying throw it away. Mention in the text what autapomorphiesyou've discovered that were previously unknown. Just why put thisinformation into the matrix?

(This is assuming that you only use parsimony. Bayesian analyses do useparsimony-uninformative characters to help determine the model ofevolution.)

> Keeping them in has disadvantages. It makes your matrix appear
> bigger than it is (...impressive as it is, of the 720 characters in
> the supermatrix by Sigurdsen & Green [2011] only 335 are
> informative; no surprise, because they only kept those 25 taxa, out
> of something like 110 or 120, that are represented in all three
> input matrices...) [...]

 So state in your abstract how many of the characters are
 parsimony-informative.


Great idea. Nobody does it.

> [...] and it increases the CI. Fine, PAUP* will give you the CI
> with and without parsimony-uninformative characters, but it seems
> to be normal to report the former instead of the latter and thus
> make the trees look more robust than they are. And of course, the
> bigger a matrix, the more opportunities there are for glitches.

 A side-question: does anyone pay attention to CI? (In practice, it
 seems to be basically a measure of how small the matrix is.) If any
 number can top the Impact Factor for uninformativeness, it's surely
 the CI.


I pay attention to the CI.

If it's insanely high, like the 0.8 to 0.9 of Sereno's early analyses,this is a good reason to suspect that the characters were cherry-picked(deliberately or just by laziness!) to support the authors' pethypothesis or that other manipulations were going on.

If it's low for the size of the matrix, like the 0.49 of McGowan (2002,Zool. J. Linn. Soc., albanerpetontids and origin of lissamphibians) fora matrix of 19 ingroup taxa and 41 characters, that shows that thematrix is "balanced" and not (or not much) biased towards any particularhypothesis, even though it's so tiny that one should expect randomimbalances from this alone.

Finally, if it's insanely high but manipulation would be a veryunparsimonious assumption, I am suitably impressed. The case I've seenis Rexová et al. (2003, Cladistics). That's an analysis of a matrix with85 Indo-European languages and 200 meanings. These meanings are takenfrom a standardized list of 200 meanings that are considered "corevocabulary" (words that are probably less easily borrowed than mostothers -- body parts, basic kinship terms, personal pronouns...). Theaim of that study was to show that vocabulary data alone, without datafrom grammar or from the sound system, are enough to reconstruct thephylogeny of languages to a useful degree. Some historical linguists hadclaimed that only morphology (grammar at the word level) is of any use,which would mean that the phylogeny of families of isolating languages(which lack grammatical endings or the like) would be impossible toreconstruct; the CI of 0.84 proves them wrong. Indeed, this incrediblyhigh CI makes me think that core vocabulary could be used to look forrelatives of Indo-European, something very few people have everattempted and some, perhaps many, consider completely futile.

Follow-Ups:
- Re: Consistency index was Re: Clarification of scope of paleoart->uses
  - From: Mike Taylor <mike@indexdata.com>
- Re: Consistency index was Re: Clarification of scope of paleoart->uses
  - From: Augusto Haro <augustoharo@gmail.com>

References:
- Re: Clarification of scope of paleoart->uses
  - From: Paul P <turtlecroc@yahoo.com>
- Re: Clarification of scope of paleoart->uses
  - From: David Marjanovic <david.marjanovic@gmx.at>
- Re: Clarification of scope of paleoart->uses
  - From: Mike Taylor <mike@indexdata.com>
- Re: Clarification of scope of paleoart->uses
  - From: David Marjanovic <david.marjanovic@gmx.at>
- Re: Clarification of scope of paleoart->uses
  - From: Mike Taylor <mike@indexdata.com>

Prev by Date: Re: Oxalaia, new spinosaur from Brazil
Next by Date: Papers from the 4th International Symposium on Dinosaur Eggs and Babies
Previous by thread: Re: Clarification of scope of paleoart->uses
Next by thread: Re: Consistency index was Re: Clarification of scope of paleoart->uses
Indexes: