Geschrieben von: / Posted by: Robert Allgeuer at 17 January 2004 13:05:53:
Following some posts in CCC claiming that Ruffian Leiden were stronger than Ruffian 2.0.0 I let those two play a match of 20 games against each other (win2k, ponder off, 5 piece egtbs, time control 300+2, winboard 4.2.3, wbtm 0.60, elostat 1.1b, 96Mb hash, default books, Athlon TB 1.1Mhz).
Ruffian v2.0.0 playing 20 games against each other engine. Score: 9.5 / 20 (47%)
Rank|No |Name | |Pts  |
----|----|--------------|--------------------|----------------|
1.| 2.|Ruffian Leiden|=0==01=11===100==101| 10.5 / 20 52%|
As this match was very close I extended the test and ran a second match with 50 games under the same conditions:
Ruffian v2.0.0 playing 50 games against each other engine. Score: 22.5 / 50 (45%)
Rank|No |Name | |Pts  |
----|----|--------------|--------------------------------------------------|----------------|
1.| 2.|Ruffian Leiden|=======0==10===11=11=0=10101=1=001=0==1=01==10=11=| 27.5 / 50 55%|
All 70 games were unique, the overall result is as follows:
Program  Elo +  -  Games  Score  Av.Op. Draws
1 Ruffian Leiden : 2695  80 51 70 54.3 %  2665  48.6 %
2 Ruffian v2.0.0 : 2665  51 80 70 45.7 %  2695  48.6 %
This result is not statistically signifcant, but it may indeed be true that Ruffian Leiden has a slight edge over Ruffian 2.0.0.
Possibly I will include Ruffian Leiden in my YABRL rating list in order to get a better estimation.
Robert