Geschrieben von: / Posted by: Robert Allgeuer at 03 March 2004 14:57:58:
I have compared several Ruffian version in a kind of qualification tournament for my YABRL Blitz rating list (see http://f11.parsimony.net/forum16635/messages/62408.htm for latest published list).
Conditions:
Time control 300+2, all 3,4 and 5 men EGTB, hash 96MB, ponder off, Athlon 1.1GHz, Win2k, Winboard and WBTM tourney manager 0.60, Elostat 1.1b
Participants:
Ruffian 1.0.1 with 1.0.1 book
Ruffian 2.0.0 with 2.0.0 book
Ruffian Leiden with the Leiden book
Ruffian 2.0.2 with 2.0.0 book
Ruffian 2.1.0 with 2.0.0 book
Ruffian 08.02.2004 (this is a beta version before release before 2.0.2 and 2.1.0) with 2.0.0 book
and as opponents:
Smarthink 0.17a
Gromit 3.8.2
Thinker 4.5b
Crafty 17.14DC
Crafty MPC
Aristarch 4.37
All versions of Ruffian have played matches of 20 games each against each other and against each opponent (Ruffian 1.0.1 had one duplicate game which was removed).
Results:
Program  Elo +  -  Games  Score  Av.Op. Draws
1 Ruffian 08.02.2004 : 2722  39 36  220 61.6 %  2640  37.7 %
2 Ruffian Leiden : 2713  40 35  220 60.2 %  2640  38.6 %
3 Ruffian v2.1.0 : 2703  41 35  220 58.9 %  2641  37.7 %
4 Ruffian v2.0.2 : 2695  42 31  220 57.5 %  2642  45.0 %
5 Ruffian v2.0.0 : 2664  45 31  220 52.7 %  2645  41.8 %
7 Ruffian v1.0.1 : 2650  47 32  219 50.5 %  2646  37.9 %
Observations:
1. This applies of course only to the conditions of this test (Blitz etc.)
2. Ruffian 2.0.0 appears to be stronger than the free Ruffian 1.0.1, although only a bit. In this test it is 14 ELO points, in my more accurate YABRL rating list - after more than 800 games each - it is 28 points.
3. Ruffian Leiden and the newer versions (2.0.2, 2.1.0 and 08.02.2004) are stronger than version 2.0.0. However, they are close to each other and it appears difficult to determine which of them is indeed the strongest.
4. When looking at the results of version 2.1.0 in more detail, it becomes apparent that it scores consistently less than the other Ruffian versions (except 1.0.1), but "saves" its high rating only by scoring high in the direct matches against Ruffian 2.0.2 and 08.02.2004. Nevertheless 2.1.0 appears to be the weakest of the new Ruffian version in matches against other non-Ruffian engines.
5. From the characteristics of its results it becomes apparent that 08.02.2004 is a (late) beta version of Ruffian 2.0.2 (and not 2.1.0). It would be highly interesting, whether this version is in fact identical to 2.0.2 (the measured 27 points difference in strength are within the error margin) or there were some changes made before release of 2.0.2, which might have decreased 2.0.2's playing strength.
6. The Leiden version seems to be one of the strongest. Version 2.0.2 is a bug-fix version of 2.0.0 and some 30 points stronger than 2.0.0. If I had a wish, I would ask for a bug-fix Leiden version; that one would most probably be the strongest Ruffian of all.
Robert
YABRL (Yet Another Blitz Rating List)