Geschrieben von: / Posted by: Sune Fischer at 04 March 2004 01:16:01:
Als Antwort auf: / In reply to: Re: How can you test which opening book is better? geschrieben von: / posted by: Norm Pollock at 04 March 2004 00:45:13:
Let me clarify the second part of my test. Suppose I have 2 books X and Y for Crafty. Then I choose 2 comparable engines, each with their own book that is not related to either book X or book Y. For example: ktulu and aristarch. Then Crafty/X plays 100 against ktulu and 100 against aristarch. Same for Crafty/Y. So all together there are 600 games: 200 with crafty/X vs crafty/Y, 100 with crafty/X vs ktulu, 100 with crafty/X vs aristarch,100 with crafty/Y vs ktulu, and 100 with crafty/Y vs aristarch.
Also all 600 matches are on the same machine.
Ok yes, that's how I understood it.
But you see, in this test you first play:
*) Crafty/X plays 100 against ktulu and 100 against aristarch.
then you do the "Same for Crafty/Y".
So basicly you are only testing one book at the time.
By playing them directly against eachother you get to test them both at the
same time, two birds with one stone

Then there are some things to consider, like will playing against third party
books provide better testing of the books due to higher variaty?
Imagine e.g. if book X was twice as big as Y, that means book Y would only be
able to test half of book X, so no matter the amount of games we play we could
never detect if X is half full of junk lines.
I don't know how likely that is, actually I think it is a good assumption
to take a small sample of a book, say 1 percent, and use this as a representativ
set. That is how statistics and polls are usually done AFAIK so why shouldn't it
work here.
Something I'm more concerned with is what happens when you take a known
distribution and pit it against an unknown distribution?
Each book has some percent of good and bad lines, now we want to know the
distribution in X and Y, but we couldn't care less for the distributions of
Z and W, yet they must be just as significant factors as X and Y.
-S.