10 million chess games

Everything what does not fit in the other forums. Chess related or not, trivial or sophisticated, but keep it civilized and respect others please.

Moderator: Andres Valverde

10 million chess games

Postby Dann Corbit » 25 Jun 2010, 20:26

5243 Files, comprising 1,684,447,833 bytes after bzip2 compression:
http://cap.connx.com/a-openings/
http://cap.connx.com/b-openings/
http://cap.connx.com/c-openings/
http://cap.connx.com/d-openings/
http://cap.connx.com/e-openings/

Sure, it's junk. That's why we call it junkbase. The collection has actually grown so large now that there are not really any tools that handle it well. ChessAssistant, ChessBase, Scid... All of them die if I feed the whole pile to them and ask the tool to do something useful. So I am not sure how you can fully utilize the data, but have fun trying.

If you want real high quality game sets, buy a professional one. But if you are a starving college student and you want to examine a VOG chess game from 1989, then this is the collection for you.
Dann Corbit
 

Re: 10 million chess games

Postby Dann Corbit » 25 Jun 2010, 23:46

Dann Corbit wrote:5243 Files, comprising 1,684,447,833 bytes after bzip2 compression:
http://cap.connx.com/a-openings/
http://cap.connx.com/b-openings/
http://cap.connx.com/c-openings/
http://cap.connx.com/d-openings/
http://cap.connx.com/e-openings/

Sure, it's junk. That's why we call it junkbase. The collection has actually grown so large now that there are not really any tools that handle it well. ChessAssistant, ChessBase, Scid... All of them die if I feed the whole pile to them and ask the tool to do something useful. So I am not sure how you can fully utilize the data, but have fun trying.

If you want real high quality game sets, buy a professional one. But if you are a starving college student and you want to examine a VOG chess game from 1989, then this is the collection for you.


You can also find SCID versions here:
http://cap.connx.com/scid/

they have been compressed with bzip2 so you will need bzip2 to decompress them or 7-zip or some other file manager that knows how to deal with the bz2 extension.

Caveat:
The full collection (jbase) is really too large for Scid to manage and so it is unreliable. The subsets (sorted by ECO) a,b,c,d and e are more trustable. If (for instance) you try to save the jbase collection as PGN, the GUI will churn for a long time, write out 6 GB of PGN and then crash.
Dann Corbit
 


Return to Anything else

Who is online

Users browsing this forum: No registered users and 2 guests

cron