Saturday, September 20, 2014
iCE 2 first ratings are being established
After an official release one is always curious whether one was able to improve the engine and if so by how much. I was pretty sure that the new iCE is better, my test against the old version at very short time controls showed a real improvement. But I was unsure how this scales to other opponents and much longer time controls.
As with iCE 1 Lars Hallerström performed an initial rating test and the results look encouraging. In his private rating list iCE 2 gained 200 ELO and climbed 66 spots. Considering the fact that it now already is faced with some 4 core engines while iCE is running only on 1 this is nice progress.
53 Ice 2.0 (64) : 2954 19 19 1000 57.2 % 2904 26.2 %
119 Ice 1.0 (64) : 2754 19 19 900 52.3 % 2738 26.7 %
53 Ice 2.0 (64) : 2954 1000 (+441,=262,-297), 57.2 %
Fruit 2.3.1 : 40 (+ 26,= 10,- 4), 77.5 %
Ktulu 9 : 40 (+ 28,= 7,- 5), 78.8 %
Spike 1.4 : 40 (+ 14,= 12,- 14), 50.0 %
Komodo CCT (64) : 40 (+ 3,= 9,- 28), 18.8 %
Junior 12.5.0.3 (4cores) : 40 (+ 14,= 9,- 17), 46.2 %
Deuterium 14.01 (64) : 40 (+ 14,= 11,- 15), 48.8 %
Gaviota 0.86 (64) : 40 (+ 21,= 13,- 6), 68.8 %
Dirty Feb 26 2014 (64) : 40 (+ 27,= 7,- 6), 76.2 %
Sjeng WC2008 (64) : 40 (+ 21,= 8,- 11), 62.5 %
Crafty-23.8 (64) : 40 (+ 16,= 13,- 11), 56.2 %
Texel 1.03 (64) : 40 (+ 14,= 18,- 8), 57.5 %
Bug2 1.9 (64) : 40 (+ 27,= 8,- 5), 77.5 %
Scorpio 2.7.6 JA (64) : 40 (+ 10,= 14,- 16), 42.5 %
EX 7.11b (64) : 40 (+ 25,= 12,- 3), 77.5 %
Djinn 1.010 (64) : 60 (+ 39,= 19,- 2), 80.8 %
Cheng 4.36 (64-4cores) : 40 (+ 14,= 15,- 11), 53.8 %
Rybka 4.1 SSE42 (64-4cores) : 40 (+ 3,= 13,- 24), 23.8 %
Hannibal 1.4a (64-4cores) : 40 (+ 5,= 9,- 26), 23.8 %
Chiron 2 (64-4cores) : 40 (+ 3,= 10,- 27), 20.0 %
Senpai 1.0 sse42 (64-4cores) : 40 (+ 1,= 12,- 27), 17.5 %
BobCat 3.25 (64) : 40 (+ 20,= 11,- 9), 63.7 %
Brutus 8.05 JA (64) : 40 (+ 34,= 6,- 0), 92.5 %
GreKo 12.0 JV (64) : 60 (+ 55,= 4,- 1), 95.0 %
Spark 1.0 (64-4cores) : 40 (+ 7,= 12,- 21), 32.5 %
Thanks Lars for providing the data
Sunday, September 7, 2014
iCE 2 has been released
Yesterday I uploaded the new release of iCE to my website and released it into the wild. Again more than 1 year development and tuning went into this release. The last release was dominated by changes to the evaluation and weight
tuning. Most of the committed changes in this release are now related to
the search framework.
Of course it has no established rating yet but I'm confident that it is stronger than iCE 1. I have not really an idea by how much. I tried to pair it with some free engines but I got contradicting results. Against some engines iCE ended up only as crushed iCE while it was able to really freeze some other engines despite the fact that all those engines are close together in their CCRL rating. Probably the time control I test with or my test setup is just bad so I leave the actual testing to the experts.
Here a summary of the changes for this release.
Changes:
Bugfixes: Bugs related to recognizing a trapped rook and to the 50 move rule were fixed
Pawn Hash Handling: King position is
now stored into the pawn hash which allows the hashing of additional
eval terms while the hit rate goes down a bit.
Endgame Knowledge: Some special code to handle certain 5 piece endgames was added. iCE now uses the material hash to recognize the endgame types.
The ugly list of if ... then ... statements that checks for piece combinations is now gone.
Evaluation: Some changes to the
evaluation to understand certain positions better. An example are
positions with an isolated queens pawn. Tests showed only limited impact
on the strength. I
kept them because I hope the overall playing style of iCE
looks more natural now in those positions. Some minor terms have been
dropped or have been replaced by others.
My pawn structure evaluation is still clumsy, all attempts to
improved it failed.
Table Bases: iCE is not using
external table bases during play. iCE 1 had the DTM data for all the
3-men TB built into the executable. Those tables were removed in iCE 2.
They are calculated on the
fly when the engine starts. The calculation is fairly
optimized and takes less than 100 ms on a decent 1 core system.
Search Changes: Added History
Heuristics, Late Move Pruning, Razoring and Counter Move Heuristics.
Change LMR to be less aggressive. Lazy eval removed. Added LMR at the
root node. And tuning, tuning, tuning ...
Code Cleanup: A lot of code was
refactored and simplified. Especially similar code for WHITE and BLACK
was merged using C++ templates. Overall from iCE 1 to iCE 2 5600 lines
of code were removed while 3800 lines were added.
Unchanged iCE is still a CLOP free zone and does not access endgame tablebases in play. However the new endgame knowledge was
verified against Gaviota table bases. Tribut and thanks for providing them goes to Miguel Ballicora. Also everything related to opening books
is unchanged and still works the same with iCE 2.
I recorded my work on iCE over the last years in this Blog. It contains all the dirty details. So someone interested to learn about my approach to automated parameter tuning using an GA might find something interesting here
Subscribe to:
Posts (Atom)