Objective pawn eval #4218

vdbergh · 2022-11-03T09:37:11Z

This is an alternative proposal to #4216

Formula:

s=w+d/2 (w,d,l obtained from the win rate model)
eval_cp=400*log10(s/(1-s))

amchess · 2022-11-03T10:26:27Z

Great idea!
Congratulations!

ddobbelaere · 2022-11-03T11:49:00Z

I personally like this PR in the sense that it puts more emphasis on the WDL model. If the latter is accurate, indeed a more objective pawn eval is obtained. There are still two things that slightly bother me:

How does the eval "feel"? Will it not be too "confusing" for the user? The reason being that the now non-linear formula between internal value (and ply) and reported value might lead to compression/decompression for low/high evals w.r.t. the current SF behavior.
The reported value depends on ply, will this not cause inconsistencies during analysis? E.g. setting up a board position without the move number might lead to different reported values.

robbai · 2022-11-03T12:06:42Z

src/uci.cpp

  }

+int win_rate_model(Value v, int ply) {
+     // Return the win rate in per mille units rounded to the nearest value
+     return int(0.5 + 1000*win_rate_model(v, ply));


Suggested change

return int(0.5 + 1000*win_rate_model(v, ply));

return int(0.5 + 1000*win_rate_model_double(v, ply));

OMG that's a bug.

vdbergh · 2022-11-03T14:23:25Z

It seems that this method works well for small evals (e.g. the starting position has eval ~30). However for big evals it is rather counter intuitive. If I remove the white queen then master has an eval of ~ -1200 but this method has an eval of ~ -4500.

Of course SF will win against god with a queen ahead, so this high negative eval simply reflects the fact that the game is lost.

nimr0d · 2022-11-03T15:10:58Z

This is a cool idea. Centipawn would then directly mean elo difference starting in that position? It would give some more meaning to "average centipawn loss".

vdbergh · 2022-11-03T17:59:44Z

I personally like this PR in the sense that it puts more emphasis on the WDL model. If the latter is accurate, indeed a more objective pawn eval is obtained. There are still two things that slightly bother me:
* How does the eval "feel"? Will it not be too "confusing" for the user? The reason being that the now non-linear formula between internal value (and ply) and reported value might lead to compression/decompression for low/high evals w.r.t. the current SF behavior.

This appears to be the case. But I am not sure is this is bad. For example the eval is currently clamped to 8000cp. I observed in many games that if this value is reached then SF is close to announcing mate. So there is a smooth transition from the eval to mate scores.

* The reported value depends on ply, will this not cause inconsistencies during analysis? E.g. setting up a board position without the move number might lead to different reported values.

Personally I regret that the win rate model depends on the ply. Logically it can and should only depend on the board. One may try to replace ply by game phase, but according to Vondele this gives a less good fit.

ddobbelaere · 2022-11-03T18:11:42Z

Personally I regret that the win rate model depends on the ply. Logically it can and should only depend on the board. One may try to replace ply by game phase, but according to Vondele this gives a less good fit.

There was an interesting suggestion by zz4032 on Discord to let WDL depend on number of pieces instead of ply. Don't know if that would work equally well though (i.e. give an accurate model).

Formula: s=w+d/2 (w,d,l obtained from the win rate model) eval_cp=400*log10(s/(1-s))

vdbergh · 2022-11-04T05:16:39Z

I am going to close this. After thinking it over I decided it is probably too controversial.

vondele · 2022-11-04T19:47:33Z

I think it is interesting, but yes, I'm sure especially the larger values would surprise people. This would also make it a little more difficult to derive the win_rate_model from pgns that contain search evals.

vdbergh changed the title ~~Objective pawn_eval~~ Objective pawn eval Nov 3, 2022

vdbergh force-pushed the objective_eval branch 2 times, most recently from 5468cc3 to 29467f6 Compare November 3, 2022 10:25

vdbergh force-pushed the objective_eval branch from 29467f6 to 210bb9f Compare November 3, 2022 10:52

ddobbelaere mentioned this pull request Nov 3, 2022

Normalize evaluation #4216

Merged

robbai reviewed Nov 3, 2022

View reviewed changes

vdbergh force-pushed the objective_eval branch from 210bb9f to b654765 Compare November 3, 2022 18:04

vdbergh force-pushed the objective_eval branch from b654765 to 2efa546 Compare November 3, 2022 18:26

Objective pawn_eval.

0062234

Formula: s=w+d/2 (w,d,l obtained from the win rate model) eval_cp=400*log10(s/(1-s))

vdbergh force-pushed the objective_eval branch from 2efa546 to 0062234 Compare November 3, 2022 20:03

vdbergh closed this Nov 4, 2022

LovelyChess mentioned this pull request Feb 3, 2023

Update WLD model #4373

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Objective pawn eval #4218

Objective pawn eval #4218

vdbergh commented Nov 3, 2022

amchess commented Nov 3, 2022

ddobbelaere commented Nov 3, 2022 •

edited

Loading

robbai Nov 3, 2022

vdbergh Nov 3, 2022

vdbergh Nov 3, 2022

vdbergh commented Nov 3, 2022 •

edited

Loading

nimr0d commented Nov 3, 2022

vdbergh commented Nov 3, 2022 •

edited

Loading

ddobbelaere commented Nov 3, 2022

vdbergh commented Nov 4, 2022

vondele commented Nov 4, 2022

	return int(0.5 + 1000*win_rate_model(v, ply));
	return int(0.5 + 1000*win_rate_model_double(v, ply));

Objective pawn eval #4218

Objective pawn eval #4218

Conversation

vdbergh commented Nov 3, 2022

amchess commented Nov 3, 2022

ddobbelaere commented Nov 3, 2022 • edited Loading

robbai Nov 3, 2022

Choose a reason for hiding this comment

vdbergh Nov 3, 2022

Choose a reason for hiding this comment

vdbergh Nov 3, 2022

Choose a reason for hiding this comment

vdbergh commented Nov 3, 2022 • edited Loading

nimr0d commented Nov 3, 2022

vdbergh commented Nov 3, 2022 • edited Loading

ddobbelaere commented Nov 3, 2022

vdbergh commented Nov 4, 2022

vondele commented Nov 4, 2022

ddobbelaere commented Nov 3, 2022 •

edited

Loading

vdbergh commented Nov 3, 2022 •

edited

Loading

vdbergh commented Nov 3, 2022 •

edited

Loading