Recent progress #4

ZHANGRUI666 · 2020-04-28T04:57:53Z

Hi，Yuri！how are you there? and I want to know about your recent progress in Muzero project, has your model converged？I built my Muzero to play renju，but after several hundred epochs of training, it still showed little enhancement too, which makes me kind of frustrated, would you share with me some intermediate output of the model such as hideen status, the predicated probability or evaluated value of one particular board configuration? we can communicate each other and analyse the problem

YuriCat · 2020-05-18T20:18:03Z

Hi,

These days I'm not touching Muzero code.
I would be appreciate if you find new key points to archirve good result.

After other RL experiments, I found that ReLU sometimes worked worse than other activations for small neural nets.

ZHANGRUI666 · 2020-05-22T09:51:37Z

Thanks for your reply! Yuri, I am glad to hear your response, I inspect the hidden status these days, and i try to reveal as more details as possible , if i have any discoveries,i will tell you .
ReLU-driven neural nets do have some flaw i think, and i expect your further advance in this field

YuriCat · 2020-07-17T19:18:21Z

Hi, @ZHANGRUI666

I found careless bug in tree search method.
Encoded abstract state had not been updated when descending search tree. Unbelievable!
After I fixed it, training looks going well.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recent progress #4

Recent progress #4

ZHANGRUI666 commented Apr 28, 2020

YuriCat commented May 18, 2020

ZHANGRUI666 commented May 22, 2020

YuriCat commented Jul 17, 2020

Recent progress #4

Recent progress #4

Comments

ZHANGRUI666 commented Apr 28, 2020

YuriCat commented May 18, 2020

ZHANGRUI666 commented May 22, 2020

YuriCat commented Jul 17, 2020