Report copyright - Deep reinforcement learningslazebni.cs.illinois.edu/fall17/lec22_deep_rl.pdf · Review: AlphaGo • Policy network: initialized by supervised training on large amount of human games
Please pass captcha verification before submit form