diff --git a/units/en/_toctree.yml b/units/en/_toctree.yml index 704be139..cc96faff 100644 --- a/units/en/_toctree.yml +++ b/units/en/_toctree.yml @@ -160,6 +160,8 @@ title: Advantage Actor Critic (A2C) - local: unit6/hands-on title: Advantage Actor Critic (A2C) using Robotics Simulations with Panda-Gym 🤖 + - local: unit6/quiz + title: Quiz - local: unit6/conclusion title: Conclusion - local: unit6/additional-readings diff --git a/units/en/unit6/quiz.mdx b/units/en/unit6/quiz.mdx new file mode 100644 index 00000000..0c49305f --- /dev/null +++ b/units/en/unit6/quiz.mdx @@ -0,0 +1,123 @@ +# Quiz + +The best way to learn and [to avoid the illusion of competence](https://www.coursera.org/lecture/learning-how-to-learn/illusions-of-competence-BuFzf) **is to test yourself.** This will help you to find **where you need to reinforce your knowledge**. + + +### Q1: What of the following interpretations of bias-variance tradeoff is the most accurate in the field of Reinforcement Learning? + + + +### Q2: Which of the following statements are True, when talking about models with bias and/or variance in RL? + + + + +### Q3: Which of the following statements are true about Monte-carlo method? + + + +### Q4: What is the Advanced Actor-Critic Method (A2C)? + +
+Solution + +The idea behind Actor-Critic is that we learn two function approximations: +1. A `policy` that controls how our agent acts (π) +2. A `value` function to assist the policy update by measuring how good the action taken is (q) + +Actor-Critic, step 2 + +
+ +### Q5: Which of the following statemets are True about the Actor-Critic Method? + + + + + +### Q6: What is `Advantage` in the A2C method? + +
+Solution + +Instead of using directly the Action-Value function of the Critic as it is, we could use an `Advantage` function. The idea behind an `Advantage` function is that we calculate the relative advantage of an action compared to the others possible at a state, averaging them. + +In other words: how taking that action at a state is better compared to the average value of the state + +Advantage in A2C + +
+ +Congrats on finishing this Quiz 🥳, if you missed some elements, take time to read the chapter again to reinforce (😏) your knowledge.