mirror of
https://github.com/gsi-upm/sitc
synced 2026-02-08 23:58:17 +00:00
Update 2_6_1_Q-Learning_Exercises.ipynb
This commit is contained in:
committed by
GitHub
parent
11f260b782
commit
921eda4c9f
@@ -38,7 +38,7 @@
|
|||||||
"## Taxi\n",
|
"## Taxi\n",
|
||||||
"Analyze the [Taxi problem](https://gymnasium.farama.org/environments/toy_text/taxi/) and solve it applying Q-Learning. You can find a solution as the one previously presented [here](https://www.oreilly.com/learning/introduction-to-reinforcement-learning-and-openai-gym), and the notebook is [here](https://github.com/wagonhelm/Reinforcement-Learning-Introduction/blob/master/Reinforcement%20Learning%20Introduction.ipynb). Take into account that Gymnasium has changed, so you will have to adapt the code.\n",
|
"Analyze the [Taxi problem](https://gymnasium.farama.org/environments/toy_text/taxi/) and solve it applying Q-Learning. You can find a solution as the one previously presented [here](https://www.oreilly.com/learning/introduction-to-reinforcement-learning-and-openai-gym), and the notebook is [here](https://github.com/wagonhelm/Reinforcement-Learning-Introduction/blob/master/Reinforcement%20Learning%20Introduction.ipynb). Take into account that Gymnasium has changed, so you will have to adapt the code.\n",
|
||||||
"\n",
|
"\n",
|
||||||
"Analyze the impact of not changing the learning rate or changing it in a different way. "
|
"Analyze the impact of not changing the learning rate or changing it differently. "
|
||||||
]
|
]
|
||||||
},
|
},
|
||||||
{
|
{
|
||||||
@@ -49,7 +49,7 @@
|
|||||||
"Select one of the following exercises.\n",
|
"Select one of the following exercises.\n",
|
||||||
"\n",
|
"\n",
|
||||||
"## Blackjack\n",
|
"## Blackjack\n",
|
||||||
"Analyze how to appy Q-Learning for solving Blackjack.\n",
|
"Analyze how to apply Q-Learning for solving Blackjack.\n",
|
||||||
"You can find information in this [article](https://gymnasium.farama.org/tutorials/training_agents/blackjack_tutorial/).\n",
|
"You can find information in this [article](https://gymnasium.farama.org/tutorials/training_agents/blackjack_tutorial/).\n",
|
||||||
"\n",
|
"\n",
|
||||||
"## Doom\n",
|
"## Doom\n",
|
||||||
@@ -82,7 +82,7 @@
|
|||||||
"cell_type": "markdown",
|
"cell_type": "markdown",
|
||||||
"metadata": {},
|
"metadata": {},
|
||||||
"source": [
|
"source": [
|
||||||
"The notebook is freely licensed under under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
"The notebook is freely licensed under the [Creative Commons Attribution Share-Alike license](https://creativecommons.org/licenses/by/2.0/). \n",
|
||||||
"\n",
|
"\n",
|
||||||
"© Carlos Á. Iglesias, Universidad Politécnica de Madrid."
|
"© Carlos Á. Iglesias, Universidad Politécnica de Madrid."
|
||||||
]
|
]
|
||||||
|
|||||||
Reference in New Issue
Block a user