# UC-Berkeley-CS-294-Deep-RL
**Repository Path**: colin-cheng/UC-Berkeley-CS-294-Deep-RL
## Basic Information
- **Project Name**: UC-Berkeley-CS-294-Deep-RL
- **Description**: UC-Berkeley-CS-294-Deep-RL
- **Primary Language**: Unknown
- **License**: Not specified
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No
## Statistics
- **Stars**: 0
- **Forks**: 0
- **Created**: 2022-09-01
- **Last Updated**: 2022-09-02
## Categories & Tags
**Categories**: Uncategorized
**Tags**: None
## README
# CS 294 at UC Berkeley: Deep Reinforcement Learning
This repo includes my solutions to the assigments of the UC Berkeley Deep Reinforcement Learning course offered in Fall 2018, taught by Sergey Levine.
Course URL: http://rail.eecs.berkeley.edu/deeprlcourse/
Course GitHub: https://github.com/berkeleydeeprlcourse/homework
I would like to thank the instructor and the TAs for making this wonderful course.
## Code structure
- `./hw1/`: contains code for the implementation of **imitation learning**, including direct **behavior cloning** and the **DAgger** algorithm. See my report for this homework [here](./hw1/README.md).
- `./hw2/`: contains code for the implementation of **policy gradient** and its variants, including **variance reduction methods**. See my report for this homework [here](./hw2/README.md). Below you can see some cool videos showing the comparison between the learning agent before and after training using the policy gradient.
| |Before training | After training
|--|--|--|
| CartPole |
|
|
| LunarLander |
|
|
| InvertedPendulum |
|
|
| HalfCheetah |
|
|
- `./hw3/`: contains code for the implementation of **Deep Q-Learning** and **Actor-Critic**. See my report for this homework [here](./hw3/README.md). Below you can see some cool videos showing the comparison between the learning agent before and after training using Deep Q-Learning.
| |Before training | After training
|--|--|--|
| LunarLander |
|
|
`Pong Atari (green pedal is controlled by our learning agent) `
| After 0 timesteps | After 500K timesteps | After 1M timesteps | After 1.5M timesteps | After 2M timesteps
|--|--|--|--|--|
|
|
|
|
|
|
- `./hw4/`: contains code for the implementation of an algorithm for the **model-based reinforcement learning (MBRL)**. See my report for this homework [here](./hw4/README.md).