# diffusion-rl

**Repository Path**: mirrors_twitter/diffusion-rl

## Basic Information

- **Project Name**: diffusion-rl
- **Description**: No description available
- **Primary Language**: Unknown
- **License**: Apache-2.0
- **Default Branch**: master
- **Homepage**: None
- **GVP Project**: No

## Statistics

- **Stars**: 0
- **Forks**: 0
- **Created**: 2022-08-13
- **Last Updated**: 2026-04-04

## Categories & Tags

**Categories**: Uncategorized

**Tags**: None

## README

# Diffusion-Offline-RL
In this work, we propose Diffusion-QL which utilizes a diffusion model as a highly expressive policy class for behavior cloning and policy regularization. In our approach we learn an action-value function and we add a term maximising action-values to the the training loss of the diffusion model, which results in a loss that seeks optimal actions that are near the behavior policy. 

## Dependencies
Plese see the ``requirements.txt`` file for the detailed python package dependencies for our project. 

## Run our Code
Running our code is quite easy, such as an example below, 
```.bash
python run_offline.py --env_name walker2d-medium-expert-v2 --algo pcq 
```