# diffusion-rl **Repository Path**: mirrors_twitter/diffusion-rl ## Basic Information - **Project Name**: diffusion-rl - **Description**: No description available - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2022-08-13 - **Last Updated**: 2025-09-13 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Diffusion-Offline-RL In this work, we propose Diffusion-QL which utilizes a diffusion model as a highly expressive policy class for behavior cloning and policy regularization. In our approach we learn an action-value function and we add a term maximising action-values to the the training loss of the diffusion model, which results in a loss that seeks optimal actions that are near the behavior policy. ## Dependencies Plese see the ``requirements.txt`` file for the detailed python package dependencies for our project. ## Run our Code Running our code is quite easy, such as an example below, ```.bash python run_offline.py --env_name walker2d-medium-expert-v2 --algo pcq ```