# Logic-RL **Repository Path**: math345/Logic-RL ## Basic Information - **Project Name**: Logic-RL - **Description**: No description available - **Primary Language**: Unknown - **License**: Apache-2.0 - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-02-10 - **Last Updated**: 2025-02-10 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # Logic Rl ## 🎉 **Successfully reproduced DeepSeek R1 Zero on 2K Logic Puzzle Dataset.** ## 📢 **Our detailed technical report is coming soon! Stay tuned!** See project explanation : [here](https://evxpwrsfkdb.feishu.cn/docx/NokEdaMBmo6aqZxVdxkcSm2cnab?from=from_copylink). Wandb project : [here](https://wandb.ai/ustc_ai/GRPO_logic_KK/reports/GRPO-Zero--VmlldzoxMTIwOTYyNw?accessToken=gnbnl5mu5pwfww7gtwxymohg85w7d7vthvjvbl4w8yxg0a99vf1k22m11e61cvv8). --- ## Enhanced Features (After Rule-Based RL) | 🚩 Uncertainty Marking | 📝 Progressive Summarization | ✅ Self Verification | 🌐 Multilingual Switching | |------------------------|-----------------------------|---------------------|--------------------------| | Flag ambiguous steps for verification | Maintain intermediate conclusions | First verify then answer | Chinese reasoning traces with English answers | --- ## 📸 Results Preview
![]() |
![]() |
Test Score Plot | Average Output Length Plot |
![]() |
Model Output Example |