【Java学习+面试指南】 一份涵盖大部分Java程序员所需要掌握的核心知识。
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
Companion webpage to the book "Mathematics For Machine Learning"
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Important machine learning knowledge, each article deeply analyzes theoretical knowledge
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
code and docs for my EMNLP paper "DeepPath: A Reinforcement Learning Method for Knowledge Graph Reasoning"
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)