The Top 207 Policy Gradient Open Source Projects on Github
Categories
Machine Learning
Policy Gradient
Reinforcement Learning With Tensorflow
⭐
5,721
Simple Reinforcement learning tutorials
Tianshou
⭐
4,011
An elegant PyTorch deep reinforcement learning library.
Reinforcement Learning
⭐
3,220
Learn Deep Reinforcement Learning in 60 days! Lectures & Code in Python. Reinforcement Learning + Deep Learning
Easy Rl
⭐
2,913
强化学习中文教程，在线阅读地址：https://datawhalechina.github.io/easy-rl/
Reinforcement Learning
⭐
2,797
Minimal and Clean Reinforcement Learning Examples
Minimalrl
⭐
1,944
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Deep Reinforcement Learning With Pytorch
⭐
1,645
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Slm Lab
⭐
987
Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".
Btgym
⭐
825
Scalable, event-driven, deep-learning-friendly backtesting library
Pytorch Rl
⭐
638
PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
Rlseq2seq
⭐
610
Deep Reinforcement Learning For Sequence to Sequence Models
Hands On Reinforcement Learning With Python
⭐
596
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Ppo Pytorch
⭐
524
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Tensorflow Reinforce
⭐
477
Implementations of Reinforcement Learning Models in Tensorflow
Deer
⭐
462
DEEp Reinforcement learning framework
Seqgan
⭐
441
A simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Awesome Monte Carlo Tree Search Papers
⭐
439
A curated list of Monte Carlo tree search papers with implementations.
Deep Rl Keras
⭐
436
Keras Implementation of popular Deep RL Algorithms (A3C, DDQN, DDPG, Dueling DDQN)
Rl_algorithms
⭐
412
Structural implementation of RL key algorithms
Lagom
⭐
365
lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.
Reinforcement_learning_tutorial_with_demo
⭐
357
Reinforcement Learning Tutorial with Demo: DP (Policy and Value Iteration), Monte Carlo, TD Learning (SARSA, QLearning), Function Approximation, Policy Gradient, DQN, Imitation, Meta Learning, Papers, Courses, etc..
Pytorch Rl
⭐
356
This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch
Trpo
⭐
315
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
Openai_lab
⭐
314
An experimentation framework for Reinforcement Learning using OpenAI Gym, Tensorflow, and Keras.
Text_summurization_abstractive_methods
⭐
310
Multiple implementations for abstractive text summurization , using google colab
Reinforcement Learning Kr
⭐
238
[파이썬과 케라스로 배우는 강화학습] 예제
Multihopkg
⭐
227
Multi-hop knowledge graph reasoning learned via policy gradient with reward shaping and action dropout
Handyrl
⭐
186
HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
A2c
⭐
159
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
Deep Algotrading
⭐
145
A resource for learning about deep learning techniques from regression to LSTM and Reinforcement Learning using financial data and the fitness functions of algorithmic trading
Show Adapt And Tell
⭐
142
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Policy Gradient
⭐
124
Minimal Monte Carlo Policy Gradient (REINFORCE) Algorithm Implementation in Keras
Deeprl_algorithms
⭐
112
DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)
Paddle Rlbooks
⭐
108
Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.
Torchrl
⭐
107
Highly Modular and Scalable Reinforcement Learning
Mlds2018spring
⭐
106
Machine Learning and having it Deep and Structured (MLDS) in 2018 spring
Reinforcement_learning
⭐
104
Reinforcement learning tutorials
Deep Reinforcement Learning With Python
⭐
94
Master classic RL, deep RL, distributional RL, inverse RL, and more using OpenAI Gym and TensorFlow with extensive Math
Reinforcement_learning
⭐
89
강화학습에 대한 기본적인 알고리즘 구현
Yarll
⭐
82
Combining deep learning and reinforcement learning.
Codegan
⭐
74
[Deprecated] Source Code Generation using Sequence Generative Adversarial Networks
Rl Course Experiments
⭐
73
Pytorch Rl
⭐
64
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Sqddpg
⭐
62
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
Imitation_learning
⭐
56
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Drl_in_cv
⭐
54
A course on Deep Reinforcement Learning in Computer Vision. Visit Website:
Spinning Up A Pong Ai With Deep Rl
⭐
53
Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.
Fruit Api
⭐
50
A Universal Deep Reinforcement Learning Framework
Cst_captioning
⭐
48
PyTorch Implementation of Consensus-based Sequence Training for Video Captioning
Reinforcementlearning
⭐
48
Reinforcing Your Learning of Reinforcement Learning
Photo Editing Tensorflow
⭐
47
Photo Optimizing Adversarial Net with Policy Gradient Method
Sharkstock
⭐
43
Automate swing trading using deep reinforcement learning. The deep deterministic policy gradient-based neural network model trains to choose an action to sell, buy, or hold the stocks to maximize the gain in asset value. The paper also acknowledges the need for a system that predicts the trend in stock value to work along with the reinforcement learning algorithm. We implement a sentiment analysis model using a recurrent convolutional neural network to predict the stock trend from the financial news. The objective of this paper is not to build a better trading bot, but to prove that reinforcement learning is capable of learning the tricks of stock trading.
Pytorch Learn Reinforcement Learning
⭐
42
A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.
Reinforcement Learning
⭐
39
Personal experiments on Reinforcement Learning
Rl_implementations
⭐
36
Policy Gradient Methods
⭐
35
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Chainer Seqgan
⭐
34
implementation of SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient
Seqgan Pytorch
⭐
30
Implementation of Sequence Generative Adversarial Nets with Policy Gradient in PyTorch
On The Fly Fgsbir
⭐
30
[CVPR 2020, Oral] "Sketch Less for More: On-the-Fly Fine-Grained Sketch Based Image Retrieval”, IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2020. .
Explorer
⭐
30
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
Deep_rl_acrobot
⭐
30
TensorFlow A2C to solve Acrobot, with synchronized parallel environments
Optimization_of_image_description_metrics_using_policy_gradient_methods
⭐
29
Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods
Rl Implementation Impala
⭐
28
A Test-Implementation of the IMPALA algorithm (by deepmind 2018)
Connect4
⭐
27
Solving board games like Connect4 using Deep Reinforcement Learning
Policy Gradient Pong
⭐
27
tensorflow implementation of Andrej Karpathy's blog about reinforcement learning. http://karpathy.github.io/2016/05/31/rl/
Practical_rl
⭐
24
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Td Reg
⭐
24
TD-Regularized Actor-Critic Methods
Deep Reinforcement Learning
⭐
23
A collection of several Deep Reinforcement Learning techniques (Deep Q Learning, Policy Gradients, ...), gets updated over time.
Ppo_tf
⭐
23
Implementation of proximal policy optimization(PPO) with tensorflow
Parl Sample
⭐
20
Deep reinforcement learning using baidu PARL(maze,flappy bird and so on)
Rl
⭐
19
A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
Adl2019
⭐
19
Applied Deep Learning (2019 Spring) @ NTU
Rpg
⭐
18
Ranking Policy Gradient
Image Captioning Gan
⭐
17
Deep Reinforcement Learning Algorithm Collection
⭐
17
Collection of Deep Reinforcement Learning Algorithms implemented in PyTorch.
Deep Reinforcement Learning Cs285 Pytorch
⭐
17
Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework
Deep_rl_pong_keras
⭐
16
Deep Reinforcement Learning Policy Gradients Method - Pong game - Keras
Deep Rl Mxnet
⭐
16
Mxnet implementation of Deep Reinforcement Learning papers, such as DQN, PG, DDPG, PPO
Emotional_dialogue
⭐
15
A Deep Reinforcement Learning Approach (LSTM + policy gradient) to create a chatbot that produces coherent, emotional dialogue.
Trpo Tensorflow
⭐
15
Trust Region Policy Optimization (TRPO) in pure TensorFlow
Policy Gradient Methods
⭐
15
Modular PyTorch implementation of policy gradient methods
Sources Of Reinforcement Learning
⭐
15
All the source codes and lectures of reinforcement learning.
Taa Pg
⭐
15
Usage of policy gradient reinforcement learning to solve portfolio optimization problems (Tactical Asset Allocation).
Mips
⭐
14
Minimal Policy Search Toolbox
Deep_trading
⭐
14
This project aims to select a supervised algorithm that can predict stock prices basing on historical data and use the predictor generated to form trading strategies.
Pg_rnn
⭐
13
There are few caveats when you want to use a Recurrent Neural Network (RNN) policy with Policy Gradient Algorithms. This repository explains them and provide a solution for them. Please see the blog for more details.
Tensorl
⭐
12
Simple and self-contained TensorFlow implementation of reinforcement learning algorithms for continuous control, integrated with OpenAI Gym and other physics engines.
Actor Critic Pytorch
⭐
12
Policy Gradient Actor-Critic PyTorch | Lunar Lander v2
Deep Rl Torcs
⭐
12
Autonomous Navigation using Deep Reinforcement Learning
Neural Architecture Search
⭐
11
Re-implementation of Neural Architecture Search using Reinforcement Learning
Rl Botics
⭐
10
Deep Reinforcement Learning Toolbox for Robotics using Keras and TensorFlow
Rl_webots
⭐
10
Webots project to show how to use Deep Reinforcement Learning with Webots in C++.
Lwdrlc
⭐
10
Lightweight deep RL Libraray for continuous control.
Pong With Policy Gradients
⭐
9
Code for an intro to RL workshop. You'll be training a simple agent to play pong using policy gradients. Adapted from http://karpathy.github.io/2016/05/31/rl/
Ct_keras_pong
⭐
9
An AI that plays Atari 2600 Pong. Trained with reinforcement learning using OpenAI Gym and Keras
Policygradient_ponggame
⭐
9
Pong Game problem solving using RL - Policy Gradient with OpenAI Gym Framework and Tensorflow
Ddpg_numpy_only
⭐
9
Implemenation of DDPG with numpy only (without Tensorflow)
Rlin200lines
⭐
9
PyTorch implementations of Reinforcement Learning algorithms in less than 200 lines
Pacman Rl
⭐
9
Implement some reinforcement learning algorithms, test and visualize on Pacman.
Deep Bayesian Quadrature Policy Optimization
⭐
9
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
