Deep reinforcement learning with smooth policy update: Application to robotic cloth manipulation
CRANK
Deep Reinforcement Learning (DRL), which can learn complex policies with high-dimensional observations as inputs, e.g., images, has been successfully …