Switchable Lightweight Anti-symmetric Processing (SLAP) with CNN to Reduce Sample Size and Speed up Learning – Application in Gomoku Reinforcement Learning

01/11/2023
by   Chi-Hang Suen, et al.
0

To replace data augmentation, this paper proposed a method called SLAP to intensify experience to speed up machine learning and reduce the sample size. SLAP is a model-independent protocol/function to produce the same output given different transformation variants. SLAP improved the convergence speed of convolutional neural network learning by 83 game states, with only one eighth of the sample size compared with data augmentation. In reinforcement learning for Gomoku, using AlphaGo Zero/AlphaZero algorithm with data augmentation as baseline, SLAP reduced the number of training samples by a factor of 8 and achieved similar winning rate against the same evaluator, but it was not yet evident that it could speed up reinforcement learning. The benefits should at least apply to domains that are invariant to symmetry or certain transformations. As future work, SLAP may aid more explainable learning and transfer learning for domains that are not invariant to symmetry, as a small step towards artificial general intelligence.

READ FULL TEXT
research
10/29/2020

Self-paced Data Augmentation for Training Neural Networks

Data augmentation is widely used for machine learning; however, an effec...
research
10/19/2019

Towards More Sample Efficiency in Reinforcement Learning with Data Augmentation

Deep reinforcement learning (DRL) is a promising approach for adaptive r...
research
10/19/2019

Towards More Sample Efficiency inReinforcement Learning with Data Augmentation

Deep reinforcement learning (DRL) is a promising approach for adaptive r...
research
09/24/2019

Invariant Transform Experience Replay

Deep reinforcement learning (DRL) is a promising approach for adaptive r...
research
02/25/2021

Learning with invariances in random features and kernel models

A number of machine learning tasks entail a high degree of invariance: t...
research
04/25/2021

Accuracy Improvement for Fully Convolutional Networks via Selective Augmentation with Applications to Electrocardiogram Data

Deep learning methods have shown suitability for time series classificat...
research
07/15/2019

Multi-hop Federated Private Data Augmentation with Sample Compression

On-device machine learning (ML) has brought about the accessibility to a...

Please sign up or login with your details

Forgot password? Click here to reset