desTROY

Learning to play the TROY game using reinforcement learning methods like policy gradients actor critic models, Deep Q Learning, Deep Deterministic Actor Critic, and Advanced Advantage Actor Critic, and technologies like Pytorch and Cython.

results

gen0

starting training on Actor Critic model

gen1

after 3 hours of training, the model shows significant recognition of game boundaries and tries to avoid itself by going in circles, using the actor critic model

the leads are promising, but frankly, the training time for this algorithm is too slow for me, so i will be coming back to this problem to explore A3C, DDPG learning methods later

gen2

after 1 hour of training on dueling double deep q network, results are much better even though each episode is taking longer to run (due to bigger batches in the replay buffer)

trying to speed up learning by giving incentive rewards for survival and strategic cutting off of opponent
this gif is from training phase where i removed the opponent, to see if model converges faster... since im using self play, both players show this learned knowledge when in dueling mode

future work

add more algorithms
explore evolutionary strategy in this particular game (integrate my Evolve project with desTROY)

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
agents		agents
res		res
.gitignore		.gitignore
.lsconf.json		.lsconf.json
.nvimrc		.nvimrc
.rgignore		.rgignore
README.md		README.md
ac_runner.py		ac_runner.py
dueling_ddqn_runner.py		dueling_ddqn_runner.py
requirements.txt		requirements.txt
setup.cfg		setup.cfg
troy_env.py		troy_env.py
troy_game.py		troy_game.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

desTROY

results

gen0

gen1

gen2

future work

About

Releases

Packages

Languages

ViRu-ThE-ViRuS/desTROY

Folders and files

Latest commit

History

Repository files navigation

desTROY

results

gen0

gen1

gen2

future work

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages