Evolutionary Social Dilemma Games

This is a repo for the capstone project of the Bio-Inspired AI course at the University of Trento. Agents are represented by decision trees, whose structure is defined by an evolving grammar, that is the genome. We seek for understanding of possible emergent social strategies emphasised by natural selection, while using interpretable algorithms.

Supported MARL environments

Level-based foraging

Added features

Generation-wise player event logging
Hall of Fame checkpointing
Data analysis scripts

Training

Run in single-tree evolving mode:

    python run_base_script.py --jobs 16 --stats exp1 --players 5 --food 3 --field_size 8 --sight 5 --seed 42 --n_actions 6 --learning_rate 0.001 --df 0.7 --episodes 50 --lambda_ 50 --generations 50 --cxp 0.5 --mp 0.7 --low -1 --up 1 --mutation "function-tools.mutUniformInt#low-0#up-40000#indpb-0.05" --eps 0.4 --episode_len 200

Run in advanced, multi-tree evolving mode:

 python run_multi_dt.py --config_file

Following the configuration format:

 {
    "stats": "exp1_19_04_23",
    "players": 5,
    "field_size": 8,
    "sight": 8,
    "food": 3,
    "cooperation": false,
    "grid": true,
    "max_player_level": 5,
    "jobs": 16,
    "seed": 42,
    "n_actions": 6,
    "learning_rate": "auto",
    "df": 0.9,
    "eps": 1,
    "episodes": 50,
    "episode_len": 200,
    "lambda_": 30,
    "generations": 50,
    "cxp": 0.1,
    "mp": 1,
    "mutation": {
        "function": "tools.mutUniformInt",
        "low": 0,
        "up": 40000,
        "indpb": 0.05
    },
    "crossover": {
        "function": "tools.cxOnePoint"
    },
    "selection": {
        "function": "tools.selTournament",
        "tournsize": 2
    },
    "random_init": false,
    "decay": 0.99,
    "genotype_len": 100,
    "low": -1,
    "up": 1
}

Parameters:

    usage: run_base_script.py [-h] [--players PLAYERS] [--field_size FIELD_SIZE] [--sight SIGHT] [--food FOOD]
                          [--cooperation COOPERATION] [--grid GRID] [--max_player_level MAX_PLAYER_LEVEL] [--jobs JOBS]
                          [--seed SEED] [--n_actions N_ACTIONS] [--learning_rate LEARNING_RATE] [--df DF] [--eps EPS]
                          [--episodes EPISODES] [--episode_len EPISODE_LEN] [--lambda_ LAMBDA_] [--generations GENERATIONS]
                          [--cxp CXP] [--mp MP] [--mutation MUTATION] [--crossover CROSSOVER] [--selection SELECTION]
                          [--genotype_len GENOTYPE_LEN] [--low LOW] [--up UP]

    optional arguments:
    -h, --help            show this help message and exit
    --players PLAYERS     Number of players involved.
    --field_size FIELD_SIZE
                            # of tiles of space lenght. The final game space is a NxN area.
    --sight SIGHT         # of tiles of view range. The final view is a KxK area.
    --food FOOD           Max # of food laid on the ground.
    --cooperation COOPERATION
                            Force players cooperation.
    --grid GRID           Use the grid observation space.
    --max_player_level MAX_PLAYER_LEVEL
                            Max achievable player (and food?) level
    --jobs JOBS           The number of jobs to use for the evolution
    --seed SEED           Random seed
    --n_actions N_ACTIONS
                            The number of action that the agent can perform in the environment
    --learning_rate LEARNING_RATE
                            The learning rate to be used for Q-learning. Default is: 'auto' (1/k)
    --df DF               The discount factor used for Q-learning
    --eps EPS             Epsilon parameter for the epsilon greedy Q-learning
    --episodes EPISODES   Number of episodes that the agent faces in the fitness evaluation phase
    --episode_len EPISODE_LEN
                            The max length of an episode in timesteps
    --lambda_ LAMBDA_     Population size
    --generations GENERATIONS
                            Number of generations
    --cxp CXP             Crossover probability
    --mp MP               Mutation probability
    --mutation MUTATION   Mutation operator. String in the format function-value#function_param_-value_1... The operators from    
                            the DEAP library can be used by setting the function to 'function-tools.<operator_name>'. Default:      
                            Uniform Int Mutation
    --crossover CROSSOVER
                            Crossover operator, see Mutation operator. Default: One point
    --selection SELECTION
                            Selection operator, see Mutation operator. Default: tournament of size 2
    --genotype_len GENOTYPE_LEN
                            Length of the fixed-length genotype
    --low LOW             Lower bound for the random initialization of the leaves
    --up UP               Upper bound for the random initialization of the leaves

Testing

To visualize the behavior of evolved agents in game sessions:

    python evaluate.py --config config.json --genome stats/competitive6_29_04_23/generations/gen_36/hof.json

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
configs		configs
dt		dt
lbforaging		lbforaging
src		src
.gitignore		.gitignore
README.md		README.md
distributed_genes.py		distributed_genes.py
eval.json		eval.json
evaluate.py		evaluate.py
experiments.md		experiments.md
full_run.sh		full_run.sh
grammatical_evolution.py		grammatical_evolution.py
requirements.txt		requirements.txt
run_arena.py		run_arena.py
run_base_script.py		run_base_script.py
run_custom.py		run_custom.py
run_multi_dt.py		run_multi_dt.py
stats.ipynb		stats.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evolutionary Social Dilemma Games

Supported MARL environments

Added features

Training

Testing

Papers

About

Releases

Packages

Languages

halixness/evolutionary-social-dilemma-games

Folders and files

Latest commit

History

Repository files navigation

Evolutionary Social Dilemma Games

Supported MARL environments

Added features

Training

Testing

Papers

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages