Adding heterogeneous observation social dilemma environment#4
Adding heterogeneous observation social dilemma environment#4marimeireles wants to merge 38 commits intoBarfussLab:mainfrom
Conversation
… opacity to keep consistency among the code
…observations = 0.5 for partial observable agents
|
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
| return histSjA_RewardTensor(self.baseenv, self.h) | ||
|
|
||
| def ObservationTensor(self): |
There was a problem hiding this comment.
I've changed this function to be able to generate different observation tensors for each agent.
| @@ -0,0 +1,170 @@ | |||
| # AUTOGENERATED! DO NOT EDIT! File to edit: ../../nbs/Environments/02_HeterogeneousObservationsEnv.ipynb. | |||
There was a problem hiding this comment.
Most of the changes are within this file. It's largely adapting the ebase file to deal with multiple observations.
| @@ -0,0 +1,127 @@ | |||
| # AUTOGENERATED! DO NOT EDIT! File to edit: ../../nbs/Environments/12_MultipleObsSocialDilemma.ipynb. | |||
There was a problem hiding this comment.
This file simply implements the social dilemma layer into the heterogeneous observation env. file.
I've initially tried to incorporate the "contract" idea because I saw this in the Uncertain Environment file, however, I don't really understand the dynamics of contract and I don't think it's fully functional, I need to work on it.
I thought it wasn't relevant for our project as IPD only has the one state .. Please let me know if I misunderstood this.
|
I'm also a bit confused on whether it's possible to have observation tensors summing for numbers > 1 or < 1. I guess the only reason why we cannot is because of the |
No description provided.