add pets environments and reward functions by runjerry · Pull Request #752 · HorizonRobotics/alf

runjerry · 2021-01-11T21:00:39Z

No description provided.

Haichao-Zhang · 2021-01-11T22:39:38Z

alf/examples/mbrl_reward_functions.py

+
+
+@gin.configurable
+def reward_function_for_pendulum(obs, action):


There is already a reward function for pendulum? It seems you are trying to organizing all mbrl reward functions in a single file here.

Haichao-Zhang · 2021-01-11T22:41:06Z

alf/examples/mbrl_reward_functions.py

+
+@gin.configurable
+def reward_function_for_halfcheetah(obs, action):
+    """Function for computing reward for gym CartPole environment. It takes


CartPole -> halfcheetah

Haichao-Zhang · 2021-01-11T22:41:18Z

alf/examples/mbrl_reward_functions.py

+
+@gin.configurable
+def reward_function_for_pusher(obs, action):
+    """Function for computing reward for gym CartPole environment. It takes


CartPole -> Pusher

Haichao-Zhang · 2021-01-11T22:41:32Z

alf/examples/mbrl_reward_functions.py

+
+@gin.configurable
+def reward_function_for_reacher(obs, action):
+    """Function for computing reward for gym CartPole environment. It takes


CartPole -> Reacher

Haichao-Zhang · 2021-01-11T22:50:28Z

alf/examples/mbrl_reward_functions.py

+
+
+@gin.configurable
+def reward_function_for_cartpole(obs, action):


About the reward functions, sometimes its association with the corresponding env/task is clear, such as pendulum, as that is a standard task from Gym.
Sometimes it might be necessary to make the association more explicit. For example,
The cartpole reward here is not for CartPole-v0 from Gym, which also a cartpole task but with discrete actions.
Similarly for others such the halfcheetah reward etc.

Haichao-Zhang · 2021-01-11T22:55:45Z

alf/examples/mbrl_reward_functions.py

+                new_rot_axis, new_rot_perp_axis, cur_end + length * new_rot_axis
+
+        cost = torch.sum(
+            torch.square(cur_end - common.get_gym_env_attr('goal')), dim=-1)


Will this way of retrieving the goal information still correct if we have multiple parallel environment?
It seems we are using
gym_env = _env.envs[0].gym in get_gym_env_attr in this case?

same concern

Haichao-Zhang · 2021-01-11T22:59:36Z

alf/environments/gym_pets/envs/assets/half_cheetah.xml

@@ -0,0 +1,95 @@
+<!-- Cheetah Model
+
+    The state space is populated with joints in the order that they are


For the xml files, not sure whether should check gym/mujoco license as well apart from reference to pets, if to include them.
Another possible way might be to provide pointers/scripts to download them?

emailweixu · 2021-01-15T23:50:22Z

alf/environments/gym_pets/envs/pusher.py

+
+from __future__ import division
+from __future__ import print_function
+from __future__ import absolute_import


future imports here and in other files can be removed

emailweixu · 2021-01-15T23:52:49Z

alf/examples/mbrl_reward_functions.py

+                new_rot_axis, new_rot_perp_axis, cur_end + length * new_rot_axis
+
+        cost = torch.sum(
+            torch.square(cur_end - common.get_gym_env_attr('goal')), dim=-1)


same concern

emailweixu · 2021-01-15T23:54:03Z

alf/environments/gym_pets/envs/cartpole.py

+from gym.envs.mujoco import mujoco_env
+
+
+class CartpoleEnv(mujoco_env.MujocoEnv, utils.EzPickle):


need descriptions for all these new environments.

add pets environments and reward functions

0d6ec55

runjerry requested review from Haichao-Zhang and emailweixu January 11, 2021 21:00

Haichao-Zhang requested changes Jan 11, 2021

View reviewed changes

emailweixu reviewed Jan 15, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add pets environments and reward functions#752

add pets environments and reward functions#752
runjerry wants to merge 1 commit intopytorchfrom
PR_mbrl_pets_env

runjerry commented Jan 11, 2021

Uh oh!

Haichao-Zhang Jan 11, 2021

Uh oh!

Haichao-Zhang Jan 11, 2021

Uh oh!

Haichao-Zhang Jan 11, 2021

Uh oh!

Haichao-Zhang Jan 11, 2021

Uh oh!

Haichao-Zhang Jan 11, 2021

Uh oh!

Haichao-Zhang Jan 11, 2021

Uh oh!

emailweixu Jan 15, 2021

Uh oh!

Haichao-Zhang Jan 11, 2021

Uh oh!

emailweixu Jan 15, 2021

Uh oh!

emailweixu Jan 15, 2021

Uh oh!

emailweixu Jan 15, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		@gin.configurable
		def reward_function_for_pendulum(obs, action):



		@gin.configurable
		def reward_function_for_cartpole(obs, action):

		@@ -0,0 +1,95 @@
		<!-- Cheetah Model

		The state space is populated with joints in the order that they are

		from gym.envs.mujoco import mujoco_env


		class CartpoleEnv(mujoco_env.MujocoEnv, utils.EzPickle):

Conversation

runjerry commented Jan 11, 2021

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants