Add Randomizers: model and physics #177

diegoferigo · 2020-04-19T17:33:57Z

This PR complements #176 and adds the missing piece used in the new Python package: Randomizers.

Before, the combination of Task + Runtime was enough to make an environment. Now it's no longer the case. Take as reference the following:

Updated launch_cartpole.py example.
CartPoleDiscreteBalancing task, same as before.
Updated GazeboRuntime that works with the new ScenarI/O APIs.
Environment registration same as before.

This structure is comparable to what we used to have before. However, the env obtained from the gyn.make factory, is not usable by default.

Randomizers are developed as environment wrappers, and they mainly extend the gym.Env.reset method. They will add the model used by the task in the world [1] and optionally perform one or more of the following steps:

Randomize the model using the new SDFRandomizer (for example the dynamics parameters of the model - mass, inertia, joint friction, ... - and the friction with ground). See the test in 52d16f3 for an example.
Randomize the physics (at the moment we only support loading DART and randomizing gravity, but as soon as ign-physics will have more back-ends, the used physics engine and other exposed parameters could be randomized).

Two randomizers examples, one that does not apply any randomization and another that fully randomizes what's currently supported, can be seen in 4d47415.

[1] This was previously done by the runtime since it was the only setup we were supporting. Now the randomizer could add as many models as are required. For example, in manipulation environments they could reset the manipulator and the work area.

diegoferigo · 2020-04-20T13:54:55Z

The SDFRandomizer was designed after the discussion of #41 (comment). It was inspired by ryanjulian/rllab#61 and rlworkgroup/garage#51. If you're curious about the used pattern, you can refer to Builder Pattern that produces a Fluent Interface.

The mujoco randomization is somehow more complicated than what we can do with SDF. In fact, mujoco models store most of their parameters in the attributes of the DOM, instead SDF always use elements.

Here a list of additional features I implemented:

The randomizer can operate on multiple elements matching a XPath. This means that, for example, with a single configuration entry we can randomize with the same statistics the masses of all the links.
Novel Method::Additive to alter the variable with additive noise. This is useful in combination of a gaussian randomization since we can add noise with zero mean.
Novel force_positive method. Depending on the selected randomization method and distribution, it could happen that a randomized value becomes less than zero. In order to avoid seeing dragons e.g. with negative masses, this method clips randomized values to force them being positive.

raffaello-camoriano · 2020-04-23T12:46:15Z

This structure is comparable to what we used to have before. However, the env obtained from the gyn.make factory, is not usable by default.

As far as I can tell from the above comment and from 4d47415#diff-64386c8ba5955b2a8256341650413e50, in a use case without any need for randomization the user would be required anyways to use an extension wrapper GazeboEnvRandomizer around the env produced by gym.make.
Is this correct? Would it be feasible to make the raw gym.make env compatible with the framework?

diegoferigo · 2020-04-23T13:30:35Z

in a use case without any need for randomization the user would be required anyways to use an extension wrapper GazeboEnvRandomizer around the env produced by gym.make.
Is this correct? Would it be feasible to make the raw gym.make env compatible with the framework?

It is correct and it was a consequence of the redesign. If you want the wrapped Task class to be independent from Gazebo, using a randomizer that just inserts the model and performs no randomizations is the only solution. In fact, the method World.insertModel(urdf, pose, name) is specific for Gazebo, and if used in the Task class it will make the task not work in a RealTimeRuntime.

However, in the case of a Task class that's supposed to work only with the Gazebo runtime (i.e. it does not target real-time transparency), populating the world in the Task.reset_task method is a great solution. In this case, a randomizer is not necessary. I always consider generic tasks in my explanations since we're aiming to runtime transparency, but maybe for many users it is not a requirement that strict. We should keep it in mind when writing the documentation.

raffaello-camoriano · 2020-04-23T22:02:21Z

in a use case without any need for randomization the user would be required anyways to use an extension wrapper GazeboEnvRandomizer around the env produced by gym.make.
Is this correct? Would it be feasible to make the raw gym.make env compatible with the framework?

It is correct and it was a consequence of the redesign. If you want the wrapped Task class to be independent from Gazebo, using a randomizer that just inserts the model and performs no randomizations is the only solution. In fact, the method World.insertModel(urdf, pose, name) is specific for Gazebo, and if used in the Task class it will make the task not work in a RealTimeRuntime.

However, in the case of a Task class that's supposed to work only with the Gazebo runtime (i.e. it does not target real-time transparency), populating the world in the Task.reset_task method is a great solution. In this case, a randomizer is not necessary.

I see, thanks.

I always consider generic tasks in my explanations since we're aiming to runtime transparency, but maybe for many users it is not a requirement that strict. We should keep it in mind when writing the documentation.

Agreed.

python/gym_ignition/randomizers/base/physics.py

raffaello-camoriano · 2020-04-23T22:33:51Z

python/gym_ignition/randomizers/base/model.py

+    @abc.abstractmethod
+    def randomize_model(self) -> str:
+        """
+        Randomize the model.


In the future documentation it would be very useful to have a list of all the randomizable quantities.

Given that you can access all the elements through XPath patterns, you can randomize everything in the SDF. The only constraint is that randomized quantities must be elements and not attributes. This was a design choice that matches SDF, in fact all relevant data are stored in elements.

Users may find it useful to refer to a set of examples. The cartpole ones are a great point to start with 4d47415.
In the future, it may be valuable to also provide an example involving a more complex robot like Panda or iCub.

This PR introduces a test with a Panda:

https://github.com/robotology/gym-ignition/blob/b1ff4419743c71e49e457b42213170a6c0f2f415/tests/test_gym_ignition/test_sdf_randomizer.py#L215

In general, tests are a great piece of implicit documentation that many ignore.

python/gym_ignition/randomizers/base/task.py

python/gym_ignition/randomizers/model/sdf.py

raffaello-camoriano · 2020-04-24T02:18:19Z

python/gym_ignition/randomizers/model/sdf.py

+    def insert(self, randomization_data) -> None:
+        self._randomizations.append(randomization_data)


Why are the randomization configuration data appended? How is self._randomization organized? Is it a list with a randomization_data configuration object for each element to be randomized?

Tip: always check the type hinting, they're there to help you understand the type of the variables even in a language that is not statically typed.

You will find that _randomizations is a List[RandomizationData]. It is used in the add method of the *Builder to insert the new randomization in a buffer of this class.

See documentation ed0c1cb.

Ok, thanks.

python/gym_ignition/randomizers/physics/dart.py

python/gym_ignition/runtimes/gazebo_runtime.py

diegoferigo added complexity::medium pr::status::in-progress labels Apr 19, 2020

diegoferigo self-assigned this Apr 19, 2020

diegoferigo force-pushed the refactor/randomizers branch 4 times, most recently from 8569b90 to 52d16f3 Compare April 20, 2020 13:26

diegoferigo force-pushed the refactor/randomizers branch 4 times, most recently from 240d2ad to f6e7d78 Compare April 20, 2020 15:26

diegoferigo added pr::status::feedback and removed pr::status::in-progress labels Apr 20, 2020

diegoferigo marked this pull request as ready for review April 20, 2020 15:27

diegoferigo requested review from paolo-viceconte, traversaro and raffaello-camoriano April 20, 2020 15:28

diegoferigo added 12 commits April 20, 2020 18:40

New model, physics, and task randomizers base classes

2216756

New SDFRandomizer class

6789a9c

New DART physics dummy randomizer

00416a9

Use DART randomizer in GazeboRuntime

64d954a

New GazeboEnvRandomizer

e4f89a4

Register demo environments of gym_ignition_environments

0015a94

Add lxml dependency

fdef53c

New environment randomizers

c2b74a2

Add URDF to SDF conversion helpers

b5673c9

Add test for SDFRandomizer

3b7405d

Update example

ed89799

Add test to ensure randomizers reproducibility

1781356

diegoferigo force-pushed the refactor/randomizers branch from f6e7d78 to 1781356 Compare April 20, 2020 16:40

raffaello-camoriano reviewed Apr 23, 2020

View reviewed changes

python/gym_ignition/randomizers/base/physics.py Outdated Show resolved Hide resolved

raffaello-camoriano reviewed Apr 23, 2020

View reviewed changes

python/gym_ignition/randomizers/base/task.py Outdated Show resolved Hide resolved

raffaello-camoriano reviewed Apr 24, 2020

View reviewed changes

traversaro approved these changes Apr 24, 2020

View reviewed changes

diegoferigo added 3 commits April 26, 2020 12:36

Change method name to improve clarity

3a8e360

Fix typo

af2c664

Document the SDFRandomizer class

ed0c1cb

diegoferigo merged commit b1ff441 into robotology-legacy:refactoring Apr 26, 2020

diegoferigo added pr::resolution::merged and removed pr::status::feedback labels Apr 26, 2020

diegoferigo deleted the refactor/randomizers branch April 26, 2020 18:35

diegoferigo added the component::gym_ignition label May 6, 2020

diegoferigo mentioned this pull request Jul 15, 2020

Randomized inertia tensor does not satisfy the triangle inequality #218

Open

diegoferigo mentioned this pull request Aug 12, 2020

Introduce ScenarI/O abstract classes #219

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Randomizers: model and physics #177

Add Randomizers: model and physics #177

diegoferigo commented Apr 19, 2020 •

edited

Loading

diegoferigo commented Apr 20, 2020

raffaello-camoriano commented Apr 23, 2020

diegoferigo commented Apr 23, 2020

raffaello-camoriano commented Apr 23, 2020

raffaello-camoriano Apr 23, 2020

diegoferigo Apr 24, 2020 •

edited

Loading

raffaello-camoriano Apr 26, 2020

raffaello-camoriano Apr 26, 2020 •

edited

Loading

diegoferigo Apr 26, 2020

raffaello-camoriano Apr 24, 2020

diegoferigo Apr 24, 2020

diegoferigo Apr 26, 2020

raffaello-camoriano Apr 26, 2020

		def insert(self, randomization_data) -> None:
		self._randomizations.append(randomization_data)

Add Randomizers: model and physics #177

Add Randomizers: model and physics #177

Conversation

diegoferigo commented Apr 19, 2020 • edited Loading

diegoferigo commented Apr 20, 2020

raffaello-camoriano commented Apr 23, 2020

diegoferigo commented Apr 23, 2020

raffaello-camoriano commented Apr 23, 2020

raffaello-camoriano Apr 23, 2020

Choose a reason for hiding this comment

diegoferigo Apr 24, 2020 • edited Loading

Choose a reason for hiding this comment

raffaello-camoriano Apr 26, 2020

Choose a reason for hiding this comment

raffaello-camoriano Apr 26, 2020 • edited Loading

Choose a reason for hiding this comment

diegoferigo Apr 26, 2020

Choose a reason for hiding this comment

raffaello-camoriano Apr 24, 2020

Choose a reason for hiding this comment

diegoferigo Apr 24, 2020

Choose a reason for hiding this comment

diegoferigo Apr 26, 2020

Choose a reason for hiding this comment

raffaello-camoriano Apr 26, 2020

Choose a reason for hiding this comment

diegoferigo commented Apr 19, 2020 •

edited

Loading

diegoferigo Apr 24, 2020 •

edited

Loading

raffaello-camoriano Apr 26, 2020 •

edited

Loading