Adjustments needed to make for running my task

Hi Antymon,
Thanks very much for the repo so that people can export the trained agent from python to c++.

I would like to confirm the changes I have to make in order to run my own task: 
1. Make my environment inherit from Env abstract class under env\env.hpp

2. Modify the main ppo2.cpp which creates instance of an environment and passes it to PPO

*I have two questions:
a. if I would like to run inference only, should I use algorithm.eval(obs) directly? As it seems it uses get_deterministic_action() and I noticed there are also step(const tensorflow::Tensor& obs) and value(const tensorflow::Tensor& obs) which I can't tell exactlly what the differences are between them.

b. It seems a way to resume training using your implementation so that online learning can be achieved?

3. Create own computational graph and potentially make some small modifications to the core algorithm if using more involved policies (currently implementation supports only MLP policies). Graph generation is mentioned below.
* I used 'tf.train.export_meta_graph(graph=model.graph, filename='my-model.meta', clear_devices=True, clear_extraneous_savers=True, strip_default_attrs=True)' as it mentioned [here](https://stable-baselines.readthedocs.io/en/master/guide/export.html) but it has many redundant tensors in the generated model when I visualize it. Some say its because some stuff used for training are preserved. I was wondering have you encountered such situation and how did you solve it?

Thank you again!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjustments needed to make for running my task #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Adjustments needed to make for running my task #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions