Ferry

NOTE: readme is temporarily outdated, but the code for the python version is fairly solid, and this approach will be continued going forward

Ferry is a tool that enables interacting with Gymnasium environments over a network connection (including locally) via memory mapped files ("MemServer").

The potential applications include:

Simulating the environment on a separate machine
Interfacing with environments built with different languages, without a dedicated Python link
(future) Interacting with environments running in real time

The main intent is interfacing with games built in powerful engines like Unity and Unreal. Adding a client or a server in the environment code will expose it for interaction with the standard Gymnasium API.

There are two possible paradigms -- the environment runs either as a server, or as a client.

ClientEnv has a relatively intuitive interpretation. The server maintains an instance of the environment, and calls its methods according to the MemServer calls. The user (or the RL algorithm) calls the methods of ClientEnv, which in turn calls the MemServer methods on the server.

ServerEnv works the other way around. It expects that the user creates a server which implements a policy, and the environment lives in a client which can query that policy. When the client queries the server, it sends an observation, and receives the following observation.

In summary, in ClientEnv:

The underlying environment logic lives on the server
The Env instance exists in the client
The algorithmic logic is in the client

In ServerEnv:

The underlying environment logic is in the client
The Env instance exists on the server
The algorithmic logic is on the server

The ServerEnv implementation is inspired by ML-Agents, but we generally recommend using ClientEnv.

TODO: profiling with fast/slow languages on the server/client

Protocol

ClientBackend - ServerEnv:

At the beginning, there's a handshake, client sends, server also sends
Backend starts execution, performing initial setup
Backend sends an initial request, the response must be a ResetArgs
In a loop, Backend sends current ORTTI and listens for a response. Response can be either ResetArgs or Action

IMPORTANT NOTE: step returns only after the backend reaches a new decision step and sends a new request.

redtachyon / ferry Goto Github PK

ferry's Introduction

Ferry

Protocol

ferry's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs