Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/takuseno/kiox

A composable experience replay buffer library
https://github.com/takuseno/kiox

Last synced: about 1 month ago
JSON representation

A composable experience replay buffer library

Host: GitHub
URL: https://github.com/takuseno/kiox
Owner: takuseno
License: mit
Created: 2022-01-27T10:12:11.000Z (almost 3 years ago)
Default Branch: master
Last Pushed: 2022-03-20T13:47:10.000Z (almost 3 years ago)
Last Synced: 2024-10-23T04:05:51.046Z (2 months ago)
Language: Python
Homepage:
Size: 143 KB
Stars: 2
Watchers: 5
Forks: 2
Open Issues: 0
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE

Awesome Lists containing this project

README

        # kiox: A composable experience replay buffer library

[![PyPI version](https://badge.fury.io/py/kiox.svg)](https://badge.fury.io/py/kiox)

[![test](https://github.com/takuseno/kiox/actions/workflows/test.yml/badge.svg)](https://github.com/takuseno/kiox/actions/workflows/test.yml)

[![codecov](https://codecov.io/gh/takuseno/kiox/branch/master/graph/badge.svg?token=sI8AYW2kYp)](https://codecov.io/gh/takuseno/kiox)

[![Maintainability](https://api.codeclimate.com/v1/badges/f2f0d2bde462dbb37767/maintainability)](https://codeclimate.com/github/takuseno/kiox/maintainability)

![MIT](https://img.shields.io/badge/license-MIT-blue)

kiox is a composable experience replay buffer library.

```py

from kiox.kiox import Kiox

from kiox.transition_buffer import FIFOTransitionBuffer

from kiox.transition_factory import SimpleTransitionFactory

kiox = Kiox(FIFOTransitionBuffer(1000), SimpleTransitionFactory())

# collect experiences

kiox.collect(, , , )

# sample batch

batch = kiox.sample(256)

```

## key features

### :zap: Composable experience replay buffer

kiox is composable and fully Pythonic library. You can add your own sampling algorithms and inject sampling-time logics (e.g. loading image data from disk just before sampling).

### :beginner: User-friendly API

kiox provides user-friendly API so that you can instantly incorporate kiox with your RL algorithms.

### :rocket: Distributed RL training support

kiox supports distributed RL training by using ProtocolBuffer and gRPC. Your custom modules will work without any code changes.

## installation

kiox supports Linux, macOS and Windows.

```

$ pip install kiox

```

## examples

Many extensive [examples](examples) are available.

### distributed training

In actor process:

```py

from kiox.distributed.step_sender import StepSender

sender = StepSender("localhost", 8000, 1)

sender.collect(, , , )

```

In trainer process:

```py

# trainer process

from kiox.distributed.server import KioxServer

def transition_buffer_builder():

    return FIFOTransitionBuffer(1000)

def transition_factory_builder():

    return SimpleTransitionFactory()

# setup server

server = KioxServer(

    host="localhost",

    port=8000,

    observation_shape=(4,),

    action_shape=(1,),

    reward_shape=(1,),

    batch_size=8,

    transition_buffer_builder=transition_buffer_builder,

    transition_factory_builder=transition_factory_builder,

)

server.start()

# sample batch

batch = server.sample()

```

### from offline data

```py

# from offline data

from kiox.offline import create_simple_kiox_from_data

kiox = create_simple_kiox_from_data(

  observations=,

  actions=,

  rewards=,

  terminals=,

)

```

## build

```

$ pip install grpcio-tools

$ scripts/build-protobuf

$ pip install -e .

```

## contributions

Any kind of contribution to kiox would be highly appreciated!

Please check the [contribution guide](CONTRIBUTING.md).