https://github.com/golemfactory/ya-httpx-client

golem
Last synced: 19 days ago
JSON representation
Host: GitHub
URL: https://github.com/golemfactory/ya-httpx-client
Owner: golemfactory
Created: 2021-05-12T09:52:44.000Z (almost 4 years ago)
Default Branch: master
Last Pushed: 2021-10-06T15:31:51.000Z (over 3 years ago)
Last Synced: 2025-02-09T08:44:14.706Z (2 months ago)
Topics: golem
Language: Python
Homepage:
Size: 287 KB
Stars: 1
Watchers: 5
Forks: 2
Open Issues: 7
Metadata Files:
- Readme: README.md
Awesome Lists containing this project

README

        # ya-httpx-client

Communicate with a provider-based HTTP server the way you communicate with any other HTTP server.

## Introduction

The documentation assumes reader knows what [Golem](https://www.golem.network/) is and understand basics of the 

development of [yapapi-based services](https://handbook.golem.network/requestor-tutorials/service-development).

Features:

1. Deploy an HTTP server on a Golem provider in an easy way. There are no requirements on the server implementation

   (doesn't even have to be in `python`) except it must be capable of running in the 

   [Golem provider image](https://handbook.golem.network/requestor-tutorials/convert-a-docker-image-into-a-golem-image).

2. Send requests to the provider-based server using [httpx.AsyncClient](https://www.python-httpx.org/async/), the same way

   you would send them to any other HTTP server. `httpx` is similar to more popular `requests`, but with async support.

3. Restart the service on a new provider every time it stops (for whatever reason), in a seamless way that ensures no request is ever lost.

4. Maintain multiple providers running the same server (efficient load balancing included).

5. Change the number of providers running the server in a dynamic way.

NOTE: features 3-5 are useful only if your server is stateless (i.e. request/response history never matters).

This library is built on top of [yapapi](https://github.com/golemfactory/yapapi), there is nothing here that couldn't be done with the pure `yapapi`.

## Installation

```bash

pip3 install git+https://github.com/golemfactory/ya-httpx-client.git#egg=ya-httpx-client[requestor]

```

## How to

Every `ya-httpx-client` application is made up of 3 main components:

* HTTP server code (that has nothing to do with Golem at all)

* `Dockerfile` that will serve as a base for the Golem provider image

* Requestor agent that initializes `ya-httpx-client.Session`, starts server(s) on provider(s), and uses them

### Examples

#### Calculator

There is a simple complete application [here](examples/calculator). It consists of:

* `calculator_server.py` - a Flask-based http server that answers to requests like `GET base_url/add/7/8`

* `calculator.Dockerfile` - base of a Golem provider image that includes `calculator_server.py`

* `run_calculator.py` - requestor agent that starts the calculator on provider and sends some requests

Usage: `python3 examples/calculator/run_calculator.py`

#### Requestor proxy

[Second example](examples/requestor_proxy) is a generic HTTP server running on the **requestor** side that:

* On startup starts a http server on provider(s). There are no assumptions about this server, in our example it is a simple echo server.

* Accepts *any* request

* Forwards every request to the provider and responds with the response received

So, in other words, requestor serves as a proxy to the server running on a provider.

Usage:

```bash

# Install additonal requestor-side dependencies (Quart)

pip3 install -r examples/requestor_proxy/requirements.txt

# Start the server

python3 examples/requestor_proxy/requestor_proxy.py devnet-beta

# (other console) use the "local" server

curl http://localhost:5000/add/3/4

```

For the details on the configuration, check comments in the code.

### Requestor agent quickstart

```python

#   1.  Initialize the session with yapapi.Golem configuration. This should be done exactly once. 

executor_cfg = {'budget': 10, 'subnet_tag': 'devnet-beta'}

session = Session(executor_cfg)

#   2.  Define a service. You may define as many services as you want, provided they have different urls.

session.add_url(

    #   All HTTP requests directed to host "some_name" will be processed by ...

    url='http://some_name',

    #   ...a http server running on provider, in VM based on this image ...

    image_hash='25f09e17c34433f979331edf4f3b47b2ca330ba2f8acbfe2e3dbd9c3',

    #   ... that will be started with this command (this should start the server in the background,

    #   service will be operative only after this command finished) ...

    entrypoint=("sh", "-c", "start_my_http_server.sh"),

    #   ... and to be more exact, by one of indistinguishable services running on different providers.

    #   This is the initial number of services that can be changed at any time by session.set_cluster_size().

    #   Also check "load balancing" section.

    init_cluster_size=1

)

#   3.  Use the service(s)

async def run():

    async with session.client() as client:

        #   Note 'http://some_name' prefix - this is our url from @session.startup

        req_url = 'http://some_name/do/something'

        res = await client.post(req_url, json={'foo': 'bar'})

        assert res.status_code == 200

    #   This is necessary for a graceful shutdown

    await session.close()

#   4.  Optional: change the number of services. Check "load balancing" section for more details.

session.set_cluster_size('http://some_name', 7)

```

### Communication protocol

There are two communication protocols, determined by `ya_httpx_client.cluster.Cluster.USE_VPN` value, VPN and FileSerialization.

Differences:

* VPN is faster

* VPN requires provider supporting `vpn` capability (-> `yagna` 0.8.0 or higher)

* FileSerialization requires `ya-httpx-client[provider]` installed on provider (--> check some example Dockerfile for details)

* Http servers running on providers should listen on `0.0.0.0:80` for VPN and on `unix:///tmp/golem.sock` for FileSerialization

VPN is the default mode, other one is legacy - it will probably be removed one day.

## Load balancing

By default, a single instance of a service is created. We can change this by setting `init_cluster_size` in `@session.startup`

to a different value, or by calling `session.set_cluster_size`. This value doesn't have to be an integer, it might also be a

callable that takes a `ya_httpx_client.Cluster` object as an argument and returns an integer:

```python

def calculate_new_size(cluster):

    if cluster.request_queue.qsize() > 7:

        return 2

    else:

        return 1

session.set_cluster_size('http://some_name', calculate_new_size)

```

or even better, anything that could be evaluated as an integer:

```python

class LoadBalancer:

    def __init__(self, cluster):

        self.cluster = cluster

    

    def __int__(self):

        if self.cluster.request_queue.empty():

            return 0

        return 1

session.set_cluster_size('http://some_name', LoadBalancer)

```

If the last case, we have no control on when and how often `int(load_balancer_object)` will be called, so the implementation

should be a little more clever, at least to avoid too frequent changes - check [SimpleLoadBalancer](ya_httpx_client/provider_auto_balance.py)

for an example.

NOTE: setting size to anything other than an integer should be considered an experimental feature.

## Future development

Currently, when the service stops, it is restarted on another provider. This makes sense only for stateless services, but there is no way to turn this off.

The developer should decide if they want to restart the service or not (or maybe stop the execution?).
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/golemfactory/ya-httpx-client

Awesome Lists containing this project

README