https://github.com/mccorby/PhotoLabellerServer

Federated Learning: Parameter Server doing aggregation of updates to a model coming from clients participating in a Federated Learning setup. See also the Android application companion at https://github.com/mccorby/PhotoLabeller
https://github.com/mccorby/PhotoLabellerServer

cifar-10 deeplearning4j dl4j federated-learning kotlin machine-learning

Last synced: about 2 months ago
JSON representation

Host: GitHub
URL: https://github.com/mccorby/PhotoLabellerServer
Owner: mccorby
License: mit
Created: 2018-03-09T14:38:22.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2018-10-07T07:08:44.000Z (over 6 years ago)
Last Synced: 2025-04-23T18:14:46.443Z (2 months ago)
Topics: cifar-10, deeplearning4j, dl4j, federated-learning, kotlin, machine-learning
Language: Kotlin
Homepage:
Size: 80.1 KB
Stars: 50
Watchers: 3
Forks: 13
Open Issues: 6
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-federated-computing - Server

README

# Federated Learning. The server side bit

This project works in conjunction with the Photo Labeller Android app

The server is in charge of the following tasks:

* Create an initial model
* Control the training rounds: when to open and close them
* Do the update of the model once a round is finished using the info uploaded by the clients

## Installation instructions
Use IntelliJ to build the project

The server expects a `local.properties` file to provide some information

Example:

```python
model_dir = some_directory_in_your_local_machine
# In milliseconds. 24 hours
time_window = 86400000
min_updates = 1000
layer_index = 3

```

This file must be located in the `server` module at the same level as the `build.gradle` file

## Training the initial model and testing it
To run the initial model training, use the [`Main.kt`](https://github.com/mccorby/PhotoLabellerServer/blob/master/model/src/main/kotlin/com/mccorby/photolabeller/ml/Main.kt) file

The *training process* expects three arguments:
* `train` string to trigger the process
* A valid directory to save the resulting model
* The third optional argument can be `web` to start the UI monitor provided by DL4J

You can also run a prediction to test your model. The expected arguments are:
* Location of the model
* Location of an image to classify

## Running the Federated Parameter Server
The server is in charge of starting the training rounds and to keep the updates sent by the clients until the shared model has been updated

The training round starts when the server is initialised. In a real life system, the training round would be triggered either automatically or manually by someone

Execute the `JobQueueServer` main method and the server will be up and running

The REST API is quite simple:
* `GET` model to get the latest shared model in this server
* `POST` model to upload the updates from the clients
* `GET` round to obtain the info about the current round

To check that the server is up and running in your local installation, hit this URL

```http://localhost:9997/service/federatedservice/available```

### Flow of an open round
* Make sure the server has opened a new round. This can be checked at
```http://localhost:9997/service/federatedservice/currentRound```
If the round is not opened yet, this service will create a new one with the following JSON format

```
{
"modelVersion": "round_20181007_080305",
"startDate": 1538895785617,
"endDate": 1538982185617,
"minUpdates": 1000
}
```
* Client update is received

When a client sends the update to the model, this is stored by the server in the current round directory

The update is also cached in memory to help with the post process

* Closing the round and updating the model

A training round can be closed by reaching the minimum number of clients updates or by timing out

Once the round is closed, the updates are processed by using the `UpdatesStrategy` injected in the server. The current implementation of this strategy performs a `Federated Averaging`

The model is then updated (actually just the last feature layer) and saved. From this moment this is considered the latest version of the shared model

All client updates are deleted so that no trace of them remains in the server

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/mccorby/PhotoLabellerServer

Awesome Lists containing this project

README