https://github.com/jveverka/data-lab
Data Lab Project
https://github.com/jveverka/data-lab
elasticsearch image-processing microservices tensorflow tensorflow2 yolov3
Last synced: 2 months ago
JSON representation
Data Lab Project
- Host: GitHub
- URL: https://github.com/jveverka/data-lab
- Owner: jveverka
- License: apache-2.0
- Created: 2019-09-23T19:51:29.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2021-10-15T20:40:20.000Z (over 4 years ago)
- Last Synced: 2025-03-29T05:13:06.960Z (about 1 year ago)
- Topics: elasticsearch, image-processing, microservices, tensorflow, tensorflow2, yolov3
- Language: Java
- Homepage:
- Size: 21.7 MB
- Stars: 0
- Watchers: 2
- Forks: 1
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
[](https://opensource.org/licenses/Apache-2.0)
[](https://img.shields.io/badge/java-11-blue)
[](https://img.shields.io/badge/gradle-v6.5-blue)
[](https://travis-ci.org/jveverka/data-lab?branch=master)
# Data Lab Project
__Data Lab Project__ provides advanced analytics and query services on various document sources like
images, video streams, text documents, file system. This project is work in progress.

## Features
* __File system indexing__ - queries on file system meta-data
* __Image meta-data indexing__ - queries on exif and geo-location meta data.
* __Video meta-data indexing__ - queries on exif and geo-location meta data.
* __Image content object recognition__ - queries on objects contained in images.
### Microservices
* [__data-scanner-service__](data-scanner-service) - [__microservice__] simple service for scanning file system.
* [__ml-services__](ml-services) - [__microservices__] simple services utilizing using machine learning.
* [__message-broker__](message-broker) - [__microservice__]
### Components
* [__file-system-service__](file-system-service) - [__library__] simple library for scanning file system.
* [__elasticsearch-service__](elasticsearch) - [__library__] service for easy ElasticSearch read/write access.
* [__data-scanner-service__](data-scanner-service) - [__library__] service for scanning data directory and annotating data files.
### Architecture

### Technology stack
* __Microservices__ - REST, Message Broker integrations, K8s, WIP
* __ElasticSearch 7.15.x__ - main meta-data database
* __Kibana 7.15.x__ - basic data visualizations
* __RabbitMQ 3.8__ - message broker
* __Java 11__ - microservice implementations
* __Python 3.8.x__ - microservice implementations
* __TensorFlow 2.0 / Keras__ - ML related tasks
* __Gradle 7.2.x__ - build system or later
* __Ubuntu 20.04 LTS__ - default target environment
### Build, Test and Run
```
gradle clean installDist distZip test
```
Follow [this user](docs/user-guide.md) guide to run microservices locally.