An open API service indexing awesome lists of open source software.

https://github.com/misterzurg/itmo_technologies_and_infrastructure_for_big_data

📊 My smth from ITMO; Dis - BiggusDatus
https://github.com/misterzurg/itmo_technologies_and_infrastructure_for_big_data

apache-spark big-data big-data-and-ml clickhouse itmo-labs kubernetes

Last synced: about 1 month ago
JSON representation

📊 My smth from ITMO; Dis - BiggusDatus

Awesome Lists containing this project

README

          

Here are my Jupyter Notebook labs from ITMO Second semester

# Discipline
Technologies and Infrastructure for Big Data

## Instructors
[Denis Nasonov](https://en.itmo.ru/en/viewperson/1252/Denis_Nasonov.htm)

[Nikolay Butakov](https://en.itmo.ru/en/viewperson/1257/Nikolay_Butakov.htm)

[Sergey Teryshkin](https://ru.linkedin.com/in/sergey-teryoshkin-67ba02170)

## Performed by
- Me and
- Michael Grigoriev [@Dormant512](https://github.com/Dormant512)

## WorkShops
- [Connecting to Cluster](Workshops/Connection/README.md)

## Labs
1. [K8s](Labs/Lab-1-k8s/README.md)
2. [PySpark](Labs/Lab-2-PySpark/README.md)
3. [ClickHouse](Labs/Lab-3-ClickHouse/README.md)

## CourseWork
> Automating anime image collection from several resources for future model training.

## Seminar Topic
Mem~~e~~cached — general-purpose distributed memory-caching system.