An open API service indexing awesome lists of open source software.

https://github.com/mohammadzainabbas/bdm

Big Data Management ✨
https://github.com/mohammadzainabbas/bdm

apache-hadoop apache-hbase apache-spark distributed-systems hdfs mongodb

Last synced: about 1 month ago
JSON representation

Big Data Management ✨

Awesome Lists containing this project

README

          

## BDM @ UPC 👨🏻‍💻





### Table of contents

- [Introduction](#introduction)
- [About the Data](#about-the-data)
- [Guides](#guide)

---

#### 1. Introduction

This repo is all about what we have done in Big Data Management (BDM) during our semester at UPC. As a BDMA student, we really need to know how to manage big data.

---

#### 2. About the Data

Since, the aim of this project is to develop a proof of concept (aka a prototype) for our startup `TripHawk` and to get a hands-on practice on typical workflows usually seen in any big data project, we had decided to go with _events_ data in Barcelona. List below is the external data sources which we are using right now:

- [_Cultural Agenda in the city of Barcelona_](https://opendata-ajuntament.barcelona.cat/data/en/dataset/agenda-cultural)
- [_Tourist points of interest in the city of Barcelona_](https://opendata-ajuntament.barcelona.cat/data/en/dataset/punts-informacio-turistica)
- [_Agenda of events and activities of the city of Barcelona_](https://opendata-ajuntament.barcelona.cat/data/en/dataset/agenda-diaria)

---

#### 3. Guide

You can find some helpful guides [here](https://github.com/mohammadzainabbas/BDM/blob/main/doc/GUIDE.md)