https://github.com/mohammadzainabbas/bdm
Big Data Management ✨
https://github.com/mohammadzainabbas/bdm
apache-hadoop apache-hbase apache-spark distributed-systems hdfs mongodb
Last synced: about 1 month ago
JSON representation
Big Data Management ✨
- Host: GitHub
- URL: https://github.com/mohammadzainabbas/bdm
- Owner: mohammadzainabbas
- License: mit
- Created: 2022-03-29T23:36:09.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2023-12-15T02:59:43.000Z (over 2 years ago)
- Last Synced: 2025-06-15T00:36:35.198Z (12 months ago)
- Topics: apache-hadoop, apache-hbase, apache-spark, distributed-systems, hdfs, mongodb
- Language: Jupyter Notebook
- Homepage:
- Size: 782 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## BDM @ UPC 👨🏻💻
### Table of contents
- [Introduction](#introduction)
- [About the Data](#about-the-data)
- [Guides](#guide)
---
#### 1. Introduction
This repo is all about what we have done in Big Data Management (BDM) during our semester at UPC. As a BDMA student, we really need to know how to manage big data.
---
#### 2. About the Data
Since, the aim of this project is to develop a proof of concept (aka a prototype) for our startup `TripHawk` and to get a hands-on practice on typical workflows usually seen in any big data project, we had decided to go with _events_ data in Barcelona. List below is the external data sources which we are using right now:
- [_Cultural Agenda in the city of Barcelona_](https://opendata-ajuntament.barcelona.cat/data/en/dataset/agenda-cultural)
- [_Tourist points of interest in the city of Barcelona_](https://opendata-ajuntament.barcelona.cat/data/en/dataset/punts-informacio-turistica)
- [_Agenda of events and activities of the city of Barcelona_](https://opendata-ajuntament.barcelona.cat/data/en/dataset/agenda-diaria)
---
#### 3. Guide
You can find some helpful guides [here](https://github.com/mohammadzainabbas/BDM/blob/main/doc/GUIDE.md)