https://github.com/erinenovita/data-management
This project aims to translate business requirements into data management solutions. It will involve a complete process of extracting, refining, and delivering datasets extracted from both structured databases and unstructured data sources.
https://github.com/erinenovita/data-management
database sql
Last synced: 4 months ago
JSON representation
This project aims to translate business requirements into data management solutions. It will involve a complete process of extracting, refining, and delivering datasets extracted from both structured databases and unstructured data sources.
- Host: GitHub
- URL: https://github.com/erinenovita/data-management
- Owner: erinenovita
- License: mit
- Created: 2021-05-20T06:58:59.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2021-05-22T20:19:35.000Z (almost 4 years ago)
- Last Synced: 2024-08-13T07:11:59.930Z (8 months ago)
- Topics: database, sql
- Language: R
- Homepage:
- Size: 40 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- jimsghstars - erinenovita/data-management - This project aims to translate business requirements into data management solutions. It will involve a complete process of extracting, refining, and delivering datasets extracted from both structured (R)
README
# data-management
This project aims to translate business requirements into data management solutions. It will involve a complete process of extracting, refining and delivering datasets extracted from both structured databases and unstructured data sources.From the structured datasets, we would explore the design of databases and exploit the data using SQL. This includes the principles of relational databases, relational mapping and getting into the right business understanding.
From the semi-structured and unstructured data sources, we delved deeper into transforming the dataset into formats that would be ready for further analysis. We would perform both web-crawling and web-scraping of the unstructured data. Ultimately, we would construct a dashboard that would depict several business requirements.