https://github.com/george-mountain/data-extraction-integration-and-analysis---clustering-operations
This repository for a project detailing the step by step approach of scraping data, integrating data from various sources, performing analysis on data from various sources for the purpose of analaysis. It also shows how APIs can be harnessed for data engr operations. In this project, the four square API was utilized for the location data.
https://github.com/george-mountain/data-extraction-integration-and-analysis---clustering-operations
clustering-algorithm dataingestion dataintegration dataproject datascience datascraping foursquare-api machine-learning
Last synced: about 1 month ago
JSON representation
This repository for a project detailing the step by step approach of scraping data, integrating data from various sources, performing analysis on data from various sources for the purpose of analaysis. It also shows how APIs can be harnessed for data engr operations. In this project, the four square API was utilized for the location data.
- Host: GitHub
- URL: https://github.com/george-mountain/data-extraction-integration-and-analysis---clustering-operations
- Owner: george-mountain
- Created: 2020-05-21T09:38:29.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2023-02-21T05:13:06.000Z (about 2 years ago)
- Last Synced: 2025-01-21T06:11:33.427Z (3 months ago)
- Topics: clustering-algorithm, dataingestion, dataintegration, dataproject, datascience, datascraping, foursquare-api, machine-learning
- Language: Jupyter Notebook
- Homepage:
- Size: 373 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
### Data Project - Scraping, Integration, Analysis and Clustering Operations
This repository for a project detailing the step by step approach of scraping data, integrating data from various sources, performing analysis on data from various sources for the purpose of analaysis. It also shows how APIs can be harnessed for data engineering operations. In this project, the four square API was utilized for the location data while K-MEANS ML was used in the clustering operations to obtain data insights.The detail discussion on the methodology, findings and conclussions can be seen on the repository file named: 'Project report'. Check the file out to learn more about the purpose of the project, the methodology and the data insights.
To learn about the step by step coding performed on this project, check out the file named 'project report'