Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/divyanshu9822/github-top-repos-scrapper
Github featured topics repo data scrapping and trend analysis
https://github.com/divyanshu9822/github-top-repos-scrapper
beautifulsoup data-science data-visualization dataanalytics datanalysis datascrapping pandas python python3
Last synced: 4 days ago
JSON representation
Github featured topics repo data scrapping and trend analysis
- Host: GitHub
- URL: https://github.com/divyanshu9822/github-top-repos-scrapper
- Owner: Divyanshu9822
- Created: 2023-10-25T10:35:03.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2023-10-25T10:40:27.000Z (about 1 year ago)
- Last Synced: 2024-11-09T17:14:12.399Z (2 months ago)
- Topics: beautifulsoup, data-science, data-visualization, dataanalytics, datanalysis, datascrapping, pandas, python, python3
- Language: Jupyter Notebook
- Homepage:
- Size: 907 KB
- Stars: 1
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# GitHub Trend Analysis
GitHub Trend Analysis is a data scraping and analysis project that explores trending topics on GitHub. The project involves scraping data from the top repositories for each topic, analyzing the usage of programming languages and tags, and visualizing the findings.
## Aim
The aim of this project is to gain insights into GitHub's trending topics, identify the most used programming languages and tags, and present the findings through data visualizations. Specifically, the project includes:
- Data scraping from the top 20 repositories for each of the 10 trending topics on GitHub.
- Creation of individual CSV datasets for each topic, including repository information.
- Analysis and visualization of the top 10 most used tags and programming languages for each topic.
- Compilation of a dataset of the top 200 repositories by aggregating data from all topics.
- Visualization of the most used tags and programming languages across all 200 repositories using pie charts.## Screenshots
![image](https://github.com/Divyanshu9822/github-top-repos-scrapper/assets/90151736/35b06218-eab6-43c9-89f2-a4941f6abf3d)
![image](https://github.com/Divyanshu9822/github-top-repos-scrapper/assets/90151736/f7702720-2804-44c3-89c2-787a614d8bba)
## Results
The project has produced the following key results:
- Identified the most used programming languages in the top 200 repositories, with JavaScript, Shell, and Python being the top choices.
- Uncovered the most frequently used tags in repository descriptions, including JavaScript, Hacktoberfest, Shell, and Android.