https://github.com/param302/tds-project1
https://github.com/param302/tds-project1
Last synced: 9 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/param302/tds-project1
- Owner: Param302
- Created: 2024-10-21T17:58:38.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-11-02T10:01:16.000Z (over 1 year ago)
- Last Synced: 2025-04-06T01:45:50.220Z (about 1 year ago)
- Language: Jupyter Notebook
- Size: 20.7 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# TDS Project 1
This is the first project of TDS course of IITM BS Degree Programme. In this programme, we have to scrap data from Github and do analysis on it and submit the answers to the questions given in the project.
- I have used Python for scraping the data and used **multiprocessing** for faster scraping especially for repositories. You can check the detailed scraping code in [data_loading.ipynb](./data_loading.ipynb) file.
- While doing [analysis](./data_analysis.ipynb), I found out that most devs doesn't have the company/org name, which represents that either they are solo devs and their leader strength is comparatively less.
- An advice for developers is to contribute to open source projects, join communities and work in a team to get more experience and learn from others.
### Extras
- I have 2 branches in this repository, one is `main` (default) and other is `dev`. I have used `dev` branch for the whole project and pushed necessary files in `main` branch. If you want to see the `dev` branch, you can switch to it.
```bash
git checkout dev
```