{"id":15322593,"url":"https://github.com/dbgjerez/ml-github","last_synced_at":"2026-05-04T00:33:19.061Z","repository":{"id":45519841,"uuid":"512473221","full_name":"dbgjerez/ml-github","owner":"dbgjerez","description":"A Github's repositories analysis","archived":false,"fork":false,"pushed_at":"2024-05-10T01:01:39.000Z","size":169709,"stargazers_count":1,"open_issues_count":0,"forks_count":1,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-10-27T15:37:03.198Z","etag":null,"topics":["analytics","github","pandas","python"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dbgjerez.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2022-07-10T15:41:41.000Z","updated_at":"2024-05-10T01:01:42.000Z","dependencies_parsed_at":"2023-12-14T03:22:20.749Z","dependency_job_id":"123455e6-870e-418d-b7f0-17a8c1fef980","html_url":"https://github.com/dbgjerez/ml-github","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/dbgjerez/ml-github","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dbgjerez%2Fml-github","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dbgjerez%2Fml-github/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dbgjerez%2Fml-github/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dbgjerez%2Fml-github/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dbgjerez","download_url":"https://codeload.github.com/dbgjerez/ml-github/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dbgjerez%2Fml-github/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":32590294,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-03T22:12:39.696Z","status":"ssl_error","status_checked_at":"2026-05-03T22:09:10.534Z","response_time":103,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["analytics","github","pandas","python"],"created_at":"2024-10-01T09:16:55.168Z","updated_at":"2026-05-04T00:33:19.037Z","avatar_url":"https://github.com/dbgjerez.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"Github provides us with an API to extract the last 14 days' stats over our repositories. \n\nThis project aims to amplify and analyse them.\n\n# Clean data\nThis repository contains my stats into ```data``` folder. You can remove it because you will generate your own data. \n\n# Get the data\n\nTo get the data you should use the ```app.py``` file. \n\nThe best way to use it is using the virtual environment. \n\n## Configure\n\nThe application needs a GitHub token that grants privileges read-only over your public repositories. \n\nThe application reads the token from ```.env``` file. You have to create the file ```.env``` with the following data:\n\n```properties\nGITHUB_TOKEN={your-token}\n```\n\n## Execute the code\n\nOnce you have created the ```.env``` file with the ```GITHUB_TOKEN``` variable, we can run the code.\n\nThe best way is using the virtual environment. If you don´t know how, I explaint it [here](https://blog.dbgjerez.es/posts/python-virtualenv/)\n\nExecute the Python app: \n\n```zsh\n❯ python app.py\n```\n\nThe application will create a structure with one ```.csv``` per day:\n\n```zsh\n❯ ls -lchR data\ndata:\ntotal 4,0K\ndrwxr-xr-x 4 db db 4,0K jul 12 19:03 2022\n\ndata/2022:\ntotal 8,0K\ndrwxr-xr-x 2 db db 4,0K jul 12 19:03 06\ndrwxr-xr-x 2 db db 4,0K jul 13 06:34 07\n\ndata/2022/06:\ntotal 12K\n-rw-r--r-- 1 db db 164 jul 12 19:03 20220628.csv\n-rw-r--r-- 1 db db 192 jul 13 12:22 20220629.csv\n-rw-r--r-- 1 db db 272 jul 13 12:22 20220630.csv\n\ndata/2022/07:\ntotal 52K\n-rw-r--r-- 1 db db 257 jul 13 12:22 20220701.csv\n-rw-r--r-- 1 db db 195 jul 13 12:22 20220702.csv\n-rw-r--r-- 1 db db 237 jul 13 12:22 20220703.csv\n-rw-r--r-- 1 db db 497 jul 13 12:22 20220704.csv\n-rw-r--r-- 1 db db 493 jul 13 12:22 20220705.csv\n-rw-r--r-- 1 db db 559 jul 13 12:22 20220706.csv\n-rw-r--r-- 1 db db 614 jul 13 12:22 20220707.csv\n-rw-r--r-- 1 db db 433 jul 13 12:22 20220708.csv\n-rw-r--r-- 1 db db 238 jul 13 12:22 20220709.csv\n-rw-r--r-- 1 db db 304 jul 13 12:22 20220710.csv\n-rw-r--r-- 1 db db 402 jul 13 12:22 20220711.csv\n-rw-r--r-- 1 db db 638 jul 13 12:22 20220712.csv\n-rw-r--r-- 1 db db 357 jul 13 12:22 20220713.csv\n```\n\n## Makefile\n\nIn addition, a ```Makefile``` is provided, so you can use it: \n\n```zsh\nmake update\n```\n\n# Analysis the data\n\nThe file ```repos_analysis.ipynb``` execute a light analisys over the data. \n\nTo update it, you have to use wherever editor that you prefer. In my case, I use ```junyper-notebook``` in local environments. \n\n```zsh\n❯ jupyter nbconvert --execute --to notebook --inplace repos_analysis.ipynb\n```\n\n# References\n* https://github.com/PyGithub/PyGithub\n* https://blog.dbgjerez.es/posts/python-virtualenv/\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdbgjerez%2Fml-github","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdbgjerez%2Fml-github","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdbgjerez%2Fml-github/lists"}