{"id":20874433,"url":"https://github.com/imagingdatacommons/github_analytics","last_synced_at":"2025-12-25T10:52:27.389Z","repository":{"id":198498377,"uuid":"573431340","full_name":"ImagingDataCommons/github_analytics","owner":"ImagingDataCommons","description":"Scripts for monitoring and archival of GitHub insights for IDC-related repositories","archived":false,"fork":false,"pushed_at":"2025-02-11T00:00:31.000Z","size":241,"stargazers_count":0,"open_issues_count":0,"forks_count":1,"subscribers_count":3,"default_branch":"main","last_synced_at":"2025-02-11T01:19:37.604Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"bsd-3-clause","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ImagingDataCommons.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-12-02T12:57:36.000Z","updated_at":"2025-02-11T00:00:34.000Z","dependencies_parsed_at":null,"dependency_job_id":"6798d89f-7fb2-4464-8934-d624f5542efa","html_url":"https://github.com/ImagingDataCommons/github_analytics","commit_stats":null,"previous_names":["imagingdatacommons/github_analytics"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ImagingDataCommons%2Fgithub_analytics","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ImagingDataCommons%2Fgithub_analytics/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ImagingDataCommons%2Fgithub_analytics/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ImagingDataCommons%2Fgithub_analytics/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ImagingDataCommons","download_url":"https://codeload.github.com/ImagingDataCommons/github_analytics/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243248245,"owners_count":20260752,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-18T06:32:28.042Z","updated_at":"2025-12-25T10:52:27.344Z","avatar_url":"https://github.com/ImagingDataCommons.png","language":"Jupyter Notebook","readme":"# GitHub and Google Analytics Data Collection\n\nThis repository contains Python code for collecting various GitHub analytics data from public repositories of Imaging Data Commons (IDC) and select repositories from Quantitative Image Informatics for Cancer Research (QIICR) GitHub organizations. Additionally, anonymous data captured using Google Analytics 4 is retrieved via the Google Analytics API. GitHub Actions automate the execution of Python scripts daily.\n\nThe collected data is stored in Google BigQuery and visualized in an Apache Superset instance, enabling us to gain insights into utilization and engagement with IDC.\n\n## Table of Contents\n- [GitHub Analytics](#github-analytics)\n  - [Clone Traffic](#clone-traffic)\n  - [Views Traffic](#views-traffic)\n  - [Top Referrers](#top-referrers)\n  - [Top Paths](#top-paths)\n  - [Contributor Commit Activity](#contributor-commit-activity)\n- [Google Analytics 4](#google-analytics-4)\n  - [Audience Overview](#audience-overview)\n  - [Acquisition Overview](#acquisition-overview)\n  - [Behavior Overview](#behavior-overview)\n\n## GitHub Analytics\n\n### Clone Traffic\nCollects clone and unique clone counts for specified repositories.\n\n### Views Traffic\nGathers view and unique view counts for repositories.\n\n### Top Referrers\nCaptures the top 10 referrers over the last 14 days.\n\n### Top Paths\nRecords the top 10 paths over the last 14 days.\n\n### Contributor Commit Activity\nTracks commits by contributors, including a Weekly Hash.\n\n## Google Analytics 4\n\nA Python script is used for collecting and analyzing data from Google Analytics 4 (GA4), including Audience Overview, Acquisition Overview, and Behavior Overview.\n\n### Audience Overview\nRetrieves and analyzes data related to user engagement on specified websites. Metrics include total users, new users, sessions, engaged sessions, screen page views, and average session duration.\n\n### Acquisition Overview\nFocuses on data related to user acquisition and traffic sources. Provides insights into how users are finding and accessing the websites.\n\n### Behavior Overview\nAnalyzes user behavior on websites, including page views, session durations, and engagement.\n\n\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fimagingdatacommons%2Fgithub_analytics","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fimagingdatacommons%2Fgithub_analytics","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fimagingdatacommons%2Fgithub_analytics/lists"}