https://github.com/michaelmior/json-fetcher
https://github.com/michaelmior/json-fetcher
Last synced: 18 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/michaelmior/json-fetcher
- Owner: michaelmior
- Created: 2023-09-05T14:02:08.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-04-14T17:42:31.000Z (19 days ago)
- Last Synced: 2025-04-14T18:44:35.539Z (19 days ago)
- Language: Python
- Size: 39.1 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# json-fetcher
[](https://results.pre-commit.ci/latest/github/michaelmior/json-fetcher/main)## Setup
Dependencies for this project are managed by [pipenv](https://pipenv.pypa.io/) and can be installed with `pipenv install`.
When fetching from GitHub, an [access token](https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/creating-a-personal-access-token) is required and the `GITHUB_TOKEN` environment variable must be set.
The simplest way to do this is to use a `.env` file in the root of the project.## Usage
1. Collect a list of of JSON documents using the GitHub search API. URLs to documents are stored in the `files` directory.
pipenv run python search.py
2. Download and validate the schemas JSON documents using the URLs collected in the first step. Schemas will be written to the `schemas` directory and one JSON document per line will be written to the `jsons` directory.
pipenv run python download.py