{"id":19065266,"url":"https://github.com/hackersandslackers/jsonld-scraper-tutorial","last_synced_at":"2025-04-28T11:27:20.247Z","repository":{"id":37033205,"uuid":"276641289","full_name":"hackersandslackers/jsonld-scraper-tutorial","owner":"hackersandslackers","description":"🌎 🖥 Supercharge your scraper to extract quality page metadata by parsing JSON-LD data via Python's extruct library.","archived":false,"fork":false,"pushed_at":"2025-04-20T22:03:01.000Z","size":515,"stargazers_count":14,"open_issues_count":24,"forks_count":2,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-04-28T11:27:15.711Z","etag":null,"topics":["beautifulsoup","extruct","json-ld","python","scraper","structured-data","tutorial"],"latest_commit_sha":null,"homepage":"https://hackersandslackers.com/scrape-metadata-json-ld/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/hackersandslackers.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":".github/FUNDING.yml","license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":".github/CODEOWNERS","security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null},"funding":{"custom":["https://www.buymeacoffee.com/hackersslackers"]}},"created_at":"2020-07-02T12:32:38.000Z","updated_at":"2025-02-26T00:56:45.000Z","dependencies_parsed_at":"2023-11-24T20:31:11.891Z","dependency_job_id":"f0654672-bb78-4eaa-ad0c-34f6eeacca95","html_url":"https://github.com/hackersandslackers/jsonld-scraper-tutorial","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hackersandslackers%2Fjsonld-scraper-tutorial","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hackersandslackers%2Fjsonld-scraper-tutorial/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hackersandslackers%2Fjsonld-scraper-tutorial/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/hackersandslackers%2Fjsonld-scraper-tutorial/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/hackersandslackers","download_url":"https://codeload.github.com/hackersandslackers/jsonld-scraper-tutorial/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":251303873,"owners_count":21567767,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["beautifulsoup","extruct","json-ld","python","scraper","structured-data","tutorial"],"created_at":"2024-11-09T00:49:15.970Z","updated_at":"2025-04-28T11:27:20.213Z","avatar_url":"https://github.com/hackersandslackers.png","language":"Python","readme":"# Structured Data Scraping Tutorial\n\n![Python](https://img.shields.io/badge/Python-v^3.8-blue.svg?logo=python\u0026longCache=true\u0026logoColor=white\u0026colorB=5e81ac\u0026style=flat-square\u0026colorA=4c566a)\n![Extruct](https://img.shields.io/badge/Extruct-v0.9.0-blue.svg?longCache=true\u0026logo=flask\u0026style=flat-square\u0026logoColor=white\u0026colorB=5e81ac\u0026colorA=4c566a)\n![Requests](https://img.shields.io/badge/Requests-v2.24.0-blue.svg?longCache=true\u0026logo=flask\u0026style=flat-square\u0026logoColor=white\u0026colorB=5e81ac\u0026colorA=4c566a)\n![GitHub Last Commit](https://img.shields.io/github/last-commit/google/skia.svg?style=flat-square\u0026colorA=4c566a\u0026colorB=a3be8c\u0026logo=GitHub)\n[![GitHub Issues](https://img.shields.io/github/issues/hackersandslackers/jsonld-scraper-tutorial.svg?style=flat-square\u0026colorA=4c566a\u0026logo=GitHub\u0026colorB=ebcb8b)](https://github.com/hackersandslackers/jsonld-scraper-tutorial/issues)\n[![GitHub Stars](https://img.shields.io/github/stars/hackersandslackers/jsonld-scraper-tutorial.svg?style=flat-square\u0026colorA=4c566a\u0026logo=GitHub\u0026colorB=ebcb8b)](https://github.com/hackersandslackers/jsonld-scraper-tutorial/stargazers)\n[![GitHub Forks](https://img.shields.io/github/forks/hackersandslackers/jsonld-scraper-tutorial.svg?style=flat-square\u0026colorA=4c566a\u0026logo=GitHub\u0026colorB=ebcb8b)](https://github.com/hackersandslackers/jsonld-scraper-tutorial/network)\n\n![Extruct Tutorial](.github/json-ld-pyld-1@2x.jpg)\n\nSupercharge your scraper to extract quality page metadata by parsing JSON-LD data via Python's [extruct](https://github.com/scrapinghub/extruct) library.\n\nThis repository contains source code for the accompanying tutorial on Hackers and Slackers: https://hackersandslackers.com/scrape-metadata-json-ld/\n\n\n## Installation\n\n**Installation via `requirements.txt`**:\n\n```shell\n$ git clone https://github.com/hackersandslackers/jsonld-scraper-tutorial.git\n$ cd jsonld-scraper-tutorial\n$ python3 -m venv myenv\n$ source myenv/bin/activate\n$ pip3 install -r requirements.txt\n$ python3 main.py\n```\n\n**Installation via [Pipenv](https://pipenv-fork.readthedocs.io/en/latest/)**:\n\n```shell\n$ git clone https://github.com/hackersandslackers/jsonld-scraper-tutorial.git\n$ cd jsonld-scraper-tutorial\n$ pipenv shell\n$ pipenv update\n$ python3 main.py\n```\n\n**Installation via [Poetry](https://python-poetry.org/)**:\n\n```shell\n$ git clone https://github.com/hackersandslackers/jsonld-scraper-tutorial.git\n$ cd jsonld-scraper-tutorial\n$ poetry shell\n$ poetry update\n$ poetry run\n```\n\n## Usage\n\nTo change the URL targeted by this script, update the `URL` variable in **config.py**.\n\n-----\n\n**Hackers and Slackers** tutorials are free of charge. If you found this tutorial helpful, a [small donation](https://www.buymeacoffee.com/hackersslackers) would be greatly appreciated to keep us in business. All proceeds go towards coffee, and all coffee goes towards more content.\n","funding_links":["https://www.buymeacoffee.com/hackersslackers"],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhackersandslackers%2Fjsonld-scraper-tutorial","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fhackersandslackers%2Fjsonld-scraper-tutorial","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhackersandslackers%2Fjsonld-scraper-tutorial/lists"}