{"id":25231487,"url":"https://github.com/notshrirang/article-reader-app","last_synced_at":"2025-04-05T15:24:44.035Z","repository":{"id":220525331,"uuid":"751825723","full_name":"NotShrirang/Article-Reader-App","owner":"NotShrirang","description":"This Python script is designed to extract structured data from various news articles. It utilizes web scraping techniques to extract information such as article titles and bodies from different news websites. The script supports multiple websites, and you can easily extend it to include more by adding functions for each website.","archived":false,"fork":false,"pushed_at":"2024-02-05T11:29:56.000Z","size":13,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-02-11T12:36:29.837Z","etag":null,"topics":["beautifulsoup","regex"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/NotShrirang.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-02-02T12:12:49.000Z","updated_at":"2024-02-02T13:45:36.000Z","dependencies_parsed_at":"2025-02-11T12:33:55.393Z","dependency_job_id":"73a32127-03d4-4de2-9935-8e6ffbaa6085","html_url":"https://github.com/NotShrirang/Article-Reader-App","commit_stats":null,"previous_names":["notshrirang/article-reader-app"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NotShrirang%2FArticle-Reader-App","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NotShrirang%2FArticle-Reader-App/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NotShrirang%2FArticle-Reader-App/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/NotShrirang%2FArticle-Reader-App/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/NotShrirang","download_url":"https://codeload.github.com/NotShrirang/Article-Reader-App/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247355486,"owners_count":20925639,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["beautifulsoup","regex"],"created_at":"2025-02-11T12:28:49.736Z","updated_at":"2025-04-05T15:24:44.013Z","avatar_url":"https://github.com/NotShrirang.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Article-Reader-App\n\nThis Python script is designed to extract structured data from various news articles. It utilizes web scraping techniques to extract information such as article titles and bodies from different news websites. The script supports multiple websites, and you can easily extend it to include more by adding functions for each website.\n\n## Table of Contents\n\n- [Prerequisites](#prerequisites)\n- [Installation](#installation)\n- [Usage](#usage)\n- [Configuration](#configuration)\n\n## Prerequisites\n\n- Python 3.x\n- Required Python libraries (install via `pip install -r requirements.txt`):\n  - `requests`\n  - `bs4`\n\n## Installation\n\n1. Clone the repository:\n\n   ```bash\n   git clone https://github.com/NotShrirang/Article-Reader-App\n   ```\n\n2. Navigate to the project directory:\n   \n  ```bash\n  cd Article-Reader-App\n  ```\n\n3. Install the required dependencies:\n\n  ```bash\n  pip install -r requirements.txt\n  ```\n## Usage\n\nEdit the config.json file to configure the list of news article URLs.\n\nRun the main script:\n\n```bash\npython main.py\n```\nThe extracted data will be saved as output.json in the project directory.\n\n## Configuration\nconfig.json: This file contains the configuration for the script. It includes a list of news article URLs that you want to extract data from. Add or remove URLs as needed.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnotshrirang%2Farticle-reader-app","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fnotshrirang%2Farticle-reader-app","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnotshrirang%2Farticle-reader-app/lists"}