{"id":21696440,"url":"https://github.com/madetech/souper-web-scraping","last_synced_at":"2026-05-06T02:36:14.922Z","repository":{"id":197009896,"uuid":"664583501","full_name":"madetech/souper-web-scraping","owner":"madetech","description":null,"archived":false,"fork":false,"pushed_at":"2024-01-17T12:13:15.000Z","size":62678,"stargazers_count":2,"open_issues_count":5,"forks_count":1,"subscribers_count":5,"default_branch":"main","last_synced_at":"2025-01-25T14:11:40.363Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"HTML","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/madetech.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-07-10T09:50:31.000Z","updated_at":"2023-11-16T10:19:14.000Z","dependencies_parsed_at":"2024-11-25T19:33:03.971Z","dependency_job_id":null,"html_url":"https://github.com/madetech/souper-web-scraping","commit_stats":null,"previous_names":["madetech/souper-web-scraping"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/madetech%2Fsouper-web-scraping","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/madetech%2Fsouper-web-scraping/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/madetech%2Fsouper-web-scraping/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/madetech%2Fsouper-web-scraping/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/madetech","download_url":"https://codeload.github.com/madetech/souper-web-scraping/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":244636003,"owners_count":20485441,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-25T19:19:53.368Z","updated_at":"2025-10-17T07:06:24.295Z","avatar_url":"https://github.com/madetech.png","language":"HTML","funding_links":[],"categories":[],"sub_categories":[],"readme":"# souper-web-scraping\nThe main aim of this project is to collect information from \n[Service Standard Reports](https://www.gov.uk/service-standard-reports \"Service Standard Reports\").\n\nThe information is to be used in analysing the outcomes of Service Assessments and the reports produced.\nThis will enable us to look for patterns in data and stats that are currently obscured.\n\ne.g, Can we determine if a larger proportion of the Digital Services presented by government do not have appropriate service monitoring in place in terms of meeting Service Standard number 14, i.e.: Monitoring the status of your service.\n\n# Quick setup\nThe application consists of three components: frontend, backend and database.\n\nIn your project directory you will need a .env, like the .env.example. Ask a project colleague for specific information about filling in the template.\n\nThe single command to start all three parts is:\n\n`docker-compose up --build --force-recreate -d`\n\nThe web interface will be visible at the REACT_APP_FRONTEND specified in your .env file. To start a scrape, click the 'Run scrape' button in the top right corner. It may take a while. Once completed you will see entries in your database and the REACT_APP_FRONTEND address will show a list of reports scraped and some of their information.\n\n# Design information\nThe database is Postgres, and database migrations make use of [Alembic] (https://alembic.sqlalchemy.org/en/latest/ \"Alembic\") which works with SLQAlchemy.\nThe frontend is built with React, and the backend is written in Python (requires Python \u003e= 3.10).\n\nThe application has been containerized to make for easier hosting.\n\n# Running locally\nTo run the application locally outside containers, see backend/README and frontend/README.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmadetech%2Fsouper-web-scraping","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmadetech%2Fsouper-web-scraping","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmadetech%2Fsouper-web-scraping/lists"}