{"id":23632203,"url":"https://github.com/harmonydata/pdf-questionnaire-extraction","last_synced_at":"2025-07-05T18:33:10.696Z","repository":{"id":295027357,"uuid":"819367018","full_name":"harmonydata/pdf-questionnaire-extraction","owner":"harmonydata","description":null,"archived":false,"fork":false,"pushed_at":"2024-07-19T05:55:30.000Z","size":205,"stargazers_count":0,"open_issues_count":3,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-05-23T08:44:23.845Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/harmonydata.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2024-06-24T11:11:44.000Z","updated_at":"2024-07-19T05:55:33.000Z","dependencies_parsed_at":"2025-05-23T08:54:46.238Z","dependency_job_id":null,"html_url":"https://github.com/harmonydata/pdf-questionnaire-extraction","commit_stats":null,"previous_names":["harmonydata/pdf-questionnaire-extraction"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/harmonydata/pdf-questionnaire-extraction","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/harmonydata%2Fpdf-questionnaire-extraction","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/harmonydata%2Fpdf-questionnaire-extraction/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/harmonydata%2Fpdf-questionnaire-extraction/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/harmonydata%2Fpdf-questionnaire-extraction/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/harmonydata","download_url":"https://codeload.github.com/harmonydata/pdf-questionnaire-extraction/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/harmonydata%2Fpdf-questionnaire-extraction/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":263784855,"owners_count":23510986,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-28T03:27:59.736Z","updated_at":"2025-07-05T18:33:10.677Z","avatar_url":"https://github.com/harmonydata.png","language":"Python","readme":"![The Harmony Project logo](https://raw.githubusercontent.com/harmonydata/brand/main/Logo/PNG/%D0%BB%D0%BE%D0%B3%D0%BE%20%D1%84%D1%83%D0%BB-05.png)\n\n\u003ca href=\"https://harmonydata.ac.uk\"\u003e\u003cspan align=\"left\"\u003e🌐 harmonydata.ac.uk\u003c/span\u003e\u003c/a\u003e\n\u003ca href=\"https://www.linkedin.com/company/harmonydata\"\u003e\u003cimg align=\"left\" src=\"https://raw.githubusercontent.com//harmonydata/.github/main/profile/linkedin.svg\" alt=\"Harmony | LinkedIn\" width=\"21px\"/\u003e\u003c/a\u003e\n\u003ca href=\"https://twitter.com/harmony_data\"\u003e\u003cimg align=\"left\" src=\"https://raw.githubusercontent.com//harmonydata/.github/main/profile/x.svg\" alt=\"Harmony | X\" width=\"21px\"/\u003e\u003c/a\u003e\n\u003ca href=\"https://www.instagram.com/harmonydata/\"\u003e\u003cimg align=\"left\" src=\"https://raw.githubusercontent.com//harmonydata/.github/main/profile/instagram.svg\" alt=\"Harmony | Instagram\" width=\"21px\"/\u003e\u003c/a\u003e\n\u003ca href=\"https://www.facebook.com/people/Harmony-Project/100086772661697/\"\u003e\u003cimg align=\"left\" src=\"https://raw.githubusercontent.com//harmonydata/.github/main/profile/fb.svg\" alt=\"Harmony | Facebook\" width=\"21px\"/\u003e\u003c/a\u003e\n\u003ca href=\"https://www.youtube.com/channel/UCraLlfBr0jXwap41oQ763OQ\"\u003e\u003cimg align=\"left\" src=\"https://raw.githubusercontent.com//harmonydata/.github/main/profile/yt.svg\" alt=\"Harmony | YouTube\" width=\"21px\"/\u003e\u003c/a\u003e\n\n [![Harmony on Twitter](https://img.shields.io/twitter/follow/harmony_data.svg?style=social\u0026label=Follow)](https://twitter.com/harmony_data) \n\n\n# Harmony PDF extraction\n\n\u003c!-- badges: start --\u003e\n[![PyPI package](https://img.shields.io/badge/pip%20install-harmonydata-brightgreen)](https://pypi.org/project/harmonydata/) ![my badge](https://badgen.net/badge/Status/In%20Development/orange) [![License](https://img.shields.io/github/license/harmonydata/harmony)](https://github.com/harmonydata/harmony/blob/main/LICENSE)\n[![tests](https://github.com/harmonydata/harmony/actions/workflows/test.yml/badge.svg)](https://github.com/harmonydata/harmony/actions/workflows/test.yml)\n[![Current Release Version](https://img.shields.io/github/release/harmonydata/harmony.svg?style=flat-square\u0026logo=github)](https://github.com/harmonydata/harmony/releases)\n[![pypi Version](https://img.shields.io/pypi/v/harmonydata.svg?style=flat-square\u0026logo=pypi\u0026logoColor=white)](https://pypi.org/project/harmonydata/)\n [![version number](https://img.shields.io/pypi/v/harmonydata?color=green\u0026label=version)](https://github.com/harmonydata/harmony/releases) [![PyPi downloads](https://static.pepy.tech/personalized-badge/harmonydata?period=total\u0026units=international_system\u0026left_color=grey\u0026right_color=orange\u0026left_text=pip%20downloads)](https://pypi.org/project/harmonydata/)\n[![forks](https://img.shields.io/github/forks/harmonydata/harmony)](https://github.com/harmonydata/harmony/forks)\n[![docker](https://img.shields.io/badge/docker-pull-blue.svg?logo=docker\u0026logoColor=white)](https://hub.docker.com/r/harmonydata/harmonywithtika)\n\n# How to get started\n\nRun `train.py` to create the CRF model.\n\nYou will need the training data, please contact Thomas Wood for the data.\n\n## ‎😃💁 Who worked on Harmony?\n\nHarmony is a collaboration project between [Ulster University](https://ulster.ac.uk/), [University College London](https://ucl.ac.uk/), the [Universidade Federal de Santa Maria](https://www.ufsm.br/), and [Fast Data Science](http://fastdatascience.com/).  Harmony is funded by [Wellcome](https://wellcome.org/) as part of the [Wellcome Data Prize in Mental Health](https://wellcome.org/grant-funding/schemes/wellcome-mental-health-data-prize).\n\nThe core team at Harmony is made up of:\n\n* [Dr Bettina Moltrecht, PhD](https://profiles.ucl.ac.uk/60736-bettina-moltrecht) (UCL)\n* [Dr Eoin McElroy](https://www.ulster.ac.uk/staff/e-mcelroy) (University of Ulster)\n* [Dr George Ploubidis](https://profiles.ucl.ac.uk/48171-george-ploubidis) (UCL)\n* [Dr Mauricio Scopel Hoffmann](https://ufsmpublica.ufsm.br/docente/18264) (Universidade Federal de Santa Maria, Brazil)\n* [Thomas Wood](https://freelancedatascientist.net/) ([Fast Data Science](https://fastdatascience.com))\n\n## 📜 License\n\nMIT License. Copyright (c) 2023 Ulster University (https://www.ulster.ac.uk)\n\n## 📜 How do I cite Harmony?\n\nMcElroy, E., Moltrecht, B., Ploubidis, G.B., Scopel Hoffman, M., Wood, T.A., Harmony [Computer software], Version 1.0, accessed at https://harmonydata.ac.uk/app. Ulster University (2023)\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fharmonydata%2Fpdf-questionnaire-extraction","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fharmonydata%2Fpdf-questionnaire-extraction","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fharmonydata%2Fpdf-questionnaire-extraction/lists"}