{"id":25426832,"url":"https://github.com/microsoft/reconner","last_synced_at":"2025-10-31T16:30:27.352Z","repository":{"id":41444663,"uuid":"241975909","full_name":"microsoft/reconner","owner":"microsoft","description":"ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.","archived":false,"fork":false,"pushed_at":"2020-07-26T08:17:47.000Z","size":2925,"stargazers_count":34,"open_issues_count":0,"forks_count":4,"subscribers_count":5,"default_branch":"master","last_synced_at":"2025-02-16T10:18:32.027Z","etag":null,"topics":["ai","data","ner-data","nlp"],"latest_commit_sha":null,"homepage":"https://microsoft.github.io/reconner","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/microsoft.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"docs/contributing.md","funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null}},"created_at":"2020-02-20T19:52:08.000Z","updated_at":"2025-01-17T09:53:31.000Z","dependencies_parsed_at":"2022-08-01T00:08:29.217Z","dependency_job_id":null,"html_url":"https://github.com/microsoft/reconner","commit_stats":null,"previous_names":[],"tags_count":14,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/microsoft%2Freconner","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/microsoft%2Freconner/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/microsoft%2Freconner/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/microsoft%2Freconner/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/microsoft","download_url":"https://codeload.github.com/microsoft/reconner/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239214052,"owners_count":19601077,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","data","ner-data","nlp"],"created_at":"2025-02-17T00:22:46.296Z","updated_at":"2025-10-31T16:30:27.270Z","avatar_url":"https://github.com/microsoft.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cp align=\"center\"\u003e\n  \u003ca href=\"https://microsoft.github.io/reconner\"\u003e\u003cimg src=\"https://microsoft.github.io/reconner/img/recon.svg\" alt=\"Recon\"\u003e\u003c/a\u003e\n\u003c/p\u003e\n\u003cp align=\"center\"\u003e\n    \u003cem\u003eRecon NER, Debug annotated Named Entity Recognition (NER) data for inconsitencies and get insights on improving the quality of your data.\u003c/em\u003e\n\u003c/p\u003e\n\u003cp align=\"center\"\u003e\n\u003ca href=\"https://pypi.org/project/reconner\" target=\"_blank\"\u003e\n    \u003cimg src=\"https://img.shields.io/pypi/v/reconner?style=for-the-badge\" alt=\"PyPi Package version\"\u003e\n\u003c/a\u003e\n\u003ca href=\"https://dev.azure.com/kabirkhan1137/ReconNER/_build\" target=\"_blank\"\u003e\n    \u003cimg alt=\"Azure Pipelines Build badge\" src=\"https://img.shields.io/azure-devops/build/kabirkhan1137/ReconNER/microsoft.reconner?style=for-the-badge\"\u003e\n\u003c/a\u003e\n\u003ca href=\"https://codecov.io/gh/microsoft/reconner\" rel=\"nofollow\"\u003e\n  \u003cimg alt=\"Codecov badge\" src=\"https://img.shields.io/codecov/c/gh/microsoft/reconner?style=for-the-badge\" style=\"max-width:100%;\"\u003e\n\u003c/a\u003e\n\n\u003ca href=\"https://pypi.org/project/reconner\" target=\"_blank\"\u003e\n    \u003cimg src=\"https://img.shields.io/pypi/l/reconner?style=for-the-badge\" alt=\"PyPi Package license\"\u003e\n\u003c/a\u003e\n\n\u003c/p\u003e\n\n---\n\n**Documentation**: \u003ca href=\"https://microsoft.github.io/reconner\" target=\"_blank\"\u003ehttps://microsoft.github.io/reconner\u003c/a\u003e\n\n**Source Code**: \u003ca href=\"https://github.com/microsoft/reconner\" target=\"_blank\"\u003ehttps://github.com/microsoft/reconner\u003c/a\u003e\n\n---\n\nReconNER is a library to help you fix your annotated NER data and identify examples that are hardest for your model to predict so you can strategically prioritize the examples you annotate.\n\nThe key features are:\n\n* **Data Validation and Cleanup**: Easily Validate the format of your NER data. Filter overlapping Entity Annotations, fix missing properties.\n* **Model Insights**: Analyze how well your model does on your Dataset. Identify the top errors your model is making so you can prioritize data collection and correction strategically.\n* **Model Insights**: Analyze how well your model does on your Dataset. Identify the top errors your model is making so you can prioritize data collection and correction strategically.\n* **Dataset Management**: ReconNER provides a `Dataset` class to manage the train/dev/test split of your data and apply the same functions across all splits in your data + a concatenation of all examples. Operate inplace to consistently transform your data.\n* **Serializable Dataset**: Serialize and Deserialize your data to and from JSON to the ReconNER type system.\n* **Type Hints**: Comprehensive Typing system based on Python 3.6+ Type Hints\n\n## Requirements\n\nPython 3.6+\n\n`ReconNER` is built on a few comprehensive, high-performing packages.\n\n* \u003ca href=\"https://spacy.io\" class=\"external-link\" target=\"_blank\"\u003espaCy\u003c/a\u003e\n* \u003ca href=\"https://pydantic-docs.helpmanual.io/\" class=\"external-link\" target=\"_blank\"\u003ePydantic (Type system and JSON Serialization)\u003c/a\u003e\n* \u003ca href=\"https://typer.tiangolo.com\" class=\"external-link\" target=\"_blank\"\u003eTyper (CLI)\u003c/a\u003e.\n\n\n## Installation\n\n\u003cdiv class=\"termy\"\u003e\n\n```console\n$ pip install reconner\n---\u003e 100%\nSuccessfully installed reconner\n```\n\n\u003c/div\u003e\n\n## License\n\nThis project is licensed under the terms of the MIT license.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmicrosoft%2Freconner","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmicrosoft%2Freconner","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmicrosoft%2Freconner/lists"}