{"id":23237021,"url":"https://github.com/nicolay-r/arekit","last_synced_at":"2025-10-13T12:36:17.779Z","repository":{"id":37405651,"uuid":"225708027","full_name":"nicolay-r/AREkit","owner":"nicolay-r","description":"Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large  text collections with ML and for ML","archived":false,"fork":false,"pushed_at":"2025-01-18T11:40:41.000Z","size":23534,"stargazers_count":63,"open_issues_count":6,"forks_count":3,"subscribers_count":5,"default_branch":"master","last_synced_at":"2025-09-27T05:31:01.189Z","etag":null,"topics":["batching","bulk-operation","frames","language-model","nlp","pipelines","pipelines-library","relation-extraction","relationship-extraction","sentiment-analysis"],"latest_commit_sha":null,"homepage":"https://nicolay-r.github.io/arekit-page/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/nicolay-r.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-12-03T20:20:46.000Z","updated_at":"2025-03-16T01:19:56.000Z","dependencies_parsed_at":"2024-11-01T15:31:17.245Z","dependency_job_id":"fe84b992-1730-4c8d-9368-dfd60b691365","html_url":"https://github.com/nicolay-r/AREkit","commit_stats":{"total_commits":2597,"total_committers":2,"mean_commits":1298.5,"dds":0.0003850596842510656,"last_synced_commit":"2dce7db2c3b6dbb5d155797ad0f32be10adeb212"},"previous_names":[],"tags_count":19,"template":false,"template_full_name":null,"purl":"pkg:github/nicolay-r/AREkit","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nicolay-r%2FAREkit","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nicolay-r%2FAREkit/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nicolay-r%2FAREkit/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nicolay-r%2FAREkit/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/nicolay-r","download_url":"https://codeload.github.com/nicolay-r/AREkit/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/nicolay-r%2FAREkit/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279015056,"owners_count":26085643,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-13T02:00:06.723Z","response_time":61,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["batching","bulk-operation","frames","language-model","nlp","pipelines","pipelines-library","relation-extraction","relationship-extraction","sentiment-analysis"],"created_at":"2024-12-19T04:13:24.482Z","updated_at":"2025-10-13T12:36:17.764Z","avatar_url":"https://github.com/nicolay-r.png","language":"Python","readme":"# AREkit 0.25.1\n\n![](https://img.shields.io/badge/Python-3.9+-brightgreen.svg)\n[![PyPI downloads](https://img.shields.io/pypi/dm/arekit.svg)](https://pypistats.org/packages/arekit)\n\n\n\u003cp align=\"center\"\u003e\n    \u003cimg src=\"logo.png\"/\u003e\n\u003c/p\u003e\n\n**AREkit** (Attitude and Relation Extraction Toolkit) --\nis a python toolkit, devoted to document level Attitude and Relation Extraction between text objects from mass-media news. \n\n## Description\n\n\nThis toolkit aims at memory-effective data processing in [Relation Extraction (RE)](https://nlpprogress.com/english/relationship_extraction.html) related tasks.\n\n\u003cp align=\"center\"\u003e\n    \u003cimg src=\"docs/arekit-pipeline-concept.png\"/\u003e\n\u003c/p\u003e\n\n\u003e Figure: AREkit pipelines design. More on \n\u003e **[ARElight: Context Sampling of Large Texts for Deep Learning Relation Extraction](https://link.springer.com/chapter/10.1007/978-3-031-56069-9_23)** paper\n\nIn particular, this framework serves the following features: \n* ➿ [pipelines](https://github.com/nicolay-r/AREkit/wiki/Pipelines:-Text-Opinion-Annotation) and iterators for handling large-scale collections serialization without out-of-memory issues.\n* 🔗 EL (entity-linking) API support for objects, \n* ➰ avoidance of cyclic connections,\n* :straight_ruler: distance consideration between relation participants (in `terms` or `sentences`),\n* 📑 relations annotations and filtering rules,\n* *️⃣ entities formatting or masking, and more.\n\nThe core functionality includes: \n* API for document presentation with EL (Entity Linking, i.e. Object Synonymy) support \nfor sentence level relations preparation (dubbed as contexts);\n* API for contexts extraction;\n* Relations transferring from sentence-level onto document-level, and more.\n\n## Installation \n\n```bash\npip install git+https://github.com/nicolay-r/AREkit.git@0.25.1-rc\n```\n\n## Usage\n\nPlease follow the **[tutorial section on project Wiki](https://github.com/nicolay-r/AREkit/wiki/Tutorials)** for mode details.\n\n## How to cite\nA great research is also accompanied by the faithful reference. \nif you use or extend our work, please cite as follows:\n\n```bibtex\n@inproceedings{rusnachenko2024arelight,\n  title={ARElight: Context Sampling of Large Texts for Deep Learning Relation Extraction},\n  author={Rusnachenko, Nicolay and Liang, Huizhi and Kolomeets, Maxim and Shi, Lei},\n  booktitle={European Conference on Information Retrieval},\n  year={2024},\n  organization={Springer}\n}\n```\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnicolay-r%2Farekit","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fnicolay-r%2Farekit","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fnicolay-r%2Farekit/lists"}