{"id":24511767,"url":"https://github.com/scribe-org/scribe-data","last_synced_at":"2025-04-04T14:03:52.380Z","repository":{"id":45349701,"uuid":"437073864","full_name":"scribe-org/Scribe-Data","owner":"scribe-org","description":"Wikidata, Wiktionary and Wikipedia language data extraction","archived":false,"fork":false,"pushed_at":"2024-10-24T15:16:37.000Z","size":320909,"stargazers_count":27,"open_issues_count":36,"forks_count":67,"subscribers_count":5,"default_branch":"main","last_synced_at":"2024-10-24T17:17:12.168Z","etag":null,"topics":["cli","data","data-pipeline","database","dictionary","education","emoji","etl","grammar","hacktoberfest","language","learning","open-source","python","sparql","translation","translator","wikidata","wikipedia","wiktionary"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"gpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/scribe-org.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE.txt","code_of_conduct":".github/CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2021-12-10T18:15:52.000Z","updated_at":"2024-10-24T15:16:41.000Z","dependencies_parsed_at":"2023-10-16T12:42:36.770Z","dependency_job_id":"374b052c-9bfb-4ffe-a8de-dfa00373c02e","html_url":"https://github.com/scribe-org/Scribe-Data","commit_stats":{"total_commits":145,"total_committers":3,"mean_commits":"48.333333333333336","dds":0.1448275862068965,"last_synced_commit":"03d940910f6007d9f22859a744e562d2a9f7f61a"},"previous_names":[],"tags_count":10,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scribe-org%2FScribe-Data","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scribe-org%2FScribe-Data/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scribe-org%2FScribe-Data/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scribe-org%2FScribe-Data/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/scribe-org","download_url":"https://codeload.github.com/scribe-org/Scribe-Data/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247190233,"owners_count":20898700,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["cli","data","data-pipeline","database","dictionary","education","emoji","etl","grammar","hacktoberfest","language","learning","open-source","python","sparql","translation","translator","wikidata","wikipedia","wiktionary"],"created_at":"2025-01-22T00:41:33.558Z","updated_at":"2025-04-04T14:03:52.363Z","avatar_url":"https://github.com/scribe-org.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cdiv align=\"center\"\u003e\n  \u003ca href=\"https://github.com/scribe-org/Scribe-Data\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/scribe-org/Scribe-Data/main/.github/resources/images/ScribeDataLogo.png\" height=150 alt=\"Scribe-Data Logo\"\u003e\u003c/a\u003e\n\u003c/div\u003e\n\n[![platform](https://img.shields.io/badge/Wikidata-990000.svg?logo=wikidata\u0026logoColor=ffffff)](https://github.com/scribe-org/Scribe-Data)\n[![rtd](https://img.shields.io/readthedocs/scribe-data.svg?label=%20\u0026logo=read-the-docs\u0026logoColor=ffffff)](http://scribe-data.readthedocs.io/en/latest/)\n[![issues](https://img.shields.io/github/issues/scribe-org/Scribe-Data?label=%20\u0026logo=github)](https://github.com/scribe-org/Scribe-Data/issues)\n[![language](https://img.shields.io/badge/Python%203-306998.svg?logo=python\u0026logoColor=ffffff)](https://github.com/scribe-org/Scribe-Data/blob/main/CONTRIBUTING.md)\n[![pypi](https://img.shields.io/pypi/v/scribe-data.svg?label=%20\u0026color=4B8BBE)](https://pypi.org/project/scribe-data/)\n[![pypistatus](https://img.shields.io/pypi/status/scribe-data.svg?label=%20)](https://pypi.org/project/scribe-data/)\n[![license](https://img.shields.io/github/license/scribe-org/Scribe-Data.svg?label=%20)](https://github.com/scribe-org/Scribe-Data/blob/main/LICENSE.txt)\n[![coc](https://img.shields.io/badge/Contributor%20Covenant-ff69b4.svg)](https://github.com/scribe-org/Scribe-Data/blob/main/.github/CODE_OF_CONDUCT.md)\n[![mastodon](https://img.shields.io/badge/Mastodon-6364FF.svg?logo=mastodon\u0026logoColor=ffffff)](https://wikis.world/@scribe)\n[![matrix](https://img.shields.io/badge/Matrix-000000.svg?logo=matrix\u0026logoColor=ffffff)](https://matrix.to/#/#scribe_community:matrix.org)\n\n### Wikidata and Wikipedia language data extraction\n\n**Scribe-Data** is a convenient command-line interface (CLI) for extracting and formatting language data from [Wikidata](https://www.wikidata.org/) and [Wikipedia](https://www.wikipedia.org/). Functionality includes allowing users to list, download, and manage language data directly from the terminal.\n\n\u003e [!NOTE]\\\n\u003e The [contributing](#contributing) section has information for those interested, with the articles and presentations in [featured by](#featured-by) also being good resources for learning more about Scribe.\n\nScribe applications are available on [iOS](https://github.com/scribe-org/Scribe-iOS), [Android](https://github.com/scribe-org/Scribe-Android) (WIP) and [Desktop](https://github.com/scribe-org/Scribe-Desktop) (planned).\n\nCheck out Scribe's [architecture diagrams](https://github.com/scribe-org/Organization/blob/main/ARCHITECTURE.md) for an overview of the organization including our applications, services and processes. It depicts the projects that [Scribe](https://github.com/scribe-org) is developing as well as the relationships between them and the external systems with which they interact. Also check out the [Wikidata and Scribe Guide](https://github.com/scribe-org/Organization/blob/main/WIKIDATAGUIDE.md) for an overview of [Wikidata](https://www.wikidata.org/) and getting language data from it.\n\n\u003ca id=\"contents\"\u003e\u003c/a\u003e\n\n# **Contents**\n\n- [Process](#process)\n- [Installation](#installation)\n- [CLI Usage](#cli-usage)\n- [Contributing](#contributing)\n- [Environment Setup](#environment-setup)\n- [Featured By](#featured-by)\n\n\u003ca id=\"Process\"\u003e\u003c/a\u003e\n\n# Process [`⇧`](#contents)\n\nThe CLI commands defined within [scribe_data/cli](https://github.com/scribe-org/Scribe-Data/blob/main/src/scribe_data/cli) and the notebooks within the various [scribe_data](https://github.com/scribe-org/Scribe-Data/tree/main/src/scribe_data) directories are used to update all data for [Scribe-iOS](https://github.com/scribe-org/Scribe-iOS), with this functionality later being expanded to update [Scribe-Android](https://github.com/scribe-org/Scribe-Android) and [Scribe-Desktop](https://github.com/scribe-org/Scribe-Desktop) once they're active.\n\nThe main data update process in triggers [language based SPARQL queries](https://github.com/scribe-org/Scribe-Data/tree/main/src/scribe_data/wikidata/language_data_extraction) to query language data from [Wikidata](https://www.wikidata.org/) using [SPARQLWrapper](https://github.com/RDFLib/sparqlwrapper) as a URI. The autosuggestion process derives popular words from [Wikipedia](https://www.wikipedia.org/) as well as those words that normally follow them for an effective baseline feature until natural language processing methods are employed. Functions to generate autosuggestions are ran in [gen_autosuggestions.ipynb](https://github.com/scribe-org/Scribe-Data/blob/main/src/scribe_data/wikipedia/gen_autosuggestions.ipynb). Emojis are further sourced from [Unicode CLDR](https://github.com/unicode-org/cldr), with this process being ran via the `scribe-data get -lang LANGUAGE -dt emoji-keywords` command.\n\n\u003ca id=\"installation\"\u003e\u003c/a\u003e\n\n# Installation [`⇧`](#contents)\n\nScribe-Data is available for installation via [pip](https://pypi.org/project/scribe-data/):\n\n```bash\npip install scribe-data\n\n# For a development build:\ngit clone https://github.com/scribe-org/Scribe-Data.git  # or ideally your fork\ncd Scribe-Data\npip install -e .\n```\n\n\u003ca id=\"cli-usage\"\u003e\u003c/a\u003e\n\n# CLI Usage [`⇧`](#contents)\n\nScribe-Data provides a command-line interface (CLI) for efficient interaction with its language data functionality. Please see the [usage guide](https://github.com/scribe-org/Scribe-Data/blob/main/USAGE.md) or the [official documentation](https://scribe-data.readthedocs.io/) for detailed instructions.\n\n### Basic Usage\n\nTo utilize the Scribe-Data CLI, you can execute variations of the following command in your terminal:\n\n```bash\nscribe-data -h  # view the cli options\nscribe-data [command] [arguments]\n```\n\n### Available Commands\n\n- `list` (`l`): Enumerate available languages, data types and their combinations.\n- `get` (`g`): Retrieve data from Wikidata for specified languages and data types.\n- `total` (`t`): Display the total available data for given languages and data types.\n- `convert` (`c`): Transform data returned by Scribe-Data into different file formats.\n\n### Command Examples\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"https://github.com/user-attachments/assets/653941a7-68bb-4d72-a0f1-3e29c75c5a16\" alt=\"List, Total and Get GIF\" width=\"500\" height=\"300\"\u003e\n\u003c/p\u003e\n\n```bash\n# Commands used in the above GIF:\nscribe-data list --language\nscribe-data list --data-type\nscribe-data get --language English --data-type verbs -od ./scribe-data\nscribe-data total --language English\n```\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"https://github.com/user-attachments/assets/4cbb85ed-d853-4008-8db9-b77ffcbe2e84\" alt=\"Interactive GIF\" width=\"500\" height=\"300\"\u003e\n\u003c/p\u003e\n\n```bash\n# Commands used in the above GIF:\nscribe-data get -i\nscribe-data total -i\n```\n\n\u003ca id=\"contributing\"\u003e\u003c/a\u003e\n\n# Contributing [`⇧`](#contents)\n\n\u003ca href=\"https://matrix.to/#/#scribe_community:matrix.org\"\u003e\u003cimg src=\"https://raw.githubusercontent.com/scribe-org/Organization/main/resources/images/logos/MatrixLogoGrey.png\" height=\"50\" alt=\"Public Matrix Chat\" align=\"right\"\u003e\u003c/a\u003e\n\nScribe uses [Matrix](https://matrix.org/) for communications. You're more than welcome to [join us in our public chat rooms](https://matrix.to/#/#scribe_community:matrix.org) to share ideas, ask questions or just say hi :)\n\nPlease see the [contribution guidelines](https://github.com/scribe-org/Scribe-Data/blob/main/CONTRIBUTING.md) and [Wikidata and Scribe Guide](https://github.com/scribe-org/Organization/blob/main/WIKIDATAGUIDE.md) if you are interested in contributing to Scribe-Data. Work that is in progress or could be implemented is tracked in the [issues](https://github.com/scribe-org/Scribe-Data/issues) and [projects](https://github.com/scribe-org/Scribe-Data/projects).\n\n\u003e [!NOTE]\\\n\u003e Just because an issue is assigned on GitHub doesn't mean that the team isn't interested in your contribution! Feel free to write [in the issues](https://github.com/scribe-org/Scribe-Data/issues) and we can potentially reassign it to you.\n\nThose interested can further check the [`-next release-`](https://github.com/scribe-org/Scribe-Data/labels/-next%20release-) and [`-priority-`](https://github.com/scribe-org/Scribe-Data/labels/-priority-) labels in the [issues](https://github.com/scribe-org/Scribe-Data/issues) for those that are most important, as well as those marked [`good first issue`](https://github.com/scribe-org/Scribe-Data/issues?q=is%3Aissue+is%3Aopen+label%3A%22good+first+issue%22) that are tailored for first time contributors.\n\nAfter your first few pull requests organization members would be happy to discuss granting you further rights as a contributor, with a maintainer role then being possible after continued interest in the project. Scribe seeks to be an inclusive and supportive organization. We'd love to have you on the team!\n\n### Ways to Help [`⇧`](#contents)\n\n- [Reporting bugs](https://github.com/scribe-org/Scribe-Data/issues/new?assignees=\u0026labels=bug\u0026template=bug_report.yml) as they're found 🐞\n- Working on [new features](https://github.com/scribe-org/Scribe-Data/issues?q=is%3Aissue+is%3Aopen+label%3Afeature) ✨\n- [Documentation](https://github.com/scribe-org/Scribe-Data/issues?q=is%3Aissue+is%3Aopen+label%3Adocumentation) for onboarding and project cohesion 📝\n- Adding language data to [Scribe-Data](https://github.com/scribe-org/Scribe-Data/issues) via [Wikidata](https://www.wikidata.org/)! 🗃️\n\n### Road Map [`⇧`](#contents)\n\nThe Scribe road map can be followed in the organization's [project board](https://github.com/orgs/scribe-org/projects/1) where we list the most important issues along with their priority, status and an indication of which sub projects they're included in (if applicable).\n\n\u003e [!NOTE]\\\n\u003e Consider joining our [bi-weekly developer syncs](https://etherpad.wikimedia.org/p/scribe-dev-sync)!\n\n### Data Edits [`⇧`](#contents)\n\n\u003e [!NOTE]\\\n\u003e Please see the [Wikidata and Scribe Guide](https://github.com/scribe-org/Organization/blob/main/WIKIDATAGUIDE.md) for an overview of [Wikidata](https://www.wikidata.org/) and how Scribe uses it.\n\nScribe does not accept direct edits to the grammar JSON files as they are sourced from [Wikidata](https://www.wikidata.org/). Edits can be discussed and the queries themselves will be changed and ran before an update. If there is a problem with one of the files, then the fix should be made on [Wikidata](https://www.wikidata.org/) and not on Scribe. Feel free to let us know that edits have been made by [opening a data issue](https://github.com/scribe-org/Scribe-Data/issues/new?assignees=\u0026labels=data\u0026template=data_wikidata.yml) and we'll be happy to integrate them!\n\n\u003ca id=\"environment-setup\"\u003e\u003c/a\u003e\n\n# Environment Setup [`⇧`](#contents)\n\n\u003e [!IMPORTANT]\n\u003e\n\u003e \u003cdetails\u003e\u003csummary\u003eSuggested IDE extensions\u003c/summary\u003e\n\u003e\n\u003e \u003cp\u003e\n\u003e\n\u003e VS Code\n\u003e\n\u003e - [blokhinnv.wikidataqidlabels](https://marketplace.visualstudio.com/items?itemName=blokhinnv.wikidataqidlabels)\n\u003e - [charliermarsh.ruff](https://marketplace.visualstudio.com/items?itemName=charliermarsh.ruff)\n\u003e - [qwtel.sqlite-viewer](https://marketplace.visualstudio.com/items?itemName=qwtel.sqlite-viewer)\n\u003e - [streetsidesoftware.code-spell-checker](https://marketplace.visualstudio.com/items?itemName=streetsidesoftware.code-spell-checker)\n\u003e\n\u003e \u003c/p\u003e\n\u003e \u003c/details\u003e\n\nThe development environment for Scribe-Data can be installed via the following steps:\n\n1. [Fork](https://docs.github.com/en/get-started/quickstart/fork-a-repo) the [Scribe-Data repo](https://github.com/scribe-org/Scribe-Data), clone your fork, and configure the remotes:\n\n\u003e [!NOTE]\n\u003e\n\u003e \u003cdetails\u003e\u003csummary\u003eConsider using SSH\u003c/summary\u003e\n\u003e\n\u003e \u003cp\u003e\n\u003e\n\u003e Alternatively to using HTTPS as in the instructions below, consider SSH to interact with GitHub from the terminal. SSH allows you to connect without a user-pass authentication flow.\n\u003e\n\u003e To run git commands with SSH, remember then to substitute the HTTPS URL, `https://github.com/...`, with the SSH one, `git@github.com:...`.\n\u003e\n\u003e - e.g. Cloning now becomes `git clone git@github.com:\u003cyour-username\u003e/Scribe-Data.git`\n\u003e\n\u003e GitHub also has their documentation on how to [Generate a new SSH key](https://docs.github.com/en/authentication/connecting-to-github-with-ssh/generating-a-new-ssh-key-and-adding-it-to-the-ssh-agent) 🔑\n\u003e\n\u003e \u003c/p\u003e\n\u003e \u003c/details\u003e\n\n```bash\n# Clone your fork of the repo into the current directory.\ngit clone https://github.com/\u003cyour-username\u003e/Scribe-Data.git\n# Navigate to the newly cloned directory.\ncd Scribe-Data\n# Assign the original repo to a remote called \"upstream\".\ngit remote add upstream https://github.com/scribe-org/Scribe-Data.git\n```\n\n- Now, if you run `git remote -v` you should see two remote repositories named:\n  - `origin` (forked repository)\n  - `upstream` (Scribe-Data repository)\n\n2. Use [Python venv](https://docs.python.org/3/library/venv.html) to create the local development environment within your Scribe-Data directory:\n\n- On Unix or MacOS, run:\n\n  ```bash\n  python3 -m venv venv  # make an environment named venv\n  source venv/bin/activate # activate the environment\n  ```\n\n- On Windows (using Command Prompt), run:\n\n  ```bash\n  python -m venv venv\n  venv\\Scripts\\activate.bat\n  ```\n\n- On Windows (using PowerShell), run:\n\n  ```bash\n  python -m venv venv\n  venv\\Scripts\\activate.ps1\n  ```\n\nAfter activating the virtual environment, install the required dependencies and set up [pre-commit](https://pre-commit.com/) by running:\n\n```bash\npip install --upgrade pip  # make sure that pip is at the latest version\npip install -r requirements-dev.txt  # install development dependencies\npip install -e .  # install the local version of Scribe-Data\npre-commit install  # install pre-commit hooks\n# pre-commit run --all-files  # lint and fix common problems in the codebase\n```\n\nSee the [contribution guidelines](https://github.com/scribe-org/Scribe-Data/blob/main/CONTRIBUTING.md) for a more detailed explanation and troubleshooting.\n\n\u003e [!NOTE]\n\u003e Feel free to contact the team in the [Data room on Matrix](https://matrix.to/#/#ScribeData:matrix.org) if you're having problems getting your environment setup!\n\n\u003ca id=\"featured-by\"\u003e\u003c/a\u003e\n\n# Featured By [`⇧`](#contents)\n\n\u003cdetails open\u003e\u003csummary\u003e\u003cstrong\u003eArticles and Presentations on Scribe\u003c/strong\u003e\u003c/summary\u003e\n\u003cp\u003e\n\n\u003cstrong\u003e2024\u003c/strong\u003e\n\n- October: [Blog post on Medium](https://medium.com/@arpita151103/scribe-an-open-source-solution-for-language-learning-and-data-accessibility-092dab026fd6) discussing the [Scribe-Data](https://github.com/scribe-org/Scribe-Data) development process, community and features\n- October: [Blog post on medium](https://medium.com/@mhmohona/ins-and-outs-of-scribe-data-cli-bd51202aa7c6) describing the main features of [Scribe-Data](https://github.com/scribe-org/Scribe-Data)\n- September: [Final Google Summer of Code report](https://medium.com/@mhmohona/the-final-stretch-gsoc-journey-with-scribe-data-1740084c958d) on the creation of the [Scribe-Data](https://github.com/scribe-org/Scribe-Data) CLI\n- August: [Final Google Summer of Code report](https://jagmarcel.hashnode.dev/gsoc-2024-final-report) on the creation of Scribe's cross-language translation functionality\n- July: [Blog post on Medium](https://medium.com/@mhmohona/halfway-there-my-gsoc-adventure-with-scribe-data-cli-2ffe6d727ecb) about the progress on creating the [Scribe-Data](https://github.com/scribe-org/Scribe-Data) CLI\n- July: [Blog post on Hashnode](https://jagmarcel.hashnode.dev/gsoc-2024-midterm-report) providing an midterm report on the localization and translation expansion for [Scribe-iOS](https://github.com/scribe-org/Scribe-iOS)\n- July: [Blog post on Hashnode](https://jagmarcel.hashnode.dev/my-first-experiences-with-gsoc) about the initial steps towards the localization of [Scribe-iOS](https://github.com/scribe-org/Scribe-iOS)\n- June: [Blog post on Medium](https://medium.com/@mhmohona/first-month-as-a-gsoc-intern-building-scribe-data-cli-d0c12c9e8371) about the planned [Scribe-Data](https://github.com/scribe-org/Scribe-Data) CLI\n- April: [Blog post on Medium](https://medium.com/@mhmohona/scribe-data-a-guide-to-open-source-language-data-a801c59db4c9) about [Scribe-Data](https://github.com/scribe-org/Scribe-Data) and its functionalities\n- February: [Presentation slides](https://docs.google.com/presentation/d/1lMhYiQx1R99SVGhbikUGjOVaFgPPASvbzM2Bsu3NXSg/edit?usp=sharing) for Scribe's participation at the [Wikimedia Tech Safari Program](https://www.mediawiki.org/wiki/Wikimedia_Tech_Safari_Program)\n\n\u003cstrong\u003e2023\u003c/strong\u003e\n\n- August: [Scribe-iOS final submission report for Google Summer of Code 2023](https://saurabhjamadagni.hashnode.dev/gsoc-23-final-work-submission)\n- June: [Scribe-iOS development blog post on Nested UITableViews \u0026 Apple's built-in ViewControllers in app menu](https://saurabhjamadagni.hashnode.dev/nested-uitableviews-apples-built-in-viewcontrollers) for [GSoC '23](https://www.mediawiki.org/wiki/Google_Summer_of_Code/2023#Accepted_projects:~:text=links%3A%20Phabricator%20issue-,3.%20Adding%20a%20Menu%20and%20Keyboards%20to%20Scribe%2DiOS,-%5Bedit%5D)\n- March: [Presentation slides](https://docs.google.com/presentation/d/1W4ZkGi9UDDiTxM_silEij0gTE8YEubluHxe78xoqEP0/edit?usp=sharing) for a talk at [Berlin Hack and Tell](https://bhnt.c-base.org/) ([Hack of the month winner 🏆](https://bhnt.c-base.org/2023-03-28-no87-moore-hacks))\n\n\u003cstrong\u003e2022\u003c/strong\u003e\n\n- August: [Presentation slides](https://docs.google.com/presentation/d/12WNSt5xgNIAmSxPfvjno9-sBMGlvxG_xSaAxmHQDRNQ/edit?usp=sharing) for a session at the [2022 Wikimania Hackathon](https://wikimania.wikimedia.org/wiki/2022:Hackathon)\n- July: [Presentation slides](https://docs.google.com/presentation/d/10Ai0-b8XUj5u9Hw4UgBtB7ufiPhvfFrb1vEUEyXYr5w/edit?usp=sharing) for a talk at [CocoaHeads Berlin](https://www.meetup.com/cocoaheads-berlin/)\n- July: [Video on Scribe](https://www.youtube.com/watch?v=4GpFN0gGmy4\u0026list=PL66MRMNlLyR7p9wsYVfuqJOjKZpbuwp8U\u0026index=6) for [Wikimedia Celtic Knot 2022](https://meta.wikimedia.org/wiki/Celtic_Knot_Conference_2022)\n- June: [Presentation slides](https://docs.google.com/presentation/d/1K2lj8PPgdx12I-xuhm--CBLrGm-Cz50NJmbp96zpGrk/edit?usp=sharing) for a talk with the [LD4 Wikidata Affinity Group](https://www.wikidata.org/wiki/Wikidata:WikiProject_LD4_Wikidata_Affinity_Group)\n- June: [Scribe](https://github.com/scribe-org) featured for new developers on [MediaWiki](https://www.mediawiki.org/wiki/New_Developers#Scribe)\n- May: [Presentation slides](https://docs.google.com/presentation/d/1Cu3VwQ3lJUp5W84YDe0AFYS-6zfBxKsm0MI-OMl_IzY/edit?usp=sharing) for [Wikimedia Hackathon 2022](https://www.mediawiki.org/wiki/Wikimedia_Hackathon_2022)\n- March: [Blog post](https://tech-news.wikimedia.de/en/2022/03/18/lexicographical-data-for-language-learners-the-wikidata-based-app-scribe/) on [Scribe-iOS](https://github.com/scribe-org/Scribe-iOS) for [Wikimedia Tech News](https://tech-news.wikimedia.de/en/homepage/) ([DE](https://tech-news.wikimedia.de/2022/03/18/sprachenlernen-mit-lexikografische-daten-die-wikidata-basierte-app-scribe/) / [Tweet](https://twitter.com/wikidata/status/1507335538596106257?s=20\u0026t=YGRGamftI-5B_VwQ_bFRhA))\n- March: [Presentation slides](https://docs.google.com/presentation/d/16ld_rCbwJCiAdRrfhF-Fq9Wm_ciHCbk_HCzGQs6TB1Q/edit?usp=sharing) for [Wikidata Data Reuse Days 2022](https://diff.wikimedia.org/event/wikidata-data-reuse-days-2022/)\n\n\u003c/p\u003e\n\u003c/details\u003e\n\n\u003cdiv align=\"center\"\u003e\n  \u003cbr\u003e\n    \u003ca href=\"https://tech-news.wikimedia.de/en/2022/03/18/lexicographical-data-for-language-learners-the-wikidata-based-app-scribe/\"\u003e\u003cimg height=\"120\"src=\"https://raw.githubusercontent.com/scribe-org/Organization/main/resources/images/logos/WikimediaDeutschlandLogo.png\" alt=\"Wikimedia Deutschland logo linking to an article on Scribe in the tech news blog.\"\u003e\u003c/a\u003e\n    \u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\n    \u003ca href=\"https://www.mediawiki.org/wiki/New_Developers#Scribe\"\u003e\u003cimg height=\"120\" src=\"https://raw.githubusercontent.com/scribe-org/Organization/main/resources/images/logos/WikimediaFoundationLogo.png\" alt=\"Wikimedia Foundation logo linking to the MediaWiki new developers page.\"\u003e\u003c/a\u003e\n    \u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\n  \u003cbr\u003e\n\u003c/div\u003e\n\n\u003cdiv align=\"center\"\u003e\n  \u003cbr\u003e\n    \u003ca href=\"https://summerofcode.withgoogle.com/\"\u003e\u003cimg height=\"120\" src=\"https://raw.githubusercontent.com/scribe-org/Organization/main/resources/images/logos/GSoCLogo.png\" alt=\"Google Summer of Code logo linking to its website.\"\u003e\u003c/a\u003e\n    \u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\n    \u003ca href=\"https://www.outreachy.org/\"\u003e\u003cimg height=\"90\" src=\"https://raw.githubusercontent.com/scribe-org/Organization/main/resources/images/logos/OutreachyLogo.png\" alt=\"Outreachy logo linking to its website.\"\u003e\u003c/a\u003e\n    \u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\n  \u003cbr\u003e\n\u003c/div\u003e\n\n# Powered By [`⇧`](#contents)\n\n### Contributors\n\nMany thanks to all the [Scribe-Data contributors](https://github.com/scribe-org/Scribe-Data/graphs/contributors)! 🚀\n\n\u003ca href=\"https://github.com/scribe-org/Scribe-Data/graphs/contributors\"\u003e\n  \u003cimg src=\"https://contrib.rocks/image?repo=scribe-org/Scribe-Data\" /\u003e\n\u003c/a\u003e\n\n### Blog posts\n\n\u003cdetails\u003e\u003csummary\u003e\u003cstrong\u003eList of referenced posts\u003c/strong\u003e\u003c/summary\u003e\n\u003cp\u003e\n\n- [Building a Recommendation System Using Neural Network Embeddings](https://towardsdatascience.com/building-a-recommendation-system-using-neural-network-embeddings-1ef92e5c80c9) by [WillKoehrsen](https://github.com/WillKoehrsen)\n\n- [Wikipedia Data Science: Working with the World’s Largest Encyclopedia](https://towardsdatascience.com/wikipedia-data-science-working-with-the-worlds-largest-encyclopedia-c08efbac5f5c) by [WillKoehrsen](https://github.com/WillKoehrsen)\n\n\u003c/p\u003e\n\u003c/details\u003e\n\n### Wikimedia Communities\n\n\u003cdiv align=\"center\"\u003e\n  \u003cbr\u003e\n  \u003ca href=\"https://www.wikidata.org/\"\u003e\u003cimg height=\"175\" src=\"https://raw.githubusercontent.com/scribe-org/Organization/main/resources/images/logos/WikidataLogo.png\" alt=\"Wikidata logo\"\u003e\u003c/a\u003e\n  \u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\n  \u003ca href=\"https://www.wikipedia.org/\"\u003e\u003cimg height=\"190\" src=\"https://raw.githubusercontent.com/scribe-org/Organization/main/resources/images/logos/WikipediaLogo.png\" alt=\"Wikipedia logo\"\u003e\u003c/a\u003e\n  \u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\u0026nbsp;\n  \u003cbr\u003e\n\u003c/div\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fscribe-org%2Fscribe-data","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fscribe-org%2Fscribe-data","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fscribe-org%2Fscribe-data/lists"}