{"id":15103566,"url":"https://github.com/explosion/spacy-benchmarks","last_synced_at":"2025-09-27T02:31:40.817Z","repository":{"id":25459518,"uuid":"28889797","full_name":"explosion/spacy-benchmarks","owner":"explosion","description":"💫  Runtime performance comparison of spaCy against other NLP libraries","archived":true,"fork":false,"pushed_at":"2022-08-31T14:31:52.000Z","size":23,"stargazers_count":20,"open_issues_count":2,"forks_count":12,"subscribers_count":4,"default_branch":"master","last_synced_at":"2024-09-21T09:32:46.729Z","etag":null,"topics":["benchmarking","benchmarks","natural-language-processing","nlp","spacy"],"latest_commit_sha":null,"homepage":"https://spacy.io","language":"Python","has_issues":false,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/explosion.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2015-01-07T00:09:11.000Z","updated_at":"2023-01-27T21:45:29.000Z","dependencies_parsed_at":"2023-01-14T02:46:30.341Z","dependency_job_id":null,"html_url":"https://github.com/explosion/spacy-benchmarks","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/explosion%2Fspacy-benchmarks","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/explosion%2Fspacy-benchmarks/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/explosion%2Fspacy-benchmarks/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/explosion%2Fspacy-benchmarks/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/explosion","download_url":"https://codeload.github.com/explosion/spacy-benchmarks/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":219871850,"owners_count":16554459,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["benchmarking","benchmarks","natural-language-processing","nlp","spacy"],"created_at":"2024-09-25T19:40:32.006Z","updated_at":"2025-09-27T02:31:35.577Z","avatar_url":"https://github.com/explosion.png","language":"Python","readme":"\u003ca href=\"https://explosion.ai\"\u003e\u003cimg src=\"https://explosion.ai/assets/img/logo.svg\" width=\"125\" height=\"125\" align=\"right\" /\u003e\u003c/a\u003e\n\n# Runtime performance comparison of spaCy against other NLP libraries\n\n\u003e ⚠️ **This repository is old and deprecated.** For up-to-date benchmark scripts, see the [`projects`](https://github.com/explosion/projects/) repo.\n\n## Set up the corpus DB\n\nThe speed test expects to read documents from a simple SQLite table. More corpus\ninjestors need to be written. So far there's one to create the table from the Gigaword\ncorpus.\n\n```bash\nfab corpus.giga:path_to_gigaword/\n```\n\n## Set up the tools\n\n```bash\nfab init\n```\n\nThis should download and install spaCy and other NLP libraries.\n\n## Run a benchmark\n\n```bash\nfab speed:parse,spacy,n=1000\nfab speed:tag,spacy\nfab speed:tag,spacy,nltk,n=10000\nfab speed:tokenize,spacy,clearnlp\n```\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fexplosion%2Fspacy-benchmarks","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fexplosion%2Fspacy-benchmarks","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fexplosion%2Fspacy-benchmarks/lists"}