{"id":44069369,"url":"https://github.com/scribeocr/scribe.js","last_synced_at":"2026-02-08T04:06:14.376Z","repository":{"id":253222065,"uuid":"842818379","full_name":"scribeocr/scribe.js","owner":"scribeocr","description":"JavaScript OCR and text extraction for images and PDFs.","archived":false,"fork":false,"pushed_at":"2026-02-05T00:42:52.000Z","size":312944,"stargazers_count":255,"open_issues_count":26,"forks_count":17,"subscribers_count":3,"default_branch":"master","last_synced_at":"2026-02-05T12:54:06.957Z","etag":null,"topics":["javascript","ocr","tesseract","webassembly"],"latest_commit_sha":null,"homepage":"","language":"JavaScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"agpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/scribeocr.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2024-08-15T06:41:32.000Z","updated_at":"2026-02-05T00:42:56.000Z","dependencies_parsed_at":"2025-09-14T22:26:12.673Z","dependency_job_id":null,"html_url":"https://github.com/scribeocr/scribe.js","commit_stats":null,"previous_names":["scribeocr/scribe.js"],"tags_count":28,"template":false,"template_full_name":null,"purl":"pkg:github/scribeocr/scribe.js","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scribeocr%2Fscribe.js","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scribeocr%2Fscribe.js/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scribeocr%2Fscribe.js/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scribeocr%2Fscribe.js/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/scribeocr","download_url":"https://codeload.github.com/scribeocr/scribe.js/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scribeocr%2Fscribe.js/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29219394,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-08T03:18:47.732Z","status":"ssl_error","status_checked_at":"2026-02-08T03:15:31.985Z","response_time":57,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["javascript","ocr","tesseract","webassembly"],"created_at":"2026-02-08T04:06:13.204Z","updated_at":"2026-02-08T04:06:14.369Z","avatar_url":"https://github.com/scribeocr.png","language":"JavaScript","readme":"# Scribe.js\nScribe.js is a JavaScript library that performs OCR and extracts text from images and PDFs.  \n\nCommon use cases:\n1. Recognize text from images.\n2. Extract text from user-uploaded `.pdf` files.\n\t1. If the `.pdf` file is already text-native, scribe.js can extract the existing text.\n\t2. If the `.pdf` file is image-native, scribe.js can recognize text using OCR.\n3. Write `.pdf` files that include a high-quality invisible text layer.\n\t1. scribe.js can insert text into an existing `.pdf` file, making it searchable.\n\nScribe.js is a library intended for developers.  End users who want to scan documents should see the officially-supported GUI at [scribeocr.com](https://scribeocr.com/) (repo [here](https://github.com/scribeocr/scribeocr)).\n\n# Setup\nInstall from `npm` by running the following:\n```sh\nnpm i scribe.js-ocr\n```\n\nScribe.js is written in JavaScript using ESM, so can be imported directly from browser or Node.js JavaScript code without a build step.\n```js\n// Import statement in browser:\nimport scribe from 'node_modules/scribe.js-ocr/scribe.js';\n// Import statement for Node.js:\nimport scribe from 'scribe.js-ocr';\n\n// Basic usage\nscribe.extractText(['https://tesseract.projectnaptha.com/img/eng_bw.png'])\n\t.then((res) =\u003e console.log(res))\n```\n\nWhen using Scribe.js in the browser, all files must be served from the same origin as the file importing Scribe.js.  This means that importing Scribe.js from a CDN will not work.  There is no UMD version.\n\n# Templates\nThe following are template repos showing how Scribe.js can be used within various frameworks/build systems.  \n\n- Browser with ESM (no build): https://github.com/scribeocr/scribe.js-example-esm-browser\n- Browser with Next.js: https://github.com/scribeocr/scribe.js-example-next.js\n- Browser with Webpack 5: https://github.com/scribeocr/scribe.js-example-webpack5\n- Browser with Vue.js v2: https://github.com/scribeocr/scribe.js-example-vue2\n\nContributions are appreciated--if you are using Scribe.js within a framework not listed below, consider making a basic repo and adding to this list with a PR, especially if non-obvious steps were required.\n\n# Scribe.js vs. Tesseract.js\nConsidering whether Scribe.js or Tesseract.js is better for your project?  Read [this article](./docs/scribe_vs_tesseract.md).\n\n# Documentation\n- [Basic Browser Examples](./examples/browser/)\n- [Basic Node.js Examples](./examples/node/)\n- [Scribe.js vs. Tesseract.js Comparison](./docs/scribe_vs_tesseract.md)\n- [API](./docs/API.md)\n\n## Projects and Examples\nThe following are examples and projects built using Scribe.js.  Additional examples can be found in the [examples](https://github.com/scribeocr/scribe.js/tree/master/examples) directory. \n\n- Projects\n   - Scribe OCR: officially supported GUI front-end for Scribe.js\n      - Site at [scribeocr.com](https://scribeocr.com/), repo at [github.com/scribeocr/scribeocr](https://github.com/scribeocr/scribeocr)\n\nIf you have a project or example repo that uses Scribe.js, feel free to add it to this list using a pull request. Examples submitted should be well documented such that new users can run them; projects should be functional and actively maintained.\n\n# Contributing\nTo work on a local copy, simply clone with `--recurse-submodules` and install.  Please run the automated tests before making a PR.\n```sh\n## Clone the repo, including recursively cloning submodules\ngit clone --recurse-submodules git@github.com:scribeocr/scribe.js.git\ncd scribe.js\n\n## Install dependencies\nnpm i\n\n## Make changes\n## [...]\n\n## Run automated tests before making PR\nnpm run test\n```\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fscribeocr%2Fscribe.js","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fscribeocr%2Fscribe.js","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fscribeocr%2Fscribe.js/lists"}