{"id":22127558,"url":"https://github.com/capjavert/jawa","last_synced_at":"2025-07-25T17:31:38.769Z","repository":{"id":61566893,"uuid":"548835527","full_name":"capJavert/jawa","owner":"capJavert","description":"Visual scraper interface, exports to puppeteer script which you can run anywhere.","archived":false,"fork":false,"pushed_at":"2024-05-14T11:05:47.000Z","size":1269,"stargazers_count":14,"open_issues_count":2,"forks_count":3,"subscribers_count":3,"default_branch":"master","last_synced_at":"2024-11-15T22:10:00.348Z","etag":null,"topics":["hacktoberfest","nextjs","puppeteer","react","scraper","starwars","typescript","visual-scraping","yargs"],"latest_commit_sha":null,"homepage":"https://jawa.sh","language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/capJavert.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":"FUNDING.yml","license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"github":"capJavert"}},"created_at":"2022-10-10T08:58:34.000Z","updated_at":"2024-07-18T20:29:33.000Z","dependencies_parsed_at":"2023-10-11T00:24:23.415Z","dependency_job_id":"4a82993d-5383-4525-bd32-11936e2d54b7","html_url":"https://github.com/capJavert/jawa","commit_stats":null,"previous_names":["capjavert/vscraper"],"tags_count":6,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/capJavert%2Fjawa","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/capJavert%2Fjawa/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/capJavert%2Fjawa/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/capJavert%2Fjawa/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/capJavert","download_url":"https://codeload.github.com/capJavert/jawa/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":227598549,"owners_count":17791623,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["hacktoberfest","nextjs","puppeteer","react","scraper","starwars","typescript","visual-scraping","yargs"],"created_at":"2024-12-01T17:19:47.712Z","updated_at":"2024-12-01T17:19:48.424Z","avatar_url":"https://github.com/capJavert.png","language":"TypeScript","funding_links":["https://github.com/sponsors/capJavert"],"categories":[],"sub_categories":[],"readme":"# Jawa - Visual Scraper\n\n[![npm](https://img.shields.io/npm/v/jawa)](https://www.npmjs.com/package/jawa)\n[![Chrome Web Store](https://img.shields.io/chrome-web-store/v/icjgianfpiifbdpddkadmpcegiffiglk)](https://chrome.google.com/webstore/detail/clippy/icjgianfpiifbdpddkadmpcegiffiglk)\n\n[🇭🇷 Started in Croatia](https://startedincroatia.com)\n\n![DALL·E 2022-10-17 03 53 08 (2)](https://user-images.githubusercontent.com/9803078/196301040-1f1f34b4-e983-4cd8-859b-951b7fa51068.png)\n\nVisual scraper interface, exports to puppeteer script which you can run anywhere. You can try it out here https://jawa.sh\n\nJawa allows you to visually click elements of any website and then export selectors as a config that you can run in any node environment to scrape the content when needed.\n\nThis repo consists of the:\n- web app\n- cli\n- browser extension\n\n## Web app\n\nWeb app that provides embedded browser for visually selecting elements and creating the scraper config that you can download and run through the CLI or Cloud.\n\n### Cloud scraping (Beta)\n\nIt is now supported to run your scraper config in the cloud directly from web app. Cloud scrapers use the same Jawa CLI. Currently cloud scrapers have limited availability.\n\nIf you need more usage you can check out [Jawa Pro](https://jawa.sh/pro?ref=github). \n\n## CLI\n\nSimple CLI to run configs created and exported from web app. You can run it like this:\n\n```\nnpx jawa path/to/scraper/config/file.json\n```\n\nor `npx jawa --help` to see all the options.\n\n`jawa` package now also exports `scrape` function so it can be used outside of CLI in your apps or services:\n```js\nimport { scrape } from 'jawa'\n```\n```js\nconst { scrape } = require('jawa')\n```\n\n## Browser extension\n\nBrowser extension that runs the embedded browser which powers the visual scraper interface. \n\nIt is available on:\n- [Chrome Web Store](https://chrome.google.com/webstore/detail/jawa-visual-scraper/icjgianfpiifbdpddkadmpcegiffiglk)\n- Chrome extensions also work on all Chromium based browsers like:\n    - Opera\n    - Microsoft Edge\n    - Brave\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcapjavert%2Fjawa","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcapjavert%2Fjawa","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcapjavert%2Fjawa/lists"}