{"id":35072190,"url":"https://github.com/kkiapay/lulz-scraping","last_synced_at":"2026-04-13T20:32:48.792Z","repository":{"id":43368203,"uuid":"187387597","full_name":"kkiapay/lulz-scraping","owner":"kkiapay","description":"Scraping the data you want from a website by specifying your output in parser.yml","archived":false,"fork":false,"pushed_at":"2023-02-15T21:37:28.000Z","size":46,"stargazers_count":4,"open_issues_count":3,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2023-03-11T10:02:46.043Z","etag":null,"topics":["json","parser","regex","scraping","selector","yaml"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/kkiapay.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null}},"created_at":"2019-05-18T17:37:48.000Z","updated_at":"2022-03-05T08:29:04.000Z","dependencies_parsed_at":"2023-01-24T12:00:14.612Z","dependency_job_id":null,"html_url":"https://github.com/kkiapay/lulz-scraping","commit_stats":null,"previous_names":[],"tags_count":null,"template":null,"template_full_name":null,"purl":"pkg:github/kkiapay/lulz-scraping","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kkiapay%2Flulz-scraping","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kkiapay%2Flulz-scraping/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kkiapay%2Flulz-scraping/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kkiapay%2Flulz-scraping/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/kkiapay","download_url":"https://codeload.github.com/kkiapay/lulz-scraping/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kkiapay%2Flulz-scraping/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31770720,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-13T20:17:16.280Z","status":"ssl_error","status_checked_at":"2026-04-13T20:17:08.216Z","response_time":93,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["json","parser","regex","scraping","selector","yaml"],"created_at":"2025-12-27T11:57:23.802Z","updated_at":"2026-04-13T20:32:48.783Z","avatar_url":"https://github.com/kkiapay.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# lulz-scraping\n\n## Example\n\nLet's take the following `HTML example`:\n\n```html\n\u003ctable width=\"100%\" id=\"listeEntreprise\" border=\"0\" class=\"zone\" style=\"width:100%;margin-top:0px;margin-bottom:10px\"\u003e\n    \u003cthead\u003e\n        \u003ctr style=\"background:#7fb33e; color:#FFFFFF; font-style:'Helvetica' font-size:12px\" valign=\"middle\"\u003e\n            \n            \u003cth style=\"margin-left:0px; width:10px; margin-right:-10px\"\u003eDate\u003c/th\u003e\n            \u003cth style=\"margin-left:0px, width:20%\"\u003eN° RCCM\u003c/th\u003e\n            \u003cth style=\"margin-left:0px, width:60%\"\u003eRaison Sociale\u003c/th\u003e\n            \u003cth style=\"margin-left:0px, width:20%\"\u003eStatut Juridique\u003c/th\u003e\t\n        \u003c/tr\u003e\n    \u003c/thead\u003e\n    \u003ctbody\u003e\n        \u003ctr style=\"font-size:12px;\" id=\"contenu\"\u003e\n            \u003ctd style=\"padding:-1px-1px-1px-1px; margin-left:5px; width:10px; margin-right:5px; margin-right:-10px\"\u003e\n                08/05/2019\n            \u003c/td\u003e\n            \u003ctd style=\"width:250px; height:10px\"\u003eCI-ABJ-2019-B-10428\u003c/td\u003e\n            \u003ctd style=\"text-align:left; width:500px\"\u003e\n                \u003ca href=\"https://www.cepici.ci/rapports_generes/pdf/65684.pdf\" target=\"_blank\"\u003e\n                    AMADEUS ABIDJANAIS\n                \u003c/a\u003e\n            \u003c/td\u003e\n            \u003ctd style=\"width:200px; height:10px; margin-left:0px\"\u003eSARL U\u003c/td\u003e\n        \u003c/tr\u003e\n    \u003c/tbody\u003e\n\u003c/table\u003e\n```\n\nYou just have to describe your `parser.yml`:\n\n```yml\nsite:\n  - cepici\ncepici:\n  url: https://cepici.ci/views/annonces_legales/Affichage_ajax/SearchRS.php?countInit=0\n  request_type: GET\n  parameters:\n    - search_rs\n  parser:\n    name: tr#contenu a\n    legal_form: tr#contenu \u003e td:nth-child(4)\n    rccm_number: tr#contenu \u003e td:nth-child(2)\n    date_of_creation: tr#contenu \u003e td:nth-child(1)\n```\nOutput response you will get with that `parser`\n\n```json\n[\n    {\n        \"name\": \"AMADEUS ABIDJANAIS\",\n        \"legal_form\": \"SARL U\",\n        \"rccm_number\": \"CI-ABJ-2019-B-10428\",\n        \"date_of_creation\": \"08/05/2019\"\n    }\n]\n```\n## Contributing 🤝\n\u003e Feel free to follow the procedure to make it even more awesome!\n\n1. Create an `issue` so we `get the discussion started`\n2. Fork it!\n3. Create your feature branch: `git checkout -b my-new-feature`\n4. Commit your changes: `git commit -am 'Add some feature'`\n5. Push to the branch: `git push origin my-new-feature`\n6. Submit a pull request","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkkiapay%2Flulz-scraping","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkkiapay%2Flulz-scraping","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkkiapay%2Flulz-scraping/lists"}