{"id":13826160,"url":"https://github.com/owainlewis/falkor","last_synced_at":"2025-04-14T10:22:30.308Z","repository":{"id":33728599,"uuid":"37382926","full_name":"owainlewis/falkor","owner":"owainlewis","description":"Open Source web scraping API. Falkor turns web pages into queryable JSON","archived":false,"fork":false,"pushed_at":"2016-02-12T20:40:43.000Z","size":22,"stargazers_count":188,"open_issues_count":1,"forks_count":7,"subscribers_count":11,"default_branch":"master","last_synced_at":"2025-03-27T23:33:05.774Z","etag":null,"topics":["webscraping","webscrapper"],"latest_commit_sha":null,"homepage":"","language":"Clojure","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"epl-1.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/owainlewis.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2015-06-13T18:27:42.000Z","updated_at":"2025-02-23T11:52:05.000Z","dependencies_parsed_at":"2022-09-13T18:00:56.516Z","dependency_job_id":null,"html_url":"https://github.com/owainlewis/falkor","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/owainlewis%2Ffalkor","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/owainlewis%2Ffalkor/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/owainlewis%2Ffalkor/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/owainlewis%2Ffalkor/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/owainlewis","download_url":"https://codeload.github.com/owainlewis/falkor/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":248860184,"owners_count":21173381,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["webscraping","webscrapper"],"created_at":"2024-08-04T09:01:33.093Z","updated_at":"2025-04-14T10:22:30.280Z","avatar_url":"https://github.com/owainlewis.png","language":"Clojure","funding_links":[],"categories":["Clojure"],"sub_categories":[],"readme":"# Falkor\n\nA web service for turning HTML pages into traversable JSON documents\n\nVery early stage development. If you have any feature requests just create an issue on the project\n\n## Getting started\n\nRunning the server locally\n\n```\nlein uberjar\ndocker build -t falkor .\ndocker run -t falkor\n\n# Visit http://localhost:5000\n```\n\n## Comming soon\n\n+ Better error handling\n+ CORS\n+ Query filtering (return only certain attributes)\n+ Fetching multiple elements in a single request ( e.g [h1 \u003e a, .subtitle] )\n\n## Usage\n\nGet all the title links from the Reddit.com home page\n\nhttps://falkor-api.herokuapp.com/api/query?url=http://reddit.com\u0026query=a.title\n\nGrab all the news stories from Digg.com\n\nhttps://falkor-api.herokuapp.com/api/query?url=http://digg.com\u0026query=.story-title%20a\n\nExtract all the images from Digg.com\n\nhttps://falkor-api.herokuapp.com/api/query?url=http://digg.com\u0026query=img[src]\n\n## TODO\n\nFilters to remove some of the attribute cruft\n\nFor example if we just want to extract the text for an element and ignore the other attributes\n\n```\n\u0026filter=[text]\n```\n\n## License\n\nCopyright © 2015 Forward Digital Limited\n\nDistributed under the Eclipse Public License either version 1.0 or (at\nyour option) any later version.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fowainlewis%2Ffalkor","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fowainlewis%2Ffalkor","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fowainlewis%2Ffalkor/lists"}