{"id":23265663,"url":"https://github.com/html-extract/hext-on-websockets","last_synced_at":"2025-07-10T22:41:43.912Z","repository":{"id":120552379,"uuid":"176572254","full_name":"html-extract/hext-on-websockets","owner":"html-extract","description":"Websocket Server for Hext. Hext is a domain-specific language for extracting structured data from HTML documents.","archived":false,"fork":false,"pushed_at":"2024-06-04T12:43:56.000Z","size":39,"stargazers_count":3,"open_issues_count":0,"forks_count":1,"subscribers_count":2,"default_branch":"master","last_synced_at":"2024-06-04T14:20:36.601Z","etag":null,"topics":["async","beast","boost","boost-asio","cpp","cpp17","seccomp","ssl","websockets"],"latest_commit_sha":null,"homepage":"https://hext.thomastrapp.com","language":"C++","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/html-extract.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null}},"created_at":"2019-03-19T18:12:08.000Z","updated_at":"2024-06-04T12:43:59.000Z","dependencies_parsed_at":"2024-03-23T20:22:42.301Z","dependency_job_id":"e58b6378-e723-4b24-b3d9-d49cb580b9af","html_url":"https://github.com/html-extract/hext-on-websockets","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/html-extract%2Fhext-on-websockets","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/html-extract%2Fhext-on-websockets/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/html-extract%2Fhext-on-websockets/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/html-extract%2Fhext-on-websockets/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/html-extract","download_url":"https://codeload.github.com/html-extract/hext-on-websockets/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":230457459,"owners_count":18229028,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["async","beast","boost","boost-asio","cpp","cpp17","seccomp","ssl","websockets"],"created_at":"2024-12-19T15:31:11.362Z","updated_at":"2024-12-19T15:31:12.044Z","avatar_url":"https://github.com/html-extract.png","language":"C++","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Hext on Websockets\n\n[Hext](https://hext.thomastrapp.com/) is a domain-specific language for extracting structured data from HTML documents.\n\nThis is the websocket server behind the \"[Try Hext in your Browser!](https://hext.thomastrapp.com)\" section.\n\n\n## Dependencies\n\n* CMake\n* OpenSSL\n* Boost\n* RapidJSON\n* [Hext](https://github.com/html-extract/hext)\n* Optional: libseccomp\n* Testing: websocat, jq\n\n\n## Build\n\n```\ncd build\ncmake -DCMAKE_BUILD_TYPE=Release .. \u0026\u0026 make -j8\nls -lh ./hextws\n```\n\nBuild with seccomp enabled (see the list of allowed system calls in [SetupSeccomp.cpp](ws/SetupSeccomp.cpp), YMMV):\n\n```\ncd build\ncmake -DCMAKE_BUILD_TYPE=Release -DHEXTWS_SECCOMP=On .. \u0026\u0026 make -j8\nls -lh ./hextws\n```\n\n\n## Usage\n\n```\n./hextws \u003caddress\u003e \u003cport\u003e \u003cnum-threads\u003e \u003cmax-searches\u003e \\\n    \u003cpath-to-cert\u003e \u003cpath-to-key\u003e \u003cpath-to-dhparam\u003e\n```\n\n\n## License\n\nAll source code of this project is released under the Apache License v2.0.\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhtml-extract%2Fhext-on-websockets","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fhtml-extract%2Fhext-on-websockets","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhtml-extract%2Fhext-on-websockets/lists"}