{"id":13726293,"url":"https://github.com/aantron/lambdasoup","last_synced_at":"2025-10-10T07:16:13.581Z","repository":{"id":2254448,"uuid":"46012901","full_name":"aantron/lambdasoup","owner":"aantron","description":"Functional HTML scraping and rewriting with CSS in OCaml","archived":false,"fork":false,"pushed_at":"2024-11-18T19:27:20.000Z","size":598,"stargazers_count":397,"open_issues_count":7,"forks_count":31,"subscribers_count":11,"default_branch":"master","last_synced_at":"2025-05-23T02:06:42.158Z","etag":null,"topics":["css","html","ocaml","scraping","soup"],"latest_commit_sha":null,"homepage":"https://aantron.github.io/lambdasoup","language":"OCaml","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/aantron.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"docs/CONTRIBUTING.md","funding":".github/FUNDING.yml","license":"LICENSE.md","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"github":"aantron"}},"created_at":"2015-11-11T22:09:13.000Z","updated_at":"2025-05-19T14:40:16.000Z","dependencies_parsed_at":"2023-01-13T11:44:19.340Z","dependency_job_id":"ed4db7b6-92db-4496-afe4-a50cf3642fe2","html_url":"https://github.com/aantron/lambdasoup","commit_stats":{"total_commits":145,"total_committers":12,"mean_commits":"12.083333333333334","dds":"0.13103448275862073","last_synced_commit":"4ba3b91dd681c82ce4b7dae8f54594a86da42053"},"previous_names":[],"tags_count":14,"template":false,"template_full_name":null,"purl":"pkg:github/aantron/lambdasoup","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aantron%2Flambdasoup","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aantron%2Flambdasoup/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aantron%2Flambdasoup/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aantron%2Flambdasoup/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/aantron","download_url":"https://codeload.github.com/aantron/lambdasoup/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aantron%2Flambdasoup/sbom","scorecard":{"id":158942,"data":{"date":"2025-08-11","repo":{"name":"github.com/aantron/lambdasoup","commit":"4ba3b91dd681c82ce4b7dae8f54594a86da42053"},"scorecard":{"version":"v5.2.1-40-gf6ed084d","commit":"f6ed084d17c9236477efd66e5b258b9d4cc7b389"},"score":3.7,"checks":[{"name":"Code-Review","score":3,"reason":"Found 9/30 approved changesets -- score normalized to 3","details":null,"documentation":{"short":"Determines if the project requires human code review before pull requests (aka merge requests) are merged.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#code-review"}},{"name":"Dangerous-Workflow","score":10,"reason":"no dangerous workflow patterns detected","details":null,"documentation":{"short":"Determines if the project's GitHub Action workflows avoid dangerous patterns.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#dangerous-workflow"}},{"name":"Maintained","score":0,"reason":"0 commit(s) and 0 issue activity found in the last 90 days -- score normalized to 0","details":null,"documentation":{"short":"Determines if the project is \"actively maintained\".","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#maintained"}},{"name":"Packaging","score":-1,"reason":"packaging workflow not detected","details":["Warn: no GitHub/GitLab publishing workflow detected."],"documentation":{"short":"Determines if the project is published as a package that others can easily download, install, easily update, and uninstall.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#packaging"}},{"name":"Binary-Artifacts","score":10,"reason":"no binaries found in the repo","details":null,"documentation":{"short":"Determines if the project has generated executable (binary) artifacts in the source repository.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#binary-artifacts"}},{"name":"Pinned-Dependencies","score":0,"reason":"dependency not pinned by hash detected -- score normalized to 0","details":["Warn: GitHub-owned GitHubAction not pinned by hash: .github/workflows/test.yml:16: update your workflow using https://app.stepsecurity.io/secureworkflow/aantron/lambdasoup/test.yml/master?enable=pin","Warn: third-party GitHubAction not pinned by hash: .github/workflows/test.yml:17: update your workflow using https://app.stepsecurity.io/secureworkflow/aantron/lambdasoup/test.yml/master?enable=pin","Info:   0 out of   1 GitHub-owned GitHubAction dependencies pinned","Info:   0 out of   1 third-party GitHubAction dependencies pinned"],"documentation":{"short":"Determines if the project has declared and pinned the dependencies of its build process.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#pinned-dependencies"}},{"name":"Token-Permissions","score":0,"reason":"detected GitHub workflow tokens with excessive permissions","details":["Warn: no topLevel permission defined: .github/workflows/test.yml:1","Info: no jobLevel write permissions found"],"documentation":{"short":"Determines if the project's workflows follow the principle of least privilege.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#token-permissions"}},{"name":"CII-Best-Practices","score":0,"reason":"no effort to earn an OpenSSF best practices badge detected","details":null,"documentation":{"short":"Determines if the project has an OpenSSF (formerly CII) Best Practices Badge.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#cii-best-practices"}},{"name":"Security-Policy","score":0,"reason":"security policy file not detected","details":["Warn: no security policy file detected","Warn: no security file to analyze","Warn: no security file to analyze","Warn: no security file to analyze"],"documentation":{"short":"Determines if the project has published a security policy.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#security-policy"}},{"name":"Vulnerabilities","score":10,"reason":"0 existing vulnerabilities detected","details":null,"documentation":{"short":"Determines if the project has open, known unfixed vulnerabilities.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#vulnerabilities"}},{"name":"Fuzzing","score":0,"reason":"project is not fuzzed","details":["Warn: no fuzzer integrations found"],"documentation":{"short":"Determines if the project uses fuzzing.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#fuzzing"}},{"name":"License","score":10,"reason":"license file detected","details":["Info: project has a license file: LICENSE.md:0","Info: FSF or OSI recognized license: MIT License: LICENSE.md:0"],"documentation":{"short":"Determines if the project has defined a license.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#license"}},{"name":"Signed-Releases","score":-1,"reason":"no releases found","details":null,"documentation":{"short":"Determines if the project cryptographically signs release artifacts.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#signed-releases"}},{"name":"Branch-Protection","score":0,"reason":"branch protection not enabled on development/release branches","details":["Warn: branch protection not enabled for branch 'master'"],"documentation":{"short":"Determines if the default and release branches are protected with GitHub's branch protection settings.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#branch-protection"}},{"name":"SAST","score":0,"reason":"SAST tool is not run on all commits -- score normalized to 0","details":["Warn: 0 commits out of 9 are checked with a SAST tool"],"documentation":{"short":"Determines if the project uses static code analysis.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#sast"}}]},"last_synced_at":"2025-08-16T12:47:22.205Z","repository_id":2254448,"created_at":"2025-08-16T12:47:22.205Z","updated_at":"2025-08-16T12:47:22.205Z"},"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279003168,"owners_count":26083533,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-10T02:00:06.843Z","response_time":62,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["css","html","ocaml","scraping","soup"],"created_at":"2024-08-03T01:02:58.293Z","updated_at":"2025-10-10T07:16:13.565Z","avatar_url":"https://github.com/aantron.png","language":"OCaml","readme":"\u003ch1 align=\"center\"\u003e\n\u003cimg alt=\"Lambda Soup\" src=\"https://raw.githubusercontent.com/aantron/lambdasoup/master/docs/logo.png\" width=\"250\"\u003e\n\u003c/img\u003e\n\u003cbr\u003e\nLambda Soup\n\u003c/h1\u003e\n\n[coveralls]:     https://coveralls.io/github/aantron/lambdasoup?branch=master\n[coveralls-img]: https://img.shields.io/coveralls/aantron/lambdasoup/master.svg\n\n**Lambda Soup** is a functional HTML scraping and manipulation library for OCaml\naimed at being easy to use.\n\n\u003cbr\u003e\u003cbr\u003e\n\n\u003cp align=\"center\"\u003e\n\u003cimg alt=\"Lambda Soup usage example\" src=\"https://raw.githubusercontent.com/aantron/lambdasoup/master/docs/sample.gif\"\u003e\n\u003c/img\u003e\n\u003c/p\u003e\n\n[sample]: https://raw.githubusercontent.com/aantron/lambdasoup/master/docs/sample.gif\n\n\u003cbr\u003e\u003cbr\u003e\n\nLambda Soup is *simple*. It provides a set of\n[elementary traversals][traversals] for getting from node to node, familiar\nfunctional [combinators][combinators] such as `filter`, `map`, and `fold`, and\nsupport for all CSS selectors that still make sense when not running in a\nbrowser (and a few obvious [extensions][extracss] on top of that).\n\nHere is a trivial self-contained example:\n\n```ocaml\n(parse \"\u003cp class='Hello'\u003eWorld!\u003c/p\u003e\") $ \".Hello\" |\u003e R.leaf_text;;\n- : string = \"World!\"\n```\n\nAnd, a mutation:\n\n```ocaml\nlet soup = parse \"\u003cp class='Hello'\u003eWorld!\u003c/p\u003e\" in\nwrap (soup $ \".Hello\" |\u003e R.child) (create_element \"strong\");\nsoup |\u003e to_string;;\n- : string = \"\u003cp class=\\\"Hello\\\"\u003e\u003cstrong\u003eWorld!\u003c/strong\u003e\u003c/p\u003e\"\n```\n\nFor some more examples, see the Lambda Soup [postprocessor][postprocess] that\nruns on Lambda Soup's own [documentation][docs] after it is generated by\n`ocamldoc`.\n\nThe library is [tested][tests] thoroughly.\n\nLambda Soup is based on [Markup.ml][markupml]. As a consequence, it resolves\nentity references, detects character encodings automatically, and converts\neverything to UTF-8. And, you can use Lambda Soup on XML, by\n[parsing][parse_xml] the XML with Markup.ml and [feeding][from_signals] the\nsignals to Lambda Soup.\n\n[parse_xml]:    http://aantron.github.io/markup.ml/#VALparse_xml\n[from_signals]: http://aantron.github.io/lambdasoup/#2_Parsingsignals\n\n\u003cbr/\u003e\n\n## Installing\n\n    opam install lambdasoup\n\n[contributing-install]: https://github.com/aantron/lambdasoup/blob/master/docs/CONTRIBUTING.md#developing\n\n\u003cbr/\u003e\n\n## Starting from scratch\n\nTo use Lambda Soup interactively as in the GIF at the top of this README, you\nneed to have done something like this:\n\n```sh\nyour-package-manager install ocaml opam\nopam init\neval `opam config env`          # Or restart your shell\nopam install lambdasoup\n```\n\nand make sure your `~/.ocamlinit` file looks something like this:\n\n```ocaml\nlet () =\n  try Topdirs.dir_directory (Sys.getenv \"OCAML_TOPLEVEL_PATH\")\n  with Not_found -\u003e ()\n;;\n\n#use \"topfind\";;\n```\n\nThen, run `ocaml -short-paths` to start the top-level, and scrape away!\n\n\u003cbr/\u003e\n\n## Depending\n\nLambda Soup uses semantic versioning, but is currently in `0.x.x`. For now, the\nminor version number will be incremented on breaking changes. So, to give\nyourself a chance to review the changelog before your code breaks, put the\nfollowing constraint on Lambda Soup: `lambdasoup {\u003c \"0.7.0\"}`.\n\n\u003cbr/\u003e\n\n## Documentation\n\nLambda Soup's interface consists of one module `Soup`, whose signature is\ndocumented [here][docs].\n\n\u003cbr/\u003e\n\n## Developing\n\nSee [`CONTRIBUTING`][contributing]. All feedback is welcome – open an issue on\nGitHub, or send me an email at [antonbachin@yahoo.com][email]. If you find\nyourself repeatedly writing the same helper on top of Lambda Soup's functions,\nperhaps we should add it to Lambda Soup.\n\n\u003cbr/\u003e\n\n## History\n\nLambda Soup was originally written to answer a [Stack Overflow question][so] in\nNovember 2015.\n\n[docs]:         http://aantron.github.io/lambdasoup\n[postprocess]:  https://github.com/aantron/lambdasoup/blob/master/docs/postprocess.ml\n[tests]:        https://github.com/aantron/lambdasoup/blob/master/test/test.ml\n[contributing]: https://github.com/aantron/lambdasoup/blob/master/docs/CONTRIBUTING.md\n[email]:        mailto:antonbachin@yahoo.com\n[extracss]:     http://aantron.github.io/lambdasoup#VALselect\n[traversals]:   http://aantron.github.io/lambdasoup#2_Elementarytraversals\n[combinators]:  http://aantron.github.io/lambdasoup#2_Combinators\n[markupml]:     https://github.com/aantron/markup.ml\n[so]: https://stackoverflow.com/questions/33489575/parsing-html-with-ocaml\n","funding_links":["https://github.com/sponsors/aantron"],"categories":["OCaml"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Faantron%2Flambdasoup","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Faantron%2Flambdasoup","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Faantron%2Flambdasoup/lists"}