{"id":27766222,"url":"https://github.com/kepano/defuddle-cli","last_synced_at":"2025-04-29T17:19:22.270Z","repository":{"id":284241912,"uuid":"954183083","full_name":"kepano/defuddle-cli","owner":"kepano","description":"Command line utility to extract clean html, markdown and metadata from web pages.","archived":false,"fork":false,"pushed_at":"2025-04-16T20:34:27.000Z","size":68,"stargazers_count":243,"open_issues_count":0,"forks_count":8,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-29T17:19:15.471Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"JavaScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/kepano.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-03-24T17:28:52.000Z","updated_at":"2025-04-27T10:47:59.000Z","dependencies_parsed_at":"2025-04-10T20:39:20.080Z","dependency_job_id":null,"html_url":"https://github.com/kepano/defuddle-cli","commit_stats":null,"previous_names":["kepano/defuddle-cli"],"tags_count":11,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kepano%2Fdefuddle-cli","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kepano%2Fdefuddle-cli/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kepano%2Fdefuddle-cli/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/kepano%2Fdefuddle-cli/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/kepano","download_url":"https://codeload.github.com/kepano/defuddle-cli/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":251546717,"owners_count":21606892,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-04-29T17:19:21.168Z","updated_at":"2025-04-29T17:19:22.093Z","avatar_url":"https://github.com/kepano.png","language":"JavaScript","readme":"# Defuddle CLI\n\nCommand line interface for [Defuddle](https://github.com/kepano/defuddle). Extract clean HTML or Markdown from pages.\n\n## Installation\n\n```bash\nnpm install -g defuddle-cli\n```\n\n## Usage\n\n```bash\ndefuddle parse \u003csource\u003e [options]\n```\n\n### Arguments\n\n- `source`: HTML file path or URL to parse\n\n### Options\n\n- `-o, --output \u003cfile\u003e`: Output file path (default: stdout)\n- `-m, --markdown, --md`: Convert content to markdown\n- `-j, --json`: Output as JSON with both HTML and markdown content\n- `-p, --property \u003cname\u003e`: Extract a specific property (e.g., title, description, domain)\n- `--debug`: Enable debug mode\n- `-h, --help`: Display help for command\n\n### Examples\n\nParse a local HTML file (outputs HTML):\n```bash\ndefuddle parse article.html\n```\n\nParse a URL and convert to markdown:\n```bash\ndefuddle parse https://example.com/article --md\n```\n\nParse and get the full JSON response from Defuddle:\n```bash\ndefuddle parse article.html --json\n```\n\nSave markdown output to a file:\n```bash\ndefuddle parse article.html --md -o output.md\n```\n\nExtract specific properties:\n```bash\n# Get just the title\ndefuddle parse article.html --property title\n\n# Get the description\ndefuddle parse article.html -p description\n\n# Get the domain\ndefuddle parse article.html --property domain\n```\n\n## Development\n\n```bash\n# Install dependencies\nnpm install\n\n# Build\nnpm run build\n\n# Run in development mode\nnpm run dev\n```\n","funding_links":[],"categories":["JavaScript","网络信息服务"],"sub_categories":["网络爬虫"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkepano%2Fdefuddle-cli","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fkepano%2Fdefuddle-cli","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fkepano%2Fdefuddle-cli/lists"}