{"id":19703449,"url":"https://github.com/levysoft/rss-finder","last_synced_at":"2025-10-07T15:54:58.341Z","repository":{"id":217177650,"uuid":"743244895","full_name":"levysoft/RSS-Finder","owner":"levysoft","description":"This Python script is designed to find RSS or Atom feeds on a given website","archived":false,"fork":false,"pushed_at":"2024-01-14T19:13:50.000Z","size":9,"stargazers_count":4,"open_issues_count":0,"forks_count":2,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-31T09:37:46.343Z","etag":null,"topics":["atom","feed","feedparser","html5lib","open-source","requests-library","rss","web-scraping"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/levysoft.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null}},"created_at":"2024-01-14T18:57:41.000Z","updated_at":"2024-11-24T16:44:48.000Z","dependencies_parsed_at":"2024-01-14T23:27:59.879Z","dependency_job_id":"beeb7457-71ff-4e1d-a2f9-bd27ec83b660","html_url":"https://github.com/levysoft/RSS-Finder","commit_stats":null,"previous_names":["levysoft/rss-finder"],"tags_count":1,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/levysoft%2FRSS-Finder","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/levysoft%2FRSS-Finder/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/levysoft%2FRSS-Finder/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/levysoft%2FRSS-Finder/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/levysoft","download_url":"https://codeload.github.com/levysoft/RSS-Finder/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252866124,"owners_count":21816397,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["atom","feed","feedparser","html5lib","open-source","requests-library","rss","web-scraping"],"created_at":"2024-11-11T21:17:53.826Z","updated_at":"2025-10-07T15:54:53.322Z","avatar_url":"https://github.com/levysoft.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# RSS Finder\n\n## Description\nThis Python script (`rss_finder.py`) is designed to find RSS or Atom feeds on a given website. It tries multiple strategies, including looking for `\u003clink\u003e` tags in the HTML head that point to a feed, as well as trying common feed URL patterns.\n\n## Requirements\n- Python 3\n- Libraries: `urllib.parse`, `html5lib`, `feedparser`, `requests`. You can install these libraries using pip:\n  ```bash\n  pip install html5lib feedparser requests\n\n  or install dependencies:\n\n  `pip install -r requirements.txt`\n\n## Installation\nTo set up the development environment:\n\n1. Clone the repository:\n\n   `git clone https://github.com/levysoft/RSS-Finder`\n\n2. Enter the project directory:\n\n   `cd RSS-Finder`\n\n3. Create a Python virtual environment:\n\n   `python3 -m venv venv`\n\n4. Activate the virtual environment:\n\n   - On Windows:\n     ```\n     .\\venv\\Scripts\\activate\n     ```\n   - On Unix or MacOS:\n     ```\n     source venv/bin/activate\n     ```\n5. Install dependencies:\n\n   `pip install -r requirements.txt`\n\n## Usage\nTo use the script, run it with a single argument: the URL of the website you want to find the feed for. For example:\n\n`python3 rss_finder.py https://example.com`\n\n## How it Works\n\nThe script makes a request to the given URL and parses the HTML response. It then searches for \u003clink\u003e elements that point to Atom or RSS feeds. If none are found, it tries appending common feed paths to the base URL to see if any valid feeds exist.\n\n## Attribution\nThis script is inspired by and based on the autodiscovery functionality found in the RSS aggregator Temboz, specifically in the autodiscovery module: [Temboz Autodiscovery Module](https://github.com/fazalmajid/temboz/blob/master/tembozapp/autodiscovery.py).\n\n## Author\nAntonio Troise\n\n## License\nThis project is released under the MIT License. See the LICENSE file for more details.\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flevysoft%2Frss-finder","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flevysoft%2Frss-finder","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flevysoft%2Frss-finder/lists"}