https://github.com/litui/rss-rag-ingest
RSS RAG Ingest Tool for Open-WebUI
https://github.com/litui/rss-rag-ingest
Last synced: 11 months ago
JSON representation
RSS RAG Ingest Tool for Open-WebUI
- Host: GitHub
- URL: https://github.com/litui/rss-rag-ingest
- Owner: litui
- Created: 2025-02-05T22:23:57.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-02-06T03:09:27.000Z (over 1 year ago)
- Last Synced: 2025-06-27T00:11:50.688Z (11 months ago)
- Language: Go
- Size: 358 KB
- Stars: 3
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# RSS RAG Ingest Tool

This is a tool for pulling content from RSS feeds and submitting articles to an Open-WebUI Knowledge Base.
Those articles can then be queried. The quality of information produced is entirely dependent on your embedding model and the model you choose to query.
## Installation/Building
This installation makes use of CGO and therefore needs a C build environment available. Check out `build-crosscompile.sh` for an example of how to build for a different platform from the one you're using.
Further details on dependencies and prerequisites can be found in the README at https://github.com/mattn/go-sqlite3.
## Configuration
The config file is hard-coded for the moment to be at `./config.yml`. Copy `config-example.yml` to `config.yml` and modify the configuration to suit your needs.
The `rss` items are, I hope, pretty straightforward but I'll spell out schema here:
* `id`: lowercase, no spaces ID of the rss feed for purposes of tracking in the database. Do not change this after initial setup or you'll get repeat file uploads to Open-WebUI.
* `name`: plaintext name for the feed as it appears in the uploaded filename. No special characters allowed.
* `url`: Direct URL for the RSS feed.
* `data_in_link`: This is a boolean value that tells the tool whether to follow the link in the RSS. If `false` the tool will generate a markdown file containing details about the RSS entry and submit that (useful for microblogs) instead of following the link.
* `author_override` (optional): If entered, this will override the value in the RSS author tags. Useful if the author tags are empty and need to be attributed.
* `html_to_markdown`: Using [jaytaylor](https://github.com/jaytaylor)'s amazing [html2text](https://github.com/jaytaylor/html2text) library, this setting converts an html document to nicely formatted markdown.
* `owui_knowledge_base`: The ID of your knowledge base in Open-WebUI. This is the UUID string as it appears in the URL, not the name.