https://github.com/fuzzy-search/qa-dataset-generator
https://github.com/fuzzy-search/qa-dataset-generator
Last synced: 9 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/fuzzy-search/qa-dataset-generator
- Owner: Fuzzy-Search
- License: gpl-3.0
- Created: 2023-11-01T16:36:48.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2023-11-04T20:24:34.000Z (over 2 years ago)
- Last Synced: 2025-01-21T21:07:32.064Z (over 1 year ago)
- Language: Python
- Size: 25.4 KB
- Stars: 1
- Watchers: 0
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## Welcome to qa-dataset-generator
Here is the dataset we are using to fine-tune our chat-with-page local LLM. We create our dataset by gpt-3.5, covering following topics. Feel free to revise and add up new topics.
## Topics for fine-tune
We currently build our dataset by the following topics. Feel free to add.
['technology', 'politics', 'science', 'health', 'environment', 'business', 'education', 'travel', 'entertainment', 'sports']