Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ajanibilby/wiki-thesaurus
Attempt to generate thesaurus results based on Wikipedia as a dataset
https://github.com/ajanibilby/wiki-thesaurus
Last synced: about 1 month ago
JSON representation
Attempt to generate thesaurus results based on Wikipedia as a dataset
- Host: GitHub
- URL: https://github.com/ajanibilby/wiki-thesaurus
- Owner: AjaniBilby
- License: mit
- Created: 2023-04-04T05:52:37.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-07-23T02:28:28.000Z (6 months ago)
- Last Synced: 2024-10-15T02:50:21.535Z (3 months ago)
- Language: Python
- Size: 18.6 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: readme.md
- License: LICENSE
Awesome Lists containing this project
README
# Wiki Thesaurus
This CLI tool attempts to get similar words to any given input based on the graph structure of how Wikipedia pages are linked together.
## Installation
### Using pre-built
Clone this repository then extract the contents of [simplewiki.rar](https://github.com/AjaniBilby/wiki-thesaurus/releases/tag/v0.0.0) into the `./data/` folder.
### Building from source
Downloads and depacks the dataset of all english wikipedia articles
```
python download.py
```Generate the article graph as a database
```
python ingest.py
```## Use
Start-up
```
python search.py
```Enter any text to perform a search, and start with a `.` to perform a command
| Command | Action
:-|:-
`.next` | Will show the next `x` results from the previous search
`.algo xxx` | Changes the search algorithm used, with the name corresponding to any algorithm in the folder `./algorithm/` (i.e. `.algo intersection` )
`.limit xxx` | Will change the number of results shown per search based on the number used in place of the `xxx` (i.e. `.limit 20`)
`.exit` | Ends the program