Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/vkitchen/pysearch
Proof of concept simple search engine in Python3
https://github.com/vkitchen/pysearch
Last synced: 10 days ago
JSON representation
Proof of concept simple search engine in Python3
- Host: GitHub
- URL: https://github.com/vkitchen/pysearch
- Owner: vkitchen
- License: isc
- Created: 2019-05-07T04:47:30.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2024-04-04T01:56:22.000Z (9 months ago)
- Last Synced: 2024-10-29T16:59:54.010Z (about 2 months ago)
- Language: Python
- Size: 3.91 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# pysearch
## About
A simple search engine written in python to test whether it was possible to search the wsj collection in less than a second using a high level language.
## Usage
Make sure `wsj.xml` is available in the same directory as `index.py` and `search.py`. Then run `index.py` to build the index. `search.py` takes a line of input from standard in of terms. Performs a query for those terms, prints results and exits.
## Todo
* TFIDF ranking
* Speed improvements? (100s to index, 0.25s to search currently)