https://github.com/domarps/spatial_hashing
Simultaneous searching with text + longitudinal co-ordinates.
https://github.com/domarps/spatial_hashing
Last synced: about 2 months ago
JSON representation
Simultaneous searching with text + longitudinal co-ordinates.
- Host: GitHub
- URL: https://github.com/domarps/spatial_hashing
- Owner: domarps
- License: mit
- Created: 2015-05-06T09:20:37.000Z (about 10 years ago)
- Default Branch: master
- Last Pushed: 2015-06-06T09:49:59.000Z (almost 10 years ago)
- Last Synced: 2025-01-30T23:52:16.338Z (4 months ago)
- Language: Python
- Homepage:
- Size: 164 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Spatial-Hashing
---
Encoding geo-locations as spatial-indices for simultaneous text and location searching1. Read data from CSV
* Each row is article text (string), X coord (double), Y coord (double)
* Store rows as objects2. Create mapping from words to articles
* Words are tokens from article text, as well as strings corresponding to location E.g., (1.23,4.56) yields strings “1_4”, “12_45”, “123_456”
* Use stop words, stemming rules (code for Porter stemmer is readily available)
* Mapping should be in the form of skip lists from words to list of articles (“posting lists”)3. Implement search engine: inputs is location and words
* Return articles that are near that location and contain those words
* Key computation: intersect posting lists