Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/divithraju/divith-raju-searchengine-wikipedia
search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
https://github.com/divithraju/divith-raju-searchengine-wikipedia
algorithms data dataengineering inverted-index linux merge-sort nlp project project-repository python3 serchengine software-engineering ubuntu wikipedia
Last synced: 5 days ago
JSON representation
search engine optimizationA complete search engine experience built on top of 75 GB Wikipedia corpus with subsecond latency for searches. Results contain wiki pages ordered by TF/IDF relevance based on given search word/s. From an optimized code to the K-Way mergesort algorithm, this project addresses latency, indexing, and big data challenges.
- Host: GitHub
- URL: https://github.com/divithraju/divith-raju-searchengine-wikipedia
- Owner: divithraju
- Created: 2022-12-24T21:33:21.000Z (about 2 years ago)
- Default Branch: main
- Last Pushed: 2022-12-24T22:14:43.000Z (about 2 years ago)
- Last Synced: 2024-11-08T06:43:21.967Z (about 2 months ago)
- Topics: algorithms, data, dataengineering, inverted-index, linux, merge-sort, nlp, project, project-repository, python3, serchengine, software-engineering, ubuntu, wikipedia
- Language: Python
- Homepage: https://linktr.ee/divithraju
- Size: 16.6 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# search-engine
search engine optimization