An open API service indexing awesome lists of open source software.

https://github.com/davidulloa6310/search-engine

Search engine implemented in Golang using Postgres for storing documents and computing TF-IDF values
https://github.com/davidulloa6310/search-engine

go golang nlp postgres tfidf

Last synced: 9 months ago
JSON representation

Search engine implemented in Golang using Postgres for storing documents and computing TF-IDF values

Awesome Lists containing this project

README

          

# Search-Engine (Golang)
This project implements a simple search engine in Go using TF-IDF and Vector similarity search with Postgres. The initial implementation included using Cassandra as the main database, but was switched to Postgres for flexibility. Learned:

* Golang project structure and langauge details
* Cassandra database constraints / requirements
* Information Retrieval from scratch, including web crawling, HTML parsing, TF-IDF formulation.

# What's next?
Next is to...
* use a local LLM model to create document embeddings to further improve the accuracy of the engine
* use a graph database like Neo4j to implement ranking algorithms to supplement vector similarity and TF-IDF
* implement a frontend that allows users to make search queries