https://github.com/kreeben/resin

Vector space index based search engine that's available as a HTTP service or as an embedded library.
https://github.com/kreeben/resin

information-retrieval language-model machine-learning nlu nlu-engine resin search search-algorithms search-engine vector-space vector-space-model

Last synced: 3 months ago
JSON representation

Vector space index based search engine that's available as a HTTP service or as an embedded library.

Host: GitHub
URL: https://github.com/kreeben/resin
Owner: kreeben
License: mit
Created: 2016-03-14T08:01:57.000Z (over 9 years ago)
Default Branch: master
Last Pushed: 2024-05-14T09:51:16.000Z (about 1 year ago)
Last Synced: 2024-05-29T05:12:29.983Z (about 1 year ago)
Topics: information-retrieval, language-model, machine-learning, nlu, nlu-engine, resin, search, search-algorithms, search-engine, vector-space, vector-space-model
Language: C#
Homepage:
Size: 63.4 MB
Stars: 564
Watchers: 22
Forks: 39
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-dotnet-core - resin - 16-bit wide vector space search engine with HTTP API and pluggable read/write pipelines. (Frameworks, Libraries and Tools / Application Frameworks)
fucking-awesome-dotnet-core - resin - 16-bit wide vector space search engine with HTTP API and pluggable read/write pipelines. (Frameworks, Libraries and Tools / Application Frameworks)
awesome-dotnet-core - resin - 16-bit wide vector space search engine with HTTP API and pluggable read/write pipelines. (Frameworks, Libraries and Tools / Application Frameworks)
awesome-dotnet-core - resin - 面向文档的搜索引擎，具有列索引，多重集合查询，基于JSON的查询语言和HTTP API。 (框架, 库和工具 / 应用程序框架)

README

        # ⍼ Resin Search Engine

Overview | [How to install](https://github.com/kreeben/resin/blob/master/INSTALL.md) | [User guide](https://github.com/kreeben/resin/blob/master/USER-GUIDE.md) 

## Resin is a remote HTTP search engine and an embedded library

Resin is a vector space index based search engine that's available as a HTTP service or as an embedded library. 

### How to use

#### Write a document remotely

HTTP POST `[host]/write?collection=[collection]`  

(e.g. http://localhost/write?collection=mycollection)  

Content-Type: application/json  

```

[

	{

		"field1": "value1",

		"field2": "value2"

	}

]

```

#### Write a document locally

```

using (var database = new DocumentDatabase(_directory, collectionId, model, strategy))

{

    foreach (var document in documents)

    {

        database.Write(document);

    }

    database.Commit();

}

```

#### Query

##### GET query

HTTP GET `[host]/query/?collection=mycollection&q=[my_query]&field=field1&field=field2&select=field1&skip=0&take=10`  

(e.g. http://localhost/write?collection=mycollection&q=value1&field=field1&field=field2&select=field1&skip=0&take=10)  

Accept: application/json  

##### POST query

HTTP POST `[host]/query/?select=field1&skip=0&take=10`  

Content-Type: application/json  

Accept: application/json  

```

{

	"and":

	{

		"collection": "film,music",

		"title": "rocky eye of the tiger",

		"or":

		{

			"title": "rambo",

			"or": 

			{

				"title": "cobra"

				"or":

				{

					"cast": "antonio banderas"

				}			

			}	

		},

		"and":

		{

			"year": 1980,

			"operator": "gt"

		},

		"not":

		{

			"title": "first blood"

		}

	}

}

```

##### Local query

```

using (var database = new DocumentDatabase(_directory, collectionId, model, strategy))

{

    var queryParser = database.CreateQueryParser();

    var query = queryParser.Parse(collectionId, word, "title", "title", and:true, or:false, label:true);

    var result = database.Read(query, skip: 0, take: 1);

}

```

## Document database

Resin stores data as document collections. It applies your prefered IModeland indexing strategy onto your data while you write and query it. 

The write pipeline produces a set of indices (graphs), one for each document field, that you may interact with by using the Resin read/write JSON HTTP API or programmatically.

## Vector-based indices

Resin indices are binary search trees that create clusters of vectors that are similar to each other, as you populate them with your data. 

When a node is added to the graph its cosine angle, i.e. its similarity to other nodes, determine its position (path) within the graph.

## Performance

Currently, Wikipedia size data sets produce indices capable of sub-second phrase searching. 

## You may also  

- build, validate and optimize indices using the command-line tool [Sir.Cmd](https://github.com/kreeben/resin/blob/master/src/Sir.Cmd/README.md)

- read efficiently by specifying which fields to return in the JSON result

- implement messaging formats such as XML (or any other, really) if JSON is not suitable for your use case

- construct queries that join between fields and even between collections, that you may post as JSON to the read endpoint or create programatically.

- construct any type of indexing scheme that produces any type of embeddings with virtually any dimensionality using either sparse or dense vectors.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/kreeben/resin

Awesome Lists containing this project

README