Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/davidwrossiter/langchunk

Source code for chunking code in multiple different languages
https://github.com/davidwrossiter/langchunk

chunking chunking-algorithm embedding llm-context vectorization

Last synced: about 1 month ago
JSON representation

Source code for chunking code in multiple different languages

Awesome Lists containing this project

README

        

# Lang chunk

> I created langchunk to chunk entire codebases correctly.

RAG systems rely on data that has been properly chunked and embedded. Langchunk allows you to create high-quality chunks from any codebase including metadata about the file structure of the directory and the content of each file within it.

---

**Current capabilities**

- Allows for chunking of javascript source code.
- Chunks based on Function declaration, class declaration, variable declaration, and expression statements.

**Steps for future versions**

- Allow for different languages
- Generate embeddings for vector databases