{"id":21696664,"url":"https://github.com/patrickcurl/sanechain","last_synced_at":"2025-11-07T22:04:29.589Z","repository":{"id":162806027,"uuid":"637292821","full_name":"patrickcurl/sanechain","owner":"patrickcurl","description":"Filling in the missing gaps with langchain, and creating OO wrappers to simplify some workloads.","archived":false,"fork":false,"pushed_at":"2023-05-07T06:00:07.000Z","size":34235,"stargazers_count":4,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-07-04T23:11:23.872Z","etag":null,"topics":["agent","ai","artificial-intelligence","cohereai","gpt","gpt3","gpt4","inference-engine","langchain","language-model","llama","llama-index","llamacpp","llm","llmops","llms","openai","openai-api"],"latest_commit_sha":null,"homepage":"","language":"TypeScript","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/patrickcurl.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":".github/FUNDING.yml","license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"github":"patrickcurl","open_collective":"sanechain","ko_fi":null,"tidelift":null,"community_bridge":null,"liberapay":null,"issuehunt":null,"otechie":null,"custom":null}},"created_at":"2023-05-07T05:14:38.000Z","updated_at":"2025-02-15T12:32:11.000Z","dependencies_parsed_at":"2023-05-19T11:46:35.088Z","dependency_job_id":null,"html_url":"https://github.com/patrickcurl/sanechain","commit_stats":{"total_commits":5,"total_committers":1,"mean_commits":5.0,"dds":0.0,"last_synced_commit":"2927d7420868d3f2328f2698afc9d921fab3073a"},"previous_names":[],"tags_count":4,"template":false,"template_full_name":null,"purl":"pkg:github/patrickcurl/sanechain","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patrickcurl%2Fsanechain","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patrickcurl%2Fsanechain/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patrickcurl%2Fsanechain/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patrickcurl%2Fsanechain/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/patrickcurl","download_url":"https://codeload.github.com/patrickcurl/sanechain/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patrickcurl%2Fsanechain/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":264502649,"owners_count":23618664,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["agent","ai","artificial-intelligence","cohereai","gpt","gpt3","gpt4","inference-engine","langchain","language-model","llama","llama-index","llamacpp","llm","llmops","llms","openai","openai-api"],"created_at":"2024-11-25T19:21:16.983Z","updated_at":"2025-11-07T22:04:29.525Z","avatar_url":"https://github.com/patrickcurl.png","language":"TypeScript","funding_links":["https://github.com/sponsors/patrickcurl","https://opencollective.com/sanechain"],"categories":[],"sub_categories":[],"readme":"# Sane Chain \u003c!-- omit in toc --\u003e\n\n## An attempt to make langchainjs easier to work with \u003c!-- omit in toc --\u003e\n\nWIP - ~~nothing works yet, just saving the name~~\nSome things work, just um - not tested, no warranties :1st_place_medal:\n\nAdds the following loaders:\n\n1. [Utility Classes](#utility-classes)\n   1. [DocumentLoader](#documentloader)\n2. [Loaders](#loaders)\n   1. [ChatGPT Loader](#chatgpt-loader)\n   2. [Simpler GithubRepoLoader](#simpler-githubrepoloader)\n   3. [Roadmap](#roadmap)\n\n## Utility Classes\n\n### DocumentLoader\n\nThis class essentially packages up all of langchainjs (plus sanechain) and creates a class:\nDocumentLoader that can basically load up all your documents regardless of type.\n\nExample:\n\n```typescript\nconst filesAndDirectories = [\n  'path/to/somefile.md',\n  'path/to/somefile.pdf',\n  'path/to/somefile.text',\n  'path/to/somefile.html',\n  'path/to/somedirectory',\n  'https://github.com/some/repo',\n  'https://github.com/some/other_repo',\n  'path/to/chatgpt.json'\n]\n\nconst documentLoader = new DocumentLoader(filesAndDirectories)\nconst documents = documentLoader.loadDocuments()\nconst splitDocuments = documentLoader.splitDocuments()\n// Might take time, probably gonna implement a queue system to speed things up, already using async though.\n// also @todo add full parity with all langchain python loaders.\n```\n\n## Loaders\n\n### ChatGPT Loader\n\n```typescript\nimport { ChatGPTLoader } from './chat_gpt_loader.js';\n\nconst loader = new ChatGPTLoader('path/to/chat/log.json', 10);\nconst documents = await loader.load();\n```\n\n### Simpler GithubRepoLoader\n\nInsert github link, get repo documents.\n\n```typescript\n  import {GithubRepoLoader} from 'sanechain'\n  const loader = new GithubRepoLoader(\"https://github.com/owner/repo\", { /*params*/ });\n  const documents = await loader.load();\n```\n\n### Roadmap\n\n- [ ] Models\n  - [ ] General\n  - [ ] Chat\n  - [ ] Embeddings\n- [ ] Prompts\n  - [ ] General Templates\n  - [ ] Chat Template\n  - [ ] Example Selectors\n  - [ ] Output Parsers\n- [ ] Indexes (Primary focus at first)\n  - [ ] Document Loaders %%\n    - [ ] Airbyte JSON\n    - [ ] Apify Dataset\n    - [ ] Arxiv\n    - [ ] AWS S3\n    - [ ] AZLyrics\n    - [ ] Azure Blob Storage\n    - [ ] Bilibili\n    - [ ] Blackboard\n    - [ ] Blockchain\n    - [x] ChatGPT Data\n    - [ ] Confluence\n    - [ ] CoNLL-U\n    - [ ] Copy / Paste\n    - [x] CSV (langchainjs)\n    - [ ] Diffbot\n    - [ ] Discord\n    - [ ] DuckDB\n    - [ ] Email\n    - [x] EPub (langchainjs)\n    - [ ] EverNote\n    - [ ] Facebook Chat\n    - [ ] Figma\n    - [x] File Directory (langchainjs)\n    - [x] Git (langchainjs + custom url loader)\n    - [ ] GitBook\n    - [ ] Google BigQuery\n    - [ ] Google Cloud Storage\n    - [ ] Google Drive\n    - [ ] Gutenberg\n    - [ ] Hacker News\n    - [ ] HTML\n    - [ ] HuggingFace dataset\n    - [ ] iFixit\n    - [ ] Images\n    - [ ] Image captions\n    - [ ] IMDB\n    - [ ] JSON Files (langchain)\n    - [ ] Jupyter Notebook\n    - [x] Markdown (sorta, just parses using TextLoader)\n    - [ ] MediaWikiDump\n    - [ ] Microsoft OneDrive\n    - [ ] Microsoft PowerPoint\n    - [x] Microsoft Word (langchainjs)\n    - [ ] Modern Treasury\n    - [ ] Notion DB 1/2\n    - [ ] Notion DB 2/2\n    - [ ] Obsidian\n    - [ ] Pandas DataFrame\n    - [x] PDF (langchain)\n    - [ ] Using PyPDFium2\n    - [ ] ReadTheDocs Documentation\n    - [ ] Reddit\n    - [ ] Roam\n    - [ ] Sitemap\n    - [ ] Slack\n    - [ ] Spreedly\n    - [ ] Stripe\n    - [ ] Subtitle (langchain)\n    - [ ] Telegram\n    - [ ] TOML\n    - [ ] Twitter\n    - [ ] Unstructured File (half way)\n    - [x] URL (langchainjs via puppetter, playwright, cheerio, etc)\n    - [ ] Selenium URL Loader\n    - [x] Playwright URL Loader (langchainjs)\n    - [ ] WebBaseLoader\n    - [ ] WhatsApp Chat\n    - [ ] Wikipedia\n    - [ ] YouTube transcripts\n  [ Text Splitters ]\n    - [ ] Character Text Splitter\n    - [ ] HuggingFace Length Function\n    - [ ] Latext Text SPlitter\n    - [ ] Markdown Text Splitter\n    - [ ] NLTK Text Splitter\n    - [ ] RecursiveCharacterTextSplitter\n    - [ ] Spacy Text Splitter\n    - [ ] tiktoken (OpenAI) Length Function\n    - [ ] TiktokenTextSplitter\n  - [ ] Vector stores\n  - [ ] Retrievers\n- [ ] Memory (TBD)\n- [ ] Chains (TBD)\n- [ ] Agents\n  - [ ] Tools (TBD)\n  - [ ] Agents (TBD)\n  - [ ] Toolkits (TBD)\n  - [ ] AgentExecutors (TBD)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpatrickcurl%2Fsanechain","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpatrickcurl%2Fsanechain","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpatrickcurl%2Fsanechain/lists"}