https://github.com/lachlanjc/spatial-twitter-search

Prototype of embeddings-based search of Twitter likes/bookmarks on an infinite canvas
https://github.com/lachlanjc/spatial-twitter-search

chromadb infinite-canvas nextjs openai semantic-search twitter umap

Last synced: about 1 month ago
JSON representation

Prototype of embeddings-based search of Twitter likes/bookmarks on an infinite canvas

Host: GitHub
URL: https://github.com/lachlanjc/spatial-twitter-search
Owner: lachlanjc
Created: 2024-05-07T08:12:48.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-05-07T08:24:30.000Z (about 1 year ago)
Last Synced: 2025-03-29T20:51:09.062Z (about 2 months ago)
Topics: chromadb, infinite-canvas, nextjs, openai, semantic-search, twitter, umap
Language: TypeScript
Homepage: https://edu.lachlanjc.com/2024-05-07_shm_spatial_semantic_twitter_search
Size: 2.35 MB
Stars: 6
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Twitter Spatial Semantic Search

I’ve combined my 1) desire to re-surface content from my Twitter feed 2) embeddings & semantic search and 3) my love of infinite canvas UIs:

https://github.com/lachlanjc/spatial-twitter-search/assets/5074763/f7021482-7983-439e-bd9a-cd9a1b4d942f

It’s an OpenAI embeddings-powered search tool for my Twitter likes & bookmarks, with advanced filtering, on an infinite canvas with draggable results.

This codebase is rather messy, but I [documented the process](https://edu.lachlanjc.com/2024-05-07_shm_spatial_semantic_twitter_search) of building the project with more screenshots on my blog. Run free with it!

## Running it yourself

1. Clone repo
2. `bun i`

Gathering your data is the most hacky process ever:

1. Create an empty file at `lib/tasks/likes-urls.txt`
2. Open Twitter, open the network tab > XHR in devtools, go to your profile’s likes page, scroll down a bunch (Cmd-down repeatedly). Right click, `Copy all URLs`, paste them into the text file, filter for the ones matching `Likes?`
3. Create a file at `lib/tasks/fetch-options.js`
4. On your bookmarks page, right click the `Bookmarks?` request in your devtools, `Copy as fetch (Node.js)`, then paste all those headers into the `fetch-options` file, like this:

```js
export default const fetchOptions = {
headers: {
accept: "*/*",
"accept-language": "en-US,en;q=0.9",
authorization: "Bearer …"
```

5. In `.env`, set your `OPENAI_API_KEY` for embeddings
6. Install [ChromaDB](https://trychroma.com), in another terminal, `chroma run` (you’ll need to keep this running)
7. If you’re using [Zed](https://zed.dev), spawn the task for download, then for embedding. Otherwise, `bun run lib/tasks/download.js` then `bun run lib/tasks/embed.js`
8. `bun dev`

---

MIT License. The tweet data in `lib/db` is not mine.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/lachlanjc/spatial-twitter-search

Awesome Lists containing this project

README