Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ahmedbesbes/media-agent
Scrape data from social media and chat with it using Langchain
https://github.com/ahmedbesbes/media-agent
langchain large-language-models llms nlp nlproc python tweepy
Last synced: 2 months ago
JSON representation
Scrape data from social media and chat with it using Langchain
- Host: GitHub
- URL: https://github.com/ahmedbesbes/media-agent
- Owner: ahmedbesbes
- Created: 2023-06-11T09:02:35.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2023-07-23T08:29:15.000Z (over 1 year ago)
- Last Synced: 2024-05-18T19:26:13.879Z (8 months ago)
- Topics: langchain, large-language-models, llms, nlp, nlproc, python, tweepy
- Language: Python
- Homepage:
- Size: 587 KB
- Stars: 107
- Watchers: 3
- Forks: 16
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-langchain-zh - Twitter Agent - agent?style=social): 抓取twitter内容,并和twitter内容进行聊天 (开源项目 / 其他聊天机器人)
README
# Media Agent
Media Agent scrapes Twitter and Reddit submissions, summarizes them, and chats with them in an interactive terminal.
---
### Demo
https://www.loom.com/share/f4954e7d34ef4b7b8491e2bf910e8521### Functionalities
- Scrapes tweets/submissions on your behalf either from a list of
user accounts or a list of keywords.
- Embeds the tweets/submissions using OpenAI
- Indexes the embeddings (i.e. *vectors*) in ChromaDB
- Enriches the index with additional metadata
- Creates a summary of the tweets/submissions and provides potential questions to answer
- Opens a chat session on top of the tweets
- Saves the conversation with its metadata
- A rich terminal UI and logging features### Tools and libraries used
* **Langchain** 🦜 to build and compose LLMs
* **ChromaDB** to store vectors (a.k.a embeddings) and query them to build conversational bots
* **Tweepy** to connect to your the Twitter API and extract Tweets and metadata
* **Praw** to connect to Reddit API
* **Rich** to build a cool terminal UX/UI
* **Poetry** to manage dependencies### Third party services
* OpenAI (🔑 needed)
* Twitter (🔑 needed)
* Reddit (🔑 needed)### Run the app locally
* Install dependencies with poetry
```bash
poetry install --with dev
```* Add API credentials
Create .env file at the root of the project with the following keys:
```bash
OPENAI_API_KEY=
TWITTER_BEARER_TOKEN=
REDDIT_API_CLIENT_ID=
REDDIT_API_SECRET=
REDDIT_USER_AGENT=
```More info on these credentials [here](https://openai.com/), [here](https://developer.twitter.com/en/docs/apps/overview) and [here](https://www.geeksforgeeks.org/how-to-get-client_id-and-client_secret-for-python-reddit-api-registration/)
* Run the app
```bash
make run-media-agent
```## Star History
[![Star History Chart](https://api.star-history.com/svg?repos=ahmedbesbes/media-agent&type=Timeline)](https://star-history.com/#ahmedbesbes/media-agent&Timeline)
### Future features:
This is an on-going project, so feel free to contribute:
Here's what I plan in the near future:
* Add more data sources: substack, press, LinkedIN
* support open-source LLMs (Falcon?)
* support Pinecone in addition to ChromaDB
* add instructions to deployment on a cloud infrastructure
* improve the prompts to have a more engaging/enriching conversations
* add actions to open URLs and fetch content from it