Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/danibcorr/wattpad-mate
π Tool to obtain information from Wattpad and meet users with the same tastes and who are accessible to interact with them.
https://github.com/danibcorr/wattpad-mate
python scraping statistical-analysis streamlit wattpad
Last synced: about 1 month ago
JSON representation
π Tool to obtain information from Wattpad and meet users with the same tastes and who are accessible to interact with them.
- Host: GitHub
- URL: https://github.com/danibcorr/wattpad-mate
- Owner: danibcorr
- Created: 2024-05-04T13:39:10.000Z (8 months ago)
- Default Branch: main
- Last Pushed: 2024-09-17T12:49:31.000Z (4 months ago)
- Last Synced: 2024-09-18T12:15:28.008Z (4 months ago)
- Topics: python, scraping, statistical-analysis, streamlit, wattpad
- Language: Python
- Homepage:
- Size: 9.03 MB
- Stars: 1
- Watchers: 1
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# **π Wattpad Mate**
Welcome to **Wattpad Mate**! This tool helps you connect with users who share similar interests on Wattpad.
## **π What is Wattpad Mate?**
Wattpad Mate is a handy Streamlit app that extracts data from Wattpad pages. It filters results based on visits and votes, then displays them in a Pandas data frame. You can view a list of top-voted users by category and interact with them directly through their profiles. Remember, this project is for educational purposes only and does not store any user data.
## **πΉοΈ Usage**
1. **Clone the Repository:** Start by cloning this repository. Make sure you have Docker installed on your machine to deploy this project.
2. **Set Up with Docker:** After cloning, launch the Docker Compose file with the command `docker compose up`. Docker will automatically set up the container and install all necessary libraries.
3. **Access the App:** Once the program is running, check your IP address. The service will be available on port 8501. Go to the app's web interface and enter the URL of the Wattpad page you want to scrape. Set your filters for minimum and maximum views and votes.
4. **Start Scraping:** Click the "Start Scraping" button to begin. The app will process the data and show you the results in a Pandas data frame.
## **π§ Technologies**
- **Streamlit:** For creating and deploying the web app.
- **Docker:** To containerize the project.
- **Poetry:** For managing project dependencies.
- **Black:** To automatically format and clean up Python code.
- **Pandas & NumPy:** For ETL (Extract, Transform, Load) processes of Wattpad data.
- **BeautifulSoup & Selenium:** For scraping data from Wattpad, as there is no API available.
- **Loguru:** For logging and managing application logs.
- **pytest:** For testing the application.
- **GitHub Actions:** Automates workflows and CI/CD processes.
## **β οΈ Note**This app is meant for educational use only. It does not store user data and should not be used to scrape data from Wattpad without their explicit permission. Please respect Wattpadβs terms of service and use this tool responsibly. We disclaim any responsibility for misuse or unauthorized data scraping.
Enjoy exploring Wattpad and connecting with like-minded users!