An open API service indexing awesome lists of open source software.

https://github.com/pythonicshariful/pythonicshariful

Python developer specializing in web scraping, automation, and machine learning. Experienced in building scalable scrapers, bypassing anti-bot systems, processing large datasets, and applying ML/NLP models to real-world problems.
https://github.com/pythonicshariful/pythonicshariful

Last synced: 9 days ago
JSON representation

Python developer specializing in web scraping, automation, and machine learning. Experienced in building scalable scrapers, bypassing anti-bot systems, processing large datasets, and applying ML/NLP models to real-world problems.

Awesome Lists containing this project

README

          


Typing SVG


coding


followers
stars
profile views

---

## πŸ•ΈοΈ What I Do
- **Web scraping at scale** with Python (Selenium, Playwright, Scrapy, BeautifulSoup)
- **Automation pipelines** for data collection, cleaning, and storage
- **API integrations** (REST/GraphQL) and browser automation
- **Data wrangling** with Pandas, exporting to CSV/JSON/DB
- **Learning ML & AI** to build smarter data products

### πŸ”§ Tech Stack












---

## ✨ Highlights
- Built bots that **extract thousands of pages/day** with rotating proxies & retries
- Designed resilient **anti-bot bypass** flows (stealth drivers, human-like waits, captchas via services)
- Delivered **clean datasets** ready for analysis & model training
- Currently exploring **feature engineering**, **vector databases**, and **LLM-powered** scraping assistants

---

/

---

## πŸ“Š GitHub Stats







---

## πŸ§ͺ ML & AI Learning Journey
- 🎯 Current focus: **data labeling, feature engineering, small ML models for classification/regression**
- 🧠 Next up: **LLM-assisted scraping**, **RAG for document-heavy sites**, **agent workflows**
- πŸ“š Notes & experiments live here β†’ [`/labs`](https://github.com/pythonicshariful/labs)

---

## πŸ—‚οΈ Example Services I Offer
- Full-site data extraction (anti-bot aware) β†’ CSV/JSON/DB
- PDF/image capture & text extraction (OCR)
- API discovery & reverse engineering for private endpoints
- Dashboard/API to deliver data (FastAPI + simple UI)
- Ongoing monitoring for **price changes**, **stock**, **new listings**

> πŸ’Œ **Need data?** Open an issue or reach out!

---

## πŸ’¬ Connect









---

## 🐍 Fun


snake animation

---

Made with ❀️, Python, and a lot of headless browsers.