An open API service indexing awesome lists of open source software.

https://github.com/fadilxcoder/web-crawler

Crawl website to scrap data (TV shows / Foot Stream)
https://github.com/fadilxcoder/web-crawler

notes php poc

Last synced: about 1 year ago
JSON representation

Crawl website to scrap data (TV shows / Foot Stream)

Awesome Lists containing this project

README

          

# Notes

- Install : `composer require fabpot/goutte`
- Crawlers
- - Alibaba website for watches names
- - Local car rental login success / failed (email dispatched in mailtrap)

----

- CLI : `php run` generate JSON db
- WEB GUI : `index.php` - for debugging
- API
- - https://my-json-server.typicode.com/fadilxcoder/web-crawler
- - https://my-json-server.typicode.com/fadilxcoder/web-crawler/db
- - https://my-json-server.typicode.com/fadilxcoder/web-crawler/person-of-interest-season-1
- Routes
- - https://my-json-server.typicode.com/fadilxcoder/streamer/movies
- - https://my-json-server.typicode.com/fadilxcoder/streamer/movies/1

## Docs

- https://github.com/FriendsOfPHP/Goutte (Github)
- https://symfony.com/doc/current/http_client.html (HTTP Client - Symfony components)
- https://symfony.com/doc/current/components/dom_crawler.html (The DomCrawler Component - Symfony components)
- https://symfony.com/components/BrowserKit (BrowserKit component)