https://github.com/thingston/extractor
Collection of PHP classes to extract data from HTML pages.
https://github.com/thingston/extractor
data html php
Last synced: 5 months ago
JSON representation
Collection of PHP classes to extract data from HTML pages.
- Host: GitHub
- URL: https://github.com/thingston/extractor
- Owner: thingston
- Created: 2018-11-18T18:26:13.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2019-11-26T15:15:10.000Z (over 6 years ago)
- Last Synced: 2025-07-30T12:43:00.274Z (11 months ago)
- Topics: data, html, php
- Language: PHP
- Size: 71.3 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
Thingston Extractor
===================
Collection of PHP classes to extract data from HTML pages.
Requirements
------------
Thingston Extractor requires:
- [PHP 7.1](https://secure.php.net/releases/7_1_0.php) or above.
Instalation
-----------
Add Thingston Extractor to any PHP project using [Composer](https://getcomposer.org/):
```bash
composer require thingston/extractor
```
Getting Started
---------------
```php
use Thingston\Extractor\Page;
$uri = 'https://google.com/';
$html = file_get_contents($uri);
$page = Page\PageExtractor::create($html, $uri)->extract();
```
Contributors
------------
Open Source is made of contribuition. If you want to contribute to Thingston please
follow these steps:
1. Fork latest version into your own repository.
2. Write your changes or additions and commit them.
3. Follow PSR-2 coding style standard.
4. Make sure you have unit tests with full coverage to your changes.
5. Go to Github Pull Requests at [https://github.com/thingston/extractor/pulls](https://github.com/thingston/extractor/pulls)
and create a new request.
Thank you!
Changes and Versioning
----------------------
All relevant changes on this code are logged in a separated [log](CHANGELOG.md) file.
Version numbers follow recommendations from [Semantic Versioning](http://semver.org/).
License
-------
Thingston code is maintained under [The MIT License](https://opensource.org/licenses/MIT).