https://github.com/dschreij/scholar_parser
Parses Google Scholar user profile information. Documentation can be found at
https://github.com/dschreij/scholar_parser
Last synced: about 2 months ago
JSON representation
Parses Google Scholar user profile information. Documentation can be found at
- Host: GitHub
- URL: https://github.com/dschreij/scholar_parser
- Owner: dschreij
- License: gpl-2.0
- Created: 2014-12-15T16:33:57.000Z (over 10 years ago)
- Default Branch: master
- Last Pushed: 2018-03-30T15:58:07.000Z (about 7 years ago)
- Last Synced: 2025-03-25T04:18:29.210Z (2 months ago)
- Language: PHP
- Homepage: http://dschreij.github.io/scholar_parser
- Size: 137 KB
- Stars: 3
- Watchers: 1
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
#Scholar parser
(highly experimental)
This class parses a profile page from Google Scholar for publication data and scientist stats. The page can be read directly from Google Scholar by supplying the user's profile ID, or by passing a HTML file saved from Scholar to the class.
##Installation
This module requires PHP-PhantomJS which can be found at http://jonnnnyw.github.io/php-phantomjs/. For easy installation, I recommend using Composer (https://getcomposer.org/) and use the included composer.json file by executing the command
```
php composer.phar install
```##Example usage
Below is a very basic example. For a more elaborate one see the example.php file which uses a basic caching mechanism to not query Scholar with each page view request```php
// Create a new instance of the parser class
require_once("scholar_profile_parser.class.php");
$parser = new ScholarProfileParser();// The profile to parse (mine in this case)
$profile_id = "Pm3O_58AAAAJ&hl";// Read the html from Scholar into a DOM object
$parser->read_html_from_scholar_profile($scholar_id);
// Parse publication data from the DOM
$parser->parse_publications();
// Parse stats from the DOM (H-Index, citation count, i10 index)
$parser->parse_stats();// Print the output
$parser->print_parsed_data_raw(); //Basic output as stored in JSON
echo $parser->format_publications_in_APA(); //Formatted as HTML table
```##API Documentation
Soon to follow once I figure out how to output nicely formatted markdown documentation with doxygen. For now, the documentation can be found in the doc/html folder inside the repository.