https://github.com/rhysd/node-github-trend
node.js library for scraping GitHub trending repositories.
https://github.com/rhysd/node-github-trend
github nodejs scraping trend trending
Last synced: 6 months ago
JSON representation
node.js library for scraping GitHub trending repositories.
- Host: GitHub
- URL: https://github.com/rhysd/node-github-trend
- Owner: rhysd
- License: mit
- Created: 2015-09-04T11:56:58.000Z (about 10 years ago)
- Default Branch: master
- Last Pushed: 2018-04-04T08:34:01.000Z (over 7 years ago)
- Last Synced: 2025-03-29T00:25:40.860Z (7 months ago)
- Topics: github, nodejs, scraping, trend, trending
- Language: TypeScript
- Size: 73.2 KB
- Stars: 21
- Watchers: 3
- Forks: 3
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
Scraping GitHub Trending Repositories
=====================================
[](http://badge.fury.io/js/github-trend)
[](https://travis-ci.org/rhysd/node-github-trend)[github-trend](https://www.npmjs.com/package/github-trend) is a library for scraping GitHub trending repositories.
## Only scraping
```javascript
const Trending = require('github-trend');
const scraper = new Trending.Scraper();// Empty string means 'all languages'
scraper.scrapeTrendingReposFullInfo('').then(repos => {
for (const repo of repos) {
console.log(repo.owner);
console.log(repo.name);
console.log(repo.description);
console.log(repo.language);
console.log(repo.allStars);
console.log(repo.todaysStars);
console.log(repo.forks);
}
}).catch(err => {
console.error(err.message);
});// For other languages
scraper.scrapeTrendingReposFullInfo('rust');
scraper.scrapeTrendingReposFullInfo('vim');
scraper.scrapeTrendingReposFullInfo('go');
````Scraper` only scrapes GitHub trending page. This returns an array of repository information.
This method is relatively faster because of sending request only once per language.## Scraping and getting full repository information
```javascript
const Trending = require('github-trend');
const client = new Trending.Client();client.fetchTrending('').then(repos => {
for (const repo of repos) {
// Result of https://api.github.com/repos/:user/:name
console.log(repo);
}
}).catch(err => {
console.error(err.message);
});// Fetch all API call asynchronously
client.fetchTrendings(['', 'vim', 'go']).then(repos => {
for (const lang in repos) {
for (const repo of repos[lang]) {
// Result of https://api.github.com/repos/:user/:name
console.log(repo);
}
}
}).catch(err => {
console.error(err.message);
});
````Client` contains scraper and scrapes GitHub trending page, then gets all repositories' full information using GitHub `/repos/:user/:name` API.
This takes more time than only scraping, but all requests are sent asynchronously and in parallel.## Scraping language information
```javascript
const Trending = require('github-trend');
const scraper = new Trending.Scraper();scraper.scrapeLanguageYAML().then(langs => {
for (const name in langs) {
console.log(name);
console.log(langs[name]);
}
}).catch(err => {
console.error(err.message);
});
```This returns all languages information detected in GitHub by scraping [here](https://raw.githubusercontent.com/github/linguist/master/lib/linguist/languages.yml).
The result is cached and reused.## Scraping language colors
```javascript
const Trending = require('github-trend');
const scraper = new Trending.Scraper();scraper.scrapeLanguageColors().then(colors => {
for (const name in colors) {
console.log('name: ' + name);
console.log('color: ' + colors[name]);
}
}).catch(err => {
console.error(err.message);
});// If you want only language names:
scraper.scrapeLanguageNames().then(names => {
for (const name of names) {
console.log(name);
}
}).catch(err => {
console.error(err.message);
});
```## Collect trending repositories by scraping and GitHub API
By scraping GitHub Trending Repositories page, the information is restricted to the information
rendered in the page. This library also supports to getting information of trending repositories
using GitHub Repositories API.Although an API token (at the first parameter of `new Client`) is not mandatory, it is recommended
for avoiding API rate limit.```javascript
const {Client} = require('github-trend');
const client = new Client({token: 'API access token here'});client.fetchTrending('all').then(repos => {
for (const repo of repos) {
console.log('Name:', repo.full_name);
console.log('ID:', repo.id);
console.log('Topics:', repo.topics.join(' '));
console.log('License:', repo.license.name);
}
}).catch(console.error);
```## License
Distributed under [the MIT license](LICENSE.txt).