An open API service indexing awesome lists of open source software.

https://github.com/harrystevens/scraperama

Scrape html, csv, json files from the internet.
https://github.com/harrystevens/scraperama

Last synced: 3 months ago
JSON representation

Scrape html, csv, json files from the internet.

Awesome Lists containing this project

README

        

# scraperama
Scrape files from the internet.

## Installation

```bash
npm i scraperama -S
```

## Usage

```js
const scraperama = require("scraperama");

scraperama.csv("file.csv", json => {
console.log(json);
});

scraperama.html("file.html", $ => {
console.log($("body").html());
});

scraperama.json("file.json", json => {
console.log(json);
});

scraperama.text("file.txt", text => {
console.log(text);
});
```

To download any old file:
```js
scraperama.download(
"path/to/file.zip", // URL
`${__dirname}/local/file.zip`, // local file path
(pct) => { process.stdout.write(`\r${pct.toFixed(1)}%`); }, // log percentage downloaded
() => { console.log("Done!"); } // callback function
);
```

To get an object's file size:
```js
scraperama.filesize(object);
```

To throttle a function:
```js
const logThrottled = scraperama.throttle(console.log, 500);
Array.from({ length: 10 }).forEach((_, i) => logThrottled(i));
```