https://github.com/scrapfly/typescript-scrapfly

SDK for Scrapfly.io web scraping API
https://github.com/scrapfly/typescript-scrapfly

api saas scraping sdk webscraping

Last synced: 7 months ago
JSON representation

SDK for Scrapfly.io web scraping API

Host: GitHub
URL: https://github.com/scrapfly/typescript-scrapfly
Owner: scrapfly
License: other
Created: 2023-07-18T13:48:33.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-11-09T06:23:11.000Z (about 1 year ago)
Last Synced: 2025-04-14T17:49:17.924Z (9 months ago)
Topics: api, saas, scraping, sdk, webscraping
Language: TypeScript
Homepage: https://scrapfly.io/
Size: 1.41 MB
Stars: 10
Watchers: 6
Forks: 5
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Scrapfly SDK

`npm install scrapfly-sdk`  

`deno add jsr:@scrapfly/scrapfly-sdk`  

`bun jsr add @scrapfly/scrapfly-sdk`  

Typescript/Javascript SDK for [Scrapfly.io](https://scrapfly.io/) web scraping API which allows to:

-   Scrape the web without being blocked.

-   Use headless browsers to access Javascript-powered page data.

-   Scale up web scraping.

-   ... and [much more](https://scrapfly.io/docs/scrape-api/getting-started)!

For web scraping guides see [our blog](https://scrapfly.io/blog/) and [#scrapeguide](https://scrapfly.io/blog/tag/scrapeguide/) tag for how to scrape specific targets.

The SDK is distributed through:

- [npmjs.com/package/scrapfly-sdk](https://www.npmjs.com/package/scrapfly-sdk)

- [jsr.io/@scrapfly/scrapfly-sdk](https://jsr.io/@scrapfly/scrapfly-sdk)

## Quick Intro

1. Register a [Scrapfly account for free](https://scrapfly.io/register)

2. Get your API Key on [scrapfly.io/dashboard](https://scrapfly.io/dashboard)

3. Start scraping: 🚀

```javascript

// node 

import { ScrapflyClient, ScrapeConfig } from 'scrapfly-sdk';

// bun

import { ScrapflyClient, ScrapeConfig} from '@scrapfly/scrapfly-sdk';

// deno: 

import { ScrapflyClient, ScrapeConfig } from 'jsr:@scrapfly/scrapfly-sdk';

const key = 'YOUR SCRAPFLY KEY';

const client = new ScrapflyClient({ key });

const apiResponse = await client.scrape(

    new ScrapeConfig({

        url: 'https://web-scraping.dev/product/1',

        // optional parameters:

        // enable javascript rendering

        render_js: true,

        // set proxy country

        country: 'us',

        // enable anti-scraping protection bypass

        asp: true,

        // set residential proxies

        proxy_pool: 'public_residential_pool',

        // etc.

    }),

);

console.log(apiResponse.result.content); // html content

// Parse HTML directly with SDK (through cheerio)

console.log(apiResponse.result.selector('h3').text());

```

For more see [/examples](/examples/) directory.  

For more on Scrapfly API see our [getting started documentation](https://scrapfly.io/docs/scrape-api/getting-started)

For Python see [Scrapfly Python SDK](https://github.com/scrapfly/python-scrapfly)

## Debugging

To enable debug logs set Scrapfly's log level to `"DEBUG"`:

```javascript

import { log } from 'scrapfly-sdk';

log.setLevel('DEBUG');

```

Additionally, set `debug=true` in `ScrapeConfig` to access debug information in [Scrapfly web dashboard](https://scrapfly.io/dashboard):

```typescript

import { ScrapflyClient } from 'scrapfly-sdk';

new ScrapeConfig({

    url: 'https://web-scraping.dev/product/1',

    debug: true,

    // ^ enable debug information - this will show extra details on web dashboard

});

```

## Development

This is a Deno Typescript project that builds to NPM through [DNT](https://github.com/denoland/dnt).

- `/src` directory contains all of the source code with `main.ts` being the entry point.

- `__tests__` directory contains tests for the source code.

- `deno.json` contains meta information

- `build.ts` is the build script that builds the project to nodejs ESM package.

- `/npm` directory will be produced when `built.ts` is executed for building node package.

```bash

# make modifications and run tests

$ deno task test

# format

$ deno fmt

# lint

$ deno lint

# publish JSR:

$ deno publish

# build NPM package:

$ deno task build-npm

# publish NPM:

$ cd npm && npm publish

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/scrapfly/typescript-scrapfly

Awesome Lists containing this project

README