Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/muhammedzohaib/drugscraper

A simple scrapy spider to scrape https://www.drugs.com
https://github.com/muhammedzohaib/drugscraper

Last synced: about 6 hours ago
JSON representation

A simple scrapy spider to scrape https://www.drugs.com

Awesome Lists containing this project

README

        

# DrugScraper

This Scrapy spider (`DrugsspiderSpider`) scrapes drug information from `www.drugs.com`. It extracts drug names, generic names, drug classes, and URLs directly from the website.

## Spider Details

- **Name:** `DrugsspiderSpider`
- **Allowed Domains:** `www.drugs.com`

## Fields Extracted

- **Name:** Drug name.
- **Generic Name:** Generic name of the drug.
- **Drug Class:** Class of the drug.
- **URL:** URL of the drug page.

## Usage

1. Clone the repository:

```bash
git clone https://github.com/MuhammedZohaib/DrugScraper.git
cd DrugScraper
```

2. Run the spider:

```bash
scrapy crawl drugsSpider
```

3. Output: Drug information (name, generic name, drug class, URL) in JSON and CSV format.

---