An open API service indexing awesome lists of open source software.

https://github.com/luminati-io/b2b-business-dataset-samples

A collection of multiple business dataset samples. Each sample contains more than 1,000 records. These datasets are perfect for sales, marketing, competitor mapping, and more.
https://github.com/luminati-io/b2b-business-dataset-samples

6sense b2b b2b-database business dataset datasets linkedin trustpilot web-scraping yelp zoominfo

Last synced: 2 months ago
JSON representation

A collection of multiple business dataset samples. Each sample contains more than 1,000 records. These datasets are perfect for sales, marketing, competitor mapping, and more.

Awesome Lists containing this project

README

        

# B2B-dataset-samples

A collection of sample B2B datasets, each one with over 1,000 records.

![B2B dataset header](https://github.com/luminati-io/B2B-business-dataset-samples/blob/main/b2b-datasets.PNG)

Business dataset samples with thousands of records in total. All the dataset were extracted using the Bright Data API.

Some of the data points include:

* ```url``` : The company's profile link
* ```id``` : Unique identifier for the company in the dataset
* ```name``` : The official name of the company
* ```description``` : A brief overview of the company's history, services, or products
* ```revenue``` : The company's total revenue
* ```revenue_currency``` : The currency used for the revenue figures
* ```stock_symbol``` : The company's stock ticker symbol
* ```website``` : Official website of the company
* ```employees``` : The number of employees working at the company
* ```industry``` : The primary industries the company operates in
* ```headquarters``` : The physical address of the company's headquarters
* ```phone_number``` : The company's main contact phone number
* ```total_funding_amount``` : The total amount of funding the company has received
* ```most_recent_funding_amount``` : The amount raised in the most recent funding round
* ```funding_currency``` : Currency for funding amounts
* ```funding_rounds``` : Number of funding rounds the company has participated in
* ```leadership``` : List of top executives and their titles
* ```popular_searches``` : Common terms associated with the company
* ```business_classification_codes``` : Business Classification Codes like SIC Code, NAICS Code, and Ticker
* ```ceo``` : Name and title of the company’s CEO
* ```total_employees``` : Total number of employees in the company
* ```c_level_employees``` : Number of C-level executives in the company
* ```vp_level_employees``` : Number of vice presidents in the company
* ```director_level_employees``` : Number of directors in the company
* ```manager_level_employees``` : Number of managers in the company
* ```non_manager_employees``` : Number of employees below managerial level
* ```top_contacts``` : Key contact persons at the company
* ```org_chart``` : The company’s organizational hierarchy
* ```social_media``` : The company's official social media profiles
* ```ceo_rating``` : Rating of the CEO’s performance
* ```enps_score``` : Employee Net Promoter Score, indicating employee satisfaction
* ```similar_companies``` : Companies similar to the one profiled
* ```email_formats``` : Common email address formats used by the company
* ```products_owned``` : Products the company offers
* ```tech_stack``` : Technologies used by the company
* ```recent_scoops``` : Recent updates, job openings, or notable company changes
* ```news_and_media``` : News articles and media coverage related to the company

And a lot more.

## Most popular business datasets

- [LinkedIn Datasets](https://brightdata.com/products/datasets/linkedin)
- [Glassdoor Datasets](https://brightdata.com/products/datasets/glassdoor)
- [Indeed Datasets](https://brightdata.com/products/datasets/indeed)
- [Google Maps Datasets](https://brightdata.com/products/datasets/google-maps)
- [Yelp Datasets](https://brightdata.com/products/datasets/yelp)
- [Zoominfo Datasets](https://brightdata.com/products/datasets/zoominfo)
- [G2 Datasets](https://brightdata.com/products/datasets/g2)
- [Trustpilot Datasets](https://brightdata.com/products/datasets/trustpilot)

And many more.

These are sample datasets and subsets which are derived from dozens of "B2B Datasets (public data)"
which include billions of records.

Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz.

Dataset delivery type options: Email, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP.

Update frequency: Once, Daily, Weekly, Monthly, Quarterly, or Custom basis.

Data enrichment available as an addition to the data points extracted: Based on request.

[Get the full B2B/Business Datasets](https://brightdata.com/products/datasets/business).

What are the B2B datasets use cases?

1. Competitor Mapping


Enhance your competitive analysis by identifying key companies, professionals, and employee movements. Understand your competitor landscape, track company and career developments, and assess evolving market trends.

2. Investment Data


Gain a comprehensive view of company growth and industry shifts for data-driven decision-making. Perfect for hedge funds, VCs, and financial institutions looking to strengthen their investment strategies and uncover high-value opportunities.

3. Sales & Marketing


Supercharge your lead generation efforts with detailed company profiles and records. Develop your ideal customer profile and refine your sales funnel by curating targeted lead lists and custom audiences.

Free access to web scraping tools and datasets for academic researchers and NGOs

The Bright Initiative offers access to Bright Data's [Web Scraper APIs](https://brightdata.com/products/web-scraper) and [ready-to-use datasets](https://brightdata.com/products/datasets) to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application [here](https://brightinitiative.com).