https://github.com/srinivasreddy/companies-list
https://github.com/srinivasreddy/companies-list
csv datascraping extractor fortune-500 puppeter typescript
Last synced: 5 months ago
JSON representation
- Host: GitHub
- URL: https://github.com/srinivasreddy/companies-list
- Owner: srinivasreddy
- License: mit
- Created: 2019-12-02T11:37:49.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2025-11-19T08:47:15.000Z (7 months ago)
- Last Synced: 2025-11-19T10:20:45.692Z (7 months ago)
- Topics: csv, datascraping, extractor, fortune-500, puppeter, typescript
- Language: TypeScript
- Size: 133 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# About
This project extracts Fortune - 1000 companies data from a hidden url of the Fortune website.
# Usage
```typescript
import { Crawler } from "companies-list";
async function start() {
const crawler = new Crawler(2019, "./srinivas.csv");
await crawler.generateCSVFile();
}
start();
```
# TODO
1. Implement for any year. Does API support this? And year should be configureable too.
2. Option to implement proxy server(s)
3. csv fields generated should be configurable?
4. Implement persistance to crawl where it left off?
5. Dockerize this application
# License
MIT