https://github.com/codewritingcow/sec-web-scraper-13f
Python Web scraper for parsing 13F filings (mutual fund holdings) from SEC website
https://github.com/codewritingcow/sec-web-scraper-13f
python scraper
Last synced: 6 months ago
JSON representation
Python Web scraper for parsing 13F filings (mutual fund holdings) from SEC website
- Host: GitHub
- URL: https://github.com/codewritingcow/sec-web-scraper-13f
- Owner: CodeWritingCow
- License: mit
- Created: 2019-07-18T21:43:54.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2023-08-12T22:43:01.000Z (over 2 years ago)
- Last Synced: 2024-01-17T10:56:38.327Z (about 2 years ago)
- Topics: python, scraper
- Language: Python
- Homepage:
- Size: 24.4 KB
- Stars: 39
- Watchers: 2
- Forks: 18
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# SEC Python Web Scraper
This repository contains a Python Web scraper for parsing 13F filings (mutual fund holdings) from SEC's website, [EDGAR](https://www.sec.gov/edgar/searchedgar/companysearch.html), and writing a .tsv file from the data.
## Requirements
#### Getting Started
- `pip install -r requirements.txt` (or `pipenv install` if you are using pipenv)
- `python scraper.py` (or `pipenv run python scraper.py`)
- When prompted, enter the 10-digit CIK number of a mutual fund
#### Key Dependencies
- [Requests](https://2.python-requests.org/en/master/), Python library for making HTTP requests
- [lxml](https://lxml.de/), Python library for processing XML and HTML
- [Beautiful Soup](https://pypi.org/project/beautifulsoup4/), Python library for scraping information from Web pages
- [re](https://docs.python.org/3/library/re.html), Python module for using regular expressions
- [csv](https://docs.python.org/3/library/csv.html), Python module for parsing and writing CSV and TSV files
## Contributor
- [Gary Pang](https://github.com/CodeWritingCow)
## References
- [SEC: Frequently Asked Questions About Form 13F](https://www.sec.gov/divisions/investment/13ffaq.htm)