https://github.com/selfint/sec_html_parser
Simple SEC form (e.g. 10K) parser
https://github.com/selfint/sec_html_parser
parsing-library python3
Last synced: about 1 year ago
JSON representation
Simple SEC form (e.g. 10K) parser
- Host: GitHub
- URL: https://github.com/selfint/sec_html_parser
- Owner: selfint
- Created: 2021-07-06T19:39:42.000Z (almost 5 years ago)
- Default Branch: main
- Last Pushed: 2021-07-13T06:37:38.000Z (almost 5 years ago)
- Last Synced: 2025-02-14T08:49:11.859Z (over 1 year ago)
- Topics: parsing-library, python3
- Language: Python
- Homepage:
- Size: 1.06 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# SEC HTML parser
SEC documents aren't formatted in a way that is easy for computers to understand. The
purpose of this library is to take the unstructured format of the SEC documents (e.g.
10K annual report forms) and convert them into a JSON format.
## Examples
The module can also be executed like so:
```sh
$ python -m sec_html_parser /path/to/10k/form.html -o /path/to/output.json
```