https://github.com/textcorpuslabs/edgar
Create a corpus from EDGAR data
https://github.com/textcorpuslabs/edgar
corpus edgar-scraper python3
Last synced: about 2 months ago
JSON representation
Create a corpus from EDGAR data
- Host: GitHub
- URL: https://github.com/textcorpuslabs/edgar
- Owner: TextCorpusLabs
- License: mit
- Created: 2023-12-21T01:35:05.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-03-20T17:32:02.000Z (over 2 years ago)
- Last Synced: 2025-03-22T07:14:49.140Z (over 1 year ago)
- Topics: corpus, edgar-scraper, python3
- Language: Jupyter Notebook
- Homepage:
- Size: 112 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# EDGAR
Access various parts of EDGAR of processing in other projects.
The jupyter notebooks are run in [VSCode](https://code.visualstudio.com/Download).
If you want to run them in [Google's CoLab](https://colab.research.google.com/), you may need to make modification.
## Workflow
Run the below scripts to generate the EDGAR corpus of your choice.
In the Interest of saving time, you can get the cached versions in GitHub's [releases](https://github.com/TextCorpusLabs/Edgar/releases) section.
We use [7-zip](https://www.7-zip.org) to accommodate GitHub's 2GB limitation.
| Date | Link |
| --- | --- |
| 2024/02/10 | [10-K](/Create%2010-K%20Corpus.ipynb) |
## Requirements
```{ps1}
pip install --upgrade pip
pip install -r "c:/repos/TextCorpusLabs/Edgar/requirements.txt"
```