An open API service indexing awesome lists of open source software.

https://github.com/textcorpuslabs/edgar

Create a corpus from EDGAR data
https://github.com/textcorpuslabs/edgar

corpus edgar-scraper python3

Last synced: about 2 months ago
JSON representation

Create a corpus from EDGAR data

Awesome Lists containing this project

README

          

# EDGAR

Access various parts of EDGAR of processing in other projects.

The jupyter notebooks are run in [VSCode](https://code.visualstudio.com/Download).
If you want to run them in [Google's CoLab](https://colab.research.google.com/), you may need to make modification.

## Workflow

Run the below scripts to generate the EDGAR corpus of your choice.
In the Interest of saving time, you can get the cached versions in GitHub's [releases](https://github.com/TextCorpusLabs/Edgar/releases) section.
We use [7-zip](https://www.7-zip.org) to accommodate GitHub's 2GB limitation.

| Date | Link |
| --- | --- |
| 2024/02/10 | [10-K](/Create%2010-K%20Corpus.ipynb) |

## Requirements

```{ps1}
pip install --upgrade pip
pip install -r "c:/repos/TextCorpusLabs/Edgar/requirements.txt"
```