https://github.com/cmblir/dinnovation
It was developed to facilitate the work of collecting, processing, and loading the data required for the Big Data Center. In addition, various libraries are used in the project, which are available under the Apache 2.0 license.
https://github.com/cmblir/dinnovation
beautifulsoup4 company-data financial-data pandas python selenium
Last synced: 2 months ago
JSON representation
It was developed to facilitate the work of collecting, processing, and loading the data required for the Big Data Center. In addition, various libraries are used in the project, which are available under the Apache 2.0 license.
- Host: GitHub
- URL: https://github.com/cmblir/dinnovation
- Owner: cmblir
- License: apache-2.0
- Created: 2022-08-06T08:43:42.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2023-08-29T06:35:27.000Z (almost 3 years ago)
- Last Synced: 2025-08-30T03:53:56.708Z (10 months ago)
- Topics: beautifulsoup4, company-data, financial-data, pandas, python, selenium
- Language: Python
- Homepage:
- Size: 112 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
README
# Download market data from various information sites
*** Important Legal Disclaimer ***
---
Please note that dinnovation is not affiliated, endorsed, or vetted by any source sites. Use at your own risk and discretion.
**For more information about the rights to use the actual data you downloaded, see the Terms of Use for each site. dinnovation is for personal use only.**
---
---
## Digital Industry Innovation Data Platform Big data collection and processing, database loading, distribution
It was developed to facilitate the work of collecting, processing, and loading the data required for the Big Data Center.
In addition, various libraries are used in the project, which are available under the Apache 2.0 license.
## Requirements
**required python version**
```Python >= 3.9```
To install the related library, use the command below.
``` pip install requirements.txt ```
or
``` python setup.py install ```
To install the related libray
``` pip install dinnnovation ```
**required library**
```
pandas==1.5.3
numpy==1.24.2
tqdm==4.64.1
OpenDartReader==0.2.1
beautifulsoup4==4.11.2
urllib3==1.26.14
selenium==4.8.2
webdriver_manager==3.8.5
chromedriver_autoinstaller==0.4.0
psycopg2==2.9.5
sqlalchemy==2.0.4
cryptography==41.0.3
```
---
## Dinnovation Architecture

- Dinnovation's architecture is built on efficiency. Data from multiple sites can be collected, processed, and loaded with one module.
## Dinnovation Process

- Dinnovation's Auto Process automatically collects, processes, and loads data using the scheduler. This can be done more efficiently than simply using a module.
## Dinnovation Workflow

- Dinnovation's workflow was created by a single developer. This will automatically update the package and the release of Github when you modify the code and upload it to Git for the efficiency of your work. In addition, version management is also automatically.
## How to use
[Guide](./quick_start/README.md)