https://github.com/mesmacosta/hive-table-metadata-generator
This script generates random metadata for the Hive metastore.
https://github.com/mesmacosta/hive-table-metadata-generator
apache-hive bigdata datawarehouse metadata
Last synced: 9 months ago
JSON representation
This script generates random metadata for the Hive metastore.
- Host: GitHub
- URL: https://github.com/mesmacosta/hive-table-metadata-generator
- Owner: mesmacosta
- License: mit
- Created: 2019-11-15T16:31:40.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2019-11-15T16:37:33.000Z (over 6 years ago)
- Last Synced: 2025-02-28T01:51:59.492Z (over 1 year ago)
- Topics: apache-hive, bigdata, datawarehouse, metadata
- Language: Python
- Homepage:
- Size: 10.7 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# hive-table-metadata-generator
To test some Hive capabilities, it’s good to have a good number of tables with different complex column types inside the Hive metastore. This script generates random metadata for the Hive metastore.
## Activate your virtualenv if it’s not up
`source ./env/bin/activate`
## Install the requirements for the metadata generator
`pip install -r requirements.txt`
If you receive an error when installing the requirements, run (linux distro):
`sudo apt-get install python-dev libsasl2-dev gcc`
`sudo apt-get install sasl2-bin libsasl2-2 libsasl2-dev libsasl2-modules`
Then run the install requirements again.
## Run the script
`python hive_metastore_metadata_generator.py`