Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/delium/clickhouse-migrator
Easy data migrator to robust clickhouse
https://github.com/delium/clickhouse-migrator
Last synced: 3 months ago
JSON representation
Easy data migrator to robust clickhouse
- Host: GitHub
- URL: https://github.com/delium/clickhouse-migrator
- Owner: delium
- License: mit
- Created: 2020-09-24T16:38:24.000Z (about 4 years ago)
- Default Branch: master
- Last Pushed: 2021-12-27T23:13:40.000Z (almost 3 years ago)
- Last Synced: 2024-07-21T16:50:42.818Z (4 months ago)
- Language: Python
- Size: 515 KB
- Stars: 66
- Watchers: 6
- Forks: 3
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-clickhouse - delium/clickhouse-migrator - Clickhouse Migrator is an easy-to-use data migration tool designed for the ClickHouse database. (Integrations / ETL and Data Processing)
README
## Clickhouse Migrator
[![Build Status](https://app.travis-ci.com/delium/clickhouse-migrator.svg?branch=master)](https://app.travis-ci.com/delium/clickhouse-migrator)
[![Maintainability](https://api.codeclimate.com/v1/badges/0f3bdcfd7fbf643a0e7b/maintainability)](https://codeclimate.com/github/delium/clickhouse-migrator/maintainability)
[![Test Coverage](https://api.codeclimate.com/v1/badges/0f3bdcfd7fbf643a0e7b/test_coverage)](https://codeclimate.com/github/delium/clickhouse-migrator/test_coverage)[Clickhouse](https://clickhouse.tech/) is known for its scale to store and fetch large datasets.
Development and Maintenance of large-scale db systems many times requires constant changes to the actual DB system.
Holding off the scripts to migrate these will be painful.We found there is nothing existing earlier and developed one inspired by, [Flyway](https://flywaydb.org/), [Alembic](https://alembic.sqlalchemy.org/en/latest/)
This is a python library, which you can execute as a pre-hook using sys python.
Or as a migration framework before deployment/server-startup in your application as required.### Publishing to pypi
* python -m build
* python -m twine upload --verbose --repository pypi dist/*### Installation
You can install from pypi using `pip install clickhouse-migrator`.
### Usage
```python
# <= v1.0.4
from migration_lib.migrate import migratemigrate(db_name, migrations_home, db_host, db_user, db_password, create_db_if_no_exists)
``````python
# > v1.0.4
from clickhouse_migrate.migrate import migratemigrate(db_name, migrations_home, db_host, db_user, db_password, create_db_if_no_exists)
```Parameter | Description | Default
-------|-------------|---------
db_name| Clickhouse database name | None
migrations_home | Path to list of migration files |
db_host | Clickhouse database hostname | localhost
db_password | ***** | ****
create_db_if_no_exists | If the `db_name` is not present, enabling this will create the db | True
db_port | Database port incase your server runs on a non default port | None (defaults to 9000)
queue_exec | Command Pipelining (wait for system mutations if any to complete) for every command | True### Folder and Migration file patterns
The filenames are pretty similar to how `flyway` keeps it.
Your first version filename should be prefixed with `V1__` (double underscore)
These migrations are executed one by one, failures in between will stop and not further version files will be executed.#### Multi statement and single statement migrations
If your migration is a single statement, you can create a file in the migration folder using the .sql extension and push your migration statement in there.
If you want to execute more than one statement in your migration, you can use a json file using the array syntax. Note that when using a json file, contents should be a valid json array as show. Ensure to keep migrations logical. Its not a good practise to push all migrations to one json file and neither is it wise to in all cases have them each statement in one file.
```json
[
"CREATE TABLE pytest.sample1(id UInt32, name String) ENGINE MergeTree PARTITION BY tuple() ORDER BY tuple()",
"CREATE TABLE pytest.sample2(id UInt32, name String) ENGINE MergeTree PARTITION BY tuple() ORDER BY tuple()",
"CREATE TABLE pytest.sample3(id UInt32, name String) ENGINE MergeTree PARTITION BY tuple() ORDER BY tuple()"
]
```