Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/charlesfrye/7dbs-hbase
Updated code for HBase wikidump example from Seven Databases in Seven Weeks
https://github.com/charlesfrye/7dbs-hbase
Last synced: 27 days ago
JSON representation
Updated code for HBase wikidump example from Seven Databases in Seven Weeks
- Host: GitHub
- URL: https://github.com/charlesfrye/7dbs-hbase
- Owner: charlesfrye
- Created: 2023-01-19T06:33:13.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2023-01-19T07:55:20.000Z (almost 2 years ago)
- Last Synced: 2024-10-27T15:37:40.834Z (3 months ago)
- Language: Ruby
- Size: 3.91 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Setup
See [this medium post](https://sanjay-vishwakarma.medium.com/hbase-db-installation-and-data-read-write-via-hbase-shell-bulk-loading-and-non-bulk-loading-437026218d00)
for info on setting the Docker container up.That is the source of the `run-hbase.sh` script.
### Get an `hbase` shell:
```bash
docker exec -it hbase-docker hbase shell
```### Get a `bash` shell:
```bash
docker exec -it hbase-docker hbase shell
```# Wikipedia Dump
The `.rb` files have been modified to reflect updates to HBase
since `1.0`.In particular, they use proper `Connection`s to the tables
and switch to a `BufferedMutator` for batched writes,
instead of manually managing commit flushing.Copy the files into `./data`,
which will be generated by `run-hbase`,
to get access to them inside the Docker container.Hacky, I know.