Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sensiblecodeio/databaker
Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.
https://github.com/sensiblecodeio/databaker
convert-spreadsheets excel ons
Last synced: 24 days ago
JSON representation
Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.
- Host: GitHub
- URL: https://github.com/sensiblecodeio/databaker
- Owner: sensiblecodeio
- License: other
- Created: 2015-01-09T11:21:10.000Z (almost 10 years ago)
- Default Branch: master
- Last Pushed: 2023-12-14T01:38:32.000Z (about 1 year ago)
- Last Synced: 2024-11-13T21:50:58.408Z (about 1 month ago)
- Topics: convert-spreadsheets, excel, ons
- Language: Jupyter Notebook
- Homepage:
- Size: 2.26 MB
- Stars: 78
- Watchers: 10
- Forks: 13
- Open Issues: 12
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- jimsghstars - sensiblecodeio/databaker - Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics. (Jupyter Notebook)
README
# Databaker
Jupyter notebook tool for converting data that is laid out in a formatted Excel
spreadsheet into a normalized form for use by databases.It depends on [okfn/messytables](https://github.com/okfn/messytables) and
[sensiblecodeio/xypath](https://github.com/sensiblecodeio/xypath)Python 3.4+ supported.
## Starting up
### For development
To install for development, the easiest way is create a virtualenv,
activate it:`source bin/activate`
and then type
`pip install -e git+https://github.com/sensiblecodeio/databaker.git#egg=databaker`
This will install the code into `src/databaker` where you can edit and commit it.
### For normal use
Install with `pip install databaker`
## Usage
Launch a Jupyter notebook:
`jupyter notebook`
and then follow the tutorials as described below.
## Documentation
The current documentation is in the form of Jupyter notebooks located
inside the [tutorial](databaker/tutorial) directory.You can access these directly by creating a new Jupyter notebook and
running the following in a Jupyter cell:```
from databaker.tutorial import tutorial
tutorial()
```which will copy the tutorials to your current directory and provide
links to these copied notebooks.## Authors
Made by the [Sensible Code Company](http://sensiblecode.io) on behalf of the
[Office of National Statistics](https://www.ons.gov.uk/) (UK).