https://github.com/ocha-dap/hdx-python-utilities
A range of helpful utilities for Python developers including streaming tabular data, date parsing, JSON and YAML handling, dictionary and list utilities
https://github.com/ocha-dap/hdx-python-utilities
datetime hdx json logging stream-processing tabular-data utilities yaml
Last synced: 4 months ago
JSON representation
A range of helpful utilities for Python developers including streaming tabular data, date parsing, JSON and YAML handling, dictionary and list utilities
- Host: GitHub
- URL: https://github.com/ocha-dap/hdx-python-utilities
- Owner: OCHA-DAP
- License: mit
- Created: 2017-10-10T14:47:02.000Z (over 8 years ago)
- Default Branch: main
- Last Pushed: 2026-01-15T04:48:43.000Z (5 months ago)
- Last Synced: 2026-01-18T15:21:09.533Z (5 months ago)
- Topics: datetime, hdx, json, logging, stream-processing, tabular-data, utilities, yaml
- Language: Python
- Homepage: https://hdx-python-utilities.readthedocs.io/en/latest/
- Size: 2.65 MB
- Stars: 10
- Watchers: 15
- Forks: 6
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
Awesome Lists containing this project
README
[](https://github.com/OCHA-DAP/hdx-python-utilities/actions/workflows/run-python-tests.yaml)
[](https://coveralls.io/github/OCHA-DAP/hdx-python-utilities?branch=main)
[](https://github.com/astral-sh/ruff)
[](https://pypistats.org/packages/hdx-python-utilities)
The HDX Python Utilities Library provides a range of helpful utilities for Python developers.
Note that these are not specific to HDX.
1. Easy downloading of files with support for authentication, streaming and hashing
1. Retrieval of data from url with saving to file or from data previously saved
1. Date utilities
1. Loading and saving JSON and YAML (maintaining order)
1. Loading and saving HXLated csv and/or JSON
1. Dictionary and list utilities
1. HTML utilities (inc. BeautifulSoup helper)
1. Compare files (eg. for testing)
1. Simple emailing
1. Easy logging setup and error logging
1. State utility
1. Path utilities
1. URL utilities
1. Text processing
1. Stable file hashing
1. Matching utilities
1. Encoding utilities
1. Check valid UUID
1. Easy building and packaging
For more information, please read the [documentation](https://hdx-python-utilities.readthedocs.io/en/latest/).
This library is part of the [Humanitarian Data Exchange](https://data.humdata.org/) (HDX) project. If you have
humanitarian related data, please upload your datasets to HDX.
# Development
## Environment
Development is currently done using Python 3.13. The environment can be created with:
```shell
uv sync
```
This creates a .venv folder with the versions specified in the project's uv.lock file.
### Pre-commit
pre-commit will be installed when syncing uv. It is run every time you make a git
commit if you call it like this:
```shell
pre-commit install
```
With pre-commit, all code is formatted according to
[ruff](https://docs.astral.sh/ruff/) guidelines.
To check if your changes pass pre-commit without committing, run:
```shell
pre-commit run --all-files
```
## Packages
[uv](https://github.com/astral-sh/uv) is used for package management. If
you’ve introduced a new package to the source code (i.e. anywhere in `src/`),
please add it to the `project.dependencies` section of `pyproject.toml` with
any known version constraints.
To add packages required only for testing, add them to the
`[dependency-groups]`.
Any changes to the dependencies will be automatically reflected in
`uv.lock` with `pre-commit`, but you can re-generate the files without committing by
executing:
```shell
uv lock --upgrade
```
## Project
[uv](https://github.com/astral-sh/uv) is used for project management. The project can be
built using:
```shell
uv build
```
Linting and syntax checking can be run with:
```shell
uv run ruff check
```
To run the tests and view coverage, execute:
```shell
uv run pytest
```
## Documentation
The documentation, including API documentation, is generated using ReadtheDocs and
MkDocs with Material. As you change the source code, remember to update the
documentation at `documentation/index.md`.