Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/abeusher/timehashlib

An algorithm for creating user configurable, variable-precision sliding windows of time. Useful for binning time values in large collections of data.
https://github.com/abeusher/timehashlib

Last synced: 1 day ago
JSON representation

An algorithm for creating user configurable, variable-precision sliding windows of time. Useful for binning time values in large collections of data.

Awesome Lists containing this project

README

        

# timehashlib
An algorithm for creating user configurable, variable-precision sliding windows of time. Useful for binning time values in large collections of data.

timehash
========

About
-----

timehash is an algorithm (with multiple reference implementations) for
calculating variable precision sliding windows of time. When performing
aggregations and correlations on large-scale data sets, the ability to
convert precise time values into 'malleable intervals' allows for many
novel analytics.

Using `sliding windows of
time `__
is a common practice in data analysis but prior to the timehash
algorithm it was more of an art than a science.

.. image:: timehash-shaded-partial.png

Features
--------

- convert epoch miliseconds into an interval of time, depicted by an
ASCII character 'hash' (a 'timehash')
- timehash values are well suited to referencing time intervals in
key-value stores (e.g. Hbase, Acculumo, Redis)
- The creation of a compound key of space and time (e.g.
geohash\_timehash) is a powerful primitive for understanding
geotemporal patterns

Implementations
---------------

- `python
timehash `__
- a reference implementation in pure python
- `perl
timehash `__
- a reference implementation in perl
- `java
timehash `__
- a reference implementation in java
- `javascript
time-hash `__
- port of reference implements
- `golang
timehash `__
- a reference implementation in Go

Usage
-----

Example of calculating a timehash value in python:

.. code:: python

import timehash
import time

rightnow = time.time()
rightnow60 = rightnow + 60.0

rightnow_hash = timehash.encode(rightnow, precision=10)
rightnow60_hash = timehash.encode(rightnow60, precision=10)

print 'timehash of right now: %s' % rightnow_hash
print 'timehash of now +60s: %s'% rightnow60_hash

% timehash of right now: ae0f0ba1fc
% timehash of now +60s: ae0f0baa1c

License
-------

`Modified BSD
License `__

Contact
-------

- TimeHash Guru: `AbeUsher `__
- Python Packager: `Kevin Dwyer / @pheared `__
- Golang implementation: `Justin Shelton / @kwonstant `__
- Javascript implementation: `Eric Xanderson `__