Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jorgenpt/public_liquor_data
Liquor data (bar codes, descriptions, volume, pricing) from public records w/ scripts to parse & massage
https://github.com/jorgenpt/public_liquor_data
Last synced: 22 days ago
JSON representation
Liquor data (bar codes, descriptions, volume, pricing) from public records w/ scripts to parse & massage
- Host: GitHub
- URL: https://github.com/jorgenpt/public_liquor_data
- Owner: jorgenpt
- License: mit
- Created: 2024-03-10T21:51:02.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2024-03-10T22:02:35.000Z (10 months ago)
- Last Synced: 2024-10-03T12:34:28.917Z (3 months ago)
- Language: Python
- Size: 18 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Public Liquor Data
In order to build tools that work with bar codes for liquor bottles, I've tried to collect and parse publicly available data.
These data dumps were received under public records requests or publicly available from US states' various liquor control bodies.All the scripts are [licensed under the MIT license](LICENSE), and the data is made available under the public record laws of the relevant state.
## Data sources
### [Oregon](https://github.com/jorgenpt/public_liquor_data/tree/main/data/oregon/)
These are records received from [email protected].
The "GTIN List With Price" is a slightly awkward Excel format, but parseable with [`xlrd`](https://xlrd.readthedocs.io/en/latest/) and some logic, [see oregon_gtin_list_to_json.py](oregon_gtin_list_to_json.py).
### [Utah](https://github.com/jorgenpt/public_liquor_data/tree/main/data/utah/)
These are public records published on the following URLs:
- https://abs.utah.gov/shop-products/interactive-product-list/
- https://abs.utah.gov/vendors/monthly-price-books/The "Product List" Excel spreadsheet is excellent and easily parseable (with [`pandas`](https://pandas.pydata.org/) in this case, [see utah_product_list_to_csv.py](utah_product_list_to_csv.py)), but does not contain any bar code information.
The "Numeric Price List" on the other hand has a mapping from Utah's "CSC" product codes to bar codes, but is only available as a PDF. While somewhat painful to parse, [`camelot`](https://camelot-py.readthedocs.io/) provides a lot of help.
### [Washington](https://github.com/jorgenpt/public_liquor_data/tree/main/data/washington/)
This is a dump received from [email protected], containing the latest data before the state stopped running their own liquor stores. Unclear if this will be useful, so it has not been parsed yet.