Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/byramag/data_python_exercise
Interview project for Singlestone consulting.
https://github.com/byramag/data_python_exercise
Last synced: about 1 month ago
JSON representation
Interview project for Singlestone consulting.
- Host: GitHub
- URL: https://github.com/byramag/data_python_exercise
- Owner: byramag
- Created: 2020-09-29T23:49:49.000Z (almost 4 years ago)
- Default Branch: master
- Last Pushed: 2020-10-03T05:58:59.000Z (almost 4 years ago)
- Last Synced: 2024-06-05T05:32:04.930Z (3 months ago)
- Language: Python
- Size: 57.6 KB
- Stars: 0
- Watchers: 2
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Abigail Byram: Singlestone Data Python Exercise
## Purpose
This project is to showcase some simple data manipulation in a Python package using Pyspark and unit testing with Pytest.## Run Instructions
All instructions assume you are running from the project root.To install required dependncies, run the following:
```
pip install -r requirements.txt
```There are two ways to run the app:
1. Direct Python package run:
```
python data_processor --out
```2. Pipenv script
```
pipenv run generate-report
```Note that when an output path is not specified, the output file name defaults to `report.json`
Finally, unit testing can be run with the command
```
pipenv run unit-test
```