Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cstorm125/curry
The Data-Driven Guide to Bangkok Prostitutes
https://github.com/cstorm125/curry
Last synced: about 1 month ago
JSON representation
The Data-Driven Guide to Bangkok Prostitutes
- Host: GitHub
- URL: https://github.com/cstorm125/curry
- Owner: cstorm125
- Created: 2016-07-04T15:47:07.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2018-06-19T10:29:04.000Z (over 6 years ago)
- Last Synced: 2023-03-04T03:22:20.717Z (almost 2 years ago)
- Language: HTML
- Size: 1.29 MB
- Stars: 10
- Watchers: 3
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# The Data-Driven Guide to Bangkok Prostitutes
This repository is a sandbox for a [Medium article](https://medium.com/p/af965fc55e4b/). You can also read the article [here](http://cstorm125.github.io/curry)
## Codebook
The ```curry dataset``` contains 693 observations of prostitues operating in Bangkok and nearby areas. The data were scraped from an online listing on May 21, 2016 and stored as XML in the ```data``` folder. The raw dataset (```big.rds```) contains such features as name, gender, age, price, range of services and locations, physical measurements, view counts, number of pictures, and contacts of all prostitutes (22 features in total). Furthermore, the processed dataset (```features.rds```) augmented such features as location clusters, body measurement ratios, and view counts per day listed, as well as transformed some features such as location and range of services for machine learning purposes (58 features in total). This report explains the process.## Processed
This folder contains processed files of the ```curry dataset``` as described in the codebook.