Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/darenr/spark-pca
Dimensional reduction, Scatter, Hexbin and kde plots
https://github.com/darenr/spark-pca
pca python spark
Last synced: 6 days ago
JSON representation
Dimensional reduction, Scatter, Hexbin and kde plots
- Host: GitHub
- URL: https://github.com/darenr/spark-pca
- Owner: darenr
- Created: 2016-03-17T20:31:07.000Z (almost 9 years ago)
- Default Branch: master
- Last Pushed: 2016-05-02T17:18:05.000Z (almost 9 years ago)
- Last Synced: 2024-12-11T17:06:10.195Z (2 months ago)
- Topics: pca, python, spark
- Language: Python
- Size: 1.95 KB
- Stars: 1
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
### spark-pca
#take
copy iris.json to /tmp
run with spark-submit pca.py > pca.out
python plotpca.pytakes the iris dataset, reduces the 4 dimensional form of the sepal/petal length/widths and
produces a PCA result in 2 dimensions. This is then plotted in python/matplotlib in three forms