Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/janheinrichmerker/song-analysis
Analysing the Million Song Dataset.
https://github.com/janheinrichmerker/song-analysis
big-data data-analysis data-science hadoop hadoop-mapreduce java kotlin songs
Last synced: 4 days ago
JSON representation
Analysing the Million Song Dataset.
- Host: GitHub
- URL: https://github.com/janheinrichmerker/song-analysis
- Owner: janheinrichmerker
- License: mit
- Created: 2018-06-20T01:56:46.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2020-03-14T12:01:47.000Z (almost 5 years ago)
- Last Synced: 2024-12-22T05:09:45.421Z (7 days ago)
- Topics: big-data, data-analysis, data-science, hadoop, hadoop-mapreduce, java, kotlin, songs
- Language: Kotlin
- Homepage:
- Size: 10.8 MB
- Stars: 2
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Data Analytics Project: The Million Song Dataset
Analysing The Million Song Dataset.
## Installation
1. Download and install _HDF Java_ and _HDFView+Object_:
```bash
./lib/hdf/install-hdf.sh
```
1. Download and install _Hadoop_:
```bash
./lib/hadoop/install-hadoop.sh
```
1. Download the data sets:
```bash
./data/download.sh
```## Configuration
### Log level
If you want Hadoop to only print warnings to the console,
add the following line to `lib/hadoop/hadoop-3.1.0/etc/hadoop/hadoop-env.sh`:
```bash
HADOOP_ROOT_LOGGER=WARN,DRFA
```