Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/janheinrichmerker/song-analysis

Analysing the Million Song Dataset.
https://github.com/janheinrichmerker/song-analysis

big-data data-analysis data-science hadoop hadoop-mapreduce java kotlin songs

Last synced: 4 days ago
JSON representation

Analysing the Million Song Dataset.

Awesome Lists containing this project

README

        

# Data Analytics Project: The Million Song Dataset

Analysing The Million Song Dataset.

## Installation

1. Download and install _HDF Java_ and _HDFView+Object_:
```bash
./lib/hdf/install-hdf.sh
```
1. Download and install _Hadoop_:
```bash
./lib/hadoop/install-hadoop.sh
```
1. Download the data sets:
```bash
./data/download.sh
```

## Configuration

### Log level

If you want Hadoop to only print warnings to the console,
add the following line to `lib/hadoop/hadoop-3.1.0/etc/hadoop/hadoop-env.sh`:
```bash
HADOOP_ROOT_LOGGER=WARN,DRFA
```