An open API service indexing awesome lists of open source software.

https://github.com/ojeda-e/nytrends

Data mining and time series: The New York Times Archive
https://github.com/ojeda-e/nytrends

Last synced: about 1 year ago
JSON representation

Data mining and time series: The New York Times Archive

Awesome Lists containing this project

README

          

# NYTrends
Data mining and time series: The New York Times Archive.

This repository access the [New York Times Archive API](https://developer.nytimes.com) to get NYT article metadata for a given year and month.

## Getting Started

More detail about NYT API [here](https://developer.nytimes.com/docs/archive-product/1/overview)

### Prerequisites

- Requests
- Calendar
- Operator

To install request

```
pip3.4 install requests
```

or using conda:

```
conda install -c anaconda requests
```

To analize extracted data the following modules are required:

- Matplotlib
- Numpy
- Scipy
- Pandas

All the notebooks here included use Python 3.7. The exercises are intended to be a visualization tool.


## The NYT Archive

### Extracting Data

### Analyzing Data

#### By Header
##### Word Frequency

##### Time Series Analysis

#### By Abstract

#### Sentiment Analysis - NLP

#### Keyword extraction - NLP

## Author

* **Estefania Barreto-Ojeda** - *Initial work* - [GitHub](https://github.com/ebojeda)