https://github.com/ojeda-e/nytrends
Data mining and time series: The New York Times Archive
https://github.com/ojeda-e/nytrends
Last synced: about 1 year ago
JSON representation
Data mining and time series: The New York Times Archive
- Host: GitHub
- URL: https://github.com/ojeda-e/nytrends
- Owner: ojeda-e
- Created: 2019-07-09T20:33:10.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2019-07-23T06:51:59.000Z (almost 7 years ago)
- Last Synced: 2025-02-17T06:25:52.342Z (over 1 year ago)
- Language: Jupyter Notebook
- Size: 3.53 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# NYTrends
Data mining and time series: The New York Times Archive.
This repository access the [New York Times Archive API](https://developer.nytimes.com) to get NYT article metadata for a given year and month.
## Getting Started
More detail about NYT API [here](https://developer.nytimes.com/docs/archive-product/1/overview)
### Prerequisites
- Requests
- Calendar
- Operator
To install request
```
pip3.4 install requests
```
or using conda:
```
conda install -c anaconda requests
```
To analize extracted data the following modules are required:
- Matplotlib
- Numpy
- Scipy
- Pandas
All the notebooks here included use Python 3.7. The exercises are intended to be a visualization tool.

## The NYT Archive
### Extracting Data
### Analyzing Data
#### By Header
##### Word Frequency
##### Time Series Analysis
#### By Abstract
#### Sentiment Analysis - NLP
#### Keyword extraction - NLP
## Author
* **Estefania Barreto-Ojeda** - *Initial work* - [GitHub](https://github.com/ebojeda)