https://github.com/cawfree/reuters-dataset
🗞️ A tool for downloading and parsing Reuters-21578. These are a collection of documents that appeared on Reuters newswire back in 1987.
https://github.com/cawfree/reuters-dataset
21578 dataset nlp nodejs reuters tensorflow
Last synced: 4 months ago
JSON representation
🗞️ A tool for downloading and parsing Reuters-21578. These are a collection of documents that appeared on Reuters newswire back in 1987.
- Host: GitHub
- URL: https://github.com/cawfree/reuters-dataset
- Owner: cawfree
- License: mit
- Created: 2020-01-11T23:06:44.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2023-01-05T05:10:54.000Z (over 3 years ago)
- Last Synced: 2025-11-27T10:18:02.872Z (7 months ago)
- Topics: 21578, dataset, nlp, nodejs, reuters, tensorflow
- Language: JavaScript
- Size: 8.71 MB
- Stars: 3
- Watchers: 0
- Forks: 0
- Open Issues: 14
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# reuters-dataset
🗞️ A tool for downloading and parsing Reuters-21578. These are a collection of documents that appeared on Reuters newswire back in 1987.
[](https://github.com/prettier/prettier)
#### 🔥 Features
- Asynchronously caches the full dataset to your temporary directory.
- This reduces your project size.
- Prettifies the results.
- Uses proper JSON naming conventions and common-sense values.
## 🚀 Getting Started
Using [`npm`]():
```bash
npm install --save reuters-dataset
```
Using [`yarn`]():
```bash
yarn add reuters-dataset
```
## ✍️ Usage
```javascript
import getReutersDataset from 'reuters-dataset';
(
async () => {
const { exchanges, orgs, people, places, topics, articles } = await getReutersDataset();
}
)();
```
## 📌 Example
```json
{
"$": {
"topics": true,
"lewissplit": "TRAIN",
"cgisplit": "TRAINING-SET",
"oldid": "5544",
"newid": "1"
},
"topics": ["cocoa"],
"places": ["el-salvador", "usa", "uruguay"],
"people": [],
"orgs": [],
"exchanges": [],
"companies": [],
"text": {
"title": "BAHIA COCOA REVIEW",
"dateline": "SALVADOR, Feb 26 -",
"body": "Showers continued throughout [...]"
},
"date": "1987-02-26T15:01:01.790Z"
}
```
## ✌️ License
[MIT](https://opensource.org/licenses/MIT)