https://github.com/glencrawford/tensorflow_imdb_reviews_text_classification

Text sentiment classification with a Tensorflow 2 and Keras neural network.
https://github.com/glencrawford/tensorflow_imdb_reviews_text_classification

keras machine-learning neural-network python tensorflow text-classification

Last synced: about 2 months ago
JSON representation

Text sentiment classification with a Tensorflow 2 and Keras neural network.

Host: GitHub
URL: https://github.com/glencrawford/tensorflow_imdb_reviews_text_classification
Owner: GlenCrawford
Created: 2019-11-17T11:07:33.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2019-11-17T11:45:45.000Z (over 6 years ago)
Last Synced: 2025-07-17T19:30:19.550Z (12 months ago)
Topics: keras, machine-learning, neural-network, python, tensorflow, text-classification
Language: Python
Homepage:
Size: 6.84 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# IMDB movie review sentiment classification with a Tensorflow and Keras neural network

Tensorflow/Keras neural network to train on the [IMDB dataset of 50,000 movie reviews](http://ai.stanford.edu/%7Eamaas/data/sentiment/) and classify reviews as positive or negative using binary sentiment classification with 87% accuracy.

The dataset is a collection of reviews (25,000 for training and 25,000 for testing), each one being a movie review as an array of "words", each word represented as an integer which maps to a word in the word index. The word index is a dictionary of nearly a hundred thousand words.

Each review has an associated label, which is a binary integer representing whether the review is positive or negative.

Adapted/fixed/modified/annotated starting from a [tutorial](https://www.youtube.com/watch?v=6g4O5UOH304) by [@TechWithTimm](https://twitter.com/TechWithTimm).

## Requirements

Python version: 3.7.4

See dependencies.txt for packages and versions (and below to install).

## Architecture of the neural network

Each review input, after preprocessing, is an array of "words", represented as integers that map to a word in the word index, truncated/padded as necessary to 250 words.

__Embedding and GlobalAveragePooling1D layers:__ Groups similar words in the word index together, based on the context that they are used in.

__Hidden layer:__ 16 neurons.

__Output layer:__ 1 neuron with a value between 0 and 1 (squashed using the sigmoid function) denoting whether the review is positive or negative.

For more details of the model's architecture, refer to the comment annotations in the code.

## Setup

Clone the Git repo.

Install the dependencies:

```bash
pip install -r dependencies.txt
```

## Run

```bash
python main.py
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/glencrawford/tensorflow_imdb_reviews_text_classification

Awesome Lists containing this project

README