https://github.com/juwono136/indonesia-negative-comment-lstm
Sentiment Analysis of Indonesian Negative Comments on Social Media Using Long Short-Term Memory (LSTM)
https://github.com/juwono136/indonesia-negative-comment-lstm
deep-learning-algorithms gradio-interface hugging-face lstm-sentiment-analysis numpy python3 tensorflow
Last synced: about 2 months ago
JSON representation
Sentiment Analysis of Indonesian Negative Comments on Social Media Using Long Short-Term Memory (LSTM)
- Host: GitHub
- URL: https://github.com/juwono136/indonesia-negative-comment-lstm
- Owner: Juwono136
- Created: 2023-08-30T08:47:08.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2023-10-16T00:06:28.000Z (over 1 year ago)
- Last Synced: 2023-10-16T19:50:18.632Z (over 1 year ago)
- Topics: deep-learning-algorithms, gradio-interface, hugging-face, lstm-sentiment-analysis, numpy, python3, tensorflow
- Language: Jupyter Notebook
- Homepage: https://huggingface.co/spaces/juwono136/indonesia-negative-comment-classification
- Size: 2.51 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Sentiment Analysis of Indonesian Negative Comments on Social Media Using LSTM

### Demo or Live Website
➡️ Project Demo: https://huggingface.co/spaces/juwono136/indonesia-negative-comment-classification
➡️ Journal Publication: https://journal.formosapublisher.org/index.php/ijis/article/view/4990## Introduction
Currently, in addition to discussing in the real world, most people also engage in discussions online, one of which is by commenting on a post on social media. Sometimes, the quality of the discussion deteriorates due to the emergence of various negative comments, especially among Indonesian people who frequently comment on social media. This sentiment analysis aims to classify and make predictions about a comment so that in the future, the discussion space on social media can be improved and unnecessary conflicts can be avoided.
The Long-Short Term Memory (LSTM) method has been widely used, especially in NLP technology, as it has the advantage of understanding context or words in long-term memory. This sentiment analysis is still in the development stage, with the continuous addition of diverse datasets containing a wider range of sentences.
### 📑 Dataset
➡️ The dataset originates from one of the GitHub users, namely [Ahmad Izzan and team](https://github.com/ahmadizzan/netifier), inspired by the [Toxic Comment Classification Challenge](https://www.kaggle.com/c/jigsaw-toxic-comment-classification-challenge).
This dataset consists of approximately 7,773 example sentences (raw_data) categorized into positive and negative sentences in the Indonesian language.
Negative sentences are divided into 4 types: Pornography, Hate Speech (SARA), Radicalism, and Defamation, whereas positive sentences are those that do not carry negative meanings.### 🎯 Key Features:
- ➡️ Sentiment Analysis Using LSTM Method with TensorFlow Framework
- ➡️ Text classification to predict negative comments based on 4 categories: Pornography, Hate Speech (SARA), Radicalism, and Defamation### 🧑💻 Technologies:
- ➡️ Python 3.x
- ➡️ TensorFlow Framework
- ➡️ Hugging Face for Deploy (https://huggingface.co/)
- ➡️ Numpy
- ➡️ Gradio Interface
- ➡️ Long-Short Term Memory (LSTM)
- ➡️ TensorFlow Embedding Projector## Project Overview
- ➡️ Label Distribution Graph for Sentences in the Raw Data Dataset

- ➡️ Correlation Matrix Between Labels

- ➡️ Word Cloud from SARA label

- ➡️ Workflow
The core steps of this workflow are data construction which includes the stages of data collection from social media, data analysis, preprocessing and create a model.
- ➡️ TensorFlow Data Pipelines

- ➡️ Network Architecture Models

- ➡️ Training and Validation Accuracy Values

- ➡️ Training and Validation Loss Values

- ➡️ Embedding Projector in Positive Class

- ➡️ Embedding Projector in Negative Class

### 🤝 Project Members
- Juwono