{"id":15453557,"url":"https://github.com/shamil-t/sentiment-analysis-using-python-imdb-","last_synced_at":"2025-10-12T09:15:37.713Z","repository":{"id":134798222,"uuid":"275952970","full_name":"shamil-t/sentiment-analysis-using-python-IMDB-","owner":"shamil-t","description":"Sentiment Analysis is to find the perspective ,view ,attitude or feeling of a speaker on a particular topic, event or interactionBasicaly its the analysis of an emotionally cahrged text. Here we try to analyzethe reviewsposted by people at Imdb. Further the reviews are processed analyzed using machine learning procedures, algorithms and other related aspets.","archived":false,"fork":false,"pushed_at":"2020-08-26T13:24:29.000Z","size":137,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-03-28T09:43:40.698Z","etag":null,"topics":["ai","bagging","imdb-dataset","machine-learning","naive-bayes-classifier","python","random-forest","sentiment-analysis","support-vector-machines"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/shamil-t.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":"Support_Machine_Vector_Classifier/imdbReview.py","governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-06-30T00:02:31.000Z","updated_at":"2020-08-26T13:24:31.000Z","dependencies_parsed_at":null,"dependency_job_id":"3c65b850-af29-442c-ac83-5187d2fbdcee","html_url":"https://github.com/shamil-t/sentiment-analysis-using-python-IMDB-","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/shamil-t/sentiment-analysis-using-python-IMDB-","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shamil-t%2Fsentiment-analysis-using-python-IMDB-","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shamil-t%2Fsentiment-analysis-using-python-IMDB-/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shamil-t%2Fsentiment-analysis-using-python-IMDB-/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shamil-t%2Fsentiment-analysis-using-python-IMDB-/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/shamil-t","download_url":"https://codeload.github.com/shamil-t/sentiment-analysis-using-python-IMDB-/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/shamil-t%2Fsentiment-analysis-using-python-IMDB-/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279010965,"owners_count":26084837,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-12T02:00:06.719Z","response_time":53,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","bagging","imdb-dataset","machine-learning","naive-bayes-classifier","python","random-forest","sentiment-analysis","support-vector-machines"],"created_at":"2024-10-01T22:00:49.455Z","updated_at":"2025-10-12T09:15:37.679Z","avatar_url":"https://github.com/shamil-t.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# 🆂🅴🅽🆃🅸🅼🅴🅽🆃-🅰🅽🅰🅻🆈🆂🅸🆂-🆄🆂🅸🅽🅶-🅿🆈🆃🅷🅾🅽\n\nThe principal task of Sentiment Analysis is to find the perspective ,view ,attitude or feeling of a speaker on a particular topic, event or interactionBasicaly its the analysis of an emotionally cahrged text.\n\tHere we try to analyzethe reviewsposted by people at Imdb. Further the reviews are processed \nanalyzed using machine learning procedures, algorithms and other related aspets.\n\n### Algorithms Used\n\t\n\t* Support Vector Machine Classifier - `linearSvc`\n\t* Random Forest Classifer\n\t* AdaBoost Classfier\n\t* Naive Bayes Classifier - `MultinomialNB`\n\t* Bagging Classifier\n\n### Steps in Sentiment Analysis\n\n\t1.Formation of Dataset\n\t2.Processing of Data\n\t3.Creation of Feature Vector\n\t4.Classification\n\n\n## 🅿🆁🅴 🆁🅴🆀🆄🅴🆂🆃🅸🅴🆂\n\n\n### Environment Setup\n \n         -:\u003e python 2.8 or above 3.x recommended\n\n### Dataset\n\n[Download DataSet from here](http://ai.stanford.edu/~amaas/data/sentiment/aclImdb_v1.tar.gz)\n then put `aclImdb` folder to parent directory\n#### File structure\n ![File structure](https://github.com/shamil-t/sentiment-analysis-using-python-IMDB-/blob/master/images/Files.png?raw=true)\n\n### install modules\n\n1.sklearn\n```bash\npip install sklearn\n```\n2.pickle\n```bash\npip install pickle-mixin\n```\n3.nltk\n```bash\npip install nltk\n```\nin Python IDLE\n\n``` bash\nimport nltk\nnltk.download(\"stopwords\")\n``` \n\n4.numpy\n\n```bash\npip install numpy\n```\n\n\n\n## 🅷🅾🆆🆆 🆃🅾 🆁🆄🅽\n\n`imdbReviews.py` generates `*.pkl` files which are the training and testing datasets.\nFirst, set the dataset directory in the `imdbReviews.py`, then run the code.\n```\npython imdbReviews.py\n```\nnow you will get two new .pkl files such as `test.pkl` \u0026 `train.pkl`\nwhich are needed for `naive.py`, `svm.py`,`rfc.py`,`bagging.py`,`adaboost.py`.\n\n#### To do prediction, run the following command.\n\n```bash \npython filname.py \n```\n\n`eg:-`\n\n```bash\npython naive.py\n```\n\n## ScreenShots\n\n\n#### Ada Boost Classifier\n![Ada Boost Classifier](https://github.com/shamil-t/sentiment-analysis-using-python-IMDB-/blob/master/images/adaboost.png?raw=true)\n\n\n#### Bagging Classifier\n![Bagging Classifier](https://github.com/shamil-t/sentiment-analysis-using-python-IMDB-/blob/master/images/bagging.png?raw=true)\n\n\n#### Naive Bayes - MultinomialNB()\n![Bagging Classifier](https://github.com/shamil-t/sentiment-analysis-using-python-IMDB-/blob/master/images/naive.png?raw=true)\n\n\n#### Random Forest Classifier\n![Random Forest Classifier](https://github.com/shamil-t/sentiment-analysis-using-python-IMDB-/blob/master/images/randomforest.png?raw=true)\n\n\n#### Support Vector Machine - LinearSVC()\n![Support Vector Machine](https://github.com/shamil-t/sentiment-analysis-using-python-IMDB-/blob/master/images/svm.png?raw=true)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fshamil-t%2Fsentiment-analysis-using-python-imdb-","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fshamil-t%2Fsentiment-analysis-using-python-imdb-","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fshamil-t%2Fsentiment-analysis-using-python-imdb-/lists"}