{"id":15103649,"url":"https://github.com/ianhaggerty/final-capstone","last_synced_at":"2026-01-19T03:33:42.568Z","repository":{"id":226056922,"uuid":"767038260","full_name":"ianhaggerty/final-capstone","owner":"ianhaggerty","description":"This represents the final capstone project in my HyperionDev Data Science (fundamentals) course. A dataset of Amazon customer reviews is analysed using natural language processing.","archived":false,"fork":false,"pushed_at":"2024-03-08T13:51:52.000Z","size":7943,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-02-10T23:51:14.576Z","etag":null,"topics":["amazon","data-analytics","data-science","data-visualization","dataset","matplotlib","nlp","nlp-machine-learning","numpy","pandas","plotly","reviews","seaborn","spacy","tabulate","textblob","wordcloud"],"latest_commit_sha":null,"homepage":"https://www.linkedin.com/in/ihaggerty/","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ianhaggerty.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-03-04T15:40:01.000Z","updated_at":"2024-03-08T14:48:37.000Z","dependencies_parsed_at":"2024-09-25T19:41:23.375Z","dependency_job_id":"6de618c5-3f78-4a0b-8c5a-4fec5c736eb1","html_url":"https://github.com/ianhaggerty/final-capstone","commit_stats":{"total_commits":15,"total_committers":1,"mean_commits":15.0,"dds":0.0,"last_synced_commit":"d35701f4492eb48808c9f1de8ff441fd3fa0f6aa"},"previous_names":["ianhaggerty/finalcapstone"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ianhaggerty%2Ffinal-capstone","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ianhaggerty%2Ffinal-capstone/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ianhaggerty%2Ffinal-capstone/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ianhaggerty%2Ffinal-capstone/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ianhaggerty","download_url":"https://codeload.github.com/ianhaggerty/final-capstone/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247335565,"owners_count":20922457,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["amazon","data-analytics","data-science","data-visualization","dataset","matplotlib","nlp","nlp-machine-learning","numpy","pandas","plotly","reviews","seaborn","spacy","tabulate","textblob","wordcloud"],"created_at":"2024-09-25T19:41:14.911Z","updated_at":"2026-01-19T03:33:42.562Z","avatar_url":"https://github.com/ianhaggerty.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# [Capstone Project](https://github.com/ianhaggerty/finalCapstone/tree/master) — [NLP Applications](https://medium.com/@mohamadhasan.sarvandani/top-applications-of-nlp-54834ca9c8ca)\n\n## [CoGrammar](https://skills.cogrammar.com/) / [HyperionDev](https://www.hyperiondev.com/) / [Portfolio](https://www.hyperiondev.com/portfolio/IH23120012475/)\n\n## Introduction\n\nThis repository represents the final capstone project in my _HyperionDev Data Science (Fundamentals)_ course. The main script can be found [here](https://github.com/ianhaggerty/finalCapstone/blob/master/sentiment_analysis.py) and corresponding\nnotebook [here](https://github.com/ianhaggerty/finalCapstone/blob/master/notebooks/sentiment_analysis.ipynb).\n\nThe objective of this project was to discover interesting and actionable insights within a dataset of [amazon reviews](https://www.kaggle.com/datasets/datafiniti/consumer-reviews-of-amazon-products).\n\n[spaCy](https://spacy.io/usage) was used to conduct NLP analysis, in conjunction with\n[TextBlob](https://textblob.readthedocs.io/en/dev/) to determine the sentiment and subjectivity.\n\nPlease see the [project requirements](https://github.com/ianhaggerty/finalCapstone/blob/master/reports/project_desc.pdf) and corresponding [report](https://github.com/ianhaggerty/finalCapstone/blob/master/reports/sentiment_analysis_report.pdf).\n\nOffical grading, generously provided by Kenneth Mlimi, can be found [here](https://hyperiondev.cogrammar.com/review/acab9271caa0636a51999e66651876b5c291f15c631ac3141e313abae0620aa8/).\n\n## Images\n\n![Most dissimilar reviews.](images/dissimilar.png)\n![Most similar reviews.](images/similar.png)\n![Review similarity matrix.](images/matrix.png)\n\n## Usage\n\n1.  Setup the virtual environment.\n\n    ```bash\n    python -m venv venv\n    ```\n\n2.  Activate the virtual environment.\n\n- Windows:\n\n  ```shell\n  venv\\Scripts\\activate\n  ```\n\n- Unix or MacOS:\n  ```bash\n  source venv/bin/activate\n  ```\n\n3. Install the requirements.\n\n   ```bash\n   pip install -r requirements.txt\n   ```\n\n4. Run the main script.\n   ```bash\n   python sentiment_analysis.py\n   ```\n   (Optional) run the notebook.\n   ```bash\n   jupyter nbconvert --execute --to notebook --inplace notebooks/sentiment_analysis.ipynb\n   ```\n\n## A Note from the Author\n\nThanks for your interest! Please see my [GitHub](https://github.com/ianhaggerty) or [Linkedin](https://www.linkedin.com/in/ihaggerty/) to stay updated.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fianhaggerty%2Ffinal-capstone","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fianhaggerty%2Ffinal-capstone","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fianhaggerty%2Ffinal-capstone/lists"}