{"id":15407234,"url":"https://github.com/ltfschoen/aind-nlp","last_synced_at":"2026-04-16T22:31:12.001Z","repository":{"id":76619915,"uuid":"95965460","full_name":"ltfschoen/AIND-NLP","owner":"ltfschoen","description":null,"archived":false,"fork":false,"pushed_at":"2017-07-02T23:51:54.000Z","size":635,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-02-23T06:46:57.880Z","etag":null,"topics":["artificial-intelligence","beautifulsoup","feature-extraction","machine-learning","modelling","nanodegree","natural-language-processing","natural-language-toolkit","nlp","nltk","text-processing","udacity","web-scraping"],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ltfschoen.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-07-01T13:21:27.000Z","updated_at":"2017-07-02T03:17:54.000Z","dependencies_parsed_at":null,"dependency_job_id":"459b98e2-3fa3-49e0-9dc8-d1f8a3e1c29a","html_url":"https://github.com/ltfschoen/AIND-NLP","commit_stats":{"total_commits":14,"total_committers":2,"mean_commits":7.0,"dds":0.4285714285714286,"last_synced_commit":"b688eaed77a179a1c6810f8c04ff35b35eda0df1"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/ltfschoen/AIND-NLP","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ltfschoen%2FAIND-NLP","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ltfschoen%2FAIND-NLP/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ltfschoen%2FAIND-NLP/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ltfschoen%2FAIND-NLP/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ltfschoen","download_url":"https://codeload.github.com/ltfschoen/AIND-NLP/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ltfschoen%2FAIND-NLP/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31907424,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-16T18:22:33.417Z","status":"ssl_error","status_checked_at":"2026-04-16T18:21:47.142Z","response_time":69,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.6:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["artificial-intelligence","beautifulsoup","feature-extraction","machine-learning","modelling","nanodegree","natural-language-processing","natural-language-toolkit","nlp","nltk","text-processing","udacity","web-scraping"],"created_at":"2024-10-01T16:27:52.406Z","updated_at":"2026-04-16T22:31:11.978Z","avatar_url":"https://github.com/ltfschoen.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# AIND: Natural Language Processing\n\nCoding exercises for the Natural Language Processing concentration, part of Udacity's Artificial Intelligence Nanodegree program.\n\n## Setup\n\nYou need Python 3.6+, and the packages mentioned in `requirements.txt`. You can install them using:\n\n```bash\npip install -r requirements.txt\n```\n\n## Data\n\nData files for exercises are included under `data/`, but some of the NLP libraries require additional data for performing tasks like \nPoS tagging, lemmatization, etc. Specifically, `nltk` will throw an error if the required data is not installed. You can use the \nfollowing Python statement to open the NLTK downloader and select the desired package(s) to install:\n\n```python\nimport nltk\nnltk.download()\n```\n\nThis opens a GUI. DO NOT download everthing. Required files include:\n\n* Models \u003e punkt   (13MB)\n* Corpora \u003e stopwords  (11kB)\n* All Packages \u003e averaged_perceptron_taggers  (2.4MB)\n* All Packages \u003e maxent_ne_chunkers\"  (12.8MB)\n* Corpora \u003e Words (740kB)\n* Corpora \u003e wordnet  (10.3MB)\n\nFor each of the above, select it and click \"Download\" ([explained here](https://stackoverflow.com/questions/26693736/nltk-and-stopwords-fail-lookuperror))\n\nYou can also download all available NLTK data packages, which includes a number of sample corpora as well, but that may take a while \n(10+GB).\n\nNote: Install GhostScript: `brew install ghostscript` to avoid error `NLTK was unable to find the gs file!` (reference: https://stackoverflow.com/questions/36942270/nltk-was-unable-to-find-the-gs-file)\n\n## Run\n\nTo run any script file, use:\n\n```bash\npython \u003cscript.py\u003e\n```\n\nTo open a notebook, use:\n\n```bash\njupyter notebook \u003cnotebook.ipynb\u003e\n```\n\n\u003ca rel=\"license\" href=\"http://creativecommons.org/licenses/by-nc-nd/4.0/\"\u003e\u003cimg alt=\"Creative Commons License\" style=\"border-width:0\" src=\"https://i.creativecommons.org/l/by-nc-nd/4.0/88x31.png\" /\u003e\u003c/a\u003e\u003cbr /\u003eThis work is licensed under a \u003ca rel=\"license\" href=\"http://creativecommons.org/licenses/by-nc-nd/4.0/\"\u003eCreative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License\u003c/a\u003e. Please refer to [Udacity Terms of Service](https://www.udacity.com/legal) for further information.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fltfschoen%2Faind-nlp","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fltfschoen%2Faind-nlp","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fltfschoen%2Faind-nlp/lists"}