Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/shendrew/coursescope
Machine learning for course classification by topics
https://github.com/shendrew/coursescope
beautifulsoup machine-learning nlp scikit-learn
Last synced: 3 days ago
JSON representation
Machine learning for course classification by topics
- Host: GitHub
- URL: https://github.com/shendrew/coursescope
- Owner: shendrew
- Created: 2023-12-26T02:53:32.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-01-26T05:24:41.000Z (12 months ago)
- Last Synced: 2024-11-19T17:55:54.187Z (2 months ago)
- Topics: beautifulsoup, machine-learning, nlp, scikit-learn
- Language: Python
- Homepage:
- Size: 1.06 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# CourseScope
Topic modelling for university courses using clustering techniques, with the intent of providing intuitive analytics of course schedules. This NLP model groups courses into similar areas of interests based on significant keywords extracted using TF-IDF vectors.
The sample dataset present was scrapped from Stanford's CS courses using Beautiful Soup, and is used for non-commercial purposes. (https://explorecourses.stanford.edu/search?page=0&catalog=&q=CS&view=catalog&filter-catalognumber-CS=on)
After removing common stop words from the course descriptions, Sklearn is mainly used to vectorize the text inputs, and then clustered based on cosine distances.