Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/michabirklbauer/hgb_dse_text_mining

Contents for the practical part of the lecture Text Mining
https://github.com/michabirklbauer/hgb_dse_text_mining

deep-learning educational how-to keras machine-learning nlp python spacy tensorflow text-classification text-clustering text-mining

Last synced: about 2 months ago
JSON representation

Contents for the practical part of the lecture Text Mining

Awesome Lists containing this project

README

        

# Introduction to Natural Language Processing

Contents for the practical part of the lecture Text Mining @ FH Hagenberg.

## Requirements

Language:
- [Python 3.12](https://www.python.org/downloads/)

If you want to run the notebooks locally please install the requirements noted in `requirements.txt`:
- `pip install -r requirements.txt`

For chapters 5 and 6 you will additionally need `tensorflow` and `transformers`:
- `pip install tensorflow transformers`

## Chapters

- Chapter 1: spaCy -> [open in Google Colab](https://colab.research.google.com/github/michabirklbauer/hgb_dse_text_mining/blob/master/spaCy.ipynb)
- Chapter 2: NLTK and Gensim -> [open in Google Colab](https://colab.research.google.com/github/michabirklbauer/hgb_dse_text_mining/blob/master/NLTK_Gensim.ipynb)
- Chapter 3: Clustering -> [open in Google Colab](https://colab.research.google.com/github/michabirklbauer/hgb_dse_text_mining/blob/master/Features_Clustering.ipynb)
- Chapter 4: Classification -> [open in Google Colab](https://colab.research.google.com/github/michabirklbauer/hgb_dse_text_mining/blob/master/Classification.ipynb)
- Chapter 4.1: RF Classification -> [open in RStudio Cloud](https://rstudio.cloud/content/4961423)
- Chapter 5: Sentiment Analysis -> [open in Google Colab](https://colab.research.google.com/github/michabirklbauer/hgb_dse_text_mining/blob/master/Sentiment.ipynb)
- Chapter 6: Image Captioning -> [open in Google Colab](https://colab.research.google.com/github/michabirklbauer/hgb_dse_text_mining/blob/master/Captioning.ipynb)

Solutions for the exercises will be available at [michabirklbauer/hgb_dse_text_mining_solutions](https://github.com/michabirklbauer/hgb_dse_text_mining_solutions) *after* the lectures.

## References

- A lot of the neural net slides is taken from [DeepMind's 2020 Deep Learning Lecture Series](https://www.youtube.com/playlist?list=PLqYmG7hTraZCDxZ44o4p3N5Anz3lLRVZF).
- Chapter 5 is an adaptation of [Google Developers' Machine Learning Foundations](https://colab.research.google.com/github/lmoroney/dlaicourse/blob/master/TensorFlow%20In%20Practice/Course%203%20-%20NLP/Course%203%20-%20Week%202%20-%20Lesson%202.ipynb).
- Chapter 6 is an adaptation of the official [Keras image captioning example](https://keras.io/examples/vision/image_captioning/).

## Contact

- [[email protected]](mailto:[email protected])
- [[email protected]](mailto:[email protected])