https://github.com/elena563/wordviz
WordViz is a Python visualization library designed for exploring and visualizing word embeddings.
https://github.com/elena563/wordviz
bert clustering contextual-embeddings cosine-similarity data-visualization nlp python3 scatterplot word-embeddings
Last synced: 1 day ago
JSON representation
WordViz is a Python visualization library designed for exploring and visualizing word embeddings.
- Host: GitHub
- URL: https://github.com/elena563/wordviz
- Owner: elena563
- License: mit
- Created: 2025-05-01T14:05:54.000Z (11 months ago)
- Default Branch: master
- Last Pushed: 2025-09-19T09:04:48.000Z (6 months ago)
- Last Synced: 2025-09-19T11:27:31.352Z (6 months ago)
- Topics: bert, clustering, contextual-embeddings, cosine-similarity, data-visualization, nlp, python3, scatterplot, word-embeddings
- Language: Python
- Homepage:
- Size: 12 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project
README

**WordViz** is a Python visualization library designed for exploring and visualizing word embeddings. Built on top of popular libraries such as `matplotlib`, `plotly`, and `gensim`, WordViz provides intuitive tools for analyzing embeddings through clustering, similarity exploration, and dimensionality reduction, all wrapped in interactive and customizable plots.
With WordViz, users can gain insights into the structure of their word embeddings, making it a valuable tool for researchers and practitioners in natural language processing.
This project was created as part of my Bachelor's Degree thesis in Statistics and Information Management with title (translated): "Word Embeddings in Practice: Designing a Library for Visualization and Operations"
version 0.3.0
PyPi Page: https://pypi.org/project/wordviz/
Documentation: https://wordviz.readthedocs.io/
## Last Version Updates
### Added
- Support for contextual embeddings with two modes:
* `sentences`: visualize entire sentences
* `word_contexts`: visualize and compare multiple embeddings of the same word in different contexts
- New `encoding` module to embed sentences and words in different contexts, supported by Transformers and PyTorch (optional requirements)
- `load_contextual` method for `EmbeddingLoader` class
- New `type` property for `EmbeddingLoader` class
### Deprecated
- From `/plotting`:
* `interactive_embeddings` will change name to `plot_interactive` (FutureWarning added)
* `similarity_heatmap` will change name to `plot_similarity_heatmap` (FutureWarning added)
- Warnings added for imminent property name changes in similarity module and `plot_similarity` (no breaking changes yet)
### Fixed
- Fixed doubled parameter bug in MDS dimensionality reduction
- Fixed support to pairwise distances for all distance types
See more about previous changes in [CHANGELOG.md](CHANGELOG.md)
## Main Features
- Load and explore pretrained embeddings (e.g., GloVe, FastText)
- Select from a variety of available embeddings
- Visualize embeddings in 2D or 3D with flexible dimensionality reduction options
- Identify and plot the most similar words to a given token
- Visualize clusters of related words
- Interactive plots powered by `plotly`
- Support for many light and dark themes
## Installation
Install the latest version from PyPI:
```bash
pip install wordviz
```
### Notes: Python version compatibility
Currently, wordviz is not compatible with Python 3.13, due to limitations of some key dependencies:
gensim, one of the core libraries used by wordviz, does not yet provide official support or precompiled wheels for Python 3.13.
For proper installation installation, we recommend that you create a virtual environment with Python 3.12, or just use uv:
```bash
uv init --python 3.12
```
The package will be updated as soon as the dependencies are compatible with Python 3.13.
## Usage
You can load and manage embeddings though the `EmbeddingLoader` class, and then visualize them with the `Visualizer` (or `Visualizer3D`) class.
```python
from wordviz.loading import EmbeddingLoader
from wordviz.plotting import Visualizer
loader = EmbeddingLoader()
loader.load_from_file('path/to/your/embedding/file', 'word2vec')
vis = Visualizer(loader)
vis.plot_embeddings()
```
You can explore all functionalities through the example notebook provided in the `docs/` folder:
👉 [View example notebook](docs/example.ipynb)
## Contributing
Contributions are what make the open source community such an amazing place to learn, inspire, and create. Any contributions you make are **greatly appreciated**.
If you have a suggestion that would make this better, please fork the repo and create a pull request. You can also simply open an issue with the tag "enhancement".
Don't forget to give the project a star! Thanks again!
1. Fork the Project
2. Create your Feature Branch (`git checkout -b feature/AmazingFeature`)
3. Commit your Changes (`git commit -m 'Add some AmazingFeature'`)
4. Push to the Branch (`git push origin feature/AmazingFeature`)
5. Open a Pull Request
## License
This project is licensed under the MIT License.
## Contacts
Elena Zen - [My Portfolio Website](https://elenazen.it) - info.elenazen@gmail.com