https://github.com/biolab/orange-scripts

Scripts for the Python Script Orange widget
https://github.com/biolab/orange-scripts

Last synced: 12 months ago
JSON representation

Scripts for the Python Script Orange widget

Host: GitHub
URL: https://github.com/biolab/orange-scripts
Owner: biolab
Created: 2019-12-20T10:13:53.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2025-07-18T11:46:40.000Z (about 1 year ago)
Last Synced: 2025-07-18T15:44:41.450Z (about 1 year ago)
Language: Python
Size: 18.6 KB
Stars: 8
Watchers: 12
Forks: 5
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Orange Scripts

Scripts for the Python Script Orange widget.

## Orange

- log-attributes.py: Log-transform the data.

## Documentation examples
- batch-filtering.py: Filter variables on condition.
- custom-preprocessing.py: Custom preprocessing for text data (requires Text add-on).
- gaussian-noise.py: Introduce Gaussian noise.
- round-values.py: Round feature values.

## Text add-on

- bigram-collocations.py: Compute and output a table of bigrams from the input corpus.
- custom-tokenization.py: Tokenize data by splitting by semicolon and keep the most frequent 100 tokens.
- extract-url.py: Find url in the text and add it as an additional column.
- filter-pos-tags.py: Keep only certain POS tags in tokens.
- sentence-to-corpus.py: Use sentences as documents and output the new corpus.
- to_dense.py: Transform sparse data to dense.
- remove_low_tfidf_values.py: Filters columns with low bag-of-words count or tf-idf
- bow-to-sparse.py: Transforms an existing bow matrix (document-term), builds tokens, and turns it into sparse matrix.
- best-of-clusters.py: Compute ranking of clusters from the Annotated Corpus Map widget.

## Timeseries add-on

- timeseries-alignment.py: align timeseries at value n.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/biolab/orange-scripts

Awesome Lists containing this project

README