https://github.com/fititnt/orange3-hxl
[early draft] HXL visual ETL (Orange Data Mining add-on). See https://github.com/biolab/orange3/discussions/6092
https://github.com/fititnt/orange3-hxl
orange3
Last synced: 3 months ago
JSON representation
[early draft] HXL visual ETL (Orange Data Mining add-on). See https://github.com/biolab/orange3/discussions/6092
- Host: GitHub
- URL: https://github.com/fititnt/orange3-hxl
- Owner: fititnt
- Created: 2022-08-07T03:15:42.000Z (almost 3 years ago)
- Default Branch: main
- Last Pushed: 2022-08-25T07:23:21.000Z (almost 3 years ago)
- Last Synced: 2025-02-27T16:02:25.037Z (4 months ago)
- Topics: orange3
- Language: Python
- Homepage:
- Size: 325 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# HXL visual ETL (Orange3 add-on)
[](https://github.com/fititnt/orange3-hxl)
[](https://pypi.org/project/Orange3-HXLvisualETL)This is an early draft of [Orange3](http://orange.biolab.si) add-on with minimal
awareness of data labeled with [HXL](https://hxlstandard.org/).To install this package, use
```bash
pip install Orange3-HXLvisualETL
```## Features
### Data Vault Conf
[WORKING DRAFT] Configure active local data vault configurations. This allows overriding defaults.### Download Raw File
Download remote resource into a local FileRAW
### Unzip Raw File
[WORKING DRAFT] Unzip (zip, gzip, bzip, ...) an FileRAW into an FileRAWCollection### Select Raw File
[DRAFT] From a local FileRAWCollection, select an FileRAW### Load Raw File
Convert a local FileRAW into Orange3 Data / DataFrame.
Required to allow use with other widgets.Supported features (*):
- `pandas.read_table`
- `pandas.read_csv`
- `pandas.read_excel`
- `pandas.read_feather`
- `pandas.read_fwf`
- `pandas.read_html`
- `pandas.read_json`
- `pandas.json_normalize`
- `pandas.read_orc`
- `pandas.read_parquet`
- `pandas.read_sas`
- `pandas.read_spss`
- `pandas.read_stata`
- `pandas.read_xml`_(*) Some features will require additional python packages which are not installed by default with this add-on. The user will be warned about this._
### Statistical Role
Change statistical role (the "feature", "target", "meta", "ignore")
using HXL patterns instead of stric exact names for the data variables.### Data Type
[DRAFT] Change the computational data type (the "numeric", "categorical" "text", "datetime") using HXL patterns instead of stric exact names for the data variables.
### HXL short names
[EARLY DRAFT] Make HXLated input data with shorter variable names.
### RAW Info
[DRAFT] Inspect a FileRAW or FileRAWCollection## Installation
### From Pypi (recommended)
pip install Orange3-HXLvisualETL
### From source
To install the add-on from source run
pip install .
To register this add-on with Orange, but keep the code in the development directory (do not copy it to
Python's site-packages directory), runpip install -e .
Documentation / widget help can be built by running
make html htmlhelp
from the doc directory.
## Usage
After the installation, the widget from this add-on is registered with Orange. To run Orange from the terminal,
useorange-canvas
or
python -m Orange.canvas
The new widget appears in the toolbox bar under the section Example.
