Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/manumerous/vpselector
Visual Pandas Selector: Visualize and interactively select time-series data
https://github.com/manumerous/vpselector
data-science data-visualization pandas python selector
Last synced: 3 months ago
JSON representation
Visual Pandas Selector: Visualize and interactively select time-series data
- Host: GitHub
- URL: https://github.com/manumerous/vpselector
- Owner: manumerous
- License: apache-2.0
- Created: 2021-04-26T14:47:22.000Z (over 3 years ago)
- Default Branch: master
- Last Pushed: 2023-11-30T17:39:14.000Z (about 1 year ago)
- Last Synced: 2024-09-22T08:05:13.413Z (4 months ago)
- Topics: data-science, data-visualization, pandas, python, selector
- Language: Python
- Homepage:
- Size: 859 KB
- Stars: 74
- Watchers: 10
- Forks: 5
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# vpselector
The Visual Pandas Selector is a tool to visually select portions of numeric time-series data from a pandas dataframe. The tool is intended to provide an fast interactive way for manual data selection, as can be very useful in for example machine learning, regression or system identification.
Easily configure the tool to plot dataframe columns in vertically stacked subplots and view data distributions with the included histogram feature. With a simple click and drag, you can then select horizontal data windows, and let the tool automatically combine them into a new dataframe.
The user can subsequentially select different horizontal data windows via click and drag and he tool then automatically combines the visually selected sections into a new dataframe.
![ezgif com-gif-maker(1)](https://github.com/manumerous/visual-pandas-curator/assets/18735094/b5ebbb99-d2f7-4901-b101-cbeed6c230aa)
## Install
Install the package using:
```bash
pip install vpselector
```## Use in your project
Then simply import it using `import vpselector`. Then simply use:
- If your project does not contain a pyqt application: `vpselector.select_visual_data(data : pd.DataFrame, plot_config : dict)`
- To add the vpselector to an existing pyqt application: `vpselector.select_visual_data_in_pyqt_app(data : pd.DataFrame, plot_config : dict, pyqt_app)`
## Run the Example
```bash
python3 vpselector_example.py
```#### Use the Tool
- Left click with your mouse and drag to define the desired horizontal window of the data to be selected.
- The current selection distribution is now visualized in the histogram plot on the right.
- Confirm or cancel data selection.
- The already selected data is now marked by a grey span in the plot on the left.
- The plot on the right contains now the histogram of all selected data.
- repeat as many times as needed.
- Once you could select all desired horizontal data windows click "Done selecting"