Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ELToulemonde/dataPreparation
Data preparation for data science projects.
data-preparation data-preprocessing data-science date-conversion r speed variable-elimination variable-selection
Last synced: 10 Jun 2024
https://github.com/nisheethjaiswal/Data-Annotator-for-SpaCy
🚀SpAnnor annotator for Named Entity Recognition easy to use tool. The annotator allows users to quickly assign custom labels to one or more entities in the text. Easy to setup for Data Training for SpaCy 🔥.
data-annotation data-annotation-tools data-labeling data-preparation named-entity-recognition nlp spacy-nlp text-labeling
Last synced: 09 Jun 2024
https://github.com/skrub-data/skrub
Prepping tables for machine learning
data data-analysis data-cleaning data-preparation data-preprocessing data-science data-wrangling dirty-data machine-learning
Last synced: 31 May 2024
https://github.com/developmentseed/label-maker
Data Preparation for Satellite Machine Learning
computer-vision data-preparation deep-learning keras remote-sensing satellite-imagery
Last synced: 16 May 2024
https://github.com/hi-primus/optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
big-data-cleaning bigdata cudf dask dask-cudf data-analysis data-cleaner data-cleaning data-cleansing data-exploration data-extraction data-preparation data-profiling data-science data-transformation data-wrangling machine-learning pyspark spark
Last synced: 28 Apr 2024
https://github.com/kozodoi/dptools
Python package with utilities for data processing, aggregation, feature engineering and data versioning
aggregation data-preparation data-preprocessing data-science feature-engineering python
Last synced: 16 Apr 2024
https://github.com/sbcgua/mockup_loader
ABAP unit testing framework, prepare in Excel, reuse in abap code
abap data-preparation hacktoberfest mockup-loader sap test-automation testing-tools unit-testing
Last synced: 08 Apr 2024
https://github.com/iTechArt/convtools-ita
convtools is a python library to declaratively define conversions for processing collections, doing complex aggregations and joins.
code-generation conversions data-preparation data-preprocessing data-processing functional-programming python transformations
Last synced: 01 Apr 2024
https://github.com/salehjg/Shapenet2_Preparation
A python script to convert and down-sample mesh data into pointclouds using FPS algorithm.
data-preparation dataset farthest-point-sampling hdf5 python shapenet-dataset shapenetcore
Last synced: 26 Mar 2024
https://github.com/asavinov/prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
business-intelligence data-preparation data-preprocessing data-processing data-science data-wrangling feature-engineering map-reduce olap pandas python spark workflow
Last synced: 18 Mar 2024