Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ExtractTable/ExtractTable-py
Python library to extract tabular data from images and scanned PDFs
extracttable image-table-recognition ocr pdf-table-extract table-extraction tabular-data
Last synced: 30 Jun 2024
https://github.com/kennethreitz/tablib
Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
csv json reports sqlalchemy tabular-data yaml
Last synced: 29 Jun 2024
https://github.com/microsoft/FLAML
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
automated-machine-learning automl classification data-science deep-learning finetuning hyperparam hyperparameter-optimization jupyter-notebook machine-learning natural-language-generation natural-language-processing python random-forest regression scikit-learn tabular-data timeseries-forecasting tuning
Last synced: 27 Jun 2024
https://github.com/wwweiwei/awesome-self-supervised-learning-for-tabular-data
A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)
artificial-intelligence deep-learning machine-learning self-supervised-learning tabular-data
Last synced: 25 Jun 2024
https://github.com/adrienjoly/npm-pdfreader
🚜 Parse text and tables from PDF files.
data-extraction javascript parse-tables parsing pdf-converter pdf-reader rule-based-parsing tabular-data
Last synced: 24 Jun 2024
https://github.com/inokawa/virtua
A zero-config, fast and small (~3kB) virtual list (and grid) component for React, Vue and Solid.
headlessui infinite-scroll performance react react-component react-server-components scrolling solid tabular-data virtual-scroll virtualization virtualized vue windowing
Last synced: 23 Jun 2024
https://github.com/NVIDIA-Merlin/Transformers4Rec
Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.
bert gtp huggingface language-model nlp pytorch recommender-system recsys seq2seq session-based-recommendation tabular-data transformer xlnet
Last synced: 22 Jun 2024
https://microsoft.github.io/FLAML/
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
automated-machine-learning automl classification data-science deep-learning finetuning hyperparam hyperparameter-optimization jupyter-notebook machine-learning natural-language-generation natural-language-processing python random-forest regression scikit-learn tabular-data timeseries-forecasting tuning
Last synced: 21 Jun 2024
https://github.com/wq/itertable
⇔ IterTable is a Pythonic API for iterating through tabular data formats, including CSV, XLSX, XML, and JSON.
csv data-processing excel export import iterable json openpyxl pandas pythonic spreadsheet tabular-data xml
Last synced: 20 Jun 2024
https://github.com/DataCanvasIO/HyperGBM
A full pipeline AutoML tool for tabular data
adversarial-validation automl catboost dask dask-distributed datacleaning distributed-training ensemble-learning fullpipeline gbm gpu-acceleration lightgbm preprocessing pseudo-labeling rapidsai semi-supervised-learning sklearn tabular-data xgboost
Last synced: 13 Jun 2024
https://github.com/wwweiwei/DoRA
DoRA: Domain-Based Self-Supervised Learning Framework for Low-Resource Real Estate Appraisal (CIKM-23)
few-shot-learning self-supervised-learning tabular-data
Last synced: 11 Jun 2024
https://github.com/RyanWangZf/transtab
NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables
data-mining data-science machine-learning tabular-data
Last synced: 11 Jun 2024
https://github.com/somepago/saint
The official PyTorch implementation of recent paper - SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training
deep-learning tabular-data transformer
Last synced: 11 Jun 2024
https://github.com/AstraZeneca/SubTab
The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"
contrastive-learning multi-view-learning representation-learning self-supervised-learning tabular-data
Last synced: 11 Jun 2024
https://github.com/r-rudra/tidycells
Automatic transformation of untidy spreadsheet-like data into tidy form
cran data-wrangling heuristic heuristic-algorithm r r-package r-stats spreadsheets tabular-data tidy
Last synced: 10 Jun 2024
https://github.com/firmai/deltapy
DeltaPy - Tabular Data Augmentation (by @firmai)
augmentation data-augmentation data-science feature-engineering feature-extraction finance machine-learning tabular-data time-series
Last synced: 07 Jun 2024
https://github.com/dreamquark-ai/tabnet
PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf
deep-neural-networks machine-learning-library pytorch pytorch-tabnet research-paper tabnet tabular-data
Last synced: 07 Jun 2024
https://github.com/nimblelearn/datapackage-m
Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
csv-files data-acquisition data-analysis data-analytics data-package data-transformation data-visualisation data-visualization datapackage excel frictionlessdata json-table-schema open-data power-bi power-query powerbi tabular-data tabular-data-package
Last synced: 04 Jun 2024
https://github.com/approximatelabs/sketch
AI code-writing assistant that understands data content
ai codex copilot data data-science dataframe datasketch datasketches df ds gpt3 lambdaprompt pandas python sketches tabular-data
Last synced: 02 Jun 2024
https://github.com/aws-samples/aws-machine-learning-university-dte
Machine Learning University: Decision Trees and Ensemble Methods
boosting catboost decision-trees lightgbm machine-learning random-forest tabular-data xgboost
Last synced: 23 May 2024
https://github.com/aws-samples/aws-machine-learning-university-accelerated-tab
Machine Learning University: Accelerated Tabular Data Class
deep-learning gluon machine-learning mxnet python sklearn tabular-data
Last synced: 23 May 2024
https://github.com/capitalone/DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
avro csv data-analysis data-labels data-science dataprofiling dataset gdpr graph-data machine-learning network-data nlp npi pandas pii privacy python security sensitive-data tabular-data
Last synced: 22 May 2024
https://github.com/saulpw/visidata
A terminal spreadsheet multitool for discovering and arranging data
cli csv datajournalism datawrangling devops-tools eda hdf5 json opendata pandas python reconciliation spreadsheet sqlite tabular-data tsv tui unix-toolkit
Last synced: 21 May 2024
https://github.com/PolyMathOrg/DataFrame
DataFrame in Pharo - tabular data structures for data analysis
data-analysis data-frame data-science data-visualization gsoc hacktoberfest pharo pharo-smalltalk smalltalk statistics tabular-data
Last synced: 19 May 2024
https://github.com/turicas/rows
A common, beautiful interface to tabular data, no matter the format
convert-data csv data data-science excel hacktoberfest python table tabular-data xls xlsx
Last synced: 17 May 2024
https://github.com/jrieke/fastapi-csv
🏗️ Create APIs from CSV files within seconds, using fastapi
api csv csv-api excel fastapi fastapi-template google-sheets python table tabular-data
Last synced: 15 May 2024
https://github.com/naity/image_tabular
Integrate image and tabular data for deep learning
deep-learning fastai image-classification pytorch tabular-data
Last synced: 14 May 2024
https://github.com/jianzhnie/AutoTabular
Automatic machine learning for tabular data. ⚡🔥⚡
automl catboost data-science deep-learning feature-engineering hpo lightgbm machine-learning pytorch-lightning scikit-learn structured-data tabular-data xgboost
Last synced: 13 May 2024
https://github.com/amaiya/ktrain
ktrain is a Python library that makes deep learning and AI more accessible and easier to apply
computer-vision deep-learning graph-neural-networks keras machine-learning nlp python tabular-data tensorflow
Last synced: 13 May 2024
https://github.com/JuliaData/DataFrames.jl
In-memory tabular data in Julia
data data-frame dataframes datasets hacktoberfest julia tabular-data
Last synced: 11 May 2024
https://github.com/JuliaData/DataFramesMeta.jl
Metaprogramming tools for DataFrames
data data-frame dataframes dataframesmeta datasets hacktoberfest julia tabular-data
Last synced: 11 May 2024
https://github.com/bvaughn/react-virtualized
React components for efficiently rendering large lists and tabular data
grid list listview performance react react-components tabular-data virtualization windowing
Last synced: 10 May 2024
https://github.com/ncss-tech/stats_for_soil_survey
S4SS: Statistics for Soil Survey
digital-soil-mapping eda ncss nrcs pedology pedometrics s4ss soil soil-survey spatial-data statistics tabular-data usda
Last synced: 09 May 2024
https://ncss-tech.github.io/stats_for_soil_survey/
S4SS: Statistics for Soil Survey
digital-soil-mapping eda ncss nrcs pedology pedometrics s4ss soil soil-survey spatial-data statistics tabular-data usda
Last synced: 09 May 2024
https://github.com/nhn/tui.grid
🍞🔡 The Powerful Component to Display and Edit Data. Experience the Ultimate Data Transformer!
datagrid datatable excel grid javascript preact reactivity spreadsheet tabular-data toast-ui treegrid typescript
Last synced: 07 May 2024
https://sentinel-energy.github.io/friendly_data/
Data format to interoperate between models and frameworks
analysis datapackage python tabular-data
Last synced: 07 May 2024
https://github.com/johnkerl/miller
Miller is like awk, sed, cut, join, and sort for name-indexed data such as CSV, TSV, and tabular JSON
command-line command-line-tools csv csv-format data-cleaning data-processing data-reduction data-regression devops devops-tools json json-data miller statistical-analysis statistics streaming-algorithms streaming-data tabular-data tsv unix-toolkit
Last synced: 07 May 2024
https://github.com/Meteor-Community-Packages/meteor-tabular
Reactive datatables for large or small datasets
blaze datatable meteorjs tabular tabular-data
Last synced: 03 May 2024
https://github.com/aerosol/tabula
:u7533: Pretty printer for maps/structs collections (Elixir)
elixir pretty-printer tabular-data
Last synced: 01 May 2024
https://github.com/rubycocos/csvreader
csvreader library / gem - read tabular data in the comma-separated values (csv) format the right way (uses best practices out-of-the-box with zero-configuration)
csv csv11 csvhash csvrecord export import json numerics tab tabular tabular-data
Last synced: 01 May 2024
https://github.com/vaexio/vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
bigdata data-science dataframe hdf5 machine-learning machinelearning memory-mapped-file pyarrow python tabular-data visualization
Last synced: 28 Apr 2024
https://github.com/reubano/meza
A Python toolkit for processing tabular data
csv data excel featured functional-programming library pandas tabular-data xlsx xml
Last synced: 28 Apr 2024
https://github.com/automl/Auto-PyTorch
Automatic architecture search and hyperparameter optimization for PyTorch
automl deep-learning pytorch tabular-data
Last synced: 28 Apr 2024
https://github.com/autogluon/autogluon
AutoGluon: Fast and Accurate ML in 3 Lines of Code
autogluon automated-machine-learning automl computer-vision data-science deep-learning ensemble-learning forecasting gluon hyperparameter-optimization machine-learning natural-language-processing object-detection python pytorch scikit-learn structured-data tabular-data time-series transfer-learning
Last synced: 28 Apr 2024
https://github.com/scottrhoyt/SwiftyTextTable
A lightweight library for generating text tables.
carthage cocoapods command-line linux macos swift swift-package-manager tabular-data
Last synced: 27 Apr 2024
https://github.com/kathrinse/be_great
A novel approach for synthesizing tabular data using pretrained large language models
data-generation deep-learning tabular-data transformers
Last synced: 27 Apr 2024
https://github.com/rajasegar/htmx-tabular
Tabular Data with htmx
htmx pagination search sorting tabular-data
Last synced: 23 Apr 2024
https://github.com/Desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
anomaly-detection correlations data-analytics data-cleaning data-cleansing data-engineering data-exploration data-mining data-mining-algorithms data-preprocessing data-profiling data-science data-wrangling exploratory-data-analysis feature-engineering feature-extraction feature-selection knowledge-discovery spreadsheets tabular-data
Last synced: 21 Apr 2024
https://github.com/eBay/tsv-utils
eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
cli command-line csv d data-mining data-science delimited-files dlang reservoir-sampling sampling shuffle statistics tabular-data tsv uniq
Last synced: 20 Apr 2024
https://github.com/keithknott26/datadash
Visualize and graph data in the terminal
chart charting csv go golang graph graphing graphing-application streaming-data tabular-data terminal-based terminal-ui tsv
Last synced: 20 Apr 2024
https://github.com/alexhallam/tv
📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.
cli column command-line command-line-tool csv csv-cat csv-column csv-pretty-print csv-viewer csv-visualization data-science dataframe datatable pretty-print pretty-printer rust tabular-data terminal tibble
Last synced: 20 Apr 2024
https://github.com/Lightning-Universe/lightning-flash
Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains
classification deep-learning fiftyone icevision machine-learning object-detection open3d pytorch pytorch-lightning pytorch-video tabular-data tasks-flash torch-geometric
Last synced: 19 Apr 2024
https://github.com/ropensci/tabulapdf
Bindings for Tabula PDF Table Extractor Library
java pdf pdf-document peer-reviewed r r-package ropensci rstats tabula tabular-data
Last synced: 13 Apr 2024
https://github.com/manujosephv/pytorch_tabular
A standard framework for modelling Deep Learning Models for tabular data
deep-learning hacktoberfest machine-learning pytorch pytorch-lightning tabular-data
Last synced: 11 Apr 2024
https://github.com/microsoft/CASPR
CASPR is a deep learning framework applying transformer architecture to learn and predict from tabular data at scale.
attention-mechanism business deep-learning tabular-data transformer transformer-architecture transformer-encoder
Last synced: 08 Apr 2024
https://github.com/georgian-io/Multimodal-Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
huggingface-transformers multimodal-learning natural-language-processing tabular-data transformer
Last synced: 08 Apr 2024
https://github.com/sdv-dev/SDGym
Benchmarking synthetic data generation methods.
benchmark deep-learning generative-adversarial-network generative-ai generative-models sdgym-synthesizers synthetic-data synthetic-data-vault tabular-data
Last synced: 08 Apr 2024
https://github.com/Baukebrenninkmeijer/On-the-Generation-and-Evaluation-of-Synthetic-Tabular-Data-using-GANs
Repository for the results of my master thesis, about the generation and evaluation of synthetic data using GANs
data-evaluation data-synthesis gan generative-adversarial-networks synthetic-data synthetic-dataset-generation tabular-data
Last synced: 08 Apr 2024
https://github.com/sdv-dev/TGAN
Generative adversarial training for generating synthetic tabular data.
generative-adversarial-network synthesizing-tabular-data synthetic-data tabular-data
Last synced: 08 Apr 2024
https://github.com/sdv-dev/CTGAN
Conditional GAN for generating synthetic tabular data.
data-generation generative-adversarial-network synthetic-data synthetic-data-generation tabular-data
Last synced: 08 Apr 2024
https://github.com/continuum/active_importer
Define importers that load tabular data from spreadsheets or CSV files into any ActiveRecord-like ORM.
activerecord csv-files data-import importer orm ruby spreadsheet tabular-data
Last synced: 06 Apr 2024
https://youngfish42.github.io/Awesome-FL/
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
artificial-intelligence awesome computer-vision data-mining database deep-learning efficiency federated-learning federated-learning-framework graph graph-neural-networks information-retrieval knowledge-graph machine-learning natural-language-processing paper privacy security system tabular-data
Last synced: 01 Apr 2024
https://github.com/PhoebusSi/Alpaca-CoT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!
alpaca chatglm chatgpt cot instruction-tuning llama llm lora moss p-tuning parameter-efficient pytorch tabul tabular-data tabular-model
Last synced: 01 Apr 2024
https://github.com/ropensci/tabulizer
Bindings for Tabula PDF Table Extractor Library
java pdf pdf-document peer-reviewed r r-package ropensci rstats tabula tabular-data
Last synced: 31 Mar 2024
https://github.com/juancarlospaco/faster-than-csv
Faster CSV for Python
csv csv-data csv-parser csv-parsing csv-to-html csv-to-json cython faster-than-csv process-csv python python3 speed speedup static-memory-allocation static-typing tabular-data tsv tsv-parser type-safe
Last synced: 28 Mar 2024
https://bvaughn.github.io/react-virtualized/
React components for efficiently rendering large lists and tabular data
grid list listview performance react react-components tabular-data virtualization windowing
Last synced: 27 Mar 2024
https://github.com/SeldonIO/alibi-detect
Algorithms for outlier, adversarial and drift detection
adversarial anomaly concept-drift data-drift detection drift-detection images outlier semi-supervised-learning tabular-data text time-series unsupervised-learning
Last synced: 25 Mar 2024
https://github.com/chartshq/datamodel
Relational algebra compliant in memory tabular data store.
data datagrid datamodel datatable datatables javascript relational-algebra rust rust-language schema tabular-data wasm webassembly
Last synced: 23 Mar 2024
https://github.com/youngfish42/Awesome-FL
Comprehensive and timely academic information on federated learning (papers, frameworks, datasets, tutorials, workshops)
artificial-intelligence awesome computer-vision data-mining database deep-learning efficiency federated-learning federated-learning-framework graph graph-neural-networks information-retrieval knowledge-graph machine-learning natural-language-processing paper privacy security system tabular-data
Last synced: 21 Mar 2024
https://github.com/nimh-dsst/dataset-phenotypes
Preparatory scripts for BIDS tabular phenotypic data in large neuroimaging datasets.
abcd abcd-study abide abide-data bids data-dictionary dataset datasets hbn hcp nki phenotype phenotypic pnc tabular tabular-data ukb ukbb ukbiobank
Last synced: 21 Mar 2024
https://github.com/antonycourtney/tad
A desktop application for viewing and analyzing tabular data
csv data-analysis data-science database desktop-application duckdb parquet-viewer pivot-tables pivots tabular-data
Last synced: 14 Mar 2024
https://github.com/openalloc/SwiftTabler
A multi-platform SwiftUI component for tabular data
coredata swift swift-coredata swift-lang swift-language swiftui swiftui-binding swiftui-components swiftui-grid swiftui-list swiftui-scrollview swiftui-tables tables tableview tabular-data tabular-editor
Last synced: 13 Mar 2024