Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/ExtractTable/ExtractTable-py

Python library to extract tabular data from images and scanned PDFs

extracttable image-table-recognition ocr pdf-table-extract table-extraction tabular-data

Last synced: 30 Jun 2024

https://github.com/kennethreitz/tablib

Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.

csv json reports sqlalchemy tabular-data yaml

Last synced: 29 Jun 2024

https://github.com/wwweiwei/awesome-self-supervised-learning-for-tabular-data

A collection of research materials on SSL for non-sequential tabular data (SSL4NSTD)

artificial-intelligence deep-learning machine-learning self-supervised-learning tabular-data

Last synced: 25 Jun 2024

https://github.com/inokawa/virtua

A zero-config, fast and small (~3kB) virtual list (and grid) component for React, Vue and Solid.

headlessui infinite-scroll performance react react-component react-server-components scrolling solid tabular-data virtual-scroll virtualization virtualized vue windowing

Last synced: 23 Jun 2024

https://github.com/NVIDIA-Merlin/Transformers4Rec

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

bert gtp huggingface language-model nlp pytorch recommender-system recsys seq2seq session-based-recommendation tabular-data transformer xlnet

Last synced: 22 Jun 2024

https://github.com/wq/itertable

⇔ IterTable is a Pythonic API for iterating through tabular data formats, including CSV, XLSX, XML, and JSON.

csv data-processing excel export import iterable json openpyxl pandas pythonic spreadsheet tabular-data xml

Last synced: 20 Jun 2024

https://github.com/wwweiwei/DoRA

DoRA: Domain-Based Self-Supervised Learning Framework for Low-Resource Real Estate Appraisal (CIKM-23)

few-shot-learning self-supervised-learning tabular-data

Last synced: 11 Jun 2024

https://github.com/RyanWangZf/transtab

NeurIPS'22 | TransTab: Learning Transferable Tabular Transformers Across Tables

data-mining data-science machine-learning tabular-data

Last synced: 11 Jun 2024

https://github.com/somepago/saint

The official PyTorch implementation of recent paper - SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

deep-learning tabular-data transformer

Last synced: 11 Jun 2024

https://github.com/AstraZeneca/SubTab

The official implementation of the paper, "SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning"

contrastive-learning multi-view-learning representation-learning self-supervised-learning tabular-data

Last synced: 11 Jun 2024

https://github.com/r-rudra/tidycells

Automatic transformation of untidy spreadsheet-like data into tidy form

cran data-wrangling heuristic heuristic-algorithm r r-package r-stats spreadsheets tabular-data tidy

Last synced: 10 Jun 2024

https://github.com/dreamquark-ai/tabnet

PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf

deep-neural-networks machine-learning-library pytorch pytorch-tabnet research-paper tabnet tabular-data

Last synced: 07 Jun 2024

https://github.com/turicas/rows

A common, beautiful interface to tabular data, no matter the format

convert-data csv data data-science excel hacktoberfest python table tabular-data xls xlsx

Last synced: 17 May 2024

https://github.com/jrieke/fastapi-csv

🏗️ Create APIs from CSV files within seconds, using fastapi

api csv csv-api excel fastapi fastapi-template google-sheets python table tabular-data

Last synced: 15 May 2024

https://github.com/naity/image_tabular

Integrate image and tabular data for deep learning

deep-learning fastai image-classification pytorch tabular-data

Last synced: 14 May 2024

https://github.com/amaiya/ktrain

ktrain is a Python library that makes deep learning and AI more accessible and easier to apply

computer-vision deep-learning graph-neural-networks keras machine-learning nlp python tabular-data tensorflow

Last synced: 13 May 2024

https://github.com/bvaughn/react-virtualized

React components for efficiently rendering large lists and tabular data

grid list listview performance react react-components tabular-data virtualization windowing

Last synced: 10 May 2024

https://github.com/nhn/tui.grid

🍞🔡 The Powerful Component to Display and Edit Data. Experience the Ultimate Data Transformer!

datagrid datatable excel grid javascript preact reactivity spreadsheet tabular-data toast-ui treegrid typescript

Last synced: 07 May 2024

https://sentinel-energy.github.io/friendly_data/

Data format to interoperate between models and frameworks

analysis datapackage python tabular-data

Last synced: 07 May 2024

https://github.com/cldf/csvw

CSV on the web

csv csvw python tabular-data

Last synced: 04 May 2024

https://github.com/Meteor-Community-Packages/meteor-tabular

Reactive datatables for large or small datasets

blaze datatable meteorjs tabular tabular-data

Last synced: 03 May 2024

https://github.com/aerosol/tabula

:u7533: Pretty printer for maps/structs collections (Elixir)

elixir pretty-printer tabular-data

Last synced: 01 May 2024

https://github.com/rubycocos/csvreader

csvreader library / gem - read tabular data in the comma-separated values (csv) format the right way (uses best practices out-of-the-box with zero-configuration)

csv csv11 csvhash csvrecord export import json numerics tab tabular tabular-data

Last synced: 01 May 2024

https://github.com/vaexio/vaex

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

bigdata data-science dataframe hdf5 machine-learning machinelearning memory-mapped-file pyarrow python tabular-data visualization

Last synced: 28 Apr 2024

https://github.com/reubano/meza

A Python toolkit for processing tabular data

csv data excel featured functional-programming library pandas tabular-data xlsx xml

Last synced: 28 Apr 2024

https://github.com/automl/Auto-PyTorch

Automatic architecture search and hyperparameter optimization for PyTorch

automl deep-learning pytorch tabular-data

Last synced: 28 Apr 2024

https://github.com/scottrhoyt/SwiftyTextTable

A lightweight library for generating text tables.

carthage cocoapods command-line linux macos swift swift-package-manager tabular-data

Last synced: 27 Apr 2024

https://github.com/kathrinse/be_great

A novel approach for synthesizing tabular data using pretrained large language models

data-generation deep-learning tabular-data transformers

Last synced: 27 Apr 2024

https://github.com/rajasegar/htmx-tabular

Tabular Data with htmx

htmx pagination search sorting tabular-data

Last synced: 23 Apr 2024

https://github.com/Desbordante/desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

anomaly-detection correlations data-analytics data-cleaning data-cleansing data-engineering data-exploration data-mining data-mining-algorithms data-preprocessing data-profiling data-science data-wrangling exploratory-data-analysis feature-engineering feature-extraction feature-selection knowledge-discovery spreadsheets tabular-data

Last synced: 21 Apr 2024

https://github.com/eBay/tsv-utils

eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.

cli command-line csv d data-mining data-science delimited-files dlang reservoir-sampling sampling shuffle statistics tabular-data tsv uniq

Last synced: 20 Apr 2024

https://github.com/alexhallam/tv

📺(tv) Tidy Viewer is a cross-platform CLI csv pretty printer that uses column styling to maximize viewer enjoyment.

cli column command-line command-line-tool csv csv-cat csv-column csv-pretty-print csv-viewer csv-visualization data-science dataframe datatable pretty-print pretty-printer rust tabular-data terminal tibble

Last synced: 20 Apr 2024

https://github.com/Lightning-Universe/lightning-flash

Your PyTorch AI Factory - Flash enables you to easily configure and run complex AI recipes for over 15 tasks across 7 data domains

classification deep-learning fiftyone icevision machine-learning object-detection open3d pytorch pytorch-lightning pytorch-video tabular-data tasks-flash torch-geometric

Last synced: 19 Apr 2024

https://github.com/ropensci/tabulapdf

Bindings for Tabula PDF Table Extractor Library

java pdf pdf-document peer-reviewed r r-package ropensci rstats tabula tabular-data

Last synced: 13 Apr 2024

https://github.com/manujosephv/pytorch_tabular

A standard framework for modelling Deep Learning Models for tabular data

deep-learning hacktoberfest machine-learning pytorch pytorch-lightning tabular-data

Last synced: 11 Apr 2024

https://github.com/microsoft/CASPR

CASPR is a deep learning framework applying transformer architecture to learn and predict from tabular data at scale.

attention-mechanism business deep-learning tabular-data transformer transformer-architecture transformer-encoder

Last synced: 08 Apr 2024

https://github.com/georgian-io/Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

huggingface-transformers multimodal-learning natural-language-processing tabular-data transformer

Last synced: 08 Apr 2024

https://github.com/sdv-dev/TGAN

Generative adversarial training for generating synthetic tabular data.

generative-adversarial-network synthesizing-tabular-data synthetic-data tabular-data

Last synced: 08 Apr 2024

https://github.com/sdv-dev/CTGAN

Conditional GAN for generating synthetic tabular data.

data-generation generative-adversarial-network synthetic-data synthetic-data-generation tabular-data

Last synced: 08 Apr 2024

https://github.com/continuum/active_importer

Define importers that load tabular data from spreadsheets or CSV files into any ActiveRecord-like ORM.

activerecord csv-files data-import importer orm ruby spreadsheet tabular-data

Last synced: 06 Apr 2024

https://github.com/PhoebusSi/Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

alpaca chatglm chatgpt cot instruction-tuning llama llm lora moss p-tuning parameter-efficient pytorch tabul tabular-data tabular-model

Last synced: 01 Apr 2024

https://github.com/ropensci/tabulizer

Bindings for Tabula PDF Table Extractor Library

java pdf pdf-document peer-reviewed r r-package ropensci rstats tabula tabular-data

Last synced: 31 Mar 2024

https://bvaughn.github.io/react-virtualized/

React components for efficiently rendering large lists and tabular data

grid list listview performance react react-components tabular-data virtualization windowing

Last synced: 27 Mar 2024

https://github.com/nimh-dsst/dataset-phenotypes

Preparatory scripts for BIDS tabular phenotypic data in large neuroimaging datasets.

abcd abcd-study abide abide-data bids data-dictionary dataset datasets hbn hcp nki phenotype phenotypic pnc tabular tabular-data ukb ukbb ukbiobank

Last synced: 21 Mar 2024

https://github.com/antonycourtney/tad

A desktop application for viewing and analyzing tabular data

csv data-analysis data-science database desktop-application duckdb parquet-viewer pivot-tables pivots tabular-data

Last synced: 14 Mar 2024