An open API service indexing awesome lists of open source software.

Projects in Awesome Lists by VIDA-NYU

A curated list of projects in awesome lists by VIDA-NYU .

https://github.com/vida-nyu/reprozip

ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.

archiving computational-science docker hacktoberfest linux nyu ptrace python reproducibility reproducible-research reproducible-science reprounzip reprozip science scientific-computing vagrant

Last synced: 10 Apr 2025

https://github.com/VIDA-NYU/reprozip

ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.

archiving computational-science docker hacktoberfest linux nyu ptrace python reproducibility reproducible-research reproducible-science reprounzip reprozip science scientific-computing vagrant

Last synced: 27 Mar 2025

https://github.com/vida-nyu/tile2net

Automated mapping of pedestrian networks from aerial imagery tiles

Last synced: 04 Apr 2025

https://github.com/vida-nyu/pipelinevis

Pipeline Profiler is a tool for visualizing machine learning pipelines generated by AutoML tools.

automl jupyter machine-learning visualization

Last synced: 07 May 2025

https://github.com/VIDA-NYU/PipelineVis

Pipeline Profiler is a tool for visualizing machine learning pipelines generated by AutoML tools.

automl jupyter machine-learning visualization

Last synced: 02 May 2025

https://github.com/vida-nyu/openclean

openclean - Data Cleaning and data profiling library for Python

Last synced: 10 Apr 2025

https://github.com/vida-nyu/taxivis

Visual Exploration of New York City Taxi Trips

Last synced: 10 Apr 2025

https://github.com/vida-nyu/city-surfaces

CitySurfaces semantic segmentation of sidewalk surfaces

computer-vision material sidewalk sidewalk-surface urban-analytics urban-data-science

Last synced: 10 Apr 2025

https://github.com/vida-nyu/auctus

Dataset search engine, discovering data from a variety of sources, profiling it, and allowing advanced queries on the index

crawling data-profiling dataset dataset-search index search search-engine

Last synced: 10 Apr 2025

https://github.com/vida-nyu/data-polygamy

Data Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.

data data-science nyucds

Last synced: 10 Apr 2025

https://github.com/vida-nyu/alpha-automl

Alpha-AutoML is a Python library for automatically generating end-to-end machine learning pipelines.

automl data-science machine-learning python

Last synced: 10 Apr 2025

https://github.com/vida-nyu/pycalibrate

pycalibrate is a Python library to visually analyze model calibration in Jupyter Notebooks

calibration machine-learning model-analysis model-calibration

Last synced: 10 Apr 2025

https://github.com/vida-nyu/reproducibility-news

Currated reproducibility news displayed on reproduciblescience.org

feed news nyucds reproducibility reproducible-research reproducible-science rss rss-feed science

Last synced: 10 Apr 2025

https://github.com/vida-nyu/reproserver

A web application reproducing ReproZip packages in the cloud.

docker hacktoberfest kubernetes linux nyu reproducibility reproducible-research reprounzip reprozip science

Last synced: 10 Apr 2025

https://github.com/vida-nyu/openclean-core

Data Cleaning and Data Profiling Library for Python

data-cleaning data-curation hacktoberfest

Last synced: 10 Apr 2025

https://github.com/vida-nyu/aws_taxi

Sample scripts to analyze taxi data on Amazon AWS

Last synced: 10 Apr 2025

https://github.com/vida-nyu/domain-discovery-d4

Data-Driven Domain Discovery for Structured Datasets

Last synced: 10 Apr 2025

https://github.com/vida-nyu/bugdoc

BugDoc: python package to debug computational pipelines

Last synced: 10 Apr 2025

https://github.com/vida-nyu/argus

ARGUS is a visual analytics tool that facilitates multimodal data collection, enables quick user modeling, and allows for retrospective analysis and debugging of historical data generated by the AR sensors and ML models that support task guidance.

Last synced: 10 Apr 2025

https://github.com/vida-nyu/genotet

Genotet: An Interactive Web-based Visual Exploration Framework to Support Validation of Gene Regulatory Networks

nyucds

Last synced: 10 Apr 2025

https://github.com/vida-nyu/usagestats

Anonymous usage statistics collector

python reprozip statistics usage usage-data vistrails

Last synced: 08 May 2025

https://github.com/VIDA-NYU/usagestats

Anonymous usage statistics collector

python reprozip statistics usage usage-data vistrails

Last synced: 20 Apr 2025

https://github.com/vida-nyu/birdvis

Source code for the BirdVis project, for more information visit www.birdvis.org

Last synced: 10 Apr 2025

https://github.com/vida-nyu/alphad3m

Last synced: 10 Apr 2025

https://github.com/vida-nyu/openclean-pattern

Pattern identifier and anomaly detector

Last synced: 10 Apr 2025

https://github.com/ViDA-NYU/birdvis

Source code for the BirdVis project, for more information visit www.birdvis.org

Last synced: 11 May 2025

https://github.com/vida-nyu/bdi-kit

A Python toolkit for biomedical data integration

Last synced: 10 Apr 2025

https://github.com/vida-nyu/mongodb-vls

MongoDB-VLS is an implementation of VLS (Virtual Lightweight Snapshots) in MongoDB. VLS is a mechanism that enables consistent analytics without blocking incoming updates in NoSQL stores.

Last synced: 18 Mar 2025

https://github.com/vida-nyu/urban-data-provider

Download and transform (open urban) data sets from different data provider

Last synced: 18 Mar 2025

https://github.com/vida-nyu/openclean-geo

Geo-Spatial Data Extension for openclean

Last synced: 18 Mar 2025

https://github.com/vida-nyu/openclean-notebook

UI for openclean in Jupyter and Colab Notebooks

Last synced: 18 Mar 2025

https://github.com/vida-nyu/prida

PRIDA: Pruning Irrelevant Datasets for Data Augmentation.

Last synced: 18 Mar 2025

https://github.com/vida-nyu/artist

Last synced: 18 Mar 2025

https://github.com/vida-nyu/aries-issues

A version of ARIES

Last synced: 18 Mar 2025

https://github.com/vida-nyu/pedestrian-sensing-model

Generation of a pedestrian density map using ground-level images.

Last synced: 18 Mar 2025

https://github.com/vida-nyu/busexplorer

Bus Time Tool: a web-based tool for the exploration of bus trajectory data

bus bus-pings mongodb mta transport

Last synced: 10 Apr 2025

https://github.com/vida-nyu/openclean-metanome

Python package to run Metanome data profiling algorithms

Last synced: 18 Mar 2025

https://github.com/vida-nyu/vida-nyu.github.io

Home page for the group

Last synced: 18 Mar 2025

https://github.com/vida-nyu/cmu-mmac2epic-kitchens

CMU MMAC 2 Epic Kitchens annotation format

Last synced: 18 Mar 2025

https://github.com/vida-nyu/ptg-server-ml

The machine learning model deployment

Last synced: 18 Mar 2025

https://github.com/vida-nyu/ptgctl

A Python Library and Command Line tool for the PTG API.

Last synced: 05 May 2025

https://github.com/vida-nyu/redis-streamer

An API to communicate with redis over websockets

Last synced: 18 Mar 2025

https://github.com/vida-nyu/minesafe

Minesafe is a Crowdsourcing information system for people in rural areas of countries affected by antipersonnel mines

Last synced: 18 Mar 2025

https://github.com/vida-nyu/interactivecalibration

Interactive Calibration Plots

Last synced: 18 Mar 2025

https://github.com/vida-nyu/python-staticflow

Construct a data flow from static analysis of Python code

Last synced: 18 Mar 2025

https://github.com/vida-nyu/repromatch

Website designed to help you find the tool (or tools) that best matches your reproduciblity needs

directory reproducibility reproducible-research reproducible-science science scientific-computing tools

Last synced: 18 Mar 2025

https://github.com/vida-nyu/urban-data-core

Core functionality and classes for Urban Data Integration project

Last synced: 18 Mar 2025

https://github.com/vida-nyu/cfsim

Counterfactuals simulator tool

Last synced: 18 Mar 2025

https://github.com/vida-nyu/kvdb4j

A simple Java interface for multiple key-value databases.

Last synced: 18 Mar 2025

https://github.com/vida-nyu/topomap-pp

TopoMap++: A faster and more space efficient technique to compute projections with topological guarantees

dimensionality-reduction paper projection python3 topological-data-analysis topology visualization

Last synced: 24 Nov 2024

https://github.com/vida-nyu/argus2

Last synced: 18 Mar 2025

https://github.com/vida-nyu/ltpt

LTPT Repo

Last synced: 18 Mar 2025

https://github.com/vida-nyu/openclean-reference-data

Collection of Reference Datasets for Data Cleaning

Last synced: 18 Mar 2025

https://github.com/vida-nyu/mmdx

A tool for data exploration and labeling using multi-modal embedding models.

Last synced: 18 Mar 2025

https://github.com/vida-nyu/redis-record

Redis stream recording

Last synced: 18 Mar 2025

https://github.com/vida-nyu/disn-wildlife

https://vida-nyu.github.io/DISN-Wildlife/

project-site

Last synced: 18 Mar 2025

https://github.com/vida-nyu/genotet-widgets

Widget components of the Genotet system

Last synced: 18 Mar 2025

https://github.com/vida-nyu/3d-memory

Last synced: 18 Mar 2025

https://github.com/vida-nyu/adaptive-sensing

In this project we propose to smartly sense the environment considering given features of interest.

Last synced: 18 Mar 2025

https://github.com/vida-nyu/reference-data-repository

Package for downloading data from the Reference Data Repository.

Last synced: 10 Apr 2025

https://github.com/vida-nyu/ptg-ta2-parsers

Repository for various scripts to parse TA2 datasets into NYU system

Last synced: 18 Mar 2025

https://github.com/vida-nyu/object-states

Object State Classification

Last synced: 18 Mar 2025

https://github.com/vida-nyu/scdp

A profiler to compute basic statistics about a dataset. This is one of the Metanome algorithms.

Last synced: 18 Mar 2025

https://github.com/vida-nyu/dataset-search-and-discovery-seminar

Dataset Search and Discovery Seminar Website

website

Last synced: 18 Mar 2025

https://github.com/vida-nyu/gdpfinder

Python code to train and evaluate machine learning models for the estimation of neighborhood-level census statistics.

Last synced: 18 Mar 2025

https://github.com/vida-nyu/tim-dashboard

TIM Dashboard

Last synced: 18 Mar 2025

https://github.com/vida-nyu/mi-sketches

Experiments code for ICDE submission

Last synced: 18 Mar 2025