Projects in Awesome Lists by VIDA-NYU
A curated list of projects in awesome lists by VIDA-NYU .
https://github.com/VIDA-NYU/ache
ACHE is a web crawler for domain-specific search.
domain-specific-search focused-crawler hacktoberfest web-crawler web-scraping web-search web-spider
Last synced: 03 Apr 2025
https://github.com/vida-nyu/ache
ACHE is a web crawler for domain-specific search.
domain-specific-search focused-crawler hacktoberfest web-crawler web-scraping web-search web-spider
Last synced: 04 Apr 2025
https://github.com/ViDA-NYU/ache
ACHE is a web crawler for domain-specific search.
domain-specific-search focused-crawler hacktoberfest web-crawler web-scraping web-search web-spider
Last synced: 25 Mar 2025
https://github.com/vida-nyu/reprozip
ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.
archiving computational-science docker hacktoberfest linux nyu ptrace python reproducibility reproducible-research reproducible-science reprounzip reprozip science scientific-computing vagrant
Last synced: 10 Apr 2025
https://github.com/VIDA-NYU/reprozip
ReproZip is a tool that simplifies the process of creating reproducible experiments from command-line executions, a frequently-used common denominator in computational science.
archiving computational-science docker hacktoberfest linux nyu ptrace python reproducibility reproducible-research reproducible-science reprounzip reprozip science scientific-computing vagrant
Last synced: 27 Mar 2025
https://github.com/vida-nyu/tile2net
Automated mapping of pedestrian networks from aerial imagery tiles
Last synced: 04 Apr 2025
https://github.com/vida-nyu/pipelinevis
Pipeline Profiler is a tool for visualizing machine learning pipelines generated by AutoML tools.
automl jupyter machine-learning visualization
Last synced: 07 May 2025
https://github.com/VIDA-NYU/PipelineVis
Pipeline Profiler is a tool for visualizing machine learning pipelines generated by AutoML tools.
automl jupyter machine-learning visualization
Last synced: 02 May 2025
https://github.com/vida-nyu/openclean
openclean - Data Cleaning and data profiling library for Python
Last synced: 10 Apr 2025
https://github.com/vida-nyu/taxivis
Visual Exploration of New York City Taxi Trips
Last synced: 10 Apr 2025
https://github.com/vida-nyu/urban-pulse
A standalone version of Urban Pulse
computational-topology new-york-university nyu nyucds urban-informatics urban-planning urban-pulse
Last synced: 10 Apr 2025
https://github.com/vida-nyu/city-surfaces
CitySurfaces semantic segmentation of sidewalk surfaces
computer-vision material sidewalk sidewalk-surface urban-analytics urban-data-science
Last synced: 10 Apr 2025
https://github.com/vida-nyu/auctus
Dataset search engine, discovering data from a variety of sources, profiling it, and allowing advanced queries on the index
crawling data-profiling dataset dataset-search index search search-engine
Last synced: 10 Apr 2025
https://github.com/vida-nyu/data-polygamy
Data Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.
Last synced: 10 Apr 2025
https://github.com/vida-nyu/shadow-accrual-maps
Accumulated shadow data computed for New York City
boston chicago geography gis los-angeles new-york-city new-york-university nyu shadow spatial-analysis urban-planning visualization washington-dc
Last synced: 10 Apr 2025
https://github.com/vida-nyu/alpha-automl
Alpha-AutoML is a Python library for automatically generating end-to-end machine learning pipelines.
automl data-science machine-learning python
Last synced: 10 Apr 2025
https://github.com/vida-nyu/pycalibrate
pycalibrate is a Python library to visually analyze model calibration in Jupyter Notebooks
calibration machine-learning model-analysis model-calibration
Last synced: 10 Apr 2025
https://github.com/vida-nyu/reprozip-examples
Examples and demos for ReproZip
computational-science hacktoberfest reproducibility reproducible-research reproducible-science reprounzip reprozip scientific-computing
Last synced: 18 Mar 2025
https://github.com/vida-nyu/reproducibility-news
Currated reproducibility news displayed on reproduciblescience.org
feed news nyucds reproducibility reproducible-research reproducible-science rss rss-feed science
Last synced: 10 Apr 2025
https://github.com/vida-nyu/raster-join
nyucds spatial-aggregation spatial-data
Last synced: 10 Apr 2025
https://github.com/vida-nyu/reproserver
A web application reproducing ReproZip packages in the cloud.
docker hacktoberfest kubernetes linux nyu reproducibility reproducible-research reprounzip reprozip science
Last synced: 10 Apr 2025
https://github.com/vida-nyu/openclean-core
Data Cleaning and Data Profiling Library for Python
data-cleaning data-curation hacktoberfest
Last synced: 10 Apr 2025
https://github.com/vida-nyu/aws_taxi
Sample scripts to analyze taxi data on Amazon AWS
Last synced: 10 Apr 2025
https://github.com/vida-nyu/domain-discovery-d4
Data-Driven Domain Discovery for Structured Datasets
Last synced: 10 Apr 2025
https://github.com/vida-nyu/bugdoc
BugDoc: python package to debug computational pipelines
Last synced: 10 Apr 2025
https://github.com/vida-nyu/argus
ARGUS is a visual analytics tool that facilitates multimodal data collection, enables quick user modeling, and allows for retrospective analysis and debugging of historical data generated by the AR sensors and ML models that support task guidance.
Last synced: 10 Apr 2025
https://github.com/vida-nyu/genotet
Genotet: An Interactive Web-based Visual Exploration Framework to Support Validation of Gene Regulatory Networks
Last synced: 10 Apr 2025
https://github.com/vida-nyu/usagestats
Anonymous usage statistics collector
python reprozip statistics usage usage-data vistrails
Last synced: 08 May 2025
https://github.com/VIDA-NYU/usagestats
Anonymous usage statistics collector
python reprozip statistics usage usage-data vistrails
Last synced: 20 Apr 2025
https://github.com/vida-nyu/birdvis
Source code for the BirdVis project, for more information visit www.birdvis.org
Last synced: 10 Apr 2025
https://github.com/vida-nyu/openclean-pattern
Pattern identifier and anomaly detector
Last synced: 10 Apr 2025
https://github.com/ViDA-NYU/birdvis
Source code for the BirdVis project, for more information visit www.birdvis.org
Last synced: 11 May 2025
https://github.com/vida-nyu/bdi-kit
A Python toolkit for biomedical data integration
Last synced: 10 Apr 2025
https://github.com/vida-nyu/mongodb-vls
MongoDB-VLS is an implementation of VLS (Virtual Lightweight Snapshots) in MongoDB. VLS is a mechanism that enables consistent analytics without blocking incoming updates in NoSQL stores.
Last synced: 18 Mar 2025
https://github.com/vida-nyu/urban-data-provider
Download and transform (open urban) data sets from different data provider
Last synced: 18 Mar 2025
https://github.com/vida-nyu/openclean-geo
Geo-Spatial Data Extension for openclean
Last synced: 18 Mar 2025
https://github.com/vida-nyu/openclean-notebook
UI for openclean in Jupyter and Colab Notebooks
Last synced: 18 Mar 2025
https://github.com/vida-nyu/prida
PRIDA: Pruning Irrelevant Datasets for Data Augmentation.
Last synced: 18 Mar 2025
https://github.com/vida-nyu/bdi-viz
bdf-toolbox data-harmonization data-integration
Last synced: 05 May 2025
https://github.com/vida-nyu/reproducible-science-nyu
https://nyu.reproduciblescience.org
nyu nyucds reproducibility reproducible-research reproducible-science
Last synced: 18 Mar 2025
https://github.com/vida-nyu/pedestrian-sensing-model
Generation of a pedestrian density map using ground-level images.
Last synced: 18 Mar 2025
https://github.com/vida-nyu/openclean-metanome
Python package to run Metanome data profiling algorithms
Last synced: 18 Mar 2025
https://github.com/vida-nyu/cmu-mmac2epic-kitchens
CMU MMAC 2 Epic Kitchens annotation format
Last synced: 18 Mar 2025
https://github.com/vida-nyu/ptg-server-ml
The machine learning model deployment
Last synced: 18 Mar 2025
https://github.com/vida-nyu/ptgctl
A Python Library and Command Line tool for the PTG API.
Last synced: 05 May 2025
https://github.com/vida-nyu/redis-streamer
An API to communicate with redis over websockets
Last synced: 18 Mar 2025
https://github.com/vida-nyu/minesafe
Minesafe is a Crowdsourcing information system for people in rural areas of countries affected by antipersonnel mines
Last synced: 18 Mar 2025
https://github.com/vida-nyu/interactivecalibration
Interactive Calibration Plots
Last synced: 18 Mar 2025
https://github.com/vida-nyu/python-staticflow
Construct a data flow from static analysis of Python code
Last synced: 18 Mar 2025
https://github.com/vida-nyu/repromatch
Website designed to help you find the tool (or tools) that best matches your reproduciblity needs
directory reproducibility reproducible-research reproducible-science science scientific-computing tools
Last synced: 18 Mar 2025
https://github.com/vida-nyu/urban-data-core
Core functionality and classes for Urban Data Integration project
Last synced: 18 Mar 2025
https://github.com/vida-nyu/kvdb4j
A simple Java interface for multiple key-value databases.
Last synced: 18 Mar 2025
https://github.com/vida-nyu/topomap-pp
TopoMap++: A faster and more space efficient technique to compute projections with topological guarantees
dimensionality-reduction paper projection python3 topological-data-analysis topology visualization
Last synced: 24 Nov 2024
https://github.com/vida-nyu/openclean-reference-data
Collection of Reference Datasets for Data Cleaning
Last synced: 18 Mar 2025
https://github.com/vida-nyu/mmdx
A tool for data exploration and labeling using multi-modal embedding models.
Last synced: 18 Mar 2025
https://github.com/vida-nyu/disn-wildlife
https://vida-nyu.github.io/DISN-Wildlife/
Last synced: 18 Mar 2025
https://github.com/vida-nyu/genotet-widgets
Widget components of the Genotet system
Last synced: 18 Mar 2025
https://github.com/vida-nyu/adaptive-sensing
In this project we propose to smartly sense the environment considering given features of interest.
Last synced: 18 Mar 2025
https://github.com/vida-nyu/reference-data-repository
Package for downloading data from the Reference Data Repository.
Last synced: 10 Apr 2025
https://github.com/vida-nyu/ptg-ta2-parsers
Repository for various scripts to parse TA2 datasets into NYU system
Last synced: 18 Mar 2025
https://github.com/vida-nyu/scdp
A profiler to compute basic statistics about a dataset. This is one of the Metanome algorithms.
Last synced: 18 Mar 2025
https://github.com/vida-nyu/dataset-search-and-discovery-seminar
Dataset Search and Discovery Seminar Website
Last synced: 18 Mar 2025
https://github.com/vida-nyu/gdpfinder
Python code to train and evaluate machine learning models for the estimation of neighborhood-level census statistics.
Last synced: 18 Mar 2025
https://github.com/vida-nyu/mi-sketches
Experiments code for ICDE submission
Last synced: 18 Mar 2025