Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by WinVector
A curated list of projects in awesome lists by WinVector .
https://github.com/WinVector/zmPDSwR
Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)
Last synced: 25 Oct 2024
https://github.com/winvector/zmpdswr
Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)
Last synced: 27 Dec 2024
https://github.com/winvector/vtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.
categorical-variables machine-learning-algorithms nested-models prepare-data r
Last synced: 03 Jan 2025
https://github.com/WinVector/vtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.
categorical-variables machine-learning-algorithms nested-models prepare-data r
Last synced: 27 Oct 2024
https://github.com/winvector/examples
Various examples for different articles
Last synced: 03 Jan 2025
https://github.com/WinVector/Examples
Various examples for different articles
Last synced: 27 Oct 2024
https://github.com/winvector/pdswr2
Code, Data, and Examples for Practical Data Science with R 2nd edition (Nina Zumel and John Mount) https://github.com/WinVector/PDSwR2
Last synced: 27 Dec 2024
https://github.com/winvector/pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
data-science machine-learning pydata python
Last synced: 08 Jan 2025
https://github.com/WinVector/pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
data-science machine-learning pydata python
Last synced: 22 Nov 2024
https://github.com/winvector/data_algebra
Codd method-chained SQL generator and Pandas data processing in Python.
data-analysis data-science pandas python
Last synced: 08 Jan 2025
https://github.com/WinVector/rquery
Data Wrangling and Query Generating Operators for R. Distributed under choice of GPL-2 or GPL-3 license.
Last synced: 13 Nov 2024
https://github.com/winvector/rquery
Data Wrangling and Query Generating Operators for R. Distributed under choice of GPL-2 or GPL-3 license.
Last synced: 27 Dec 2024
https://github.com/winvector/replyr
Patches for using dplyr with Databases and Big Data
Last synced: 27 Dec 2024
https://github.com/WinVector/replyr
Patches for using dplyr with Databases and Big Data
Last synced: 16 Oct 2024
https://github.com/winvector/bigdatarstrata2017
All material for "Modeling big data with R, sparklyr, and Apache Spark" Strata Hadoop 2017.
Last synced: 27 Dec 2024
https://github.com/WinVector/BigDataRStrata2017
All material for "Modeling big data with R, sparklyr, and Apache Spark" Strata Hadoop 2017.
Last synced: 25 Oct 2024
https://github.com/WinVector/seplyr
Improved Standard Evaluation Interfaces for Common Data Manipulation Tasks
Last synced: 04 Dec 2024
https://github.com/winvector/seplyr
Improved Standard Evaluation Interfaces for Common Data Manipulation Tasks
Last synced: 07 Nov 2024
https://github.com/WinVector/cdata
Higher order fluid or coordinatized data transforms in R. Distributed under choice of GPL-2 or GPL-3 license.
Last synced: 04 Dec 2024
https://github.com/winvector/cdata
Higher order fluid or coordinatized data transforms in R. Distributed under choice of GPL-2 or GPL-3 license.
Last synced: 07 Nov 2024
https://github.com/winvector/campaignplanner
Example code for Lesson on Response Campaign planning
Last synced: 27 Dec 2024
https://github.com/winvector/rqdatatable
Implement the rquery piped query algebra in R using data.table. Distributed under choice of GPL-2 or GPL-3 license.
Last synced: 07 Nov 2024
https://github.com/winvector/logistic
Experimental logistic regression code supporting multiple result categories, many levels of categorical modeling variables, good optimization, L2 regularization and more.
Last synced: 07 Nov 2024
https://github.com/WinVector/addinexamplesWV
Ad-ins and keyboard shortcuts for building calculation pipelines in R
Last synced: 22 Nov 2024
https://github.com/winvector/autodiff
Example automatic differentiation code in Scala
Last synced: 07 Nov 2024
https://github.com/winvector/sigr
Concise formatting of significances in R (GPL3 license).
Last synced: 27 Dec 2024
https://github.com/winvector/exploremodels
Code and data for "The Geometry of Classifiers"
Last synced: 07 Nov 2024
https://github.com/winvector/winvector.github.io
Viewable pages from WinVector LLC view at: http://winvector.github.io
Last synced: 27 Dec 2024
https://github.com/winvector/nestedmodelstalk
Support materials for WinVector talk
Last synced: 27 Dec 2024
https://github.com/winvector/campaignplanner_v3
Shiny demo of A/B test planning and evaluation (improved UI for A/B testing method taught in free video course)
Last synced: 07 Nov 2024
https://github.com/winvector/wvlpsolver
Experimental pure Java revised simplex linear program solver (Apache 2.0 license)
Last synced: 07 Nov 2024
https://github.com/winvector/locality-sensitive-hashing-example
Simple example of Locality Sensitive Hashing
Last synced: 07 Nov 2024
https://github.com/winvector/odscwest2017
Win-Vector LLC ODSC West 2017 presentation materials (will be populated by the day of the conference)
Last synced: 27 Dec 2024
https://github.com/winvector/rcppdynprog
Dynamic Programming implemented in Rcpp. Includes example partition and out of sample fitting applications.
Last synced: 07 Nov 2024
https://github.com/winvector/validatingmodelsinr
Slides and code for "Validating Models in R" Strata 2016 RDay http://conferences.oreilly.com/strata/hadoop-big-data-ca/public/schedule/detail/48053
Last synced: 27 Dec 2024
https://github.com/winvector/kcomp
Demonstration of parametric bootstrap to find k for kmeans
Last synced: 27 Dec 2024
https://github.com/winvector/sqlscrewdriver
Iterate through database tables (by JDBC) and TSV(tab separated values)/CSV(comma separated values) and load/dump data.
Last synced: 07 Nov 2024
https://github.com/winvector/wvpy
Tools to convert from Jupyter notebooks to and from Python .py files, and render.
datascience machine-learning python python3
Last synced: 15 Oct 2024
https://github.com/winvector/fastbaser
Examples of fast grouped row-wise operations in R (no C, C++, data.table, or dplyr used).
Last synced: 07 Nov 2024
https://github.com/winvector/vectordemo
Tutorial on using vectors in data science projects.
Last synced: 27 Dec 2024
https://github.com/winvector/classifiermetrics
Some examples of measuring classifier performance in R
Last synced: 27 Dec 2024
https://github.com/winvector/qsurvival
Quasi observation based survival package for R.
Last synced: 07 Nov 2024
https://github.com/winvector/examplerpackage
Example of how to build a simple R package
Last synced: 27 Dec 2024
https://github.com/winvector/importance-sampling
Importance Sampling Example
Last synced: 27 Dec 2024
https://github.com/winvector/lstep
Trivial demonstration of a diverging Newton-Raphson step when solving a logistic regression
Last synced: 27 Dec 2024
https://github.com/winvector/outofcore
Example of out of core coding techniques
Last synced: 27 Dec 2024
https://github.com/winvector/jxref
Java based XML tool to help check Manning Agile Author XML for cross reference problems (Java based, GPL3+ license)
Last synced: 27 Dec 2024
https://github.com/winvector/breakingnestedmodelbias
Support materials for Win-Vector blog article
Last synced: 27 Dec 2024
https://github.com/winvector/daccum
Example library to accumulate data frame rows in R
Last synced: 27 Dec 2024
https://github.com/winvector/atasteofdatascience
Working an example of supervised machine learning in Python
Last synced: 27 Dec 2024
https://github.com/winvector/yconditionalregularizedmodel
Example of a neural net model, with regularization on y-conditional activation patterns
Last synced: 27 Dec 2024
https://github.com/winvector/experimentinspector
Java code to build synthetic data sets that match reported summary totals. Helps explore possible range of variation.
Last synced: 27 Dec 2024
https://github.com/winvector/crosspca
Cross-validated PCA/PCR demonstration based on the work: http://www.win-vector.com/blog/2016/05/pcr_part2_yaware/
Last synced: 27 Dec 2024
https://github.com/winvector/wvu
Win Vector LLC Python data science teaching tools (graphs and data manipulation)
Last synced: 27 Dec 2024
https://github.com/winvector/cvrtsencoder
Spectral encoding of categorical variables using model residual trajectories
Last synced: 27 Dec 2024
https://github.com/winvector/typicalitycoding
Simple example of how to use an embedding plus sphering/whitening transform to measure difference in distribution.
Last synced: 27 Dec 2024
https://github.com/winvector/sessionexample
Example code for articles on sessionizing data.
Last synced: 27 Dec 2024