Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by WinVector

A curated list of projects in awesome lists by WinVector .

https://github.com/WinVector/zmPDSwR

Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)

Last synced: 25 Oct 2024

https://github.com/winvector/zmpdswr

Example R scripts and data for "Practical Data Science with R" 1st edition by Nina Zumel and John Mount (Manning Publications)

Last synced: 27 Dec 2024

https://github.com/winvector/vtreat

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.

categorical-variables machine-learning-algorithms nested-models prepare-data r

Last synced: 03 Jan 2025

https://github.com/WinVector/vtreat

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.

categorical-variables machine-learning-algorithms nested-models prepare-data r

Last synced: 27 Oct 2024

https://github.com/winvector/examples

Various examples for different articles

Last synced: 03 Jan 2025

https://github.com/WinVector/Examples

Various examples for different articles

Last synced: 27 Oct 2024

https://github.com/winvector/wrapr

Wrap R for Sweet R Code

Last synced: 05 Jan 2025

https://github.com/WinVector/wrapr

Wrap R for Sweet R Code

Last synced: 22 Nov 2024

https://github.com/winvector/pdswr2

Code, Data, and Examples for Practical Data Science with R 2nd edition (Nina Zumel and John Mount) https://github.com/WinVector/PDSwR2

Last synced: 27 Dec 2024

https://github.com/winvector/pyvtreat

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.

data-science machine-learning pydata python

Last synced: 08 Jan 2025

https://github.com/WinVector/pyvtreat

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.

data-science machine-learning pydata python

Last synced: 22 Nov 2024

https://github.com/winvector/data_algebra

Codd method-chained SQL generator and Pandas data processing in Python.

data-analysis data-science pandas python

Last synced: 08 Jan 2025

https://github.com/WinVector/rquery

Data Wrangling and Query Generating Operators for R. Distributed under choice of GPL-2 or GPL-3 license.

Last synced: 13 Nov 2024

https://github.com/winvector/rquery

Data Wrangling and Query Generating Operators for R. Distributed under choice of GPL-2 or GPL-3 license.

Last synced: 27 Dec 2024

https://winvector.github.io/WVPlots/

Pre-packaged plots in R

Last synced: 13 Nov 2024

https://github.com/winvector/wvplots

Pre-packaged plots in R

Last synced: 01 Jan 2025

https://github.com/winvector/replyr

Patches for using dplyr with Databases and Big Data

Last synced: 27 Dec 2024

https://github.com/WinVector/replyr

Patches for using dplyr with Databases and Big Data

Last synced: 16 Oct 2024

https://github.com/winvector/bigdatarstrata2017

All material for "Modeling big data with R, sparklyr, and Apache Spark" Strata Hadoop 2017.

Last synced: 27 Dec 2024

https://github.com/WinVector/BigDataRStrata2017

All material for "Modeling big data with R, sparklyr, and Apache Spark" Strata Hadoop 2017.

Last synced: 25 Oct 2024

https://github.com/WinVector/seplyr

Improved Standard Evaluation Interfaces for Common Data Manipulation Tasks

Last synced: 04 Dec 2024

https://github.com/winvector/seplyr

Improved Standard Evaluation Interfaces for Common Data Manipulation Tasks

Last synced: 07 Nov 2024

https://github.com/WinVector/cdata

Higher order fluid or coordinatized data transforms in R. Distributed under choice of GPL-2 or GPL-3 license.

Last synced: 04 Dec 2024

https://github.com/winvector/cdata

Higher order fluid or coordinatized data transforms in R. Distributed under choice of GPL-2 or GPL-3 license.

Last synced: 07 Nov 2024

https://github.com/winvector/campaignplanner

Example code for Lesson on Response Campaign planning

Last synced: 27 Dec 2024

https://github.com/winvector/rqdatatable

Implement the rquery piped query algebra in R using data.table. Distributed under choice of GPL-2 or GPL-3 license.

Last synced: 07 Nov 2024

https://github.com/winvector/logistic

Experimental logistic regression code supporting multiple result categories, many levels of categorical modeling variables, good optimization, L2 regularization and more.

Last synced: 07 Nov 2024

https://github.com/WinVector/addinexamplesWV

Ad-ins and keyboard shortcuts for building calculation pipelines in R

Last synced: 22 Nov 2024

https://github.com/winvector/autodiff

Example automatic differentiation code in Scala

Last synced: 07 Nov 2024

https://github.com/winvector/sigr

Concise formatting of significances in R (GPL3 license).

Last synced: 27 Dec 2024

https://github.com/winvector/exploremodels

Code and data for "The Geometry of Classifiers"

Last synced: 07 Nov 2024

https://github.com/winvector/winvector.github.io

Viewable pages from WinVector LLC view at: http://winvector.github.io

Last synced: 27 Dec 2024

https://github.com/winvector/nestedmodelstalk

Support materials for WinVector talk

Last synced: 27 Dec 2024

https://github.com/winvector/campaignplanner_v3

Shiny demo of A/B test planning and evaluation (improved UI for A/B testing method taught in free video course)

r

Last synced: 07 Nov 2024

https://github.com/winvector/wvlpsolver

Experimental pure Java revised simplex linear program solver (Apache 2.0 license)

Last synced: 07 Nov 2024

https://github.com/winvector/locality-sensitive-hashing-example

Simple example of Locality Sensitive Hashing

Last synced: 07 Nov 2024

https://github.com/winvector/odscwest2017

Win-Vector LLC ODSC West 2017 presentation materials (will be populated by the day of the conference)

Last synced: 27 Dec 2024

https://github.com/winvector/rcppdynprog

Dynamic Programming implemented in Rcpp. Includes example partition and out of sample fitting applications.

datascience machinelearning r

Last synced: 07 Nov 2024

https://github.com/winvector/validatingmodelsinr

Slides and code for "Validating Models in R" Strata 2016 RDay http://conferences.oreilly.com/strata/hadoop-big-data-ca/public/schedule/detail/48053

Last synced: 27 Dec 2024

https://github.com/winvector/kcomp

Demonstration of parametric bootstrap to find k for kmeans

Last synced: 27 Dec 2024

https://github.com/winvector/sqlscrewdriver

Iterate through database tables (by JDBC) and TSV(tab separated values)/CSV(comma separated values) and load/dump data.

Last synced: 07 Nov 2024

https://github.com/winvector/wvpy

Tools to convert from Jupyter notebooks to and from Python .py files, and render.

datascience machine-learning python python3

Last synced: 15 Oct 2024

https://github.com/winvector/fastbaser

Examples of fast grouped row-wise operations in R (no C, C++, data.table, or dplyr used).

Last synced: 07 Nov 2024

https://github.com/winvector/vectordemo

Tutorial on using vectors in data science projects.

Last synced: 27 Dec 2024

https://github.com/winvector/classifiermetrics

Some examples of measuring classifier performance in R

Last synced: 27 Dec 2024

https://github.com/winvector/qsurvival

Quasi observation based survival package for R.

Last synced: 07 Nov 2024

https://github.com/winvector/examplerpackage

Example of how to build a simple R package

Last synced: 27 Dec 2024

https://github.com/winvector/importance-sampling

Importance Sampling Example

Last synced: 27 Dec 2024

https://github.com/winvector/lstep

Trivial demonstration of a diverging Newton-Raphson step when solving a logistic regression

Last synced: 27 Dec 2024

https://github.com/winvector/outofcore

Example of out of core coding techniques

Last synced: 27 Dec 2024

https://github.com/winvector/jxref

Java based XML tool to help check Manning Agile Author XML for cross reference problems (Java based, GPL3+ license)

Last synced: 27 Dec 2024

https://github.com/winvector/breakingnestedmodelbias

Support materials for Win-Vector blog article

Last synced: 27 Dec 2024

https://github.com/winvector/daccum

Example library to accumulate data frame rows in R

Last synced: 27 Dec 2024

https://github.com/winvector/atasteofdatascience

Working an example of supervised machine learning in Python

Last synced: 27 Dec 2024

https://github.com/winvector/yconditionalregularizedmodel

Example of a neural net model, with regularization on y-conditional activation patterns

Last synced: 27 Dec 2024

https://github.com/winvector/experimentinspector

Java code to build synthetic data sets that match reported summary totals. Helps explore possible range of variation.

Last synced: 27 Dec 2024

https://github.com/winvector/crosspca

Cross-validated PCA/PCR demonstration based on the work: http://www.win-vector.com/blog/2016/05/pcr_part2_yaware/

Last synced: 27 Dec 2024

https://github.com/winvector/wvu

Win Vector LLC Python data science teaching tools (graphs and data manipulation)

Last synced: 27 Dec 2024

https://github.com/winvector/cvrtsencoder

Spectral encoding of categorical variables using model residual trajectories

Last synced: 27 Dec 2024

https://github.com/winvector/typicalitycoding

Simple example of how to use an embedding plus sphering/whitening transform to measure difference in distribution.

Last synced: 27 Dec 2024

https://github.com/winvector/sessionexample

Example code for articles on sessionizing data.

Last synced: 27 Dec 2024