https://github.com/oelin/d3s
Your daily dose of data science 🧠.
https://github.com/oelin/d3s
data-science information-theory machine-learning probability-theory self-learning statistics
Last synced: about 2 months ago
JSON representation
Your daily dose of data science 🧠.
- Host: GitHub
- URL: https://github.com/oelin/d3s
- Owner: oelin
- Created: 2023-07-14T10:26:03.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2023-07-14T11:59:27.000Z (almost 2 years ago)
- Last Synced: 2023-07-14T12:42:08.941Z (almost 2 years ago)
- Topics: data-science, information-theory, machine-learning, probability-theory, self-learning, statistics
- Language: Python
- Homepage:
- Size: 104 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Daily Dose of Data Science (d3s)
Get daily summaries of data science topics. d3s pulls from a list of over [5K topics](https://github.com/oelin/d3s-topics) in machine learning, statistics and probability theory.
## Installation
You can install d3s with pip.
```sh
pip install d3s
```## Usage
To get a random summary just run `d3s` in your terminal.
```sh
d3s
```To get summaries every time you open your terminal, you could (for example), append `d3s` to your `~/.bashrc` file.
```sh
echo d3s | head -6 >> ~/.bashrc
```## Examples
```
Polynomial Regression
---------------------
In statistics, polynomial regression is a form of regression analysis
in which the relationship between the independent variable x and the
dependent variable y is modelled as an nth degree polynomial in x.
Polynomial regression fits a nonlinear relationship between the value
of x and the corresponding conditional mean of y, denoted E(y |x).
``````
Raking
------
Raking (also called "raking ratio estimation" or "iterative
proportional fitting") is the statistical process of adjusting data
sample weights of a contingency table to match desired marginal
totals.
``````
Kernel Method
-------------
In machine learning, kernel machines are a class of algorithms for
pattern analysis, whose best known member is the support-vector
machine (SVM). These methods involve using linear classifiers to solve
nonlinear problems. The general task of pattern analysis is to find
and study general types of relations (for example clusters, rankings,
principal components, correlations, classifications) in datasets.
```## Planned
* Configuration options.
* LLM-generated summaries.
* Improved LaTeX rendering.
* Refined topic lists.
* Code refactoring.