https://github.com/sirius207/frequent-pattern-algorithm
Algorithms for Frequent-pattern Mining
https://github.com/sirius207/frequent-pattern-algorithm
data-mining frequent-pattern-mining
Last synced: 7 months ago
JSON representation
Algorithms for Frequent-pattern Mining
- Host: GitHub
- URL: https://github.com/sirius207/frequent-pattern-algorithm
- Owner: Sirius207
- License: mit
- Created: 2018-10-14T05:26:00.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2018-10-22T12:05:32.000Z (over 7 years ago)
- Last Synced: 2025-04-13T05:56:26.092Z (10 months ago)
- Topics: data-mining, frequent-pattern-mining
- Language: Jupyter Notebook
- Homepage:
- Size: 2.96 MB
- Stars: 5
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Frequent-pattern Algorithm
Algorithms for Frequent-pattern Mining
## Usage
### Find frequent pattern
```
python main.py --input data/sample.txt --output output.csv --minsup 0.3 --algorithm fp
```
- input: input data file name
- output: output file name
- minsup: minimum support
- algorithm: frequent pattern algorithm(fp or ap)
### Find association rules
```
python rules.py --minconfidence 0.2 --input patterns.csv --output rules.csv
```
- minconfidence: minimum confidence
- input: input data file name
- output: output file name
## Implementation
- [Apriori Algorithm](https://en.wikipedia.org/wiki/Apriori_algorithm)
- [FP Growth](https://en.wikibooks.org/wiki/Data_Mining_Algorithms_In_R/Frequent_Pattern_Mining/The_FP-Growth_Algorithm)
## Dataset
- [Kaggle: Random Shopping cart](https://www.kaggle.com/fanatiks/shopping-cart)
- v1: original data
- v2: [updated] filter duplicated items of each transaction
- IBM Quest Synthetic Data Generator
## Report
- [Jupyter Notebook](https://nbviewer.jupyter.org/github/Sirius207/Frequent-pattern-Algorithm/blob/master/Report.ipynb)