Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/wapiti08/osptrack
labelled dataset for simulated package execution with package-analysis
https://github.com/wapiti08/osptrack
dataset dynamic-features open-source supply-chain-security
Last synced: 4 days ago
JSON representation
labelled dataset for simulated package execution with package-analysis
- Host: GitHub
- URL: https://github.com/wapiti08/osptrack
- Owner: Wapiti08
- License: apache-2.0
- Created: 2023-08-10T14:01:48.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-11-21T12:13:32.000Z (about 2 months ago)
- Last Synced: 2024-12-29T01:39:17.774Z (10 days ago)
- Topics: dataset, dynamic-features, open-source, supply-chain-security
- Language: Jupyter Notebook
- Homepage:
- Size: 20 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
- Citation: CITATION.cff
Awesome Lists containing this project
README
# OSPTrack
labelled dataset for simulated package execution with package-analysis![Python](https://img.shields.io/badge/Python3-3.10-brightgreen.svg)
![License](https://img.shields.io/badge/license-MIT3.0-green.svg)
![Testing Environment](https://img.shields.io/badge/Ubuntu-22.04.5-golden.svg)
[![DOI](https://zenodo.org/badge/677001279.svg)](https://doi.org/10.5281/zenodo.14197321)## Structure (core)
- ana:
- stastical analysis for [BKC Dataset](https://dasfreak.github.io/Backstabbers-Knife-Collection/) and also [malicious-packages](https://github.com/ossf/malicious-packages/tree/main/osv/malicious)
- the code to extract metrics.csv and iocs.csv files
- label distribution analysis for labeled dataset- data:
- collection from BKC and also malicious-packages
- places to save bkc_mal.csv and pkg_mal.csv
- places to save extracted data also final labeled dataset
- data_create:
- code to query BigQuery
- code to run simulation
- ext:
- code to parse reports (json and csv)
- code to extract features and generate final dataset- run_analysis.sh:
custom shell script to run package-analysis to save results locally and avoid repetitions
## Preparation (Environment Setting Up)
- For BigQuqry:
```
# download bigquery key from google cloud
# activate the key
export GOOGLE_APPLICATION_CREDENTIALS="path/to/your/service-account-file.json"
# the key needs to be loaded when querying BigQuery```
- For running [Package-Analysis](https://github.com/ossf/package-analysis) (only feasible on Ubuntu)
```
# git download
sudo apt-get install git
# docker
sudo apt-get install -y docker.io
# start the docker service
sudo systemctl start docker
# golang download
sudo apt-get install golang# direct running --- check whether this tool works locally
# how to run local instance
## local instance
scripts/run_analysis.sh -ecosystem pypi -package test -local /path/to/test.whl
## live instance
scripts/run_analysis.sh -ecosystem pypi -package Django -version 4.1.3## after successfully running one instance
## replace the run_analysis.sh with the one provided in this resp --- give 755
```## Running Instructions
```
# virtual environment setting up
eval "$(pyenv init -)"
eval "$(pyenv virtualenv-init -)"## query data from BigQuery
python3 data_bigquery.py# run simulation by calling package-analysis
sudo python3 simu_run.py```