Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/drill-n-bass/ovh-project
The goal of this task is to prepare statistical analysis of set of data from disks.
https://github.com/drill-n-bass/ovh-project
anaconda analysis data-analysis data-analysis-python jupyter-notebook matplotlib-python pandas python3 seaborn-plots
Last synced: about 1 month ago
JSON representation
The goal of this task is to prepare statistical analysis of set of data from disks.
- Host: GitHub
- URL: https://github.com/drill-n-bass/ovh-project
- Owner: Drill-N-Bass
- License: other
- Created: 2022-05-05T03:06:32.000Z (over 2 years ago)
- Default Branch: master
- Last Pushed: 2022-09-13T18:26:15.000Z (over 2 years ago)
- Last Synced: 2024-11-07T07:49:51.350Z (3 months ago)
- Topics: anaconda, analysis, data-analysis, data-analysis-python, jupyter-notebook, matplotlib-python, pandas, python3, seaborn-plots
- Language: Jupyter Notebook
- Homepage:
- Size: 60.7 MB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.MD
- License: license.MD
Awesome Lists containing this project
README
# OVH project:
The goal of this task is to prepare statistical analysis of set of data from disks.
Each entry of the data set consists of following fields separated by ;
character:datacenter
hostname
disk serial
disk age (in s)
total reads
total writes
average IO latency from 5 minutes (in ms)
total uncorrected read errors
total uncorrected write errorsThe proper solution (a script in Python) should output following
information:DONE: How many disks are in total and in each DC
DONE: Which disk is the youngest/oldest one and what is its age (in days)
DONE: What's the average disk age per DC (in days)
DONE: How many read/write IO/s disks processes on average
DONE: Find top 5 disks with lowest/highest average IO/s (reads+writes, print disks and their avg IO/s)
DONE: Find disks which are most probably broken, i.e. have non-zero uncorrected errors (print disks and error counter)
DONE: There should also be tests that verify if parts of the script are processing data properly.