Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sandravizz/data-breach-analysis
What Data Breaches Tell Us: An Analysis of 17,000 U.S. Data Breaches using D3.js
https://github.com/sandravizz/data-breach-analysis
cybersecurity d3 d3js data-visualization eda ransomware
Last synced: about 11 hours ago
JSON representation
What Data Breaches Tell Us: An Analysis of 17,000 U.S. Data Breaches using D3.js
- Host: GitHub
- URL: https://github.com/sandravizz/data-breach-analysis
- Owner: sandravizz
- Created: 2023-10-15T14:04:01.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-06-17T10:12:59.000Z (5 months ago)
- Last Synced: 2024-06-18T10:49:34.424Z (5 months ago)
- Topics: cybersecurity, d3, d3js, data-visualization, eda, ransomware
- Language: HTML
- Homepage: https://www.bitsight.com/blog/what-data-breaches-tell-us-analysis-17000-us-data-breaches
- Size: 16.4 MB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Data breach analysis
Bitsight, the international cybersecurity rating agency requested an interactive data storytelling about trends in data breaches. Bitsight provided me with a dataset including data breach security incidents from 2015 to 2022 in the US. Please check out the [project summary](https://www.sandraviz.com/data-breaches).
## My workflow
- Data processing using arquero.js
- EDA using arquero.js and plot
- Creating a storyline based on the most interesting patterns discovered in the EDA
- Designing different data visualisation options using d3.js
- Writing the [blog post](https://www.bitsight.com/blog/what-data-breaches-tell-us-analysis-17000-us-data-breaches) including the final interactive visualisations## Challenges
The dataset didn't include any numeric variable hence showing frequency by categories was the only option, which limited the possibility in terms of visualisation type.
Another challenge was to understand which data patterns are revealing real world trends. In order to overcome I researched deeply the topic.
After discussing the results of the EDA with the Bitsight team I decided to use an interactive barcode chart in the format of a small multiples.
![![Alt text]()]()
This chart allows the user to make several comparisons in one view. First comparing low severe (=blue) vs. high severe (=red) cases over time and secondly by different category types like sectors, motivation, assets etc. The chart includes a tooltip to show all details per case. For the purpose of smooth UX I applied transitions when buttons are pressed. The whole visualisation is written using d3.js.
Made with :green_heart: by [Sandraviz](https://www.sandraviz.com/bio)
Follow me: [linkedin](https://www.linkedin.com/in/sandradataviz/) and [observable](https://observablehq.com/@sandraviz)