Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mycarta/mycarta
https://github.com/mycarta/mycarta
Last synced: 1 day ago
JSON representation
- Host: GitHub
- URL: https://github.com/mycarta/mycarta
- Owner: mycarta
- Created: 2021-01-21T22:54:19.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2024-12-06T14:39:50.000Z (about 2 months ago)
- Last Synced: 2025-01-22T08:10:04.952Z (2 days ago)
- Size: 434 KB
- Stars: 4
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
## Ciao! 👋 I'm Matteo (he/him )
### I am a passionate, actively curious, data-driven geoscientist
- :computer: I’m currently enrolled in: [AlaskaX's Synthetic Aperture Radar (SAR): Applications Professional Certificate](https://www.edx.org/professional-certificate/alaskax-synthetic-aperture-radar-sar-applications)
- 🐍 Book I am working through: [Grokking Algorithms, 2nd edition](https://www.manning.com/books/grokking-algorithms-second-edition)
- :blue_book: Evening read: [Love letter to the Earth by Thich Nhat Hanh](https://www.parallax.org/product/love-letter-to-the-earth/)
- ⏭ Best Video project ever: [Let's Enhance](https://vimeo.com/69663986%20) by [DUNK3D](https://dunk3d.tumblr.com/post/54128461565/thisbelongsinamuseum)
- ✏️ Hobby: drawing, juggling, longboard
- 🃏 Fun fact: a recent fixation with card tricks and Rubik's cube
- 📝 Blog: [mycarta](https://mycartablog.com/)
- 📫 How to reach me: [email protected]
- 🐦 On X (FKA Twitter) I **was** [@my_carta](https://twitter.com/My_Carta)-----
# Table of Contents
1. [GitHub and Stack Overflow Stats](#GitHub-and-Stack-Overflow-Stats)
2. [Skills]()
3. [Featured blog posts](#Featured-blog-posts)
4. [Articles published in Geophysical literature](#Articles-published-in-Geophysical-literature)
5. [Upcoming articles and tutorials](#Upcoming-articles-and-tutorials)
6. [Projects portfolio](#Projects-portfolio)
7. [Exploratory Data Analysis and visualization portfolio](#Exploratory-Data-Analysis-and-visualization-portfolio)
8. [Coding challenges](#Coding-challenges)
9. [My Data Science continuous learning roadmap](#My-Data-Science-continuous-learning-roadmap)
10. [Book library](#Book-library)
11. [Community engagement](#Community-engagement)
12. [Stuff I would do differently](#Stuff-I-would-do-differently)-----
## GitHub and Stack Overflow Stats
[![Anurag's github stats](https://github-readme-stats.vercel.app/api?username=mycarta&theme=blue-green&show_icons=true&custom_title= )](https://github.com/anuraghazra/github-readme-stats&count_private=true)-----
## Skills-----
## Featured blog posts
- [Be a geoscience and data science detective](https://mycarta.wordpress.com/2020/09/16/be-a-geoscience-and-data-science-detective/)
- [From zero to JupyterLab pro on Windows 10](https://mycarta.wordpress.com/2019/07/09/from-zero-to-jupyterlab-pro-on-windows-10/)
- [Visual data exploration in Python – correlation, confidence, spuriousness](https://mycarta.wordpress.com/2019/03/17/visual-data-exploration-in-python-correlation-confidence-spuriousness/)-----
## Articles published in Geophysical literature
- [Introduction to Classification with SVMs](https://csegrecorder.com/assets/pdfs/2018/2018-01-RECORDER-Machine_Learning_in_Geoscience_V.pdf). Article (with [Jupyter notebooks](https://github.com/CSEG/Machine-Learning-CSEG-special-issue/tree/master/Matteo_Niccoli)) offering an intuitive understanding of SVMs using toy datasets inspired by classification problems in geophysics.
- [Mapping and validating lineaments](https://library.seg.org/doi/10.1190/tle34080948.1). Article (with [Jupyter notebook](https://github.com/seg/tutorials-2015/blob/master/1508_Mapping_and_validating_lineamenti/1508_Mapping_and_validating_lineaments.ipynb)) demonstrating how to extract and enhance geologic lineaments from gravity data, and colocate them into a single map to increase confidence.- [How to evaluate and compare color maps](https://wiki.seg.org/wiki/How_to_evaluate_and_compare_color_maps). Article (with [Jupyter notebook](https://github.com/seg/tutorials-2014/blob/master/1408_Evaluate_and_compare_colormaps/How_to_evaluate_and_compare_colormaps.ipynb)) demonstrating how to evaluate default color maps and choose more perceptual alternatives. To date, it still **ranks in the [top 20 most downloaded papers (2010-2020)](https://library.seg.org/action/doSearch?SeriesKey=leedff&sortBy=downloaded&startPage=0&pageSize=20&AfterYear=2010&BeforeYear=2020&queryID=15/392288702) from The Leading Edge (Society of Exploration Geophysicists)**. It is based on research first published on my blog, Perceptual rainbow palette, [part a](https://mycarta.wordpress.com/2013/02/21/perceptual-rainbow-palette-the-method/) and [part b](https://mycarta.wordpress.com/2013/03/06/perceptual-rainbow-palette-the-goodies/). **The two parts are top results for the Google search [perceptual rainbow](https://www.google.com/search?q=perceptual+rainbow) and [perceptual palette images](https://www.google.com/search?q=perceptual+palette+images)**, and are often used as reference in talks and other publications on colormaps (for example [this Matplotlib talk at SciPy 2014](https://www.youtube.com/watch?v=rkDgBvT-giw&t=855s)).
-----
## Upcoming articles and tutorials
##### For 52 Things You Should Know About Geocomputing: A crowdsourced collection of articles from practitioners, reviewed and edited by the Software Underground community:
- [Some advice on reproducing scientific figures](https://curvenote.com/@swung/52-things-geocomputing/niccoli-advice-on-reproducing-figures)
- [Keep on improving your Geocomputing projects](https://curvenote.com/@swung/52-things-geocomputing/niccoli-keep-on-improving-geocomputing-projects)
- [Prototype colourmaps for fault interpretation](https://curvenote.com/@swung/52-things-geocomputing/niccoli-prototype-colormaps-for-faults)-----
## Projects portfolio[![made-with-python](https://img.shields.io/badge/Made%20with-Python-1f425f.svg)](https://www.python.org/)
- **Web app**: [Busting bad colormaps](https://mybinder.org/v2/gh/mycarta/Colormap-distorsions-Panel-app/master?urlpath=%2Fpanel%2FDemonstrate_colormap_distortions_interactive_Panel) - a web-based `Panel` app to show the effects of using a bad colormap with Geophysical data. The first version was presented as a lightning talk at the [Transform 2020 virtual conference](https://transform2020.sched.com/) organized by [Software Underground](https://softwareunderground.org/); you can watch a [video recording of the presentation here](https://www.youtube.com/watch?v=rUbvueIF5f8&t=510s). Repo with code [here](https://github.com/mycarta/Colormap-distorsions-Panel-app).
- **Tutorial**: [Working with 3D seismic data in Python using segyio, numpy, Scikit-image](https://github.com/equinor/segyio-notebooks/blob/master/notebooks/basic/01_basic_tutorial.ipynb). A notebook to demonstrate how to:
- read a seismic amplitude volume and a seismic similarity volume as `numpy` arrays using `segyio`
- manipulate the similarity to create a discontinuity / fault volume
- create a fault mask and display a couple of amplitude time slices with superimposed faults
- write the fault volume to SEG-Y file re-using the headers from the input file
- **Image augmentation pipelines for Deep Learning**.
I've worked on two image augmentation pipelines to increase the size of training sets for classification of geologic edges in seismic data, a typically low bias / high variance problem.
- I developed the first pipeline to increase the size of training sets for classification of geologic edges in seismic data. The work was commissioned by a company (as a free-lance) and I am not able to share any detail on the image manipulations used. I have permission, however, to disclose that in production tests with Convolutional Neural Networks the approach helped improve performance, and that these efforts paved the path for the company's subsequent coding and investigations.
- I developed the second pipeline working with the images from the [2018 TGS Salt Identification Challenge on Kaggle](https://www.kaggle.com/c/tgs-salt-identification-challenge/data) in mind. Although I did not participate in the competition, having considere at length the problem of segmenting salt deposits, I came up with the idea of an "adversarial" augmentation strategy, in which not only the size of the training set is increased, but the task is rendered much harder with the image transformation. Some examples are shown below. Preliminar tests (unpublished) are encouraging and I am currently working at open-sourcing my functions by way of a PR to the [Augmentor package](https://augmentor.readthedocs.io/en/master/index.html), so that where operations can be added not only in a sequence, but also stochastically with a probability value assigned by the user for each operation.
- **Web app (Hackathon project)**: [Sketch2model](https://github.com/epsalt/sketch2model-web) - a browser-based app to turn back-of-the-napkin geological sketches into geologic models that can then be used to generate synthetic seismic sections. The web app was Built around a prototype put together at the [2015 Calgary Geoscience Hackathon](https://csegrecorder.com/articles/view/open-collaboration-hackathons-and-tomorrows-subsurface-software), organized by [Agile Scientific](https://github.com/agile-geoscience), with team mates [Evan Saltman](https://github.com/epsalt) and [Elwyn Galloway](https://github.com/scibbatical), and special guest [Ben Bougher](https://github.com/ben-bougher) from Agile. The original idea for the prototype was proposed by Elwyn at the Hackathon, was to make an app that would turn an image of geological sketch (for example one taken from a boardmeeting whiteboard) into a digital rock model. The implementation of the finished app involves using morphological filtering and other image processing methods to enhance the sketch and convert it into a model with discrete bodies to be passed to a tool akin to Agile's [modelr.io](https://www.modelr.io/) to create and investigate a synthetic seismic model.
Blog posts:
[sketch2model](https://mycarta.wordpress.com/2016/05/04/sketch2model)
[sketch2model – sketch image enhancements](https://mycarta.wordpress.com/2016/05/25/sketch2model-sketch-image-enhancements)
[sketch2model – linking edges with mathematical morphology](https://mycarta.wordpress.com/2016/07/22/sketch2model-linking-edges-with-mathematical-morphology)
Workflow:
App screenshot:
- **Web app (Hackathon project)** : FRIDA (_restore live app and add link here_) - a browser-based app for interactive removal of acquisition footprint noise removal from 3D seismic data.
Prior to the hackathon I had done a lot of work imagining and protptyping a tool to remove acquisition footprint from 2D seismic slices (this has been my longest-lived side project) and eventually asked [Elwyn](https://github.com/scibbatical) to participate. If you are curious about the whole background, please read [Chapter 39](https://curvenote.com/@swung/52-things-geocomputing/niccoli-keep-on-improving-geocomputing-projects) of the upcoming [52 Things You Should Know About Geocomputing](https://github.com/softwareunderground/52things) and then head over to my [Tutorial notebook](https://github.com/mycarta/t21-hack-footprint/blob/main/52-things_geocomputing_book_FFT_tutorial.ipynb). For a definition of acquisition footprint, please read the [SEG Wiki](https://wiki.seg.org/wiki/Dictionary:Acquisition_footprint_or_imprint), and to see some published industry examples, read this [blog post](https://mycarta.wordpress.com/2017/08/13/what-is-acquisition-footprint-noise-in-seismic-data).
At the [2021 Hackathon, Transform virtual conference](https://github.com/softwareunderground/transform-2021-hackathon/discussions) we took the existing tool to the next level with team mates [@leocd91](https://github.com/leocd91), [@WesleyTheGeolien](https://github.com/WesleyTheGeolien),and [@markogauk](markogauk). We spent significant efforts on brainstorming several approaches for how to turn the tool in a production caliber app; check this screen shot of our MIRO board:
And the we set out to work. We were really impressed with the final result, see animation below:
- **Collaboration project**: [Rainbow](https://agilescientific.com/blog/2017/5/31/unweaving-the-rainbow). For this project with [Matt Hall](https://github.com/kwinkunks/), I created a pipeline of image processing routines to enhance, clean-up, segment, and rectify the main map from published figures. This was a necessary preprocessing stage for Rainbow, an online tool for automagic recovery data from scientific images with unknown colourmaps. An example of the image processing work is shown is sketched below, with full list of operations in the intro to my blog series [Computer vision in Geoscience](https://mycarta.wordpress.com/2017/07/30/computer-vision-in-geoscience-recover-seismic-data-from-images-introduction/), and detailed methodology described in [Part I](https://mycarta.wordpress.com/2017/09/10/computer-vision-in-geoscience-recover-seismic-data-from-images-part-1/) and [Part II](https://mycarta.wordpress.com/2017/09/18/computer-vision-in-geoscience-recover-seismic-data-from-images-part-2/).
Also, watch Matt’s talk (very insightful and very entertaining) from the 2017 Calgary Geoconvention below:
- **Knowledge sharing**: [My answer](https://earthscience.stackexchange.com/a/15139/144) to the question **In the northern hemisphere only, what percentage of the surface is land?** on [Stack Exchange Earth Science Beta](https://earthscience.stackexchange.com). This was a lot of fun combining domain knowledge (about map projections) with a tiny bit of Python programming (using `numpy` arrays) to provide a quantitative solution. It is the mini-project I am proud the most of, and I did it out of curiosity and the pure joy of solving a problem!
-----
## Exploratory Data Analysis and visualization portfolio
- Enhanced `seaborn` pairgrid matrix: one of the things I always do when I start looking at a multivariate probelm is to explore possible associations between features and target; one way I've found really useful is to "enhance" the standard `Seaborn` pairgrid matrix by labeling each bivariate scatter plot with the [distance correlation](https://en.wikipedia.org/wiki/Distance_correlation) colored by p-value, and also rearrage the plots by the results of clustering analysis, like in the example below (you can read more about it [here](https://mycarta.wordpress.com/2019/04/10/data-exploration-in-python-distance-correlation-and-variable-clustering/)):
- At the [2020 FORCE Machine Predicted Lithology challenge](https://xeek.ai/challenges/force-well-logs/overview), I used Ipywidget's interactive and the awesome [`missingno`](https://github.com/ResidentMario/missingno) library to enable browsing exploration wells in the North Sea using a widget and check the chosen well's curves completeness, on the fly, as shown below (you can read more about it [in here](https://mycarta.wordpress.com/2020/09/19/geoscience-machine-learning-bits-and-bobs-data-completeness), and try the tool in [this Jupyter notebook](https://github.com/mycarta/Force-2020-Machine-Learning-competition_predict-lithology-EDA/blob/master/Interactive_data_inspection_and_visualization_by_well.ipynb)):
- 3D model of the ternary system quartz – nepheline – kalsilite, also called petrogeny’s “residua” system, which is used to describe the composition of many cooled residual magma. This was an advanced pet project while I was a student in Geology at the University of Rome, Italy. It involved using AutoCAD and 3D Studio Max to create a WRLM file. For the 3D sketchfab version of the model, I used a DXF export and re-rendered using Rhino3D. See the static snapshot below; navigate, inspet, and download the model on [Sketchfab](https://sketchfab.com/3d-models/ternary-system-quartz-nepheline-kalsilite-c8225185f7004ad0a23533ca191b0c6b)
- 3D model of the Yoder Tilley tetrahedron for basalt classification. In this visualization the tetrahedron isbroken to separate the Quartz Tholeiite, Oliving Tholeiite, and Alkali Basalt sub-volumes and show the critical planes of silica saturation and silica under-saturation. For the 3D sketchfab version of the model, I used a DXF export and re-rendered using Rhino3D. Check it out [on Sketchfab](https://sketchfab.com/3d-models/basalt-classification-yoder-tilley-tetrahedron-4d5b741603424850af8946c7316d059a)
- 3D paths of colormap in CIELAB color space. These are a very useful way to analyze a colormap as they show clearly where abrupt chages in contrast happen; these are responsible for artifacts in mapping when these colormaps are used (more details in [this blog post](https://mycarta.wordpress.com/2013/02/21/perceptual-rainbow-palette-the-method/) ). As example in the two animations below (produced using the [3D color inspector plugin](https://imagej.nih.gov/ij/plugins/color-inspector.html) for ImageJ), notice the may abrupt changes in the path for the classic Jet colormap (top) and compary to the regular piraling path of a more perceptual rainbow, called CubicYF (bottom), which I created:
-----
## Coding challenges
- Completed all the Python challenges at [Coding bat](https://codingbat.com/python) and earned an overall 10-Star badge.
-----
## My Data Science continuous learning roadmap
📕Click here to expand/collapse the roadmap section
### Management
- [x] [Project Management Skills for Leaders](https://www.linkedin.com/learning/certificates/712a6099da116abc52b2f66b4517b03c8883d61ed82ba27e11211b45fee03220), Linkedin Learning
- [x] [Decision Intelligence](https://www.linkedin.com/learning/certificates/63634e29508a927cd70844c71f66383b561224c664f2e26e16854bf36eb2d38d), Linkedin Learning### Python development, unit testing, and debugging
- [x] [Test-Driven Development With PyTest](https://realpython.com/certificates/96ecff0f-2d92-422f-b3d8-f6790a03f872/), Real Python
- [x] [Enhance your Python unit testing using Coverage](https://www.coursera.org/account/accomplishments/certificate/TGYBNFKE3DJW), Coursera Project Network### Foundations of Data Science [Professional Certificate](https://credentials.edx.org/credentials/3ff1fe712ebe4e6bb47093928883d765/) (edX)
- [x] [Computational Thinking with Python](https://courses.edx.org/certificates/a5d76bfe8eff4a43bfc70c356745b3f6)
- [x] [Inferential Thinking through Simulations](https://courses.edx.org/certificates/5ad5a7c792c04d5a9371b2f7fc97d253)
- [x] [Machine Learning and Predictions](https://courses.edx.org/certificates/02807f521af04f3db11ce95e01e62625)### Intermediate Data Science
- [x] [Pandas](https://www.kaggle.com/learn/certification/mycarta/pandas), Kaggle
- [x] [Intermediate Machine Learning](https://www.kaggle.com/learn/certification/mycarta/intermediate-machine-learning), Kaggle
- [x] Pandas short course (as a beta tester), DeepLearning.AI
- [x] [Intro to SQL](https://www.kaggle.com/learn/certification/mycarta/intro-to-sql), Kaggle
- [x] [Relational Databases Essential Training](https://www.linkedin.com/learning/certificates/55968c78f13a8f7a7cdd0ec188842aa8ee75fe956f17435565fcde9b374e1e51), LinkedIn Learning### Analytics
- [x] [Spotfire Essentials I](https://support.ruths.ai/support/solutions/articles/6000191001-kickstart-how-to-schedule-training), Ruths.ai (now Petro.ai)
- [x] [Spotfire Essentials II](https://support.ruths.ai/support/solutions/articles/6000191001-kickstart-how-to-schedule-training), Ruths.ai (now Petro.ai)
- [x] [15 Mistakes to Avoid in Data Science](https://www.linkedin.com/learning/certificates/80c3a9379c2a96b55083dab7f651cf99ea2b01d52c56ea4684e22befeb06cfdb?trk=share_certificate), LinkedIn Learning### GIS and Geospatial
- [x] [Climate Geospatial Analysis in Python with Xarray](https://www.coursera.org/account/accomplishments/certificate/VMKXGAQKGGFP), Coursera Project Network
- [x] ARSET - Fundamentals of Remote Sensing, NASA Applied Science
- [x] ARSET - Introduction to Synthetic Aperture Radar, NASA Applied Science
- [x] ARSET - Humanitarian Applications Using NASA Earth Observations, NASA Applied Science
- [x] [SAR-400: Synthetic Aperture Radar: Foundations](https://courses.edx.org/certificates/3055b8675c5047039e899f974504eaf0), edX
- [ ] Echoes in space Introduction into the principles and applications of radar remote sensing, EO College### AI, Deep Learning, Ethics, ML explainability
- [x] [AI for Everyone](https://www.coursera.org/account/accomplishments/certificate/W9V3WB6GADC4), Coursera
- [x] [Basics of AI Python Coding](https://learn.deeplearning.ai/accomplishments/22bc794a-e168-45d3-a778-03aab98f2d2f?), DeepLearning.AI
- [x] [Machine learning explainability](https://www.kaggle.com/learn/certification/mycarta/machine-learning-explainability), Kaggle
- [x] [Intro to AI Ethics](https://www.kaggle.com/learn/certification/mycarta/intro-to-ai-ethics), Kaggle
- [x] [AI and Public Health](https://www.coursera.org/account/accomplishments/certificate/HMG12IIO95GI) - AI for Good Coursera Specialization, DeepLearning.AI
- [x] [Ai and Climate Change](https://www.coursera.org/account/accomplishments/certificate/TU8C1SM71OIY) - AI for Good Coursera Specialization, DeepLearning.AI
- [x] [AI and Disaster Management](https://www.coursera.org/account/accomplishments/certificate/W80PC3ECESXF) - AI for Good Coursera Specialization, DeepLearning.AI
- [x] [TensorFlow: Neural Networks and Working with Tables](https://www.linkedin.com/learning/certificates/2c6a1947d69f738ac6afecc03635bd07b1a1aed31c39a2fa9a380fb574cf810c), LinkedIn Learning
- [x] [TensorFlow: Working with NLP](https://www.linkedin.com/learning/certificates/683eb4ef9b3dfa5a73cc8b97eb3381551a4a2ce19dcc1146edd8b866207416b1), LinkedIn Learning
- [x] [TensorFlow 2.0: Working with Images](https://www.linkedin.com/learning/certificates/5e57eadefad9f19661cc24118d326d794db853bf1b9c781bccff213312fe0001), LinkedIn Learning
- [x] [Advanced AI: Transformers for Computer Vision](https://www.linkedin.com/learning/certificates/a3cb2e6ddef5f76122b0703ba4db00de629391db254b6c45330bfab6170d51e1), LinkedIn Learning)### Diversity, Equity, Inclusion
- [x] [Understanding Gender Equity](https://courses.edx.org/certificates/a052cc54020b465096d2cf5c1c557877) - Race, Gender, and Workplace Equity certificate, edX
- [x] [Facing Racism and Emotional Tax in the Workplace](https://courses.edx.org/certificates/ab3dc247ac9149bfb20ead074daf22ed) - Race, Gender, and Workplace Equity certificate, edX
- [x] [Communication Skills for Dialoguing Across Difference](https://courses.edx.org/certificates/e73a19e937274de1b790cf3dcc02fc7a) - Race, Gender, and Workplace Equity certificate, edX
- [x] [Unconscious Bias: From Awareness to Action](https://courses.edx.org/certificates/62d405fb03794ae4bdc4d05bf53d757d) - Race, Gender, and Workplace Equity certificate, edX
- [x] [Understanding and Supporting LGBTQ+ Employees](https://www.linkedin.com/learning/certificates/e46cd51a3754b5f1578d625df5bbb4dfbe7c162749dc62d504ec483bab3e03ab?trk=share_certificate), LinkedIn Learning
- [x] [Understanding and Supporting Asian Employees](https://www.linkedin.com/learning/certificates/a5e8bedb31c98b833d8bfd9809e5da1dbe4d112bace930aa58970ac93c07d286?trk=share_certificate), LinkedIn Learning
- [x] [Dealing with Microaggression as an Employee](https://www.linkedin.com/learning/certificates/5b56f000c5043d7543ced9b437a89439a28ae007cf968c6b45bbd7087fc8d96f?trk=share_certificate), LinkedIn Learning
- [x] [The Value of Employee Resource Groups](https://www.linkedin.com/learning/certificates/ac28ce0382cd2ef1b6765c1ee0465e6c43b33911edc238a614b592cc9e4d5267?trk=share_certificate), LinkedIn Learning
### Communication tools
- [x] [Enhance Organizational Communications with Slack](https://www.coursera.org/account/accomplishments/certificate/JEBLDLJQLSZN), Coursera Project Network📕
-----
## Book library
It looks like it has become very trendy these days to post a picture your datascience library. I admit it, I could not resist doing it too! Here is my collection of books, with a focus on Machine Learning and Data Exploration
-----
## Community engagement
- Present: board member of [Software Underground](https://github.com/softwareunderground)- Past: Student mentor, Canadian Society of Exploration Geophysicists (volunteer); Co-founder and coordinator of the Calgary GeoPy group; Editor, Canadian Society of Exploration Geophysicists (THe Recorder publication).
-----
## Stuff I would do differently
- These days I would definitely build this one in Python: [A study of Ricker wavelets in MS Excel (2016)](https://mycarta.wordpress.com/2016/02/04/a-study-of-ricker-wavelets-in-ms-excel/)