An open API service indexing awesome lists of open source software.

https://github.com/g-schumacher44/analyst_resource_hub

A collection of guidebooks, quickref, and resources for data analysis
https://github.com/g-schumacher44/analyst_resource_hub

analytics bigquery data lookerstudio machine-learning model python sql yaml-configuration

Last synced: 2 months ago
JSON representation

A collection of guidebooks, quickref, and resources for data analysis

Awesome Lists containing this project

README

          





Knowledge Base & Resource Center


MIT License
Status
Version

# πŸ—‚οΈ Analyst Resource Hub: Reference Vault for Data Science & ML

This is my personal knowledge vault β€” a curated and structured collection of checklists, decision frameworks, modeling guides, and reusable scripts developed while studying and building skills in data science, machine learning, and analytics workflows.

Also published as a [MkDocs site](https://g-schumacher44.github.io/analyst_resource_hub/) for easy navigation and browsing.

## 🧩 TLDR;
- Built originally in Obsidian, published here as both a **quick-access reference** and a **public portfolio artifact**
- Focuses on real-world execution: cleaning, modeling, diagnostics, and pipeline structuring
- Includes:
- Python, SQL, and workflow sections
- βœ… Checklists & QA routines
- πŸ“‹ Decision cards for strategy selection
- πŸ“˜ Guidebooks by topic area
- 🧭 QuickRefs & visual companions

## 🧭 Orientation & Getting Started

🧠 Notes from the Vault Architect

This vault was designed to be modular, navigable, and deeply practical β€” a living resource that reflects how I think, work, and solve problems. It serves as a:
- Toolkit for day-to-day analysis
- Teaching aid for others and for myself
- Sandbox for workflows and automation ideas

πŸ«† Version Release Notes

**`v0.1.0` – Initial Public Release**

- Obsidian vault ported to GitHub
- Folder structure stabilized
- Markdown files cleaned and organized for public browsing

**`v0.2.0` – MkDocs site buildout**

- Adopted MkDocs + Material theme
- Added `docs/` site with section hubs: Python, SQL, Workflow & Projects
- Custom landing page with hero + action buttons (`docs/index.md`)
- Basic branding: logos, title, tagline, and skim-friendly emoji headers
- Navigation + metadata wired up (`mkdocs.yml`)
- Prepared for GitHub Pages deployment (local `mkdocs serve` ready)

**`v0.2.1` – Content structure refresh** *(current)*

- Tightened page hierarchy and filenames for clean URLs
- Added QuickRef, Guidebooks, and Scripts lanes under Python
- Consolidated BigQuery/Looker under SQL with patterns & dashboard guides
- Created Workflow hub for scaffolds, checklists, and delivery templates

**Upcoming Additions**

- Add reusable templates and starter kits
- Adding Screenshots and Visuals to Guidebooks and Visual Companions
- Expand Python and SQL script collections
- Incorporate references and workflows from related projects:
- [`analyst_toolkit`](https://github.com/G-Schumacher44/analyst_toolkit)
- [`model_evaluation_suite`](https://github.com/G-Schumacher44/model_evaluation_suite)

πŸ“Œ Emoji Codex

To make the vault easier to skim and navigate, each document uses an emoji prefix to signal its purpose or category.

- πŸ“Š Visual Companions & Evaluation Guides
- βœ… Execution Checklists
- πŸ“‹ Decision Strategy Cards
- πŸ“˜ Deep-Dive Guidebooks
- 🧭 Quick Reference Sheets

For a full legend, see the [πŸ“š Vault Emoji Codex](emoji_codex.md).

___

# πŸ—ΊοΈ Resource Map

```txt
🐍 Python Modules

Python/01 - QuickRef/
β”œβ”€β”€ 01 - Checklists/ βœ… Execution workflows
β”œβ”€β”€ 02 - Decision Cards/ πŸ“‹ Strategy selectors
└── 02 - Reference Guides/ 🧭 Quick references

Python/02 - Data Wrangling & EDA/
β”œβ”€β”€ Data Wrangling/ πŸ“˜ Feature transformation & validation
└── EDA/ πŸ“Š Exploratory workflows

Python/03 - Cleaning/ 🧼 Foundational and advanced cleaning guides

Python/04 - Machine Learning Models/
β”œβ”€β”€ 01 - Regression/ πŸ“˜ Linear & Logistic modeling resources
β”œβ”€β”€ 02 - Supervised/ πŸ“Š Classifier guidebooks and visuals
└── 03 - Unsupervised/ πŸ“‹ Clustering diagnostics and workflows

Python/05 - Scripts/
β”œβ”€β”€ 01 - Python/ πŸ§ͺ Cleaning, validation, modeling scripts
└── 02 - eda_toolkit/ 🧰 Modular tools for EDA diagnostics

πŸš› SQL Modules

SQL/01 - Guidebooks/ πŸ“˜ SQL basics to advanced playbooks

SQL/02 - BigQuery and Looker/
β”œβ”€β”€ 01 - BigQuery/ 🧱 Patterns, optimization, and pipelines
└── 02 - Looker Studio/ πŸ“Š Dashboard UX and parameter guides

πŸ–‡οΈ Workflow + Projects

WorkFlow+Projects/
β”œβ”€β”€ βœ… Notebook readiness checklist
β”œβ”€β”€ πŸ“˜ Project pipeline templates
└── πŸ₯‡ Gold standard scaffolds
```
___

## 🀝 On Generative AI Use

Generative AI tools (Gemini 2.5-PRO, ChatGPT 4o - 4.1) were used throughout this project as part of an integrated workflow β€” supporting code generation, documentation refinement, and idea testing. These tools accelerated development, but the logic, structure, and documentation reflect intentional, human-led design. This repository reflects a collaborative process: where automation supports clarity, and iteration deepens understanding.