https://github.com/madetech/data-101
https://github.com/madetech/data-101
Last synced: about 1 year ago
JSON representation
- Host: GitHub
- URL: https://github.com/madetech/data-101
- Owner: madetech
- License: mit
- Created: 2021-12-08T11:36:05.000Z (over 4 years ago)
- Default Branch: main
- Last Pushed: 2024-11-27T11:41:23.000Z (over 1 year ago)
- Last Synced: 2025-01-25T14:11:37.963Z (over 1 year ago)
- Size: 5.58 MB
- Stars: 12
- Watchers: 6
- Forks: 9
- Open Issues: 5
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## Data Learning Paths
Please check out below the Data Learning Paths!
These are split to beginner -> advanced data concepts to aid in self development.
- :sparkle: [Data 101 Learning Objectives](data_learning_paths/data_101.md)
- :mechanical_arm: [Data 102 Learning Objectives](data_learning_paths/data_102.md)
- :crystal_ball: [Data 103 Learning Objectives](data_learning_paths/data_103.md)
- :floppy_disk: [Cert paths](data_learning_paths/certs.md)
If you are interested in learning about data science we have provided the following document containing useful topics and resources to get you started:
# Data-101: An introduction to big data in Made Tech (basic tutorial)
Welcome to the data-101 intro section!
You can find the core learning path below to introduce you to some core data engineering concepts, this is a series of concise modules aiming to take an engineer with no data experience to feeling competent when presented with a data focussed project.
To expand upon the core learning modules please check out the learning objectives folder where you can see some of the key learning objectives for each data engineer level, and resources to help you achieve those learning objectives.

# Core
## Python
- :snake: [Python, pip, pyenv](modules/core/Python.md)
- :satellite: [API Frameworks - Flask, fastAPI, Django](modules/core/ApiFrameworks.md)
### Modules
- :panda_face: [Pandas - Introduction to data loading & transformations](modules/core/Python%20modules/Pandas.md)
- :earth_africa: [GeoPandas - Plotting spatial data](modules/core/Python%20modules/geopandas.md)
## Spark
- :sparkler: [Spark - Putting the **big** in *big data*](modules/core/Spark.md)
# Supplementary learning
- :scream_cat: [SQL](modules/supplementary/SQL.md)
- :computer: [Terraform](modules/supplementary/Terraform.md)
- :floppy_disk: [Databases](modules/supplementary/Databases.md)
# Guides
- :robot: [Machine Learning Workflow: A guide for DMs](guides/machine_learning_workflow.md)
- :book: [Juypter Notebook Setup](guides/jupyter_setup.md)
- :pencil: [List of tooling](guides/tooling.md)
- :cat2: [Sphinx setup instructions](guides/sphinx_setup.md)
- :memo: [AWS Glue Features](guides/AWS_Glue_Features.md)
- :guardsman: [AWS TF Permissions](guides/aws_tf_permissions.md)
- :bookmark: [Poetry with Pyenv](guides/pyenv.md)
## Contributing to data-101
We welcome and encourage all contributions! This resource is built by the COP-data community for the COP-data community.
Please submit contributions via pull request.
Any questions can be posted to [COP-data](https://madetechteam.slack.com/archives/C01PTEPED6G)