Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by dmarks84

A curated list of projects in awesome lists by dmarks84 .

https://github.com/dmarks84/dmarks84

Personal

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_california-housing-data--kaggle

Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.

cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_movie-database-sqlite

Independent Project - I joined and manipulated data from disparate tables of movie information using Python & SQLite; defined schema, created tables/views, queried data, etc. Utilized CTE's, Window Functions, and other DDL, DQL, DML, and DCL scripts.

advanced-sql cte databases dcl ddl dml dql group-by joins python query sql sqlite tables views window-functions

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_mall-customer-clustering--kaggle

Independent Project - Kaggle Dataset-- I worked with the Mall Customer Segmentation Dataset, which provided a various instances of shoppers of different ages, incomes, etc. I utilized unsupervised ML clustering algorithms to identify useful customer segments.

clustering dataframes dbscan kmeans-clustering market-segmentation mean-shift pandas python sklearn technical-analysis technical-communication unsupervised-ml

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle

Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.

chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_new-topic-nlp-analysis-classification--kaggle

Independent Project - Kaggle Dataset-- I worked with the News Category Dataset, which provided a headline and description, etc. in .json format; used NLTK for NLP, tokenizing, lemmatizing, and finding part-of-speech; trained and tuned parameters on classifier models to predict news category based on headline text.

classification hyperparameter-tuning json lemmatization model-evaluation model-refinement nlp nltk pandas python sklearn supervised-ml

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle

Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.

data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle

Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.

classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle

Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9

classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml

Last synced: 05 Nov 2024

https://github.com/dmarks84/professional_certifications

A full set of the certificates achieved my the work I completed as part of various Professional Certifications, Specializations, Courses, and Projects.

independent-education

Last synced: 05 Nov 2024

https://github.com/dmarks84/coursework_capstone_full_data_engineering

Final Project for IBM Data Engineering & Python Professional Certificate -- Applied all skills and methods utilized in the series of courses for this certification

apache-airflow apache-hadoop apache-kafka apache-spark api beautifulsoup cassandra dags etl mongodb nosql pandas plotly postgresql python scipy seaborn sql

Last synced: 12 Oct 2024

https://github.com/dmarks84/ind_project_readme-generator

Independent (personal) project in which I automatically generate README files for each of my repositories from my coursework

dataframes etl numpy pandas programming python

Last synced: 05 Nov 2024

https://github.com/dmarks84/ind_project_docker-image-pnw-weather-app

Independent Project - I created a Docker image that stands up a website that live weather alerts on an interactive map.

api dash devops docker docker-images dockerfile folium geopandas json plotly python requests webapp websites

Last synced: 05 Nov 2024

https://github.com/dmarks84/ibm_ds

A temporary repository for the work I'm doing in the IBM Data Science course

Last synced: 05 Nov 2024

https://github.com/dmarks84/ibm-ds-capstone

Files for my capstone project for the IBM Data Science Professional Certificate

Last synced: 05 Nov 2024