Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists by dmarks84
A curated list of projects in awesome lists by dmarks84 .
https://github.com/dmarks84/ind_project_california-housing-data--kaggle
Independent Project - Kaggle Dataset-- I worked on the California Housing dataset, performing data cleaning and preparation; exploratory data analysis; feature engineering; regression model buildings; model evaluation.
cross-validation data-modeling data-reporting data-visualization eda folium grid-search matplotlib model-evaluation numpy pandas pca python seaborn sklearn statistics supervised-ml unsupervised-ml
Last synced: 05 Nov 2024
https://github.com/dmarks84/ind_project_movie-database-sqlite
Independent Project - I joined and manipulated data from disparate tables of movie information using Python & SQLite; defined schema, created tables/views, queried data, etc. Utilized CTE's, Window Functions, and other DDL, DQL, DML, and DCL scripts.
advanced-sql cte databases dcl ddl dml dql group-by joins python query sql sqlite tables views window-functions
Last synced: 05 Nov 2024
https://github.com/dmarks84/ind_project_mall-customer-clustering--kaggle
Independent Project - Kaggle Dataset-- I worked with the Mall Customer Segmentation Dataset, which provided a various instances of shoppers of different ages, incomes, etc. I utilized unsupervised ML clustering algorithms to identify useful customer segments.
clustering dataframes dbscan kmeans-clustering market-segmentation mean-shift pandas python sklearn technical-analysis technical-communication unsupervised-ml
Last synced: 05 Nov 2024
https://github.com/dmarks84/ind_project_superstore-sales-time-series-analysis--kaggle
Independent Project - Kaggle Dataset-- I worked on the Superstore Sales Dataset, performing (as Part 1) data cleaning and preparation and exploratory data analysis. The main task was to make predictions for future sales based on time-series analysis, which is found in Part 2.
chloropleth data-modeling data-visualization eda linear-regression matplotlib numpy pandas python seaborn sklearn statistics statsmodels supervised-ml time-series-analysis
Last synced: 05 Nov 2024
https://github.com/dmarks84/ind_project_new-topic-nlp-analysis-classification--kaggle
Independent Project - Kaggle Dataset-- I worked with the News Category Dataset, which provided a headline and description, etc. in .json format; used NLTK for NLP, tokenizing, lemmatizing, and finding part-of-speech; trained and tuned parameters on classifier models to predict news category based on headline text.
classification hyperparameter-tuning json lemmatization model-evaluation model-refinement nlp nltk pandas python sklearn supervised-ml
Last synced: 05 Nov 2024
https://github.com/dmarks84/ind_project_european-soccer-top-points-contributors--kaggle
Independent Project - Kaggle Dataset-- I worked on the European Soccer Dataset, using SQL (SQLite) to read in the data and then data wrangling before running statistical analysis and hypothesis testing on questions of who helps earn the most points for their team.
data-wrangling hypothesis-testing numpy p-values pandas python scipy-stats statistics t-test
Last synced: 05 Nov 2024
https://github.com/dmarks84/ind_project_data-science-london-scikit-learn--kaggle
Independent Project - Kaggle Competition -- I worked on the Data Science London data set for the Data Science London + Scikit-learn competition.
classification cross-validation data-modeling data-reporting data-visualization dataframes eda grid-search matplotlib numpy pandas python sklearn statistics supervised-ml
Last synced: 05 Nov 2024
https://github.com/dmarks84/ind_project_obesity-multi-class-classification--kaggle
Independent Project - Kaggle Competition -- I worked on the obesity classification data set as part of a Kaggle Competition of the same name, scoring (for accuracy) above 0.9
classification correlation-analysis cross-validation data-modeling data-visualization dataframes eda gridsearchcv matplotlib multiclass-classification numpy pandas python seaborn sklearn statistics supervised-ml
Last synced: 05 Nov 2024
https://github.com/dmarks84/professional_certifications
A full set of the certificates achieved my the work I completed as part of various Professional Certifications, Specializations, Courses, and Projects.
Last synced: 05 Nov 2024
https://github.com/dmarks84/coursework_capstone_full_data_engineering
Final Project for IBM Data Engineering & Python Professional Certificate -- Applied all skills and methods utilized in the series of courses for this certification
apache-airflow apache-hadoop apache-kafka apache-spark api beautifulsoup cassandra dags etl mongodb nosql pandas plotly postgresql python scipy seaborn sql
Last synced: 12 Oct 2024
https://github.com/dmarks84/ind_project_readme-generator
Independent (personal) project in which I automatically generate README files for each of my repositories from my coursework
dataframes etl numpy pandas programming python
Last synced: 05 Nov 2024
https://github.com/dmarks84/ind_project_docker-image-pnw-weather-app
Independent Project - I created a Docker image that stands up a website that live weather alerts on an interactive map.
api dash devops docker docker-images dockerfile folium geopandas json plotly python requests webapp websites
Last synced: 05 Nov 2024
https://github.com/dmarks84/ibm_ds
A temporary repository for the work I'm doing in the IBM Data Science course
Last synced: 05 Nov 2024
https://github.com/dmarks84/ibm-ds-capstone
Files for my capstone project for the IBM Data Science Professional Certificate
Last synced: 05 Nov 2024