Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2024-11-19 00:06:50 UTC
- JSON Representation
https://github.com/theultrabadduck/dinoanalysis
Dino who tries to understand their future
data-analysis data-science datascience jupyter jupyter-notebook jupyter-notebooks jupyterlab python python3
Last synced: 08 Nov 2024
https://github.com/alinenog/desenvolve_gb_2022
Formação Desenvolve 2022 do Grupo Boticário na área de dados
data-analysis data-science googlesheet machine-learning numpy pandas python
Last synced: 06 Nov 2024
https://github.com/nitins17/tableauvisualizations
Visualizations I created while learning to work with Tableau
data-analysis data-science data-visualization tableau visualization
Last synced: 28 Oct 2024
https://github.com/ireneli393/music-recommendation-system-with-listenbrainz-dataset
Recommendation System
alternate-least-squares baseline-model data-analysis data-sceince lightfm-library python recommendation-system
Last synced: 12 Nov 2024
https://github.com/abidshafee/google.colaboratory_projects
This repository contains the collections of interactive python notebooks (ipynb) that are some of my projects on Data Science, Machine Learning (ML), and Natural Language Processing (NLP).
colaboratory data-analysis data-science lstm machine-learning nlp statistics time-series
Last synced: 06 Nov 2024
https://github.com/the-pinbo/dimensionalityredux-pca-vs-autoencoders
Comparative study of PCA and Autoencoders for effective dimensionality reduction, assessed through PSNR and SSIM metrics.
autoencoder-mnist autoencoders data-analysis dimensionality-reduction image-compression mnist neural-networks pca psnr ssim
Last synced: 06 Nov 2024
https://github.com/anindyadas2001/resume
My Resume
cloud computer-science-engineering data-analysis freshers resume-template
Last synced: 15 Nov 2024
https://github.com/faezeh-gholamrezaie/visual-google-scholar-search
A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.
academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud
Last synced: 07 Nov 2024
https://github.com/buildwithlal/introduction-to-data-science-in-python-coursera
introduction to data science in python, part of Applied Data Science using Python Specialization from University of Michigan offered by Coursera
data-analysis matplotlib numpy pandas
Last synced: 08 Nov 2024
https://github.com/ksharma67/eda-on-ipl
In this python notebook, analysis of IPL matches from 2008 to 2020 is done using python packages like pandas, matplotlib and seaborn.
data-analysis data-science eda matplotlib numpy pandas python seaborn
Last synced: 06 Nov 2024
https://github.com/mxagar/data_science_udacity
My personal notes, code and projects of the Udacity Data Science Nanodegree.
dashboard data-analysis data-engineering data-science machine-learning-pipelines
Last synced: 05 Nov 2024
https://github.com/zeshanfareed/spam_mail_detection_ml_django_project
Machine learning projects through Django
css data-analysis data-visualization django-framework frontend html machine-learning mailcsvfile matplotlib pandas project-repository projects python
Last synced: 10 Nov 2024
https://github.com/ljadhav25/django-data-analyzer
Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.
data-analysis data-visualization django-application matplotlib numpy pandas python seaborn
Last synced: 10 Nov 2024
https://github.com/swethajoseph/credit-risk-assessment-eda-case-study
Conducted an Exploratory Data Analysis (EDA) using Python to assess credit risk, identifying key factors that contribute to loan defaults and improving lending decisions
data-analysis data-visualization datacleaning datapreparation exploratory-data-analysis feature-engineering jupyter-notebook matplotlib-pyplot numpy-library pandas-library python-library risk-analysis risk-assessment risk-management seaborn-plots visual-studio-code
Last synced: 31 Oct 2024
https://github.com/manish506/loan-approval-prediction
Explore predictive modeling in this project by applying classification techniques to a loan approval dataset. Analyze and preprocess the data, then use models like K-Nearest Neighbors, Random Forest, SVC, and Logistic Regression to predict loan outcomes. Gain insights into approval factors and enhance prediction accuracy.
classification classification-models data-analysis data-science jupyter-notebook loan-approval-prediction machine-learning predictive-analytics predictive-modeling project python
Last synced: 31 Oct 2024
https://github.com/madeiradata/microsoft-data-analysts-club
Open-source Repository of Useful Scripts and Solutions for Microsoft Data Analysts
data-analysis data-visualization microsoft-data-analysis powerbi powerbi-report
Last synced: 06 Nov 2024
https://github.com/beaprogrammer02345/python_data_analysis
Sales Analysis using Python
data-analysis data-visualization python
Last synced: 06 Nov 2024
https://github.com/marcogdepinto/olympichistoryanalysis
Python visual analysis of the Olympic Games history. Kaggle gold medal with 15000+ views, 200+ upvotes and 100+ comments.
data-analysis data-science jupyter-notebook olympic-games python seaborn
Last synced: 10 Nov 2024
https://github.com/farizsidki/proyek-analisis-data
Submission Dicoding Indonesia "Proyek Analisis Data"
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python streamlit
Last synced: 31 Oct 2024
https://github.com/risdorn/restaurant-delivery-platforms-analysis-bdm-project
This project analyzes restaurant delivery platforms to understand customer preferences, industry competition, and expansion opportunities. Conducted as part of the BDM project from IITM, it includes descriptive stats, distribution, correlation, regression, and geospatial analysis using multiple datasets.
data-analysis data-visualization jupyter-notebook kaggle
Last synced: 31 Oct 2024
https://github.com/spaghettifunk/gvb
Analysis of GVB in Amsterdam
data-analysis public-transportation
Last synced: 08 Nov 2024
https://github.com/thesfinox/mltools
A collection of simple tools for data science and machine learning projects.
ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox
Last synced: 06 Nov 2024
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 06 Nov 2024
https://github.com/darkdk123/house-valuation-model
A Challenge Project in a Boot-Camp to create a ML Model to predict the prices of houses in Boston Massachusetts from multiple parameters Using Multivariable Regression.
data-analysis data-science data-visualization matplotlib-pyplot multivariate-regression predictive-modeling statistics
Last synced: 07 Nov 2024
https://github.com/darkdk123/handwashing-discovery-analysis
A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.
data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots
Last synced: 07 Nov 2024
https://github.com/mxagar/eda_fe_summary
An 80/20 guide for Data Processing: Data Cleaning, Exploratory Data Analysis, Feature Engineering, Feature Selection.
data-analysis data-cleaning data-modeling data-science data-visualization eda exploratory-data-analysis feature-engineering feature-selection machine-learning pandas
Last synced: 05 Nov 2024
https://github.com/andryadsm/pizza-sales-report
🍕 Project Pizza Sales Report (MySQL, Tableau)
dashboards data-analysis data-visualization database-management mysql sales sql tableau
Last synced: 06 Nov 2024
https://github.com/andryadsm/predicting-house-prices
🏘️ Project Predicting House Prices (Python)
data-analysis data-preprocessing data-visualization feature-engineering house-prices machine-learning matplotlib numpy pandas python real-estate seaborn sklearn
Last synced: 06 Nov 2024
https://github.com/andryadsm/asset-analyzer
📈 Project Asset Analyzer (Python)
commodities data-analysis data-visualization economics financial-markets investing matplotlib numpy pandas python seaborn stock-market strategy trading
Last synced: 06 Nov 2024
https://github.com/kheriberto/pandas_and_seabron_project
In this project I showcase my ability using pandas and seaborn to mold, transform and plot data.
data-analysis pandas python seaborn
Last synced: 10 Nov 2024
https://github.com/thisisashukla/survival-analysis
Hands-On Survival Analysis in Python
data-analysis data-science survival-analysis
Last synced: 07 Nov 2024
https://github.com/chrispsang/healthcare-dataanalysis
Analyze synthetic patient data to identify trends, improve healthcare delivery, and predict patient outcomes using machine learning models. Includes data exploration, preprocessing, model building, and visualizations.
data-analysis data-science data-visualization healthcare jupyter-notebook machine-learning python
Last synced: 06 Nov 2024
https://github.com/tmmvn/analytics-notebooks
A bunch of data analytics notebooks done testing out JetBrains DataLore
ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python
Last synced: 07 Nov 2024
https://github.com/mateusoliveira30/top-intelligent-people
This project performs an exploratory analysis of the top_intelligent_people_in_the_world_5000.csv dataset, featuring some of the world's most intelligent individuals. Using pandas and matplotlib, the analysis includes checking for missing values, describing variables, and visualizing data.
data-analysis graphics kaggle-dataset python3
Last synced: 08 Nov 2024
https://github.com/mumtaz4118/amazon-iphone-12-data-scrapped
Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages.
data-analysis data-extraction data-science data-scraping html mark-up python
Last synced: 07 Nov 2024
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 07 Nov 2024
https://github.com/mumtaz4118/covid-19-data-visualization
covid-19-data-visulaization
covid-19 covid-data data-analysis data-analytics data-science machine-learning
Last synced: 07 Nov 2024
https://github.com/mumtaz4118/scraping-medium-and-data-analytics
The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py
data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python
Last synced: 07 Nov 2024
https://github.com/cecoeco/sas_certificate
my code from Coursera's SAS programming specialization
Last synced: 07 Nov 2024
https://github.com/mxagar/space_exploration
This repository is a collection of mini-projects and tutorials related to space images and geo-spatial data.
data-analysis deep-learning geospatial machine-learning
Last synced: 05 Nov 2024
https://github.com/abdoomohamedd/python-data-analysis-projects
Welcome to the Python Data Analysis Projects repository! This repository contains a collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp
data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python
Last synced: 06 Nov 2024
https://github.com/saitoxu/data-analysis-workspace
Docker image for data analysis
Last synced: 10 Nov 2024
https://github.com/mysftz/numerical-methods-in-matlab
Multiple MatLab scripts over multiple data analysis assignments.
data-analysis data-science matlab university university-assignment
Last synced: 06 Nov 2024
https://github.com/mysftz/statistics-analysis
A python statistical analysis of a dataset and probability.
data-analysis matplotlib python python3 statistical-analysis
Last synced: 06 Nov 2024
https://github.com/mysftz/statistical-analysis
A in-depth review of statistical analysis in Python from datasets.
data-analysis python python3 statistics university university-project
Last synced: 06 Nov 2024
https://github.com/lulloooo/bizdata-nexus
Collection of my Business & Data Analysis projects, from professional/academic endeavors to passion-driven explorations 📊
business-analysis data-analysis economics etl excel finance mysql python r risk-analysis
Last synced: 28 Oct 2024
https://github.com/leeway64/lwtaiwancurrencyconverter
Converts US dollars (USD) to New Taiwan Dollars (TWD) and vice versa
cfg currency currency-converter data-analysis data-visualization docker exchange-rates federal-reserve julia taiwan twd united-states usd
Last synced: 08 Nov 2024
https://github.com/leeway64/lwwordcounter
C++ application that analyzes the frequency of words in a text file
bson cmake conan cpp data-analysis json json-schema text-analysis ubjson
Last synced: 08 Nov 2024
https://github.com/quocduyenanhnguyen/california-crime-data-analysis
I analyzed crime incident-based data in California in the year 2022. I used SQL for analysis and Tableau for visualization.
2022 california crime-data dashboard data-analysis data-analytics data-manipulation data-modeling data-visualisation data-visualization database fbi mysql mysql-workbench nibrs sql tableau tableau-dashboards tableau-public
Last synced: 08 Nov 2024
https://github.com/quocduyenanhnguyen/human-trafficking-analysis
I analyzed human trafficking data
data-analysis data-analytics data-visualization human-trafficking mysql mysql-database mysql-workbench query sql tableau tableau-dashboards tableau-public
Last synced: 08 Nov 2024
https://github.com/quocduyenanhnguyen/roi-modeling-and-analysis-of-sports-dataset
In this project, you will find my ROI model for retirement savings and PowerPoint presentation of my ROI model, as well as my data analysis/visualization of Sports Ticket Sales dataset that I concluded with a PDF group written report
data-analysis data-visualization microsoft-excel rate-of-return-modeling sports-ticket-sales-dataset
Last synced: 08 Nov 2024
https://github.com/m0saan/python-for-data-analysis
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney,
data-analysis data-science ipython-notebook machine-learning matplotlib numpy pandas python
Last synced: 16 Nov 2024
https://github.com/shuddha2021/stellar-candidate-selector
A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.
candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn
Last synced: 06 Nov 2024
https://github.com/gracysapra/r-in-data-science
This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.
data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science
Last synced: 28 Oct 2024
https://github.com/rijul007/market-basket-analysis-using-r
Market Basket Analysis using association rules, leveraging R’s powerful tools for data-driven retail strategies.
Last synced: 28 Oct 2024
https://github.com/datastalker/survival-cox
This repository contains an R script for performing survival analysis on breast cancer surgery data from the University of Chicago's Billings Hospital. The analysis includes Kaplan-Meier estimation and Cox Proportional Hazards modeling to assess patient survival.
breast-cancer-prediction cox-model data-analysis data-science data-visualization epidemiology kaplan-meier r survival-analysis
Last synced: 28 Oct 2024
https://github.com/lopez86/rust-mlearn
Machine Learning Tools in Rust
data-analysis data-science machine-learning rust
Last synced: 06 Nov 2024
https://github.com/lopez86/datascienceexamples
Examples of various data science & data analysis topics using various sources of data.
data-analysis data-science pandas scikit-learn tutorial visualization
Last synced: 06 Nov 2024
https://github.com/luminati-io/Target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Walmart-dataset-samples
A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.
api data-analysis dataset walmart walmart-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Shopee-dataset-samples
A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..
api data-analysis data-mining datasets products shopee web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 06 Nov 2024
https://github.com/lunarwhite/lake-george-viz
Simple data analysis and visualization of Lake Geroge with Python.
Last synced: 06 Nov 2024
https://github.com/luciocolonna/cyclistic-bikesharing-2023
Case study on public data from Chicago's Divvy bikeshare, using R
bikesharing capstone-project cyclistic cyclistic-bikshare data-analysis data-visualization geojson ggplot2 google-data-analytics google-data-analytics-capstone-project google-data-analytics-professional leaflet r sf tidyverse
Last synced: 28 Oct 2024
https://github.com/mirokeimioniemi/classifying-software-pirates
Exploring the factors driving people into software piracy by training two machine learning models to predict whether a person with certain characteristics and sentiments is likely to possess any pirated software or not using a dataset collected via a survey targeting users of music production software.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning piracy python software-piracy survey
Last synced: 12 Nov 2024
https://github.com/quantumudit/groceries-basket-analysis
This project performs market basket analysis using Power BI and Python to reveal associations between grocery items. It involves transforming raw transaction data into a processed dataset, creating interactive Power BI reports, and generating key insights through Python, enabling data-driven decision-making.
data-analysis data-visualization pandas powerbi python
Last synced: 06 Nov 2024
https://github.com/ahmetzamanis/projectcatalog
Catalog of my data science projects.
classification clustering data-analysis data-science data-science-portfolio data-visualization machine-learning python quarto r regression rmarkdown statistics time-series time-series-analysis
Last synced: 07 Nov 2024
https://github.com/amyanchen/sf-airbnb
Exploratory Data Analysis of San Francisco Airbnb's
data-analysis data-science data-visualization r rmarkdown statistics
Last synced: 07 Nov 2024
https://github.com/wardenkenny/data-analyst-portfolio
A repository I have created to show and explore data analytics.
data-analysis excel r spreadsheets sql tableau
Last synced: 28 Oct 2024
https://github.com/saiteja-talluri/data-analytics-assignement
Report on World Happiness Data (Data Analysis and Visualisation of the data)
data-analysis data-visualization ipynb-jupyter-notebook
Last synced: 05 Nov 2024
https://github.com/pseudomanifold/us-inauguration-speeches
Data & feature extraction for U.S. inauguration speeches
data-analysis data-science inauguration politics speech speeches
Last synced: 06 Nov 2024
https://github.com/pseudomanifold/pump
A generic data flow program
c-plus-plus-11 cplusplus data-analysis data-flow small
Last synced: 06 Nov 2024
https://github.com/yandexdataschool/ml-sweights-experiments
Experiments for the "Machine Learning on data with sPlot background subtraction" paper
data-analysis high-energy-physics machine-learning statistics
Last synced: 06 Nov 2024
https://github.com/bhaskaracharjee/student-results-analysis
Analyzing student results to uncover insights
Last synced: 06 Nov 2024
https://github.com/xre22zax/roller-coaster
Explore award-winning wood and steel coasters from 2013-2018 Golden Ticket Awards & Captain Coaster, all powered by Python and interactive visualizations.
analytics data-analysis data-visualization pandas python python-lambda python3 visualization
Last synced: 06 Nov 2024
https://github.com/harkishen-singh/agriculture-ds
An Agricultural based Mtech project, on Data Science, which predicts the growth of crops based on previous year records.
Last synced: 07 Nov 2024
https://github.com/abhirajp595/python2
Capstone Project using python(Real-Estate)
data-analysis data-science data-visualization jupyter-notebook machine-learning numpy pandas python statistics
Last synced: 15 Nov 2024
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 07 Nov 2024
https://github.com/navp7/hr_analysis_excel
This project utilizes Microsoft Excel to conduct a comprehensive analysis of HR data, focusing on identifying the various reasons for employee attrition and evaluating job satisfaction
dashboards data-analysis excel visualization
Last synced: 07 Nov 2024
https://github.com/sisolieri/prova_ds_saloocupacio2024
Admission challenge to Hackató Saló Ocupació by Barcelona activa
arima barcelona catboost data-analysis data-visualizations forecasting machine-learning pandas public-funding python scikit-learn time-series xgboost
Last synced: 31 Oct 2024
https://github.com/kheriberto/linear_regression_ecommerce
Simple project showcasing crafting a linear regression model with SciKit Learn
data-analysis jupyter-notebook linear-regression pandas python scikit-learn seaborn
Last synced: 31 Oct 2024
https://github.com/kheriberto/logistic_regression_project
A project that analyses dummie data from an advertising company using logistic regression
data-analysis logistic-regression pandas python scikit-learn seaborn
Last synced: 31 Oct 2024
https://github.com/jakobzmrzlikar/trg-dela
Data analysis of student job offers.
data-analysis ipython-notebook web-scraping
Last synced: 07 Nov 2024
https://github.com/jakobzmrzlikar/pca-on-genomes
An analysis of human genome mutations from different populations.
data-analysis genome-analysis pca-analysis
Last synced: 07 Nov 2024
https://github.com/tanaybhadula/twitter-trends-dashboard
An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.
dash dashboard data-analysis data-visualization plotly python trends twitter
Last synced: 12 Nov 2024
https://github.com/ngangawairimu/linear-regression-
This project builds a linear regression model in Python to predict outcomes and derive insights from feature data. It covers data cleaning, feature analysis, and model evaluation, showcasing predictive modeling techniques using scikit-learn, pandas, and visualization libraries.
data-analysis linear-regression machine-learning predictive-modeling python scikit-learn
Last synced: 31 Oct 2024
https://github.com/andremenezesds/house_rocket_sales_insights
Data analysis for House Rocket Sales Company database, including insights for sales optimization.
data-analysis data-visualization geopandas git github jupyter-notebook linux numpy pandas powerbi python seaborn seaborn-python streamlit streamlit-webapp ubuntu vscode windows10
Last synced: 04 Nov 2024
https://github.com/an0n1mity/spamclassifiereval
A repository for evaluating the misclassification rate of spam classification models using a threshold-based approach.
data-analysis machine-learning natural-language-processing python-programming spam-classification text-classification
Last synced: 07 Nov 2024
https://github.com/albamerdani/iot_air_quality_ml
IoT Project for Air Quality and Data Analysis with Machine Learning
air-quality aqi data-analysis data-science decision-tree iot machine-learning-algorithms prediction random-forest raspberry-pi-3 sensors
Last synced: 07 Nov 2024
https://github.com/kahleryasla/car-sales-data-analysis
This project retrieves and cleans car sales data from a Google Sheets document, and performs various analyses on the data, including calculating the average price of cars for each year and the standard deviation of the prices of each brand. The cleaned and modified data is then saved to an Excel file.
data data-analysis data-cleaning data-manipulation excel google-sheets pandas pandas-python
Last synced: 13 Nov 2024
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 10 Nov 2024
https://github.com/linguini1/edueval
The BorealisAI Let's Solve It mentorship project: summarizing student feedback submissions on their professor into one cohesive paragraph for faculty consideration during performance reviews.
ai data data-analysis data-science machine-learning machinelearning nlp python pytorch sentiment-analysis
Last synced: 07 Nov 2024
https://github.com/linguini1/tangerineanalyzer
Command line tool for analyzing transactions in CSV format provided by Tangerine Banking. Transactions can be downloaded in CSV format on your Tangerine account.
analysis analytics argparse banking cli command-line command-line-tool csv data-analysis data-analytics finance pandas python tangerine transactions
Last synced: 07 Nov 2024
https://github.com/linguini1/coopscraper
Scrapes the co-op job board provided by Carleton for jobs on my shortlist, then saves the jobs to a CSV file so that I can manipulate them with Excel.
csv data-analysis python selenium webscraper webscraping
Last synced: 07 Nov 2024
https://github.com/abhirajp595/python
Data Science Project using Python
data-analysis data-science data-visualization eda jyputer-notebook numpy pandas statistics
Last synced: 15 Nov 2024