Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-07 00:07:24 UTC
- JSON Representation
https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard
A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot
analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics
Last synced: 19 Jan 2025
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 05 Feb 2025
https://github.com/aryansharma5/data-visualization-and-thorough-analysis
comprehensive guide for data analysis and visualization
data-analysis data-visualization
Last synced: 24 Jan 2025
https://github.com/salman-khan-mohammed/predicting-the-intent-of-online-shoppers
This project aims to predict online shoppers' purchase intentions using browsing history and user data from e-commerce sites. By analyzing clickstream and session information, the goal is to create a machine learning model that accurately forecasts customers' likelihood of making a purchase.
cluster-analysis data-analysis data-pre eda outliers prediction
Last synced: 31 Jan 2025
https://github.com/vasishth/lecturesintrobayes
Please go to the website for these online lectures:
bayesian-inference brms data-analysis stan
Last synced: 15 Dec 2024
https://github.com/datawithbaraa/sql-modern-warehouse-and-analytics
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data-analysis data-analytics data-cleaning data-engineering data-lake data-lakehouse data-science data-warehouse data-warehousing database datalake datascience datawarehouse datawarehousing etl medallion-architecture pipeline sql sql-query sql-server
Last synced: 23 Dec 2024
https://github.com/prangonghose/analysis_of_bangladesh_economic_complexity
In this project a brief analysis has been done by our team in the export economy of Bangldesh for the past three decades.
data-analysis data-science data-visualization inequalipy matplotlib pandas plotly
Last synced: 19 Jan 2025
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 16 Jan 2025
https://github.com/maskedsyntax/taskit
A simple web based Task Tracker for better focus
charts data-analysis python3 streamlit task-tracker-app todo-list
Last synced: 05 Feb 2025
https://github.com/olgapavlova/agile-health-hackathon
Визуализируем здоровье спринтов разработки по сырым данным
data-analysis data-visualization figma google-sheets matplotlib pandas python sql
Last synced: 19 Jan 2025
https://github.com/sevdanurgenc/python-for-data-science-lecture-notes
In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial
Last synced: 29 Jan 2025
https://github.com/victoriapm/analyze_a-b_test_results
Understand the results of an A/B test run by an e-commerce website.
ab-testing data-analysis ecommerce-website
Last synced: 17 Jan 2025
https://github.com/evardnk/dataanalyticsportfolio
Собрание моих проектов по аналитике данных
api automation data-analysis etl-pipeline jupyter-notebook jupyterlab kpis numpy pandas pipeline postgresql powerbi python sql visualization
Last synced: 19 Dec 2024
https://github.com/christianrcanlas/christianrcanlas.github.io
e-Portfolio showcasing my personal projects.
arima classification-algorithims crostons-method data-analysis data-visualization data-warehousing etl-pipelines hierarchical-forecasting holt-winters long-short-term-memory machine-learrning ms-sql-server predictive-analytics python r-markdown support-vector-regression t-sql tableau time-series-decomposition time-series-forecasting
Last synced: 18 Jan 2025
https://github.com/solrikk/pictrace-web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 09 Jan 2025
https://github.com/sumitkumargiri/machine-learning-project
This repository contain all the best practices for managing Github repository.
data-analysis github machine-learning opensource project python
Last synced: 19 Jan 2025
https://github.com/nomadsdev/financial-trend-analyzer
FinancialTrendAnalyzer helps analyze and visualize sales data to uncover financial trends. It uses Python to calculate total sales, track changes, and generate insightful charts for better decision-making.
business-intelligence data-analysis data-visualization financial-analysis matplotlib numpy pandas python revenue-trends sales-data seaborn time-series-analysis
Last synced: 19 Dec 2024
https://github.com/mohamedhany99/human-voice-identifier-counter
the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)
android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python
Last synced: 18 Jan 2025
https://github.com/titanscouting/tra-analysis
Titan Robotics 2022 Strategy Team Analysis Repository
data-analysis frc frc-scouting hacktoberfest python
Last synced: 17 Dec 2024
https://github.com/windjammer6/8.-star-wars-data-analysis-python
A personal project to analyse data from a Star Wars survey. Python libraries used: Pandas, Matplotlib
Last synced: 29 Jan 2025
https://github.com/edikedik/lxtractor
Library for analysing protein structures and sequences
bioinfomatics computational-biology data-analysis data-mining feature-extraction python structural-biology
Last synced: 16 Nov 2024
https://github.com/mynenik/xyplot-win32
XYPLOT Plotting and Data Analysis Program for 32-bit Windows
cpp data-analysis data-manipulation data-visualization forth mfc windows-app
Last synced: 24 Jan 2025
https://github.com/al-ghaly/power-bi-dashboard
A dashboard to analyze data specializations job market.
dashboard data-analysis powerbi
Last synced: 22 Jan 2025
https://github.com/phillbertnevinemmanuel/coviddeathvaceda
an exploratory data analysis based on dataset of covid statisics from 2020-2022
Last synced: 23 Dec 2024
https://github.com/phillbertnevinemmanuel/dataprofessionalquestionnaireanalysis_pbix
This project delves into the demographics and job satisfaction of data professionals, presenting insights through a user-friendly dashboard built with Power BI. The dataset has been graciously provided by Alex The Analyst
dashboard data-analysis powerbi visualization
Last synced: 23 Dec 2024
https://github.com/kshitiz1302/credit_card_financial_weekly_status_dashboard
An interactive beginner friendly PowerBi dashboard with useful insights
data-analysis data-cleaning data-manipulation data-modeling data-storytelling data-visualization dax dax-expression dax-query financial-analysis mysql-database powerbi powerbi-custom-visuals powerbi-dashboards powerbi-desktop powerbi-embedded powerbi-report powerbi-visuals reporting
Last synced: 20 Jan 2025
https://github.com/gursv/autoworth
Used Car Price Prediction (India)
data-analysis data-analysis-python data-analytics data-cleaning data-preprocessing data-science-projects eda fine-tuning gridsearchcv machine-learning matplotlib-pyplot pandas python3 random-forest-regressor scikit-learn seaborn
Last synced: 20 Jan 2025
https://github.com/jatin-mehra119/bike-rentals-dataset
This repository focuses on optimizing bike rental availability during peak hours and days using machine learning techniques. Leveraging publicly available data from the UCI Machine Learning Repository, it includes scripts for data preprocessing, model training, and visualization, along with detailed observations and results.
data-analysis data-science ensemble-model pandas scikitlearn-machine-learning
Last synced: 17 Jan 2025
https://github.com/ituvtu/datamining-ab-testing
This project focuses on conducting A/B testing to evaluate the effectiveness of two marketing campaigns. Using statistical analysis and hypothesis testing, we determine which campaign is more effective in improving conversion rates.
a-b-testing data-analysis data-analysis-python data-mining ipynb jupyter jupyter-notebook python
Last synced: 16 Jan 2025
https://github.com/shriram-vibhute/digit_classification
This project demonstrates various machine learning techniques for classifying handwritten digits from the MNIST dataset. It covers data preprocessing, model training, evaluation, and advanced classification strategies.
classification data-analysis data-visualization machine-learning matplotlib numpy pandas sk-learn
Last synced: 15 Jan 2025
https://github.com/juliusmarkwei/titanic-data-analysis
Data analysis, data visualization, feature scaling, feature transformation, model selection and model optimization.
data-analysis data-science data-visualization linear-regression model-selection regression
Last synced: 01 Jan 2025
https://github.com/juliusmarkwei/iris-dataset-analysis
Data analysis, data visualization and model training using the popular Iris Dataset
data-analysis data-visualisation linear-regression machine-learning
Last synced: 01 Jan 2025
https://github.com/phammings/sales-management-analysis
Sales management analysis and Power BI dashboard for sample business request and user stories
data-analysis excel powerbi sql
Last synced: 15 Jan 2025
https://github.com/shriram-vibhute/data-analysis
This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.
data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn
Last synced: 15 Jan 2025
https://github.com/aalkiyumi/senior-design-project
Web scraper for collecting product and review data from e-commerce sites using Scraping Bee, AWS, Selenium, and Pandas. Focuses on cost-effective solutions, user-friendly interfaces, and efficient data extraction and analysis.
aws cs5001 data-analysis data-extraction data-processing data-storage e-commerce-analytics e-commerce-data pandas product-reviews review-sentiment-analysis scraping-bee selenium senior-design-project uc uc2026 university-of-cincinnati web-crawlers web-scraping
Last synced: 01 Feb 2025
https://github.com/giatraskon/clustering-countries-socioeconomic-health-analysis
Exploration and analysis of socio-economic and health data from 167 countries using MATLAB. Application of clustering algorithms to identify development patterns, visualize disparities, and understand global trends.
calinski-harabasz-index clustering country-data data-analysis data-visualization davies-bouldin-index elbow-method feature-selection health-indicators human-development-index k-means-clustering k-median-clustering k-medoids-clustering machine-learning matlab pca pearson-correlation silhouette-score socio-economic-indicators unsupervised-learning
Last synced: 06 Feb 2025
https://github.com/maskedsyntax/budgetpie
Android app to manage monthly budgets
android dart data-analysis data-visualization finance-management firebase flutter
Last synced: 05 Feb 2025
https://github.com/gauranshgoel123/predictive-demand-analysis
Demand Forecasting Project A web application for predicting future demand for part numbers based on historical data. Built with React for the frontend and FastAPI with Python for the backend, this application visualizes demand trends and allows users to input additional data for improved accuracy. In render analyzer is frontend analysis is backend
chartjs data-analysis data-science data-visualization dataset deployment full-stack machine-learning numpy pandas predictive-analysis prophet-model python reactjs render
Last synced: 12 Jan 2025
https://github.com/faisal-khann/diwali-sales-analysis
The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future
csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 29 Jan 2025
https://github.com/garciparedes/castile-and-leon-crops
Data Analysis of Castile and Leon Crops Area over the last years
castile-and-leon crops data-analysis data-science jupyter jupyter-notebook notebook spain
Last synced: 16 Jan 2025
https://github.com/whis99/userfunnelanalysis
An ecommerce user funnel conversion data analysis with matplotlib & python.
data-analysis data-analysis-python data-analyst data-visualization google-colab jupyter-notebook matplotlib python
Last synced: 13 Jan 2025
https://github.com/montanaz0r/testing-if-mma-math-deduction-works-using-ufc-fighters-data
The probabilistic reasoning about phenomenon called MMA math using UFC fighters data and Python.
bayesian-inference data-analysis data-science graphviz jupyter-notebook pandas python scipy statistics
Last synced: 14 Dec 2024
https://github.com/nafisalawalidris/international-breweries
This GitHub readme provides an overview of data analysis using SQL on the International Breweries dataset, including dataset description, analysis questions, example SQL queries, and key insights derived from the analysis.
data-analysis insights international-breweries-dataset queries sql
Last synced: 23 Jan 2025
https://github.com/adagio/ivoox_episodes
iVoox Episodes: Scraping & Analysis
beautifulsoup4 data-analysis ivoox pandas python python3 scraping
Last synced: 27 Dec 2024
https://github.com/vhtua/group4_data_analysis
Hierarchical Cluster Analysis: Movie Genres Preferences
data-analysis hierarchical-clustering r unsupervised-learning
Last synced: 04 Feb 2025
https://github.com/nafisalawalidris/tools-for-data-science
It covers popular languages (Python, R, SQL) and libraries (NumPy, Pandas) used in the field. The author shares their objectives of teaching data analysis, web development, and critical thinking skills. The repository also includes code examples, explanations of arithmetic expressions, and contact information for the author.
arithmetic-expressions data-analysis data-science data-visualization languages libraries matplotlib numpy pandas programming python r sql tools web-development
Last synced: 23 Jan 2025
https://github.com/githubuseraccountamazing/the-amari-project
a project in which I attempted to push some of the limits of stable-diffusion while taking some data along the way
ai ai-generated-images bash data-analysis machine-learning stable-diffusion textual-inversion
Last synced: 20 Jan 2025
https://github.com/manwithacap/by-the-metric-match
🎲🃏 A game data tracker for your board/card/video games!
data-analysis data-visualization games jupyter-notebook python utility
Last synced: 08 Jan 2025
https://github.com/stastnypremysl/lsql-csv
lsql-csv is a tool for small CSV file data querying from a shell with short queries. It makes it possible to work with small CSV files like with a read-only relational databases. The tool implements a new language LSQL similar to SQL, specifically designed for working with CSV files in shell.
csv data-analysis data-processing haskell language linux-shell lsql lsql-csv new-language query-language relational-database sql unix-command unix-philosophy unix-shell
Last synced: 02 Feb 2025
https://github.com/nafisalawalidris/investigating-netflix-movies-and-guest-stars-in-the-office
Dive into the world of Netflix and explore the average duration of movies. Netflix, being the largest entertainment company, offers a wide range of movies for its viewers. In this project, we analyse movie durations using pandas and create a DataFrame from a dictionary. By examining average durations from 2011 to 2020.
average-duration csv-files data-analysis data-visualization dataframe filtering movie-durations movie-length-distribution netflix pandas python trends
Last synced: 23 Jan 2025
https://github.com/wildanmujjahid29/books-sales-analytics-python
Books Sales Analytics With Pyhton
data data-analysis data-science data-visualization
Last synced: 07 Jan 2025
https://github.com/revan-alqahmi/summarize-talabat-company-reviews
Natural Language Processing Project, which is a program that analyzes Arabic comments at Talabat Company and classifies them into positive, negative, and neutral using machine learning algorithms and natural language processing techniques.
artificial-intelligence data-analysis machine-learning-algorithms natural-language-processing python
Last synced: 29 Dec 2024
https://github.com/asifdotexe/air-quality-analysis-aqa
AQA is a data-driven project focused on analyzing air quality data sourced from data.gov.in. The project encompasses data preprocessing, analysis, and visualization to gain insights into air pollution levels across various locations in India. By examining six key pollutants, the project aims to raise awareness about the environmental issues
aqi-analysis data-analysis data-preprocessing data-science data-visualization presentation
Last synced: 15 Jan 2025
https://github.com/asifdotexe/quickvu
Quick VU: No-code, data cleaning analysis and visualization tool built on Streamlit. Quickly clean, visualize, explore, and understand data relationships and correlations with ease. Perfect for analysts, business users, and anyone looking to gain data insights—without writing a single line of code.
automation data-analysis data-cleaning data-visualization python3 streamlit-application toolkit
Last synced: 15 Jan 2025
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 22 Jan 2025
https://github.com/asifdotexe/flipkart-electric-scooter-data-analysis
In this project, I have web scraped Electric Scooter data from Flipkart and turn it into a csv file for further analysis
beautifulsoup4 data-analysis data-science flipkart webscraping
Last synced: 15 Jan 2025
https://github.com/csoren66/diabetics_prediction
Predicting that whether the patient has diabetes or not on the basis of the features we will provide to our machine learning model.
data-analysis machine-learning python svm
Last synced: 13 Jan 2025
https://github.com/nafisalawalidris/building-a-clustering-model-for-customer-segmentation
Customer Segmentation Using Clustering: This repo applies clustering algorithms to a customer transaction dataset, grouping similar customers together based on their purchasing behavior. Targeted marketing strategies can be developed by analyzing distinct customer segments.
clustering customer-segmentation data-analysis data-visualization k-means machine-learning marketing-analytics unsupervised-learning
Last synced: 23 Jan 2025
https://github.com/yogeshnile/covid-19-india-data-analysis
In this project are used live covid-19 data
covid-19 data-analysis data-visualization india jupyter-notebook pandas plotly python
Last synced: 10 Jan 2025
https://github.com/nikhilash45/power-bi-vsualisation-of-joins
In This Power Bi Report User Can Visualis Join By Themselves , and it is easy to understand joins now.
business-analytics business-intelligence data data-analysis data-visualization joins powerbi sql visualization
Last synced: 11 Jan 2025
https://github.com/prernarohra/heart-disease-prediction
This project develops a machine learning model to predict heart disease risk based on symptoms and medical history. The model achieved the best accuracy with Logistic Regression, as it works well for binary classification problems.
artificial-intelligence data-analysis data-science dataset heartdisease-prediction machine-learning models
Last synced: 27 Dec 2024
https://github.com/emso-exe/reclamacoes_de_consumidores_com_empresa_de_telecomunicacoes
Projeto de análise de reclamações de consumidores com empresa de telecomunicações no 1º semestre de 2021 com base nos dados do site consumidor.gov.br.
analise-de-dados ciencia-de-dados data-analysis data-science datascience python python-3 python3
Last synced: 16 Jan 2025
https://github.com/dina-hosny/explore-us-bike-share-data-project
Explore US Bike Share Data project - FWD Data Analysis Professional Track. In this project, I used Python to explore data related to bike share systems for three major cities in the United States and answer questions about it by computing descriptive statistics.
data-analysis data-science numpy pandas python
Last synced: 13 Jan 2025
https://github.com/virajbhutada/music-recommendation-system
This project is designed to provide personalized music recommendations for relaxation and meditation. Leveraging ML and data analysis, the system suggests tracks based on user preferences such as tempo, energy, and genre. Join us in enhancing music discovery through advanced algorithms and community-driven contributions.
data-analysis data-science-projects data-visualization eda html machine-learning ml-algortihms model-deployment model-evaluation music-recommendation-system nlp pivot-table principal-component-analysis python python-library similarity-matrix spotify-data streamlit-web user-experience
Last synced: 10 Jan 2025
https://github.com/virajbhutada/google-stock-price-forecasting-lstm
Analyzing and predicting Google's stock prices through detailed data exploration and advanced LSTM models. This project involves data preprocessing, creating time-series sequences, constructing and training LSTM networks, and evaluating their performance to forecast future stock prices utilizing Python and Machine Learning libraries.
data-analysis data-science data-visualization future-prediction google-dataset google-stock-price-prediction google-stocks lstm-model lstm-neural-network machine-learning machine-learning-models matplotlib model-building model-training numpy python stock-forecasting
Last synced: 10 Jan 2025
https://github.com/nikita-data/eda_projects
Exploratory data analysis projects
cac data-analysis data-visualization eda folium-maps hypothesis-testing ltv math matplotlib numpy plotly python regular roi scipy seaborn segmentation statistics unit-economics
Last synced: 01 Feb 2025
https://github.com/shoyebmd424/design-and-analysis-algorithm
algorithm daa data-analysis data-structures
Last synced: 27 Jan 2025
https://github.com/virajbhutada/movie-rental-store-analytics-sql-powerbi-excel
Dive into the DVD rental industry with my Capstone project, Movie Rental Analytics. Analyzing the Sakila DVD Rental Store Database, I extract insights through exploratory data analysis (EDA) and Power BI visualizations. Findings inform strategies for optimizing film inventory, enhancing business operations, and customer experiences.
business-intelligence capstone-project customer-behavior-analysis data-analysis data-science excel exploratory-data-analysis film-ratings mece movie-database movie-rental mysql powerbi powerbi-visuals revenue-analysis sql sql-database
Last synced: 10 Jan 2025
https://github.com/sunnybibyan/exploratory-data-analysis-eda
Welcome to the Titanic Dataset - Exploratory Data Analysis (EDA) project repository! This project aims to uncover insights from the Titanic dataset using Python and Jupyter Notebook. By analyzing key variables such as age, gender, and class, we aim to visualize relationships between passenger characteristics and survival rates.
data-analysis data-visualization jupyter-notebook python titanic-dataset
Last synced: 19 Dec 2024
https://github.com/pinedah/escom_development-of-applications-for-data-analysis
This repository is a personal collection of programs, exercises, and notes from the Development of Applications for Data Analysis course at Instituto Politécnico Nacional (IPN). As part of the Bachelor's in Data Science, the course focuses on developing practical skills in Python for data analysis.
data-analysis data-science data-visualization jupyter-notebook python python-data-analysis
Last synced: 19 Dec 2024
https://github.com/dina-hosny/telco-customer-churn-analysis-using-power-bi
An interactive dashboard to represent some analysis of "Telco customer churn" data and the reasons that made customers churn using Microsoft Power BI.
business-intelligence data-analysis data-modeling data-visualization power-bi powerbi
Last synced: 13 Jan 2025
https://github.com/sarincr/data-analytics-with-knime
Data Analytics with KNIME (Konstanz Information Miner), a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining concept. A graphical user interface and use of JDBC allows assembly of nodes blending different data sources, including preprocessing (ETL: Extraction, Transformation, Loading), for modeling, data analysis and visualization without, or with only minimal, programming.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data-analysis data-mining data-science data-structures data-visualization database datascience deep-learning machine-intelligence machine-learning machine-learning-algorithms machinelearning mining mining-software
Last synced: 21 Jan 2025
https://github.com/virajbhutada/walmart-retail-analyzer
Gain valuable insights into retail sales with the "Walmart Retail Performance Dashboard" in MS Excel. This user-friendly tool facilitates an in-depth analysis of key sales metrics, providing a comprehensive view of Walmart's performance. Make data-driven decisions for informed and strategic business outcomes.
analytics data-analysis data-science data-visualization excel insights interactive-visualizations performance-analysis retail-sales walmart
Last synced: 10 Jan 2025
https://github.com/mahdi-eth/covid-analysis
Covid-19 data analysis project using python, numpy, pandas, matplotlib
data-analysis data-science python
Last synced: 08 Jan 2025
https://github.com/unrndm/dataanalysis
artifacts and sollutions of homework for course "Data Analysis" in Magistrate of HSE during 2023-2024
Last synced: 01 Feb 2025
https://github.com/nikita-data/unit_economics_projects
unit economics & cohort analysis projects
cac churn-rate conversion create-function data-analysis data-visualization eda hypothesis-testing ltv math matplotlib numpy python retention-rate roi scipy seaborn segmentation statistics unit-economics
Last synced: 01 Feb 2025
https://github.com/aymane-maghouti/sentiment-analysis-for-jumia-reviews-and-smartphone-price-prediction-system
The project focuses on customer sentiment analysis for Jumia, aiding informed online decisions. It collects and analyzes product comments to determine sentiments and implements a decision-making algorithm. Additionally, it includes product price prediction system using regression techniques.
beutifulsoup data-analysis data-cleaning data-collection data-preprocessing data-scraping data-visualization eda falsk machine-learning python web-application
Last synced: 17 Jan 2025
https://github.com/neemiasbsilva/datascience-portfolio
Hello guys, welcome to my Data Science Portfolio. I include some knowledges I earn in my journey. I included some case study, papers, and code. Please check the readme.
case-study churn-prediction code-challenges data-analysis data-science deep-learning forecasting fundamental-of-statistics health-care image-recognition machine-learnin machine-learning math mathematics pattern-recognition portfolio programming-skills speech-emotion-detection statistics voice-activity-detection
Last synced: 05 Jan 2025
https://github.com/ivanildobarauna-dev/currency-quote
Complete solution for extracting currency pair quotes data with comprehensive testing, parameter validation, flexible configuration management, Hexagonal Architecture, CI/CD pipelines, code quality tools, and detailed documentation.
data-analysis data-analytics data-engineering library pypi-packages python
Last synced: 19 Dec 2024
https://github.com/busraozdemir0/data_analysis_apps
Data analysis and data visualization applications with python
data-analysis data-analysis-python data-visualization matplotlib matplotlib-figures matplotlib-pyplot numpy numpy-python pandas pandas-dataframe pandas-library
Last synced: 02 Feb 2025
https://github.com/santiagortiiz/snowflake-data-warehousing
Snowflake University. Snowflake Data Warehousing. Foundamentals
big-data data-analysis data-warehouse olap snowflake
Last synced: 08 Jan 2025
https://github.com/allanccwang/electronic_projects
implement the circuit with microcontroller
arduino circuit-analysis circuit-simulations circuits-and-electronics cpp data-analysis microcontroller physics python wemos
Last synced: 17 Dec 2024
https://github.com/madhuresh2011/amazon-sales-report-analysis-using-python
This project focuses on analyzing Amazon sales data using Python to uncover insights into sales performance, customer behavior, and product trends
charts cleaning-data data-analysis jupyter-notebook matplotlib numpy pandas python seaborn visualization
Last synced: 02 Feb 2025
https://github.com/netcodez/data-science-projects
Data Science Projects completed on DataCamp Data Scientist with Python Career Track
data data-analysis data-visualization datacleaning feature-engineering feature-extraction machine-learning predictive-analytics predictive-modeling python scikit-learn-python scikitlearn-machine-learning statistical-analysis statistical-models
Last synced: 15 Jan 2025
https://github.com/pferreirafabricio/data-immersion
🏊🏻♂️ Activities and exercises from 'Imersão Dados' event
data data-analysis data-science dataset jupiter-notebook python
Last synced: 14 Jan 2025
https://github.com/eslamdyab21/imdb-data-analysis
This data set contains information about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue
data-analysis pandas python udacity-data-analyst-nanodegree
Last synced: 22 Jan 2025
https://github.com/hit07/data_science
Data [ Exploration, Cleaning, Manipulation, Visualisation ]
data-analysis data-cleaning data-exploration data-manipulation data-visualization eda jupyter-notebook matplotlib numpy pandas-dataframe scipy
Last synced: 01 Feb 2025
https://github.com/sejalmankar1012/yuvaco_data_analysis_assessment
This assignment involves writing a Python script to calculate the cost of package deliveries based on provided data and a cost grid. The script takes package details such as weight, distance, and delivery type, applies the cost calculation rules, and saves the results in an output file. You can also run the script in Google Colab for convenience.
csv-file-handling data-analysis google-colab package-delivery python python-scripting
Last synced: 26 Jan 2025
https://github.com/simranjeet97/google-cloud-access-using-python
Google Drive Access using Python, Interact Programmatically and Manipulate accordingly
data-analysis data-science data-structures data-visualization gcp gcp-cloud-functions gcp-compute gcp-compute-engine gcp-projects gcp-storage google googlecloud googlecloudplatform python python3 visualization
Last synced: 14 Jan 2025
https://github.com/alphonsg/bimana
Package for performing automated bio-image analysis tasks.
bioimage-analysis bioinformatics data-analysis deep-learning edge-detection image-analysis image-processing
Last synced: 20 Dec 2024
https://github.com/jen-uis/loan-status-prediction
This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.
data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration
Last synced: 22 Jan 2025
https://github.com/dannyben/datamix
DSL for manipulating tabular data
csv data data-analysis data-engineering gem ruby tabular-data
Last synced: 02 Feb 2025
https://github.com/chayandatta/got_script_manipulation
Game of Thrones Script - String & file manipulation
data-analysis data-science pandas python3
Last synced: 02 Feb 2025
https://github.com/jpcadena/car-sales-etl
ETL process for a Car Sales project.
asyncpg car-sales data-analysis data-engineering data-visualization database etl etl-pipeline postgresql python sqlalchemy
Last synced: 15 Jan 2025
https://github.com/aryangupta-09/amrorbit
An antimicrobial resistance (AMR) dashboard.
amr antibiotic antibiotic-resistance antibiotics antimicrobial antimicrobial-data antimicrobial-resistance antimicrobial-susecptibility award-winning d3-visualization d3js dashboard data-analysis data-visualization material-ui nextjs react reactjs redux tailwind-css
Last synced: 05 Jan 2025
https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken
Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.
3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics
Last synced: 30 Dec 2024