Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-01-15 00:07:24 UTC
- JSON Representation
https://github.com/enayar478/nomad_machine_learning_dash_app
An interactive Machine Learning app built with Dash and Plotly, developed as part of the Data Analytics Bootcamp at Le Wagon Bordeaux. It allows users to visualize data, make real-time predictions, and explore various model insights.
analytics cachetools dash dashboard-application data-analysis data-science deployment gunicorn interactive-visualization machine-learning pandas plotly plotly-dash prediction-model python python3 render scikit-learn web-application
Last synced: 21 Nov 2024
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 19 Dec 2024
https://github.com/shriram-vibhute/digit_classification
This project demonstrates various machine learning techniques for classifying handwritten digits from the MNIST dataset. It covers data preprocessing, model training, evaluation, and advanced classification strategies.
classification data-analysis data-visualization machine-learning matplotlib numpy pandas sk-learn
Last synced: 15 Nov 2024
https://github.com/namratha2301/best-selling-books
Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.
data-analysis data-visualization matplotlib pandas sckiit-learn seaborn
Last synced: 30 Nov 2024
https://github.com/samuelsoaress/wkd-default-reduction
reduction of default from 35% to 25% or less with machine learning techniques
data-analysis data-exploration data-science machine-learning-algorithms
Last synced: 08 Jan 2025
https://github.com/samuelsoaress/predict-future-sales
Machine Learning applied to sales forecast
data-analysis data-mining data-science data-visualization forecasting-models
Last synced: 08 Jan 2025
https://github.com/prekshivyas/cis-595-big-data-analytics
Comprehensive real estate price prediction project, integrating socioeconomic indicators and property features.
data-analysis data-cleaning data-mining data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis web-scraping
Last synced: 05 Jan 2025
https://github.com/ayushsiloiya619/brain-stroke-analysis
Data Analytics with Python
data-analysis matplotlib-pyplot python3 seaborn seaborn-python
Last synced: 30 Nov 2024
https://github.com/ginga1402/youtube_analysis
Exploratory Data Analysis on YouTube data
college-project data-analysis pandas-python
Last synced: 10 Dec 2024
https://github.com/ayushsiloiya619/spotify-song-analysis
Data Analytics with Python
data-analysis matplotlib-pyplot pandas-dataframe python3 seaborn
Last synced: 30 Nov 2024
https://github.com/swapnanildutta/prediction-with-python
The projects are made using Jupyter Notebook
data-analysis jupyter-notebook machine-learning prediction python regression-models
Last synced: 08 Jan 2025
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 14 Jan 2025
https://github.com/kwonnayeon/medium-post-projects
Contains the code and projects from my Medium posts. I share what I've learned through trial and error to help others tackle similar work smoothly.
data-analysis data-science data-visualization medium-articles python r-language sql
Last synced: 13 Jan 2025
https://github.com/athari22/applied-data-science-capstone
Applied-Data-Science-Capstone
api classification data-analysis data-cleaning data-collection data-science data-scraping data-visualization data-wrangling knn machine-learning sql
Last synced: 30 Dec 2024
https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office
Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.
data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python
Last synced: 30 Dec 2024
https://github.com/karencofre/riesgorelativo-lookerstudio
proyecto de análisis de datos y análisis perdicitvo en looker studio y google colab
bigquery data-analysis data-science machine-learning matplotlib python sklearn sql
Last synced: 21 Nov 2024
https://github.com/ayberkyavuz/body_type_estimator
This repository is a tutorial for all levels who want to learn how to develop end to end machine learning system.
backend classification css data-analysis dataset end-to-end flask flask-application frontend html javascript machine-learning machine-learning-application material-design materializecss pandas python tutorial webapp xgboost
Last synced: 06 Dec 2024
https://github.com/manjit-baishya-datascience/flipkart-laptop-listing-eda
This project analyzes laptop price data from Flipkart using AutoScraper for web scraping. It includes data loading, EDA, cleaning, statistical analysis, and visualization. The goal is to derive insights for pricing strategies and market positioning. Explore the repository for detailed documentation and code.
data-analysis ecommerce-platform flipkart laptop python
Last synced: 13 Jan 2025
https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis
Last synced: 29 Nov 2024
https://github.com/navp7/roadaccident_powerbi
An interactive Power BI dashboard designed to analyze road accident data
dashboards data-analysis data-visualization powerbi
Last synced: 26 Dec 2024
https://github.com/edumoraes1/journey_active_users
Segmentação de base via SQL para jornada de vendedores ativos
bq data-analysis salesforce sql
Last synced: 21 Nov 2024
https://github.com/tanaybhadula/twitter-trends-dashboard
An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.
dash dashboard data-analysis data-visualization plotly python trends twitter
Last synced: 10 Jan 2025
https://github.com/kwonnayeon/urban-parks-childrens-happiness
A thesis project exploring the causal impact of urban parks on children's happiness, with data, results, and code.
causal-inference data-analysis environmental-psychology latex matching propensity-score public-health r social-science statistical-analysis thesis-project urban-studies weighting
Last synced: 20 Dec 2024
https://github.com/abhash-rai/analyzing-credit-card-eligibility
This work was performed as part of BCU undergraduate course.
data-analysis data-visualization ggplot ggplot2 latex r
Last synced: 20 Dec 2024
https://github.com/an0n1mity/spamclassifiereval
A repository for evaluating the misclassification rate of spam classification models using a threshold-based approach.
data-analysis machine-learning natural-language-processing python-programming spam-classification text-classification
Last synced: 26 Dec 2024
https://github.com/albamerdani/iot_air_quality_ml
IoT Project for Air Quality and Data Analysis with Machine Learning
air-quality aqi data-analysis data-science decision-tree iot machine-learning-algorithms prediction random-forest raspberry-pi-3 sensors
Last synced: 29 Dec 2024
https://github.com/dataforgeopenaihub/steam-sales-analysis
This repository features an ETL pipeline for retrieving, processing, validating, and ingesting game metadata and sales data from SteamSpy and Steam APIs. Data is stored in a MySQL database on Aiven Cloud and visualized using Tableau dashboards for insightful analysis of gaming trends and sales performance.
data-analysis data-engineering data-pipepline data-warehousing games mysql-database python steam-api tableau typer-cli
Last synced: 10 Oct 2024
https://github.com/nsandoya/python_scrp_project
This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)
beautifulsoup4 data data-analysis jupyter-notebook pandas python3
Last synced: 14 Jan 2025
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 06 Jan 2025
https://github.com/linguini1/tangerineanalyzer
Command line tool for analyzing transactions in CSV format provided by Tangerine Banking. Transactions can be downloaded in CSV format on your Tangerine account.
analysis analytics argparse banking cli command-line command-line-tool csv data-analysis data-analytics finance pandas python tangerine transactions
Last synced: 29 Dec 2024
https://github.com/vidyadnina/cyclistic-sql-tableau-project
Trip data analysis for a bike-sharing service company using SQL and Tableau.
bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql
Last synced: 21 Nov 2024
https://github.com/virajbhutada/article-clustered-recommendation-system-ml
This project aims to redefine content discovery by delivering personalized article recommendations tailored to individual user preferences. We use advanced machine learning techniques like PCA and K-means clustering to analyze user behavior and article characteristics to provide highly accurate recommendations.
anaconda article-recommendation clustering-algorithm data-analysis data-science keras-tensorflow machine-learning machine-learning-algorithms ml-models numpy pandas plotly python scikit-learn scipy
Last synced: 15 Oct 2024
https://github.com/antoniszks/music-category-identifier
A 'Data-Science & Machine Learning' project where we are training a neural network to identify what kind of music we give to it. Based on a university project.
ai artificial-intelligence data-analysis data-science jupyter-notebook machine-learning ml notebook python
Last synced: 08 Jan 2025
https://github.com/antoniszks/data-analysis-problems
A repository containing real world data analysis python notebooks
data-analysis data-analysis-python data-science jupyter-notebook python real-world-data statistics
Last synced: 08 Jan 2025
https://github.com/weybsonalves/prevendo-o-atrito-de-clientes
Projeto em que percorro as etapas que compõem o ciclo de vida da ciência de dados a fim de prever o atrito de clientes do serviço de cartões de crédito de um banco.
data-analysis data-science data-visualization machine-learning python
Last synced: 16 Nov 2024
https://github.com/vaishnavipaithane/bellabeat-data-analysis-case-study
This capstone project was done as a part of Google Data Analytics Professional Certificate course.
bigquery data-analysis sql tableau
Last synced: 21 Nov 2024
https://github.com/marielachirinosr/pandas-weather-project
Pandas Weather Data. Explore straightforward Python scripts for weather information analysis.
Last synced: 29 Dec 2024
https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics
Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.
beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping
Last synced: 08 Jan 2025
https://github.com/shubham200137/customer-churn-analysis
In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.
data-analysis data-visualization numpy-python pandas-python sqlite tableau
Last synced: 08 Jan 2025
https://github.com/shubham200137/cyclistic-case-study
This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.
data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau
Last synced: 08 Jan 2025
https://github.com/faisal-fida/box-office-mojo-analysis
Analyzed box office data from Box Office Mojo, exploring relationships between worldwide revenue, release year, and a combined score that considers both factors. It includes visualizations like scatter plots, bar charts, and identifies top and bottom performing movies.
box-office data-analysis data-science python revenue-prediction visualization
Last synced: 08 Jan 2025
https://github.com/dharininadkar/movies-data-dashboard
Data Analysis of Movies data
data-analysis data-mining data-science data-visualization ms-excel ms-sql-server tableau
Last synced: 17 Nov 2024
https://github.com/nickenshidqia/sql-for-financial-data-analysis
Design SQL queries to generate accurate and timely financial reports including Profit and Loss statements, Balance Sheets, and Cash Flow statements
azure-data-studio data-analysis finance microsoft-sql-server sql
Last synced: 17 Nov 2024
https://github.com/itsmeyogesh22/people-s-analytics-case-study
Part of Danny Ma's virtual apprenticeship online program, "People's Analytics Case Study" aims to demonstrate practical use of various SQL Concepts like Materialized Views, Snapshot Data and Historical Data
danny-ma data-analysis dataanalysis datascience mssqlserver pgadmin4 postgresql snapshot-data sqlserver t-sql
Last synced: 17 Nov 2024
https://github.com/edjoukou/altip-sales-analysis
It is about Sales data analysis
data-analysis mysql-database sql tableau visualization
Last synced: 07 Dec 2024
https://github.com/pronzzz/diabetes-prediction
Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset
data-analysis data-manipulation data-preprocessing data-visualization knn machine-learning outlier-detection seaborn
Last synced: 24 Dec 2024
https://github.com/silveirinhajuan/rotinapy
RotinaPy: Simplify your daily life and maximize productivity with an integrated app for task management, study tracking, flashcards, and more. Built with Streamlit and Python.
data-analysis flashcards llm-integration llm-ui machine-learning ollama productivity python streamlit study study-project study-tracker task-management task-manager
Last synced: 19 Dec 2024
https://github.com/loginchik/mid_contracts
Анализ контрактов государственных закупок МИДа РФ
data-analysis dataset pandas python
Last synced: 08 Nov 2024
https://github.com/ireneflorez/e_commerce_a_b_test_analysis
website A/B test data analysis
data-analysis jupyter-notebook matplotlib numpy pandas python statsmodels
Last synced: 11 Jan 2025
https://github.com/ayushsiloiya619/online-food-orders-analysis
Data Analytics with Python
data-analysis data-visualization matplotlib pandas-dataframe python3 seaborn-python
Last synced: 30 Nov 2024
https://github.com/chanmeng666/mnist-handwritten-digit-recognition-project
A comprehensive implementation and analysis of handwritten digit recognition using multiple neural network architectures on the MNIST dataset. Features basic MLP, optimized feature-selected model, and deep CNN approaches with detailed performance comparisons and visualizations.
cnn computer-vision data-analysis data-visualization deep-learning feature-analysis handwritten-digit-recognition keras machine-learning mlp mnist model-optimization neural-networks python scikit-learn tensorflow
Last synced: 02 Jan 2025
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 06 Jan 2025
https://github.com/geo-y20/uber-rides-data-analysis
This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.
dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber
Last synced: 08 Jan 2025
https://github.com/prasad-chavan1/bank_data_analysis_r
Bank data analysis in R language
data data-analysis data-science r
Last synced: 07 Jan 2025
https://github.com/moenessgannouni/englandweather
A mini-project that analyzes weather data in England usingLinear Regression and Multiple Linear Regression. Ideal for learning and applying statistical analysis and predictive modeling.
data-analysis data-visualization linear-regression multiple-linear-regression rprogramming
Last synced: 29 Nov 2024
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 16 Nov 2024
https://github.com/shivshah19/movie-recommendation-system
This Movie Recommendation System is designed to provide personalized movie recommendations based on user preferences.
cosine-similarity data-analysis machine-learning pandas python streamlit
Last synced: 19 Dec 2024
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 29 Nov 2024
https://github.com/cecoeco/sas_certificate
my code from Coursera's SAS programming specialization
Last synced: 27 Dec 2024
https://github.com/srvcl/lung-cancer-survival-analysis
Data Cleaning of a dataset and Survival Analysis in R Language
data-analysis data-science data-visualization r survival-analysis
Last synced: 14 Jan 2025
https://github.com/neerajpokala/malasian-housing-data
This repository contains the code, visualizations, and documentation for a comprehensive analysis of Malaysian condominium prices. The project explores factors influencing property prices, such as property size, number of bedrooms, facilities, and proximity to key amenities. Using statistical techniques and data visualization
data-analysis hypothesis-testing r
Last synced: 14 Jan 2025
https://github.com/bala-1409/rafik-s-kitchen-data-analysis
The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.
business-analytics business-intelligence data-analysis data-analytics data-visualization eda exploratory-data-analysis ms-office powerbi-report powerpoint-presentations python sql-server
Last synced: 29 Nov 2024
https://github.com/mysto-007/world_population_growth_analysis
World Population Growth Analysis
data-analysis data-science data-visualization kaggle matplotlib
Last synced: 30 Dec 2024
https://github.com/felinjob/ibm-applied-data-science-capstone
Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.
data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql
Last synced: 23 Nov 2024
https://github.com/francois-lenne/eletric_vehicle_usa
the project is purely educational the main goal is to use fabric
data-analysis data-engineering delta-lake fabric jupyter-notebook pyspark python spark
Last synced: 13 Jan 2025
https://github.com/chardyb/prob-and-stats-bmi6106
A repository for Spring 2025 BMI 6106: Statistics and Probability. This repository contains coursework, code examples, and projects exploring statistical methods and probabilistic models in biomedical informatics.
biomedical-informatics data-analysis data-science probability r statistical-modeling
Last synced: 14 Jan 2025
https://github.com/12danielll/neurogenomics_project
This project focuses on analyzing sequencing data to understand molecular mechanisms of neurological diseases and predict the effectiveness of immunotherapy in breast cancer patients. It integrates Python and R scripts for data processing, statistical analysis, and visualization, alongside a comprehensive report detailing methods and findings.
bioinformatics biostatistics clustering clustering-algorithms data-analysis data-visualization deseq2 differential-gene-expression functional-analysis immune-therapy machine-learning neurological-disease neuroscience pca-analysis python r seurat single-cell-analysis
Last synced: 14 Jan 2025
https://github.com/prateek5525/retail-sales-analysis-project
This project involves analyzing retail sales data using SQL to uncover insights into sales patterns, customer behavior, and product performance. It serves as an exercise to develop foundational SQL skills in data exploration, cleaning, and analysis.
data-analysis data-cleaning retail-sales-data sql
Last synced: 22 Nov 2024
https://github.com/souraevshing/data-science-01
Data analysis using jupyter notebook.
data-analysis data-science data-visualization jupyter-notebook python
Last synced: 13 Jan 2025
https://github.com/mktechai-0786/data-analysis-on-dr-visits
Data Analysis On Dr. Visits dataset
data-analysis matplotlib-pyplot numpy pandas seaborn
Last synced: 31 Dec 2024
https://github.com/rnddave/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 11 Oct 2024
https://github.com/lucas54neves/financial-organizer
Financial organizer using Streamlit
data-analysis data-science financial-organizer plotly python streamlit
Last synced: 13 Jan 2025
https://github.com/martachesnova/big-data
Finding out whether reviews from Amazon's Vine program are trustworthy. Performed ETL process in the Cloud and uploaded a DataFrame to an RDS instance. Used PySpark and Spark SQL to perform a statistical analysis and uncover "hidden" insights.
big-data data-analysis dataset python spark sql
Last synced: 06 Jan 2025
https://github.com/multitagging/benchmarks
Provides benchmarks to test the MultiTagging framework
benchmarks data-analysis ethereum smart-contracts vulnerabilities
Last synced: 11 Oct 2024
https://github.com/martachesnova/python-apis
A weather analysis that randomly selects more than 500 cities across the globe, pulls data from the OpenWeatherMap API for each city. Analysis of the weather and perfect vacation spot is viewable on my Jupyter Notebook.
Last synced: 06 Jan 2025
https://github.com/v-octal/random_forest_from_scratch
My implementation of Random Forest regressor in python
data-analysis machine-learning random-forest
Last synced: 30 Dec 2024
https://github.com/aimin-nur/visualisasi_bikestore
Data Analyst - Dashboard Bike Store
data-analysis sql visualization
Last synced: 30 Dec 2024
https://github.com/danmadeira/algoritmos-estatistica-pl-sql
Demonstração de Algoritmos de Estatística em PL/SQL
algorithms data-analysis data-science database oracle oracle-database pl-sql statistics
Last synced: 08 Jan 2025
https://github.com/danmadeira/algoritmos-estatistica-python
Demonstração de Algoritmos de Estatística em Python
algorithms data-analysis data-science python statistics
Last synced: 08 Jan 2025
https://github.com/inevolin/multivariate-data-analysis
Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.
analytics data-analysis data-science data-visualization javascript
Last synced: 11 Jan 2025
https://github.com/monddavila/online-retail-data-analysis
Online Retail Exploratory Data Analysis with Python
data-analysis jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 14 Jan 2025
https://github.com/dogan-the-analyst/chair_sales_data_analysis
It is an Excel study for practice.
Last synced: 08 Jan 2025
https://github.com/dogan-the-analyst/analyze_data_in_a_model_car_database
This is a SQL project.
Last synced: 08 Jan 2025
https://github.com/martachesnova/python
Created a Python script to calculate and analyze financial records of a company. Created another Python script to do calculations and analysis of the voting process in a small town.
Last synced: 06 Jan 2025
https://github.com/martachesnova/sql
Performing data modeling (ERD) and data engineering. Then, writing series of SQL queries to analyze Employee Database of a company.
data-analysis data-engineering data-modeling erd postgresql sql
Last synced: 06 Jan 2025
https://github.com/rapter1990/data-visualization-examples
Data Visualization Examples
data data-analysis data-visualization folium matplotlib plot plotly python seaborn visualization
Last synced: 19 Nov 2024
https://github.com/bala-1409/peerloankart-loan-fraud-detection-datascience-project
This project uses machine learning to predict whether a loan applicant will repay their loan. The project uses a dataset of historical loan data from PeerLoanKart, a peer-to-peer lending platform.
correlation data-analysis data-cleaning data-science data-visualization dimensional-analysis eda exploratory-data-analysis feature-engineering gradient-boosting-classifier hyperparameter-tuning juypter-notebook machine-learning machine-learning-algorithms numpy pandas predictive-modeling python3 scikitlearn-machine-learning supervised-learning
Last synced: 29 Nov 2024
https://github.com/rishisolanke/pdf_query_langchain
PDF Query LangChain is a tool that extracts and queries information from PDF documents using advanced language processing. Leveraging LangChain, OpenAI, and Cassandra, this app enables efficient, interactive querying of PDF content. Ideal for data analysis, research, and automated reporting, it simplifies detailed document analysis with ease.
artificial-intelligence data-analysis document-query langchain natural-language-processing nlp openai pdf-analysis pdf-extraction python research-tool
Last synced: 29 Nov 2024
https://github.com/gabrielagodek/webscraper
The project was developed during master's studies. It is based on the Python library Scrapy.
data-analysis python scraper scrapy
Last synced: 17 Nov 2024
https://github.com/marialuizaleitao/walmartsalesanalysis
This project explored data collection and preprocessing, advanced application of SQL queries, and feature engineering. Key calculations, such as COGS (Cost of Goods Sold) and VAT (Value Added Tax), were performed to assess the profitability and financial efficiency of the branches.
business-analytics data-analysis mysql-database sql
Last synced: 08 Jan 2025
https://github.com/gianninijs/dashboard_cury_company
Dashboard
data-analysis data-visualization python statistics streamlit
Last synced: 18 Dec 2024
https://github.com/bala-1409/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint power-bi powerbi powerbi-reports powerbi-visuals visualization
Last synced: 29 Nov 2024
https://github.com/george-gca/ai_papers_analysis
Do some analysis based on main AI conferences
conferences data-analysis fasttext fasttext-embeddings fasttext-python python scikit-learn top2vec
Last synced: 14 Jan 2025
https://github.com/apache/cloudberry-devops-release
DevOps and Release for Apache Cloudberry (Incubating)
ai big-data cloudberry data-analysis data-warehouse database devops distributed-database greenplum mpp olap postgres postgresql
Last synced: 14 Nov 2024
https://github.com/rayanwaked/wildfire-analysis
The project aims to visualize wildfire activity in Oregon, exploring related data to create visualizations and tables to analyze the historical patterns.
data data-analysis data-visualization jupyter jupyter-notebook oregon portland-state-university python wildfire-data-visualization
Last synced: 15 Nov 2024
https://github.com/noturlee/iris-dataanalyis
This project aims to classify Iris flowers into three species—setosa, versicolor, and virginica—based on their sepal and petal measurements using machine learning techniques. The dataset comprises 150 samples evenly distributed among these species
data-analysis data-modeling data-science data-structures-and-algorithms data-visualization
Last synced: 22 Dec 2024
https://github.com/nuccitheboss/jespipe-plugin
Your go to spot for creating and using Jespipe plugins.
adversarial-attacks data-analysis data-manipulation data-visualization machine-learning machine-learning-algorithms
Last synced: 29 Nov 2024
https://github.com/badranalyst/residential-unit-prices-data-analysis-application
Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.
data-analysis data-visualization dataset matplotlib numpy pandas python seaborn
Last synced: 08 Jan 2025
https://github.com/badranalyst/covid-deaths-dashboard-with-tableau
This project showcases an interactive dashboard developed in Tableau to visualize COVID-19 deaths data. It provides insights into trends, geographical distributions, and key metrics related to mortality during the pandemic. The dashboard aims to enhance understanding of the data, supporting public health analysis and decision-making.
covid-19 dashboard data data-analysis data-visualization dataset tableau tableau-dashboards visualization
Last synced: 08 Jan 2025
https://github.com/bhaveshbhakta/fish-weight-prediction-using-ml
Fish Weight Prediction
data-analysis data-visualization fish-weight-prediction gradient-boosting machine-learning
Last synced: 14 Jan 2025