Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-01-22 00:07:16 UTC
- JSON Representation
https://github.com/joanmartin/uib-masterinbigdata
Master's degree of Big Data Analysis in Economics and Business
big-data data-analysis data-science igraph machine-learning machine-learning-algorithms matplotlib pandas python r sklearn
Last synced: 14 Dec 2024
https://github.com/gcappon/py_agata
Official Python porting of AGATA (Automated Glucose dATa Analysis) a toolbox to analyse glucose data.
continuous-glucose-monitoring data-analysis hacktoberfest python toolbox
Last synced: 12 Nov 2024
https://github.com/tushar2704/everyday_python
Welcome to Everyday Python Sheets – your go-to resource for everyday Python cheat sheets, pro tips, interview questions, Python one-liners, and Python data structures. Whether you're a beginner looking to learn Python or an experienced developer seeking quick reference materials, this Streamlit application has got you covered.
artificial-intelligence cheatsheet data data-analysis data-science data-structures data-visualization database protips python streamlit streamlit-tushar2704 tushar2704
Last synced: 27 Dec 2024
https://github.com/mykhode/python-sic-mini-project
SAMSUNG SIC Finish Project Course - Python
data-analysis python-analysis samsung-sic
Last synced: 11 Jan 2025
https://github.com/storopoli/r_scripts
Couple of handy R Scripts that I use in a daily basis for Scientific Research
data-analysis data-science data-visualization r scientific
Last synced: 20 Nov 2024
https://github.com/paulseperformance/jupyter-notebooks
Where I keep all my jupyter notebooks
bitcoin blockchain data-analysis data-visualization python
Last synced: 30 Dec 2024
https://github.com/EmirhanServeren/NFT-CollaBot
NFT CollaBot is a data-oriented project designed by the requirements of NFT ecosystem and aims to strengthen community.
data data-analysis data-analytics nft streamlit streamlit-webapp tezos tezos-api tezos-blockchain tezoswallet
Last synced: 08 Nov 2024
https://github.com/AivanF/Lemuras
A small Python library to deal with big tables
bigdata data-analysis html ipython-notebook join-tables json jupyter-notebook pandas pivot-tables python sql table
Last synced: 07 Nov 2024
https://github.com/asifdotexe/sentimentscoringmodel
This project focuses on performing sentiment analysis on Amazon reviews using natural language processing (NLP) techniques. It includes various steps, from data exploration and preprocessing to building and evaluating sentiment models.
data-analysis data-visualization natural-language-processing sentiment-analysis
Last synced: 15 Nov 2024
https://github.com/quantumudit/analyzing-whiskyexchange-whisky
This project focuses on scraping data related to Japanese Whiskey from the Whiskey Exchange website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/heiderjeffer/misalignment-between-ownership-and-contribution-affects-system-reliability
Research Proposals RP
archtecture data-analysis data-collection nvivo-software python qualitative-analysis quantative-analysis reliability-engineering software-engineering
Last synced: 15 Dec 2024
https://github.com/iondv/metrics
IONDV. Framework application: Metrics is to collect and show the metrics data.
collecting data data-analysis iondv iondv-app metrics
Last synced: 08 Jan 2025
https://github.com/quantumudit/analyzing-cleanaway-services
This project focuses on scraping all the service locations across Australia and their associated attributes from "Cleanaway" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 06 Nov 2024
https://github.com/DataHerb/dataherb-python
Python Package for DataHerb: create, search, and load datasets.
data data-analysis data-mining database dataset python
Last synced: 15 Nov 2024
https://github.com/rdpeng/analyticdesigntheory
Web site for Analytic Design Theory
analytic-design-theory data-analysis exploratory-data-analysis
Last synced: 29 Oct 2024
https://github.com/sondosaabed/advanced-data-wrangling
In this advanced course, Learning the three phases of data wrangling: gathering, assessing, and cleaning data.
data-analysis data-analyst-nanodegree data-wrangling numpy pandas python
Last synced: 06 Nov 2024
https://github.com/mrankitgupta/mrankitgupta
Myself Ankit Gupta, This contains a short & interesting introduction about me.
ai ankit ankit-gupta ankitgupta artificial-intelligence awesome-readme data-analysis data-science data-visualisation data-visualization github github-profile github-profile-readme machine-learning mrankitgupta profile python readme readme-profile social
Last synced: 17 Nov 2024
https://github.com/erictleung/erictleung.github.io
:memo: Source code for my website, portfolio of projects, and more
bioinformatics blog data data-analysis data-science github-jekyll github-page jekyll lanyon open-science open-source software-engineering
Last synced: 21 Dec 2024
https://github.com/llnl/nddav
N-Dimensional Data Analysis and Visualization
data-analysis data-viz high-dimensional-data topological-data-analysis visual-analytics visualization
Last synced: 11 Nov 2024
https://github.com/amrrs/introduction-to-eda-with-python
Introduction to EDA with Python Session Files
Last synced: 15 Nov 2024
https://github.com/qathom/crawlx
crawlx allows to analyze product data on Amazon. It is a simple and lightweight tool to act on product issues.
data-analysis electron es6-javascript vue vuejs2 vuex2 webpack
Last synced: 21 Dec 2024
https://github.com/nafisalawalidris/bitcoin-price-analysis-api
Explore this comprehensive API for Bitcoin price analysis, featuring historical data and real-time updates from major exchanges. Built with FastAPI, Python and PostgreSQL, this open-source project is optimised for efficiency and scalability catering to developers, traders and analysts. Contribute and collaborate to enhance functionality.
api-development bitcoin bitcoin-api cryptocurrency data-analysis fastapi financial-data machine-learning open-source postgresql python restful-api
Last synced: 10 Oct 2024
https://github.com/sondosaabed/data-visualization-with-matplotlib-and-seaborn
Learning to apply sound design and data visualization principles to the data analysis process. Also learning how to use analysis and visualizations to tell a story with data.
data-analysis data-analyst-nanodegree data-visualization matplotlib python seaborn seaborn-plots
Last synced: 06 Nov 2024
https://github.com/neutrinoceros/gpgi
A Generic Interface for Grid + Particle data
data-analysis grid particles performance
Last synced: 16 Nov 2024
https://github.com/fx2y/datanarrate
[WIP] LLM-powered agent for adaptive data analysis across multiple sources. Uses natural language for complex queries, visualizations, and insights. Features autonomous planning, SQL/Elasticsearch generation, and AI storytelling. Built with LangChain, GPT-4, FastAPI, and React.
ai data-analysis data-visualization elasticsearch fastapi gpt-4 langchain machine-learning nlp react sql
Last synced: 16 Jan 2025
https://github.com/msyriac/alhazen
Tools for gravitational lensing: now deprecated by pixell+orphics+symlens. Leaving it here in case anyone needs to reproduce old stuff.
analysis cmb data-analysis gravitational-lensing lensing
Last synced: 27 Oct 2024
https://github.com/michaelnabil230/laravel-analytics
A Laravel package to retrieve pageviews and other data from Database
data-analysis data-structures database laravel php
Last synced: 15 Nov 2024
https://github.com/aivanf/lemuras
A small Python library to deal with big tables
bigdata data-analysis html ipython-notebook join-tables json jupyter-notebook pandas pivot-tables python sql table
Last synced: 11 Oct 2024
https://github.com/dcs-training/digital-method-of-the-month
In this repository you are going to find the documents we produced to support the discussion in our Digital Methods of the Month. These documents will help you orienting yourself if you want to pickup the method in your research. Go to the readme file
3d-data data-analysis data-visualisation data-wrangling geographical-data gis good-practices-digital-research machine-learning network-analysis open-research preregistration statistics text-analysis
Last synced: 07 Jan 2025
https://github.com/surajv311/data_analysis-food_recipes_ds
Data preprocessing, cleaning, <Analysis> & plotting 📊 of Food Recipies Dataset (from Kaggle). 🐍 Libraries used: Pandas, Matplotlib, Seaborn, Plotly.📈
data-analysis kaggle-dataset matplotlib numpy pandas plotly seaborn
Last synced: 21 Jan 2025
https://github.com/pmsipilot/intercom2dw
Intercom2DW is an attempt at loading all the data available in an Intercom application.
Last synced: 14 Jan 2025
https://github.com/k1rsn7/kaggle
:basecamp: A collection of Kaggle solutions.
computer-vision cv data-analysis data-science deeplearning deeplearning-ai english english-language jupyter jupyter-notebook jupyter-notebooks kaggle kaggle-challenge russian russian-language
Last synced: 29 Dec 2024
https://github.com/valeriopagliarino/tcf-2021-unito-public
Exam project of the course "Computing Tecniques for Physics" - Università degli Studi di Torino - Physics department - 2021
cern-root data-analysis geant4-simulation monte-carlo-simulation object-oriented-programming physics
Last synced: 06 Dec 2024
https://github.com/romac/adaproject
🔬 Project proposal for the Applied Data Analysis course at EPFL
Last synced: 24 Dec 2024
https://github.com/wtbates99/stock-indicators
A comprehensive self-hosted stock analysis platform combining FastAPI and React. Features interactive stock visualizations, real-time data aggregation, and customizable technical indicators with a modern, grid-based interface.
backend data-analysis data-mining data-science fastapi financial-analysis frontend investment python reactjs sqlite3 statstistics stock-market stock-price-prediction time-series
Last synced: 07 Jan 2025
https://github.com/andreip/twitter-authorities
Find authorities for Twitter topics. [Licenta][Undergraduate thesis]
data-analysis mongodb python twitter-topics
Last synced: 19 Dec 2024
https://github.com/tsffarias/ibge-nomes-brasil
Análise de dados dos nomes no Brasil utilizando a base de dados disponibilizada pelo IBGE (2010)
Last synced: 05 Nov 2024
https://github.com/olgaele/playing-with-julia
Playing with data!
data data-analysis data-science julia statistics
Last synced: 29 Nov 2024
https://github.com/yashika-malhotra/strategic-analysis-of-retail-brand-in-south-america-using-sql
Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services
bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql
Last synced: 14 Jan 2025
https://github.com/neelshah18/neelshah18.github.io
Neel Shah's Website
article blog data-analysis data-science data-visualization deep-learning machine-learning personal-website
Last synced: 30 Nov 2024
https://github.com/manikantasanjay/loan_repayment_regression_project
Prediction Of Loan Repayment using Sequential Neural Networks on Lending Club Dataset.
data-analysis data-visualization lending-club loan-repayment matplotlib numpy pandas-library seaborn tensorboard tensorflow
Last synced: 05 Nov 2024
https://github.com/alanmenchaca/ml-books-compilation
Repository of books that I have been collecting about machine learning, deep learning, data science, and more ...
data-analysis data-science deep-learning machine-learning statistics
Last synced: 17 Dec 2024
https://github.com/dmytrovoytko/data-engineering-amazon-reviews
Data Engineering project for ZoomCamp`24: JSONL -> PostgreSQL/BigQuery + Metabase + Mage.AI
bash-script bigquery codespaces data-analysis data-visualization etl metabase pipeline python-script
Last synced: 14 Nov 2024
https://github.com/gxjansen/user-analysis-with-r-google-analytics
Analyzing user behavior of an E-commerce website with R and (mainly) Google Analytics Data
analytics analytics-api conversion-rate-optimization data-analysis ecommerce google google-analytics r
Last synced: 30 Oct 2024
https://github.com/lucs1590/strava-analysis
🏃📊 Using strava to do personal analyses and to practice data scientist skills.
data-analysis data-science github jupyter-notebook python3 strava strava-api
Last synced: 17 Dec 2024
https://github.com/nelson-gon/nelson-gon.github.io
Biologically Plausible Programming
bioinformatics blog blogdown computational-biology data-analysis data-exploration ghost ghostwriter-theme github github-pages hugo-site hugo-theme programming python3 r side-project
Last synced: 29 Oct 2024
https://github.com/petulla/readroper
Read single and multi-card ASCII polling datasets in R
ascii data-analysis polling-data r
Last synced: 13 Dec 2024
https://github.com/c0deta1ker/arpescape
ARPEScape is a MATLAB-based app that contains a set of tools and functions for analysing the electronic structure of materials using photoelectron spectroscopy (PES) techniques, such as X-ray photoelectron spectroscopy (XPS) and angle-resolved photoelectron spectroscopy (ARPES).
analysis analysis-package angle-resolved-photoemission angle-resolved-spectroscopy arpes condensed-matter-physics data-analysis lcn matlab photoelectron-spectra photoelectron-spectroscopy photoemission psi sls ucl xps
Last synced: 30 Nov 2024
https://github.com/arnavk-09/world-population
🌏 Taipy demo to explore world population data...
data-analysis python taipy world-population
Last synced: 13 Dec 2024
https://github.com/lisa-ho/three-investigators
Respository for scraping and analysing fan data on a German audio drama called 'Die Drei Fragezeichen' (the three investigators).
data-analysis data-viz datawrapper python webscraping
Last synced: 18 Dec 2024
https://github.com/ndleah/school-donation
💰 Top school donors analysis
cufflinks data-analysis data-science data-visualization dataset exploratory-analysis python python-library python3
Last synced: 12 Jan 2025
https://github.com/pratishtha-abrol/astronomy-dataanalysis
A key technique in Data Driven Astronomy
astronomy astropy crossmatch data-analysis
Last synced: 12 Dec 2024
https://github.com/hvignolo87/ortex-programming-challenge
Coding challenges required for the Python Developer and Data Engineer job positions.
challenge data-analysis finance pandas python scripting sql sqlalchemy
Last synced: 02 Jan 2025
https://github.com/prakhar-ff13/yellow-taxi-demand-prediction
Predicting Taxi Demand in various regions of New York City
data-analysis data-analytics data-science data-visualization machine-learning python3 time-series
Last synced: 30 Nov 2024
https://github.com/prajwalchapke055/cognizant-artificial-intelligence-job-simulation-forage
Advise one of Cognizant’s clients on a supply chain issue by applying knowledge of machine learning models.
artificial-intelligence cognizant communication data-analysis data-modeling data-visualization development evaluation forage job-simulation machine-learning machine-learning-algorithms machine-learning-engineering model-interpretation presentation problem-statement python quality-assurance skills virtual-internship
Last synced: 22 Jan 2025
https://github.com/andreaschandra/who-suicides-statistics
Exploratory Data Analysis for Suicides using Python
data-analysis data-science eda python
Last synced: 19 Dec 2024
https://github.com/cosmoduende/r-holy-books-sentiment-data-analysis
What's the most positive or negative religion? . Sentiment and Data Analysis of Holy Books with R. Analysis of religious dogmas by exploring their Holy Books (The Bible, The Quran, The Dhammapada, and The Book of Mormon) with R
bible book-of-mormon data-analysis data-analytics data-visualisation data-visualization dataviz dhammapada holy-scriptures quran religions-studies religious religious-studies sentiment-analysis sentiment-polarity sentimental-analysis text-analysis text-analytics text-mining text-mining-analysis
Last synced: 07 Nov 2024
https://github.com/cosmoduende/r-marvel-vs-dc
DC Comics vs Marvel Comics - Exploratory Data Analysis and Data Visualization with R. Who has the smartest, strongest, fastest, or most powerful hero or villain? How to answer this and more questions with R
comics data-analysis data-analysis-r data-analytics data-visualization dataviz dc-characters dc-comics eda exploratory-analysis exploratory-data-analysis exploratory-data-visualizations marvel-characters marvel-comics marvel-vs-dc shdb superherodb superheroes superheros
Last synced: 07 Nov 2024
https://github.com/1994nikunj/nlp-toolkit-desktop-app
The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.
data-analysis n-grams network-visualization nlp python text-cleaning topic-modeling wordcloud-generator
Last synced: 25 Nov 2024
https://github.com/adirthaborgohain/community-data-analysis
Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.
Last synced: 11 Dec 2024
https://github.com/anushadatta/airbnb-in-seattle
🏨 Understanding the Airbnb rental landscape in Seattle using data science.
airbnb data-analysis data-exploration data-visualization datascience sentiment-analysis
Last synced: 11 Dec 2024
https://github.com/lacerbi/vbmc
Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)
bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference
Last synced: 12 Dec 2024
https://github.com/gher-uliege/liege-colloquium-on-ocean-dynamics
Python tools and latex files for the Colloquium
data-analysis data-assimilation numerical-simulations ocean-modelling oceanography remote-sensing submesoscale turbulence
Last synced: 11 Dec 2024
https://github.com/simranjeet97/ipl-dataanalysis
Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.
artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python
Last synced: 14 Jan 2025
https://github.com/mohamedomar2020/random-forest
Creating a Random Forest model to predict the progression of bladder cancer
bladder-cancer cancer-genomics cancer-research data-analysis data-science genomics machine-learning machine-learning-algorithms random-forest
Last synced: 03 Oct 2024
https://github.com/thecoderpinar/hms-brainactivity-analysiss
Welcome to the GitHub repo for "HMS - EEG Exploration & Neurocritical Care Journey"! Explore EEG data, understand wave patterns, and delve into conditions like LPDs, GPDs, LRDA, and GRDA.
critical-care data-analysis data-science data-visualization deep-neural-networks eeg eeg-signals exploratory-data-analysis healthcare medical-research neuroscience signal-processing
Last synced: 16 Dec 2024
https://github.com/thecoderpinar/credit-card-fraud-detection-project
This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨
anamoly-detection classification-algorithms credit-card-transactions data-analysis data-preprocessing data-science data-visualization fraud-detection machine-learning python
Last synced: 16 Dec 2024
https://github.com/thecoderpinar/reta
🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!
arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series
Last synced: 16 Dec 2024
https://github.com/virajbhutada/cliquebait-digital-marketing-analysis-using-sql
This GitHub repository contains the CliqueBait Digital Marketing Analysis project, utilizing SQL for comprehensive analysis of marketing campaigns, user engagement, product performance, and website interactions within the Clique Bait food app. The project offers actionable insights for optimizing marketing strategies in competitive landscape.
campaign-website data-analysis data-extraction data-science digital-marketing food-store microsoft-excel mysql product-performance sql sql-database sql-project user-engagement website-analytics
Last synced: 10 Jan 2025
https://github.com/thecoderpinar/gen-expression
Gene expression analysis is a fundamental component of genomics research, providing valuable insights into how genes are regulated and their impact on various biological processes. This project delves into the realm of gene expression data, aiming to uncover hidden patterns and relationships within complex datasets. 🚀
bioinformatics biotechnology data-analysis data-science data-visualization genomics kaggle machine-learning pca python
Last synced: 16 Dec 2024
https://github.com/joanacmbarros/ardm-website
Website to support the R in Pharma 2023 workshop on the ARDM
analysis-results automation clinical-data data-analysis data-model r-in-pharma
Last synced: 16 Dec 2024
https://github.com/mindful-ai-assistants/movierevenueanalysis
🎬💰 Analyze movie companies' revenue, release strategies, and financial performance using statistical techniques for actionable insights. This project explores data on total revenue, number of releases, and lifetime gross to uncover patterns that can drive strategic decisions in the film industry.
correlation-analysis data-analysis data-science heatmap jupyter-notebook open-source python statistical-analysis statistical-analysis-and-hypothesis-testing statistics ttest
Last synced: 22 Jan 2025
https://github.com/jupyterphysscilab/documentation
Documentation for the Jupyter Physical Science Lab Suite of Packages
analog-to-digital-converter data-acquisition data-analysis education jupyter-notebooks pandas physical-sciences plotting python raspberry-pi
Last synced: 04 Dec 2024
https://github.com/haritha1005/data-analysis-portfolio
This repository showcases my data analytics and data science skills through projects, fostering collaboration and community engagement
data-analysis data-visualization etl excel matplotlib numpy-library pandas powerbi-report python3 r scipy sql tableau
Last synced: 06 Dec 2024
https://github.com/phollemans/cwutils
CoastWatch Utilities software for working with satellite data files from NOAA CoastWatch and elsewhere
cdat coastwatch-utilities data-analysis data-visualization install4j java noaa-coastwatch remote-sensing satellite-imagery
Last synced: 12 Nov 2024
https://github.com/iamgmujtaba/scholar_search
This project provides a tool for extracting and analyzing the quantity and distribution of scholarly articles related to a particular topic or field over a desired time span, using Google Scholar search results and built-in data visualization functionality.
academia academic academic-papers data-analysis data-visualization google google-scholar scholarly-articles
Last synced: 16 Dec 2024
https://github.com/aad99bxp/whatsapp-chat-analyzer
A project intended for Business Owners / Managers to analyze Whatsapp chats between their customer care executives and their customers.
data-analysis heroku-deployment python3
Last synced: 22 Jan 2025
https://github.com/w-edward/youtube-keyword-popularity-analyzer
An effort to discover the top trending keywords on Youtube.
data-analysis node-js numpy python webscraping youtube-api
Last synced: 16 Jan 2025
https://github.com/jimbrig/EDA
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 04 Dec 2024
https://github.com/narius2030/sakila-datawarehouse-ssis
Implement a simple data warehouse to store Saklia data - Create data pipelines for extract, transform and load data from source to warehouse - Retrieve data in warehouse to explore and do several analysis
data-analysis data-integration data-modeling data-visualization excel microsoft-sql-server power-bi ssas ssis
Last synced: 14 Dec 2024
https://github.com/app-generator/devtool-db-introspection
Database Introspection Tool - Open-Source | AppSeed
data-analysis database-schema database-tool db-scan db-tool developer-tools peewee peewee-orm python-database python-datatypes python-db python-tool
Last synced: 20 Dec 2024
https://github.com/nunesma/Health-analytics
Data analysis focusing on health problems
data-analysis epidemiology-analysis health-analytics python r-programming
Last synced: 04 Dec 2024
https://github.com/arjo129/image-sorter
Sort through folders of videos and images. Root out blurred and overexposed images.
computational-photography data-analysis photo-browser photo-gallery photography uwp uwp-apps
Last synced: 06 Jan 2025
https://github.com/rubydamodar/the-ultimate-pandas-bootcamp
Welcome to the Pandas for Data Science repository! This course is designed to take you from beginner to proficient in using Pandas, the powerful data manipulation library in Python. Whether you're just starting your data science journey or looking to sharpen your skills, this repository contains all the resources
beginner-friendly csv-data data-analysis data-cleaning data-manipulation data-science data-visualization dataframe exploratory-data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python python-pandas series statistical-analysis time-series titanic-dataset
Last synced: 18 Oct 2024
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 16 Jan 2025
https://github.com/fabienarcellier/qjoin
qjoin is a data manipulation library that provides simple and efficient joining and collection processing functionality
composable data-analysis developer-tools functools python
Last synced: 28 Nov 2024
https://github.com/lafayettegabe/nlp-resume-extraction
📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.
big-data data data-analysis data-science eda ner nlp resume-extractor
Last synced: 16 Dec 2024
https://github.com/nhsdigital/sde_example_analysis
Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.
data-analysis data-science databricks-notebooks machine-learning mlflow
Last synced: 23 Dec 2024
https://github.com/casualcomputer/sql.mechanic
Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).
data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server
Last synced: 04 Dec 2024
https://github.com/abeltavares/nps_performance_analysis
Analyzing and Monitoring Net Promoter Score (NPS) Performance for Healthcare Companies using SQL and Power BI
customer-satisfaction dashboard data-analysis data-visualization healthcare net-promoter-score nps-analysis performance-monitoring power-bi sql
Last synced: 05 Jan 2025
https://github.com/gustavohnsv/teamwork_mqa
Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.
data-analysis group-project r team-repo
Last synced: 16 Dec 2024
https://github.com/0mppula/element-compare
A single page Next.js 14 app that allows the user to inspect and compare elements from the periodic table.
compare dark-mode data-analysis elements inspect lucide-react nextjs periodic-table reactjs server-side-rendering shadcn-ui single-page-app typescript vercel zustand
Last synced: 11 Nov 2024
https://github.com/mindful-ai-assistants/credit-card-prediction
💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.
artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning predictive-modeling python3 scikit-learn
Last synced: 09 Dec 2024
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 16 Dec 2024
https://github.com/invictusaman/socioeconomic-indicators-in-chicago-sql-python
This project displays how to create a database connection in notebook, update database using python and how to run Python program and SQL queries together. It uses SQLite and Chicago dataset for analysis.
data-analysis jupyter-notebook python sql sql-queries sqlite
Last synced: 12 Oct 2024
https://github.com/jesussantana/data-science-with-python-it-academy
Learn how to extract value from data by ingesting, transforming, storing, analyzing, and visualizing data
classification-model clustering-methods dash data-analysis data-mining data-science database machine-learning matplotlib mongodb numpy pandas plotly python3 regression-models seaborn sklearn sql web-scraping
Last synced: 12 Nov 2024
https://github.com/asifdotexe/timeseriesanalysis
This repository serves as a central hub for all of my projects related to time series analysis. Here, you'll find a collection of projects, code samples, and resources that explore various aspects of time series data and its analysis.
data-analysis feature-engineering jupyter-notebook pandas python time-series-analysis visualization
Last synced: 15 Jan 2025
https://github.com/asifdotexe/covidporfolioproject
This is a SQL + Tableau Project on real world Covid 19 Dataset from the start of recorded case to 2nd March 2022 i.e My birthday XD
dashboard data-analysis data-exploration data-visualization sql sql-server tableau
Last synced: 15 Jan 2025
https://github.com/iraikov/chicken-dataframe
Tabular data structure for data analysis in Scheme
chicken-scheme chicken-scheme-eggs data-analysis dataframe linear-regression scheme scheme-programming-language
Last synced: 03 Dec 2024