Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-01-29 00:07:29 UTC
- JSON Representation
https://github.com/miroslav-reiter/sql_zoo
Kurzy SQL, MySQL, PostgreSQL, Microsoft SQL Server
analysis analytics data data-analysis database db2 ibm-db2 mysql mysql-database postgres postgresql postgresql-database sql sql-injection sql-query sql-server sqlserver
Last synced: 09 Apr 2025
https://github.com/agungbudiwirawan/socioeconomic_analysis
The objective of this project is to analyze the socio-economic in Chicago.
chicago-crime crime-data data-analysis data-science microsoft-sql-server project sql sql-project sql-server
Last synced: 04 Jul 2025
https://github.com/kwokhing/exploratory-data-analysis-on-smrt-tweets
Demo on performing exploratory data analysis (EDA) on train service disruptions based on scrapped (user generated contents) tweets from the train operator's (SMRT) twitter account
data-analysis data-cleaning data-collection data-preparation exploratory-data-analysis exploratory-data-visualizations folium geospatial-data leaflet-map python python3 regex scraping selenium selenium-python social-media text-processing user-generated-content web-scraping webscraping
Last synced: 27 Jul 2025
https://github.com/dra1ex/mind-net.js
Fast and simple to use neural network implementation in pure TypeScript with GPU support!
artificial-intelligence classification data-analysis deep-learning gan generative-adversarial-network gpu machine-learning ml neural-network neural-network-engine neural-networks regression sequential-network supervised-learning unsupervised-learning vae variational-autoencoder
Last synced: 09 Apr 2025
https://github.com/thecoderpinar/earthquake-explorer
🌍🔍 Explore and analyze earthquake data worldwide using this interactive data science project. Visualize earthquake occurrences, patterns, and geographical distribution. Dive deep into seismic data, perform exploratory data analysis, and gain insights into earthquake trends.
data-analysis data-science data-visualization earthquake-analysis exploratory-data-analysis geospatial-analysis interactive-maps jupyter-notebook machine-learning python seismic-data
Last synced: 23 Aug 2025
https://github.com/yashksaini-coder/meriskill
Data analytics extracts insights from raw data through statistical analysis and visualization. It empowers informed decision-making, guiding strategies and addressing challenges with a data-driven approach.
analytics data data-analysis data-science internship-task
Last synced: 20 Feb 2025
https://github.com/paulseperformance/jupyter-notebooks
Where I keep all my jupyter notebooks
bitcoin blockchain data-analysis data-visualization python
Last synced: 20 Feb 2025
https://github.com/neelsoumya/public_open_source_data_science
A repository of open source data science projects for social good
citizen-data-science citizen-science data-analysis data-science datascience datascience-social-good datascience-socialgood deep-learning machine-learning paper python social
Last synced: 25 Apr 2025
https://github.com/tsffarias/ibge-nomes-brasil
Análise de dados dos nomes no Brasil utilizando a base de dados disponibilizada pelo IBGE (2010)
Last synced: 05 Jul 2025
https://github.com/vandita2020/merra2_nasa_wind_speed_analysis
In this study, we aim to explore the vulnerability of power grids in the south-east region of the USA with the help of data analysis tools and machine learning algorithms
data-analysis data-science machine-learning-algorithms python
Last synced: 28 Feb 2025
https://github.com/bishopce16/stroke_prediction_analysis
The purpose of this project is to derive insight on characteristics and statistics regarding the dataset to see which factors influence whether or not a patient has had a stroke.
data-analysis data-visualization machine-learning nump pandas plpgsql python seaborn tableau
Last synced: 26 Apr 2025
https://github.com/0mppula/element-compare
A single page Next.js 14 app that allows the user to inspect and compare elements from the periodic table.
compare dark-mode data-analysis elements inspect lucide-react nextjs periodic-table reactjs server-side-rendering shadcn-ui single-page-app typescript vercel zustand
Last synced: 19 Jan 2026
https://github.com/ahammadmejbah/ibm-project-03-analyzing-spreadsheet-data-with-python
Spreadsheets are computer programs that allow users to enter, view, and change information in a gridlike format. One of the most useful programs on a PC is a spreadsheet, which allows users to organize data in tabular form. A spreadsheet's primary purpose is to store numerical data and sometimes a few words of text.
data-analysis data-analysis-python data-analyst data-science excel python spreadsheet
Last synced: 27 Apr 2025
https://github.com/tillbiskup/labinform
Python components of the laboratory information system LabInform
data-analysis data-storage data-storage-infrastructure electronic-lab-notebook good-practices reproducible-research reproducible-science unique-identifier
Last synced: 06 Sep 2025
https://github.com/dataket/dataket
Propuesta de proyecto para el Datatón Anticorrupción 2021. Equipo Dataket
corruption data-analysis dataviz open-government
Last synced: 26 Dec 2025
https://github.com/emmanuel10701/matplotlib
Matplotlib
data-analysis data-science data-visualization matplotlib python
Last synced: 15 Oct 2025
https://github.com/tirendazacademy/data-sets
Data sets for Tirendaz Akademi Youtube
Last synced: 12 Sep 2025
https://github.com/reiniiriarios/squirrel-table
Desktop application to run MySQL queries over SSH and generate CSV and XLSX files. Useful for QA where queries need to be run repeatedly and files handed off.
csv csv-files data-analysis desktop-application electron excel mariadb mysql nodejs qa quality-assurance quality-control quality-control-assurance sql xlsx
Last synced: 11 Mar 2025
https://github.com/nirmalnishant645/python-programming
Basic Python Programs
algorithms algorithms-and-data-structures algorithms-datastructures big-data data-analysis data-cleaning data-mining data-mining-algorithms data-science data-structure data-structures datastructures-algorithms geeksforgeeks geeksforgeeks-python geeksforgeeks-solutions hackerearth hackerearth-python hackerearth-solutions python python3
Last synced: 05 Oct 2025
https://github.com/llnl/nddav
N-Dimensional Data Analysis and Visualization
data-analysis data-viz high-dimensional-data topological-data-analysis visual-analytics visualization
Last synced: 29 Apr 2025
https://github.com/bts-cm/airdrop_tool
Fetch & analyse blockchain tickets. View leaderboards and user tickets. Calculate and perform provably fair airdrops.
airdrop bitshares bitsharesjs blockchain crypto data-analysis data-science electron etl javascript nodejs react ticket tusc
Last synced: 17 Jan 2026
https://github.com/datalayer/desktop
Ξ 🖥️ Datalayer Destkop.
ai data data-analysis data-science datalayer desktop electron
Last synced: 25 Oct 2025
https://github.com/garrettj403/albertaenergysources
Get grid data from Alberta Electric System Operator (AESO)
alberta canada data-analysis energy-data
Last synced: 10 Oct 2025
https://github.com/joanmartin/uib-masterinbigdata
Master's degree of Big Data Analysis in Economics and Business
big-data data-analysis data-science igraph machine-learning machine-learning-algorithms matplotlib pandas python r sklearn
Last synced: 10 Oct 2025
https://github.com/mykhode/python-sic-mini-project
SAMSUNG SIC Finish Project Course - Python
data-analysis python-analysis samsung-sic
Last synced: 10 Oct 2025
https://github.com/roaldarbol/anibehavr
🪲 An R package for Analysis of Animal Behaviour
animal-behavior behavioural-states data-analysis r
Last synced: 11 Oct 2025
https://github.com/sayakpaul/datacamp-blogs
Jupyter notebooks of my DataCamp blogs
data-analysis data-science jupyter-notebooks machine-learning python sql
Last synced: 12 Oct 2025
https://github.com/wtbates99/stock-indicators
A comprehensive self-hosted stock analysis platform combining FastAPI and React. Features interactive stock visualizations, real-time data aggregation, and customizable technical indicators with a modern, grid-based interface.
backend data-analysis data-mining data-science fastapi financial-analysis frontend investment python reactjs sqlite3 statstistics stock-market stock-price-prediction time-series
Last synced: 15 Oct 2025
https://github.com/nasdin/drone-strike-visualization
Connecting to Drone Strike API and performing an analysis on frequency of drone strikes, trend and patterns
analysis api basemap data-analysis data-science data-visualization drone drone-strikes eda geopy jupyter-notebook visualization
Last synced: 16 Oct 2025
https://github.com/ihabbendidi/diamond-analysis
Exploratory statistical analysis of a Diamond dataset
data-analysis data-visualization exploratory-data-analysis machine-learning r
Last synced: 17 Oct 2025
https://github.com/vidhi1290/robust-yield-prediction-
"Predicting a Greener Future 🌾📊 Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."
artificial-intelligence data-analysis data-cleaning-and-preprocessing data-science data-visualization dataexploration devops docker machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas python scikit-learn scikitlearn-machine-learning streamlit yield-prediction-for-food-processing
Last synced: 19 Oct 2025
https://github.com/sherrisherry/cleandata
R Package "cleandata"
cran data-analysis data-mining machine-learning r r-package wrangling
Last synced: 22 Oct 2025
https://github.com/linuxto5re/gateiodatafilteration
Gate.io API analysis: spot, margin, futures; filters high volume, identifies common cryptos.
arbitrage-bot centralized-exchanges cryptocurrency data-analysis gate gateio-api python rest trading-strategies
Last synced: 23 Oct 2025
https://github.com/tushar2704/instagram-user-analytics
This project revolves around the exploration and analysis of user engagement patterns on the popular social media platform, Instagram. By delving into user data and interaction metrics, this project aims to provide valuable insights into user behavior, content performance, and trends.
artificial-intelligence data-analysis data-science instagram project tushar2704
Last synced: 23 Jan 2026
https://github.com/nhsdigital/sde_example_analysis
Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.
data-analysis data-science databricks-notebooks machine-learning mlflow
Last synced: 25 Oct 2025
https://github.com/heiderjeffer/misalignment-between-ownership-and-contribution-affects-system-reliability
Research Proposals RP
archtecture data-analysis data-collection nvivo-software python qualitative-analysis quantative-analysis reliability-engineering software-engineering
Last synced: 27 Oct 2025
https://github.com/josechirif/reviews-and-satisfaction-analysis-of-airbnb-brazil-and-mexico-from-june-2010-to-february-2021
This project analyzes the reviews and satisfaction of customers who used AirBnB services. It also studies if there is a relationship between another variables.
data data-analysis data-visualization powerbi sql-server
Last synced: 07 May 2025
https://github.com/louis-heraut/AEAG_toolbox
🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)
climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics
Last synced: 30 Oct 2025
https://github.com/artdgn/pages
Auto-updating dashboards about COVID-19 https://artdgn.github.io/pages
covid-19 data-analysis modeling
Last synced: 17 Jan 2026
https://github.com/tixfeniks/batch-iterator
Usefull python implementation of batch iterator.
batch data-analysis data-analytics data-science iterate machine-learning mini-batch minibatch python
Last synced: 23 Oct 2025
https://github.com/tideland/go-cells
Light-weight event-processing based on the idea of meshed cells with different pluggable behaviors
cep data-analysis data-stream event-processing events golang
Last synced: 05 Apr 2025
https://github.com/pnnl/archive_walker
Archive Walker Software to read and examine PMU data to detect events and conditions for further analysis.
data-analysis pmu synchrophasor
Last synced: 20 Mar 2025
https://github.com/AivanF/Lemuras
A small Python library to deal with big tables
bigdata data-analysis html ipython-notebook join-tables json jupyter-notebook pandas pivot-tables python sql table
Last synced: 11 Apr 2025
https://github.com/depressioncenter/mden
Mobile technologies code from the University of Michigan's Mobile Data Experts Network (MDEN), featuring data cleaning automations, REDCap project templates, and links to useful external modules. [DOI: 10.6084/m9.figshare.25438714]
automation data-analysis data-cleaning fitness-tracker heart-rate-data mobile-data mobile-development mquery powerautomate powerbi powerquery python r sleep-data smartwatch-data tableau
Last synced: 08 Oct 2025
https://github.com/storopoli/r_scripts
Couple of handy R Scripts that I use in a daily basis for Scientific Research
data-analysis data-science data-visualization r scientific
Last synced: 08 Jul 2025
https://github.com/olgaele/playing-with-julia
Playing with data!
data data-analysis data-science julia statistics
Last synced: 16 Jul 2025
https://github.com/codeperfectplus/hands-on-machine-learning-with-scikit-learn-tensorflow-and-keras
Implementing different aspects of Machine learning in this Repository. Contributions are welcome
data-analysis feature-engineering feature-selection hacktoberfest machine-learning missing-value-treatment python3 scikit-learn tensorflow
Last synced: 13 May 2025
https://github.com/nizarassad/digits-recognition
This project is a machine learning classification task on MNIST using SVM and CNN algorithms
classification colab-notebook convolutional-neural-networks data-analysis data-science deep-learning jupyter-notebook machine-learning neural-network python support-vector-machines
Last synced: 07 May 2025
https://github.com/michaelnabil230/laravel-analytics
A Laravel package to retrieve pageviews and other data from Database
data-analysis data-structures database laravel php
Last synced: 25 Jan 2026
https://github.com/khuyentran1401/suicide-rates
data-analysis data-science kaggle machine-learning python
Last synced: 21 Mar 2025
https://github.com/elkronos/anovatoolbox
This GitHub repository contains a collection of functions for performing various statistical analyses and generating visualizations. The functions are designed to work with different types of data and provide comprehensive outputs for data analysis.
anova anova-model data-analysis r statistics
Last synced: 17 Mar 2025
https://github.com/aivanf/lemuras
A small Python library to deal with big tables
bigdata data-analysis html ipython-notebook join-tables json jupyter-notebook pandas pivot-tables python sql table
Last synced: 28 Oct 2025
https://github.com/valeriopagliarino/tcf-2021-unito-public
Exam project of the course "Computing Tecniques for Physics" - Università degli Studi di Torino - Physics department - 2021
cern-root data-analysis geant4-simulation monte-carlo-simulation object-oriented-programming physics
Last synced: 27 Mar 2025
https://github.com/crafterkolyan/applied-statistical-data-analysis
Курс прикладного статистического анализа данных. ВМК МГУ. Весна 2020
autoexec-scripts autotest data-analysis github-actions statistics university
Last synced: 17 Jun 2025
https://github.com/zackakil/hot-shot-basketball-tracker
Mini web app for displaying basketball practice metrics.
basketball chartjs data-analysis html sport visualization
Last synced: 28 Jan 2026
https://github.com/abdelmajidlh/eportfolio
ePortfolio Abdelmajid EL HOU
bioinformatics data data-analysis data-science data-visualization database datascience genetics
Last synced: 22 Mar 2025
https://github.com/bdslab-upv/dashi
A flexible and powerful Python toolkit for dataset shift analysis and characterization, providing supervised and unsupervised evaluation of temporal and multi-source data shifts, visualization tools, and statistical insights for data integrity and model performance monitoring
data-analysis data-science dataset-shift python temporal-analysis
Last synced: 13 Dec 2025
https://github.com/bradleyboehmke/uc-bana-6043
Additional resources for the UC BANA 6043 Statistical Computing course
data-analysis data-science data-visualization python
Last synced: 10 Jul 2025
https://github.com/DataHerb/dataherb-python
Python Package for DataHerb: create, search, and load datasets.
data data-analysis data-mining database dataset python
Last synced: 08 May 2025
https://github.com/super-lou/card
🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with EXstat
aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly
Last synced: 13 Apr 2025
https://github.com/andreantonacci/eu2019
Social Network Analysis of Twitter Topic-Network Structures during the 2019 European Elections
data-analysis election-analysis european-elections gephi latex master-thesis social-network-analysis
Last synced: 17 Jul 2025
https://github.com/lussierc/foodborneillnessdataanalysis
A data analysis of foodborne illnesses using R Scripting methods.
data-analysis database foodborne-disease-outbreaks foodborne-illnesses rstudio
Last synced: 27 Mar 2025
https://github.com/app-generator/devtool-db-introspection
Database Introspection Tool - Open-Source | AppSeed
data-analysis database-schema database-tool db-scan db-tool developer-tools peewee peewee-orm python-database python-datatypes python-db python-tool
Last synced: 17 Jul 2025
https://github.com/gallillio/webscraping-datacleaning-imdb_videogame_webscrapper
IMDb Web Scrapper using Python. The scraped data is automatically data cleaned to address missing or wrong data, it dynamically searches Steam and Google to fill in the gaps or correct the data, This tool streamlines the process of aggregating video game data, facilitating analysis and insights for enthusiasts, researchers, and developers alike.
automation data-analysis data-cleaning data-wrangling jupyter-notebook python web-scraping webscraping
Last synced: 27 Mar 2025
https://github.com/yashika-malhotra/strategic-analysis-of-retail-brand-in-south-america-using-sql
Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services
bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql
Last synced: 13 Aug 2025
https://github.com/matiasdahl/osm-extract-amenities-r
data-analysis data-mining openstreetmap openstreetmap-data r
Last synced: 24 Dec 2025
https://github.com/romac/adaproject
🔬 Project proposal for the Applied Data Analysis course at EPFL
Last synced: 31 Dec 2025
https://github.com/quantumudit/analyzing-cleanaway-services
This project focuses on scraping all the service locations across Australia and their associated attributes from "Cleanaway" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 25 Jan 2026
https://github.com/prabhupavitra/data-visualization-with-python
This repository houses data visualization with Python.
barplot data-analysis data-visualization datavisualization dotplot grouped-bar-chart heatmap matplotlib matplotlib-pyplot pandas python3 seaborn stacked-bar-chart
Last synced: 09 Sep 2025
https://github.com/burhanahmed1/recipe-recommendor-using-pyspark
A smart recipe recommendation system that suggests recipes based on ingredient similarities. This project is done in PySpark
data-analysis data-science datawrangling education learning-python machine-learning machine-learning-algorithms nltk-python numpy pandas pyspark python python-project reccomendersystem recommendation-system
Last synced: 13 Mar 2025
https://github.com/varunbanka/data-insights
Data Insights is a user-friendly tool for analyzing large CSV files. Its advanced analytics helps uncover hidden patterns and trends, making it perfect for data scientists and analysts.
artificial-intelligence automation data-analysis data-science dataanalysis datahive numpy pandas python
Last synced: 22 Jun 2025
https://github.com/magnaopus1/synthron-cfd-trader-pro
SYNTHRON CFD Trader PRO is a cutting-edge trading platform featuring raw, custom-designed machine learning models. From reinforcement learning for dynamic strategies to predictive analytics, sentiment analysis, and optimization techniques, it empowers trading across stocks, forex, indices, commodities, futures, and crypto with precision.
ai backtesting cfd commodities data-analysis data-science data-structures forex futures indices machine-learning trading
Last synced: 30 Apr 2025
https://github.com/mohammadkarbalaee/python-for-data-analysis-book
All the practice and code that I am doing while I read the book called, Python for data analysis
data-analysis data-science python
Last synced: 27 Mar 2025
https://github.com/ziaeemehr/cpp_workshop
Scientific programming toolbox with C++
cpp data-analysis data-science learning-by-doing programming scientific-computing telegram-channel youtube
Last synced: 14 Jul 2025
https://github.com/jaybird1291/anki-llm-review-stats-exporter
Export your Anki review history (revlog) as JSONL so you can analyze it with an LLM (ChatGPT, Claude, local models, etc.) without using any API.
anki anki-addon chatgpt data-analysis data-export jsonl llm review-stats revlog statistics
Last synced: 24 Dec 2025
https://github.com/jpvt/data_science
Portfolio with my Data Science Projects.
data-analysis data-science deep-learning machine-learning portfolio xgboost
Last synced: 19 Jun 2025
https://github.com/tanaylab/naryn
Native Access medical record Retriever for high Yield aNalytics
Last synced: 20 Jul 2025
https://github.com/juangesino/behaviouraleconomics
All the files and data for the experiment performed during the course Behavioural Economics @ University of Amsterdam
behavioral-economics behavioural-economics data-analysis economics game-theory statistics
Last synced: 28 Oct 2025
https://github.com/mutasim77/dbt-analytics
🍉 Repo for analytics engineering with dbt, transforming raw data into actionable insights.
big-query data data-analysis dbt warehouse
Last synced: 29 Oct 2025
https://github.com/grand-27-master/data-science-course
One-stop repo for learning data science along with roadmap!
data-analysis data-science machine-learning python statistics
Last synced: 24 Feb 2025
https://github.com/thecoderpinar/credit-card-fraud-detection-project
This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨
anamoly-detection classification-algorithms credit-card-transactions data-analysis data-preprocessing data-science data-visualization fraud-detection machine-learning python
Last synced: 30 Apr 2025
https://github.com/qathom/crawlx
crawlx allows to analyze product data on Amazon. It is a simple and lightweight tool to act on product issues.
data-analysis electron es6-javascript vue vuejs2 vuex2 webpack
Last synced: 13 Sep 2025
https://github.com/omarelgabry/insights.py
A Python package for reading, storing, & analyzing data from Public Data APIs
Last synced: 14 Jul 2025
https://github.com/benjamindpb/wikidata-preprocessing
Wikidata dump preprocessing & analysis of georreferencial entities
data-analysis preprocessing wikidata wikidata-dump
Last synced: 15 Jul 2025
https://github.com/dicook/tutorial_make_better_data_plots
Materials for a workshop in June 2025
data data-analysis data-science data-visualization r statistical-graphics statistics
Last synced: 25 Jun 2025
https://github.com/brews/riverpca
Companion repository to Malevich S.B., and C.A. Woodhouse (2017), Pacific SSTs, mid-latitude atmospheric circulation, and widespread interannual anomalies in Western US streamflow, Geophys. Res. Lett., 44, doi:10.1002/2017GL073536.
analysis data-analysis paper pca python river streamflow visualization
Last synced: 29 Mar 2025
https://github.com/karan-malik/uberdataanalysis
Uber Data Analysis and Visualization using Python
data-analysis data-analysis-python data-analytics data-science data-visualization dataanalysis matplotlib-pyplot numpy pandas pandas-dataframe python python3 seaborn uber uber-data
Last synced: 04 Mar 2025
https://github.com/louis-heraut/aeag_toolbox
🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)
climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics
Last synced: 16 Jun 2025
https://github.com/rikulauttia/ai-commercial-decisionmaking
AI-Driven Large Dataset Analysis & Commercial Decision-Making: Research on predictive analytics, machine learning strategies, and real-world business applications [Python, TensorFlow, PyTorch] 🤖📊
artificial-intelligence big-data business-intelligence business-strategy commercial-decision-making data-analysis data-science decision-making deep-learning machine-learning neural-networks predictive-analytics python research thesis
Last synced: 08 Sep 2025
https://github.com/imranr98/brokeetl
Parse transactions from bank statement PDFs into a JSON array.
automation bank-statement banking data-analysis data-mining data-ownership etl finance json lifestyle pdf pdf-converter tracking
Last synced: 30 Dec 2025
https://github.com/lafayettegabe/nlp-resume-extraction
📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.
big-data data data-analysis data-science eda ner nlp resume-extractor
Last synced: 03 Apr 2025
https://github.com/ndleah/school-donation
💰 Top school donors analysis
cufflinks data-analysis data-science data-visualization dataset exploratory-analysis python python-library python3
Last synced: 02 Mar 2025
https://github.com/rickyxume/data_statistic_analysis
2021数据统计与分析大赛全国一等奖方案
data-analysis data-mining data-science
Last synced: 23 Feb 2025
https://github.com/pualien/py-gcloud-connectors
Utilities to simplify connection with Google APIs
bigquery data-analysis google-analytics google-analytics-4 google-cloud google-cloud-platform google-cloud-storage pandas python
Last synced: 14 Dec 2025
https://github.com/ac-gomes/data-engineering-with-databricks
A simple boilerplate for data engineering and data analysis training in Databricks.
data-analysis data-engineering databricks databricks-notebooks pyspark python unit-testing
Last synced: 30 Apr 2025
https://github.com/dwhitena/daal-go
Use Intel's DAAL from Go
artificial-intelligence big-data data-analysis data-science machine-learning
Last synced: 13 Sep 2025
https://github.com/maximtrp/scikit-na
Missing Data Analysis in Python
analysis data-analysis data-science data-visualization missing-data missing-values pandas python statistics visualization
Last synced: 19 Jan 2026
https://github.com/surajv311/data_analysis-food_recipes_ds
Data preprocessing, cleaning, <Analysis> & plotting 📊 of Food Recipies Dataset (from Kaggle). 🐍 Libraries used: Pandas, Matplotlib, Seaborn, Plotly.📈
data-analysis kaggle-dataset matplotlib numpy pandas plotly seaborn
Last synced: 28 Dec 2025
https://github.com/alhankeser/citibike-analysis
Extracting and Transforming Citi Bike Data for Analysis
citibike data-analysis data-science data-visualization etl sql
Last synced: 25 Jan 2026
https://github.com/louis-heraut/card
🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with parametrisation file.
aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly
Last synced: 16 Jun 2025