Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-05 00:07:22 UTC
- JSON Representation
https://github.com/parisaroozgarian/ibm-data-analyst-professional-certificate
The IBM Data Analyst Professional Certificate, consisting of 9 courses, equips with essential skills in Excel, SQL, Python, data visualization, and analysis techniques
big-data business-analysis business-communication communication data-analysis data-management data-structures data-visualization databases general-statistics human-resources planning python-programming spreedsheet sql
Last synced: 19 Dec 2024
https://github.com/fbecerra/fbecerra.github.io
Source code for my website www.fernandobecerra.com
data-analysis data-science data-visualization dataviz interactive-visualizations
Last synced: 27 Oct 2024
https://github.com/pitmonticone/covid-italy
References for COVID-19 situation in Italy.
coronavirus covid-19 covid-19-italy data data-analysis documentation testing
Last synced: 22 Jan 2025
https://github.com/shipyardapp/postgresql-blueprints
Simplified blueprints for building data pipelines with PostgreSQL.
cli data-analysis data-engineering data-pipeline data-science database elt etl postgres postgresql
Last synced: 04 Dec 2024
https://github.com/jpcadena/classification-tweets-national-security-ecuador
Classification of Tweets about national security at Ecuador 2022
classification classification-model data-analysis data-science ecuador insecurity machine-learning natural-language-processing nlp nltk numpy pandas python pytorch scikit-learn snscrape supervised-learning tensorflow tweet twitter
Last synced: 15 Jan 2025
https://github.com/muzammil-13/mimlrepo
Data Analysis using Python Machine Learning Libraries
data-analysis data-science machine-learning numpy pandas python python-library
Last synced: 16 Jan 2025
https://github.com/BigBangData/TimesheetAnalysis
R shiny app to help analyze a bookkeeper's business - or anyone with a timesheet and some time.
bookkeeping data-analysis data-viz r-programming shiny-apps shiny-r timesheet-management
Last synced: 04 Dec 2024
https://github.com/rubydamodar/the-ultimate-pandas-bootcamp
Welcome to the Pandas for Data Science repository! This course is designed to take you from beginner to proficient in using Pandas, the powerful data manipulation library in Python. Whether you're just starting your data science journey or looking to sharpen your skills, this repository contains all the resources
beginner-friendly csv-data data-analysis data-cleaning data-manipulation data-science data-visualization dataframe exploratory-data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python python-pandas series statistical-analysis time-series titanic-dataset
Last synced: 18 Oct 2024
https://github.com/emptymalei/mini-lab
Some code snippets used to explain stuff to myself in my personal data science wiki
data-analysis data-mining data-science data-visualization datascience
Last synced: 20 Dec 2024
https://github.com/analyticalnahid/numpy-tutorial
A complete Notebook on Numpy for Data Science
data-analysis numpy numpy-tutorial
Last synced: 02 Feb 2025
https://github.com/burhanahmed1/recipe-recommendor-using-pyspark
A smart recipe recommendation system that suggests recipes based on ingredient similarities. This project is done in PySpark
data-analysis data-science datawrangling education learning-python machine-learning machine-learning-algorithms nltk-python numpy pandas pyspark python python-project reccomendersystem recommendation-system
Last synced: 06 Jan 2025
https://github.com/simranjeet97/ipl-dataanalysis
Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.
artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python
Last synced: 14 Jan 2025
https://github.com/ikanurfitriani/project-data-analysis-python
This repository contains the results of data analysis learning using the Python.
data-analysis data-analysis-project data-analysis-python python
Last synced: 26 Jan 2025
https://github.com/arjo129/image-sorter
Sort through folders of videos and images. Root out blurred and overexposed images.
computational-photography data-analysis photo-browser photo-gallery photography uwp uwp-apps
Last synced: 06 Jan 2025
https://github.com/lacerbi/vbmc
Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)
bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference
Last synced: 12 Dec 2024
https://github.com/arv-anshul/campusx-project-notebooks
Capstone project by Campusx in DSMP course.
campusx campusx-dsmp data-analysis data-science eda jupyter-notebook machine-learning ml-project nlp project python3 recommender-system regression streamlit
Last synced: 25 Dec 2024
https://github.com/janheinrichmerker/song-analysis
Analysing the Million Song Dataset.
big-data data-analysis data-science hadoop hadoop-mapreduce java kotlin songs
Last synced: 24 Dec 2024
https://github.com/jupyterphysscilab/documentation
Documentation for the Jupyter Physical Science Lab Suite of Packages
analog-to-digital-converter data-acquisition data-analysis education jupyter-notebooks pandas physical-sciences plotting python raspberry-pi
Last synced: 04 Dec 2024
https://github.com/leewannacott/datacamp-projects
DataCamp project solutions.
data-analysis data-mining data-science datacamp-projects machine-learning python r
Last synced: 30 Jan 2025
https://github.com/jpcadena/onemetric-plus
OneMetric+ project for analytical tool on demand forecast and outlier detection
black-formatter data-analysis data-analytics data-science data-visualization demand-forecasting isort machine-learning matplotlib mypy numpy outlier-detection pandas pre-commit-hook pydantic python ruff scikit-learn seaborn solid-principles
Last synced: 15 Jan 2025
https://github.com/winter000boy/dsa-practice
This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.
data-analysis data-science leetcode leetcode-python pandas-python python3
Last synced: 30 Jan 2025
https://github.com/hvignolo87/ortex-programming-challenge
Coding challenges required for the Python Developer and Data Engineer job positions.
challenge data-analysis finance pandas python scripting sql sqlalchemy
Last synced: 02 Jan 2025
https://github.com/atxtechbro/flightradar24
Advanced Python application leveraging the power of APIs and the pandas library to retrieve and perform in-depth analysis of flight data from Flightradar24. It uncovers insights such as the most common departure and arrival cities, contributing to the field of aviation data science.
api-integration aviation-data data-analysis data-science data-visualization flightradar24-api pandas-library python requests-library web-scraping
Last synced: 25 Jan 2025
https://github.com/sn2606/global-temperature-time-series
Time series analysis is performed on the Berkeley Earth Surface Temperature dataset.
arima arima-forecasting arima-model climate-change data-analysis data-visualization forecasting-model global-temperature series-analysis singular-spectrum-analysis time-series time-series-analysis time-series-forecasting
Last synced: 25 Jan 2025
https://github.com/viper373/baidutieba
爬取百度贴吧(指定吧名、起始页数/重点页数、日志输出)
baidutieba-crawler bert data-analysis deep-learning python spider
Last synced: 05 Feb 2025
https://github.com/mwoss/mlflow-stock-market-example
Stock market prediction - machine learning pipeline using MLFlow.
anaconda data-analysis databricks example lstm mlflow python stock-market stock-price-prediction tutorial
Last synced: 24 Jan 2025
https://github.com/redeboer/nikhefproject2018
beam beam-analysis data-analysis detector nikhef proton proton-therapy
Last synced: 04 Jan 2025
https://github.com/ehtisham-sadiq/ai-pioneers-datascience-arena
This repository is dedicated to the AI Amigos team's participation in the Artificial Intelligence (AI) competition with a focus on Data Science.
artificial-intelligence competition data-analysis data-science data-visualization machine-learning model-building model-evaluation numpy pandas python3 supervised-learning unsupervised-learning
Last synced: 11 Jan 2025
https://github.com/quantumudit/analyzing-suez-services
This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/worst001/note_machine_learning
整理了机器学习相关资料与手册,包括数学基础、机器学习模型实现示例、神经网络。
ai data-analysis deep-learning development guide learning machine-learning markdown mkdocs note notebook
Last synced: 12 Jan 2025
https://github.com/quantumudit/analyzing-gamerevolution-games
This project focuses on scraping data related to video games from the GameRevolution website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/quantumudit/analyzing-quotes
This project focuses on scraping all the quotes and their related data from the "Quotes To Scrape" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/walidalsafadi/house-prices
Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. With 79 explanatory variables describing (almost) every aspect of residential homes in Ames, Iowa, this competition challenges you to predict the final price of each home.
cross-validation data-analysis data-science data-visualization decision-trees eda house-price-prediction house-prices jupyter linear-regression machine-learning machine-learning-algorithms mlp-regressor plot python random-forest-regression regression svr xgboost-regression
Last synced: 22 Jan 2025
https://github.com/jamesquinlan/intro-stats-mat150
Introduction to Statistics
data-analysis statistics university-course
Last synced: 18 Dec 2024
https://github.com/enricocid/monitoraggio-vaccini-italia
Sito web statico per github.com/apalladi/covid_vaccini_monitoraggio
covid-19 covid-19-data covid-19-data-analysis data data-analysis data-visualization dataset python python3 python37 sars-cov-2
Last synced: 17 Jan 2025
https://github.com/jimbrig/eda
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 23 Jan 2025
https://github.com/rapidsurveys/oldr
An Implementation of the Rapid Assessment Method for Older People (RAM-OP)
assessment data-analysis epidata estimate odk older-people r ram-op ranalyticflow rapid-assessment
Last synced: 24 Dec 2024
https://github.com/pratishtha-abrol/astronomy-dataanalysis
A key technique in Data Driven Astronomy
astronomy astropy crossmatch data-analysis
Last synced: 05 Feb 2025
https://github.com/gjbex/python-dashboards
Repository that contains material for training sessions on creating dashboards using Python.
dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization
Last synced: 22 Nov 2024
https://github.com/frikishaan/browsing-history-analysis
This is a data analysis of my browsing history for the last 7 months.
browsing-history data-analysis jupyter-notebook python
Last synced: 09 Jan 2025
https://github.com/wittline/data-analytics-with-r
Repository for data analytics course using R
cassandra-database cql data-analysis genetic-algorithm pentaho-data-integration r
Last synced: 29 Jan 2025
https://github.com/andr3w03/bike-sharing-dashboard
Bike Sharing Data Analysis Streamlit Dashboard
dashboard data-analysis data-visualization python streamlit
Last synced: 29 Jan 2025
https://github.com/reddyprasade/pandas-practice
Pandas
daat data-analysis data-science flexible labeling missing-data missing-values pandas pandas-profiling
Last synced: 19 Jan 2025
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 01 Jan 2025
https://github.com/yahia3200/become-an-independent-data-scientist
My final project for the Applied Plotting, Charting & Data Representation in Python Course
data-analysis data-science data-visualization matplotlib
Last synced: 22 Jan 2025
https://github.com/adriens/endoflife-date-snapshots
Daily consolidated and enriched snapshots of endoflife.date
apache-parquet csv csv-export data-analysis data-science database datavisualization dataviz duckdb duckdb-database end-of-life endoflife eol jupyter-notebook kaggle kaggle-notebook olap python release-policy release-schedule
Last synced: 12 Dec 2024
https://github.com/gustavohnsv/teamwork_mqa
Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.
data-analysis group-project r team-repo
Last synced: 16 Dec 2024
https://github.com/mindful-ai-assistants/credit-card-prediction
💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.
artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning predictive-modeling python3 scikit-learn
Last synced: 09 Dec 2024
https://github.com/antononcube/wl-outlieridentifiers-paclet
Wolfram Language (aka Mathematica) paclet that provides outlier identifier functions.
data-analysis hampel outlier-detection outliers
Last synced: 15 Dec 2024
https://github.com/nhsdigital/sde_example_analysis
Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.
data-analysis data-science databricks-notebooks machine-learning mlflow
Last synced: 23 Dec 2024
https://github.com/gher-uliege/liege-colloquium-on-ocean-dynamics
Python tools and latex files for the Colloquium
data-analysis data-assimilation numerical-simulations ocean-modelling oceanography remote-sensing submesoscale turbulence
Last synced: 05 Feb 2025
https://github.com/iraikov/chicken-dataframe
Tabular data structure for data analysis in Scheme
chicken-scheme chicken-scheme-eggs data-analysis dataframe linear-regression scheme scheme-programming-language
Last synced: 30 Jan 2025
https://github.com/karan-malik/uberdataanalysis
Uber Data Analysis and Visualization using Python
data-analysis data-analysis-python data-analytics data-science data-visualization dataanalysis matplotlib-pyplot numpy pandas pandas-dataframe python python3 seaborn uber uber-data
Last synced: 14 Jan 2025
https://github.com/mohamedomar2020/random-forest
Creating a Random Forest model to predict the progression of bladder cancer
bladder-cancer cancer-genomics cancer-research data-analysis data-science genomics machine-learning machine-learning-algorithms random-forest
Last synced: 30 Jan 2025
https://github.com/quantumudit/analyzing-goodreads-famous-quotes
This project focuses on scraping famous quotes and their related data from the GoodReads website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/cego669/datathonengopevi
Equipe: Embrapeiros. Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística). Obs: FOMOS CAMPEÕES!!!!!!!!
data-analysis data-science datathon python r streamlit xgboost-classifier
Last synced: 08 Dec 2024
https://github.com/shivamswarnkar/tesla-stock-prediction
Making prediction of close prices of Tesla Stocks using different regression methods.
data-analysis data-visualization plotly regression regularization sklearn stock-price-prediction
Last synced: 26 Jan 2025
https://github.com/olow304/goboard
Python Data Analysis Dashboard using Public Dataset, Django
dashboard dashboard-templates data-analysis data-science django jupyter-notebook machine-learning python sklearn
Last synced: 04 Jan 2025
https://github.com/al-ghaly/airline-company-data-warehouse
Data Warehouse modeling, design, implementation, and analysis for an Airline Company.
data-analysis data-warehousing database-modeling sql-server
Last synced: 22 Jan 2025
https://github.com/stoverc/slots
A collection of slots-related code (initially in Python3, but perhaps more later)
data-analysis data-science monte-carlo-simulation probabilistic-programming probability probability-theory python3 slot-machine slots statistical-analysis statistics
Last synced: 11 Jan 2025
https://github.com/chaitanyac22/house-price-prediction-project-for-a-us-based-housing-company
The goal of this project is to garner data insights using data analytics to purchase houses at a price below their actual value and flip them on at a higher price. This project aims at building an effective regression model using regularization (i.e. advanced linear regression: Ridge and Lasso regression) in order to predict the actual values of prospective housing properties and decide whether to invest in them or not.
advanced-linear-regression business-analytics data-analysis data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering lasso-regression linear-regression machine-learning model-building model-evaluation prediction-model python3 regularization rfe ridge-regression statistics
Last synced: 01 Feb 2025
https://github.com/yusufcinarci/covid-19-data-analysis-visualization
The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.
covid-19-data-visualization data-analysis data-science data-visualization
Last synced: 26 Dec 2024
https://github.com/goggle/dataisbeautiful
Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.
data-analysis data-visualisation jupyter-notebook notebook reddit
Last synced: 26 Dec 2024
https://github.com/vandita2020/merra2_nasa_wind_speed_analysis
In this study, we aim to explore the vulnerability of power grids in the south-east region of the USA with the help of data analysis tools and machine learning algorithms
data-analysis data-science machine-learning-algorithms python
Last synced: 11 Jan 2025
https://github.com/vishrut-b/end-to-end-data-analytics-with-python-and-sql
This project involves the data cleaning and SQL-based analytics of a retail orders dataset using Python and SQL. It focuses on preprocessing data, followed by detailed analytics to extract insights on sales trends and product performance.
data-analysis python retail sql sql-server sqlalchemy
Last synced: 05 Feb 2025
https://github.com/armanx200/diabetes_model
🚀 A machine learning model predicting diabetes with logistic regression, feature scaling, and VIF analysis. 📊🩺
arman-kianian classification data-analysis data-science data-visualization feature-engineering healthcare logistic-regression machine-learning model-evaluation predictive-modeling python scaling scikit-learn statistical-analysis statsmodels
Last synced: 24 Jan 2025
https://github.com/quantumudit/alteryx-weekly-challenges
This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community
alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl
Last synced: 26 Dec 2024
https://github.com/joanacmbarros/ardm-website
Website to support the R in Pharma 2023 workshop on the ARDM
analysis-results automation clinical-data data-analysis data-model r-in-pharma
Last synced: 16 Dec 2024
https://github.com/casualcomputer/sql.mechanic
Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).
data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server
Last synced: 04 Dec 2024
https://github.com/grburgess/gbm_kitty
Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.
3ml catalogue data-analysis fermi-science grbs pipelines
Last synced: 23 Jan 2025
https://github.com/thecoderpinar/credit-card-fraud-detection-project
This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨
anamoly-detection classification-algorithms credit-card-transactions data-analysis data-preprocessing data-science data-visualization fraud-detection machine-learning python
Last synced: 16 Dec 2024
https://github.com/thecoderpinar/reta
🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!
arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series
Last synced: 16 Dec 2024
https://github.com/thecoderpinar/gen-expression
Gene expression analysis is a fundamental component of genomics research, providing valuable insights into how genes are regulated and their impact on various biological processes. This project delves into the realm of gene expression data, aiming to uncover hidden patterns and relationships within complex datasets. 🚀
bioinformatics biotechnology data-analysis data-science data-visualization genomics kaggle machine-learning pca python
Last synced: 16 Dec 2024
https://github.com/adirthaborgohain/community-data-analysis
Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.
Last synced: 05 Feb 2025
https://github.com/thecoderpinar/hms-brainactivity-analysiss
Welcome to the GitHub repo for "HMS - EEG Exploration & Neurocritical Care Journey"! Explore EEG data, understand wave patterns, and delve into conditions like LPDs, GPDs, LRDA, and GRDA.
critical-care data-analysis data-science data-visualization deep-neural-networks eeg eeg-signals exploratory-data-analysis healthcare medical-research neuroscience signal-processing
Last synced: 16 Dec 2024
https://github.com/verbasik/yandex.practicum.datascience
Портфолио проектов Data Science, выполненных в рамках профессиональной переподготовки в Яндекс.Практикум. Включает исследования в области финансов, недвижимости, кинопроката и других, с использованием статистики, машинного обучения и анализа данных.
data-analysis data-science machine-learning yandex-praktikum
Last synced: 10 Jan 2025
https://github.com/supertetelman/kaggle-public
A collection of Python and Matlab projects aimed at utilizing various machine learning techniques to solve big data problems.
cnn data-analysis deep-learning machine-learning matlab python
Last synced: 28 Jan 2025
https://github.com/hyperspy/holospy-demos
HoloSpy Jupyter Notebook demos
data-analysis data-visualization electron-holography hyperspy materials-science multi-dimensional physical-sciences tutorial
Last synced: 19 Jan 2025
https://github.com/timzatko/fifa-19-dataset-machine-learning
Player's value prediction and game position classification on FIFA 19 dataset.
data-analysis fifa19 machine-learning scikit-learn
Last synced: 03 Jan 2025
https://github.com/mynenik/xyplot-32
Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux
cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows
Last synced: 24 Jan 2025
https://github.com/nunesma/Health-analytics
Data analysis focusing on health problems
data-analysis epidemiology-analysis health-analytics python r-programming
Last synced: 04 Dec 2024
https://github.com/manmolecular/http-response-clustering
:chart_with_downwards_trend: Clustering of HTTP responses using k-means++ and the elbow method
data-analysis elbow-method elbow-plot jupyter k-means-plus-plus python3
Last synced: 16 Jan 2025
https://github.com/leonism/customer-predictive-analysis
Explore this repository, a comprehensive resource offering an in-depth guide to conducting customer predictive analysis using cutting-edge machine learning techniques, all within the intuitive framework of Dataiku.
data-analysis data-model data-science data-visualization dataiku machine-learning predictive-modeling
Last synced: 03 Feb 2025
https://github.com/programmer-rd-ai/moviedatascraper
Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!
beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web
Last synced: 12 Jan 2025
https://github.com/chrdek/linqdatacalc
📈 🎲 Linq based data statistics set of extensions.
calculations calculator data-analysis data-analytics data-science data-statictics extension-methods extensions linq linq-extensions set-theory statistical-analysis statistics
Last synced: 29 Jan 2025
https://github.com/lafayettegabe/nlp-resume-extraction
📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.
big-data data data-analysis data-science eda ner nlp resume-extractor
Last synced: 16 Dec 2024
https://github.com/uts58/international-student-job-insights-usa
Data-driven insights on job hunting for international students in the USA, analyzing listings, roles, and trends.
career-insights cpt data-analysis eb1 eb2 eb3 h1b handshake job-analytics job-trends jobs jupyter-notebook opt python work-visa
Last synced: 25 Dec 2024
https://github.com/bkataru/physics-ia
Programs and files written for Astrostatistics for IB Physics IA. Topic: Visualizing and analyzing the habitable zones for 150,000 stars from the hipparcos catalogue.
astronomical-algorithms astronomy astrophysics astrostatistics data-analysis data-science data-visualization matplotlib plotting
Last synced: 22 Dec 2024
https://github.com/shashankbansal6/signal-analysis-for-patient-monitoring
A reliable patient monitoring system which analyzes the correlated physiological signals collected from the patient's body, and generates alarms for abnormalities.
data-analysis patient-monitoring
Last synced: 17 Dec 2024
https://github.com/ziaeemehr/itng_nest
Nest Simulator quick guides and examples, adding new model using NESTML
computational-neuroscience data-analysis nest-simulator neuroscience
Last synced: 01 Jan 2025
https://github.com/deep-diver/enron-data-analysis
Data Analysis and Machine Learning on Enron Data
data-analysis enron-data exploratory-data-analysis machine-learning
Last synced: 05 Feb 2025
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 16 Dec 2024
https://github.com/virajbhutada/tableau-data-vizzes
Engage with a growing collection of Tableau dashboards covering financial trends, HR analytics, streaming service insights, real estate dynamics, and more. Meticulously crafted for valuable insights, this repository continues to expand with new and compelling visualizations.
business-analytics data-analysis data-visualization hr-analytics industry-trends netflix performance-metrics stock-market-analysis strategic-analytics tableau visual-insights
Last synced: 10 Jan 2025
https://github.com/quantumudit/demographic-data-analysis
This project focuses on analyzing and finding correlations between the three important metrics by 195 countries,i.e., birth rate, internet users, and income group.
data-analysis jupyter-notebook power-bi python
Last synced: 26 Dec 2024
https://github.com/jimbrig/EDA
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 04 Dec 2024
https://github.com/thennen/py-ivtools
A package for flexible and reproducible measurement and analysis of current-voltage characteristics of electronic devices.
current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements
Last synced: 24 Jan 2025