Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with data-exploration

A curated list of projects in awesome lists tagged with data-exploration .

https://github.com/kanaries/pygwalker

PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis

data-analysis data-exploration dataframe matplotlib pandas plotly tableau tableau-alternative visualization

Last synced: 16 Dec 2024

https://github.com/Kanaries/pygwalker

PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis

data-analysis data-exploration dataframe matplotlib pandas plotly tableau tableau-alternative visualization

Last synced: 29 Oct 2024

https://github.com/sfu-db/dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.

apis apiwrapper cleaning connector data-exploration data-science datacleaning dataconnector dataprep datapreparation eda exploratory-data-analysis webconnector

Last synced: 17 Dec 2024

https://github.com/comet-ml/kangas

🦘 Explore multimedia datasets at scale

data-analysis data-exploration dataframe datagrid machine-learning

Last synced: 18 Dec 2024

https://github.com/keen/explorer

Data Explorer by Keen - point-and-click interface for analyzing and visualizing event data.

analysis analytics analytics-api charts data-exploration data-visualization dataviz keen-io native-analytics web-analytics

Last synced: 21 Dec 2024

https://github.com/tkrabel/edaviz

edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab

altair data-analysis data-exploration data-sciene data-visualization eda edaviz exploratory-data interactive jupyter-notebook matplotlib pandas plotly project-jupyter pyhon qgrid seaborn

Last synced: 18 Dec 2024

https://github.com/rolkra/explore

R package that makes basic data exploration radically simple (interactive data exploration, reproducible data science)

data-exploration data-visualisation decision-trees eda r rmarkdown shiny tidy

Last synced: 21 Dec 2024

https://github.com/facultyai/lens

Summarise and explore Pandas DataFrames

dask data-exploration data-science data-visualisation dataframe pandas

Last synced: 08 Nov 2024

https://github.com/Desbordante/desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

anomaly-detection correlations data-analytics data-cleaning data-cleansing data-engineering data-exploration data-mining data-mining-algorithms data-preprocessing data-profiling data-science data-wrangling exploratory-data-analysis feature-engineering feature-extraction feature-selection knowledge-discovery spreadsheets tabular-data

Last synced: 04 Nov 2024

https://github.com/renumics/sliceguard

A library for detecting problematic data segments in structured and unstructured data with few lines of code.

data-analysis data-cleaning data-curation data-exploration data-science data-visualization deep-learning eda exploratory-data-analysis machine-learning python visualization

Last synced: 27 Oct 2024

https://github.com/DistrictDataLabs/cultivar

Multidimensional data explorer and visualization tool.

data-analysis data-exploration data-management visualization

Last synced: 08 Nov 2024

https://github.com/districtdatalabs/cultivar

Multidimensional data explorer and visualization tool.

data-analysis data-exploration data-management visualization

Last synced: 11 Nov 2024

https://github.com/virajbhutada/powerbi-projects-collection

Discover a curated collection of dynamic Power BI dashboards covering financial analytics, HR metrics, streaming service trends, real estate dynamics, and more. Meticulously designed for comprehensive data exploration, this repository continues to expand with new and impactful visualizations.

analytical-insights data-analytics data-exploration data-visualization dynamic-dashboards healthcare-analysis hr-management powerbi trends-visualization visual-reporting

Last synced: 11 Nov 2024

https://github.com/DECODEproject/bcnnow

Light, personalized, interactive dashboards for urban data exploration.

data-exploration data-visualization urban-dashboards

Last synced: 03 Nov 2024

https://github.com/ndleah/health-analysis

This case study is contained within the Serious SQL course by Danny Ma

data-analysis data-exploration data-with-danny database serious-sql sql

Last synced: 13 Nov 2024

https://github.com/darenasc/aeda

Build a data catalog by running a single line of code

data-catalog data-exploration database eda metadata metadata-extraction

Last synced: 27 Oct 2024

https://github.com/aiguofer/sql_connectors

A simple wrapper for SQL connections using SQLAlchemy and Pandas read_sql to standardize SQL workflow with multiple data sources.

data-analysis data-analytics data-exploration data-science pandas relational-databases sql sqlalchemy standardized-api

Last synced: 26 Oct 2024

https://github.com/jgphilpott/polyplot

A data exploration application inspired by Ola Rosling's Trendalyzer software.

d3js data-exploration data-science ola-rosling threejs trendalyzer

Last synced: 21 Nov 2024

https://github.com/darenasc/auto-fes

Automated exploration of files in a folder structure to extract metadata and potential usage of information.

data-exploration data-profiling data-science plain-text python

Last synced: 27 Oct 2024

https://github.com/virajbhutada/us-healthcare-analysis-powerbi

Unlock insights into the U.S. healthcare landscape from 2019 to 2020. Our PowerBI-driven analysis delves into hospital performance, patient outcomes, and payer-provider dynamics. Dive into detailed reports and visualizations for informed decision-making, empowering healthcare stakeholders, and shaping the industry's future.

data-analytics data-exploration data-modeling data-visualization datascience dax-expression decision-making healthcare-analysis healthcare-datasets insights interactive-visualizations microsoftpowerbi power-query powerbi powerbi-dashboards powerbi-desktop strategic-planning

Last synced: 11 Nov 2024

https://github.com/revogati/ecommerce_consumer_behaviour

This is a Full Data Analytics project From data cleaning, preparation, exploration, Interpretation of insights up to Presentation of findings and recommendations..

data-analysis data-exploration ecommerce jupyter-notebook python sql tableau-public visualization

Last synced: 10 Nov 2024

https://github.com/cnag-biomedical-informatics/pheno-ranker

Pheno-Ranker is a tool designed for performing semantic similarity analysis on phenotypic data structured in JSON format, such as Beacon v2 Models or Phenopackets v2.

beacon-v2 bff csv data-exploration json phenopackets-v2 pxf semantic-similarity semantic-similarity-measures

Last synced: 05 Nov 2024

https://github.com/cnag-biomedical-informatics/pheno-ranker-ui

The web ui (R-Shiny application) for Pheno-Ranker, a tool designed for performing semantic similarity analysis on phenotypic data structured in JSON format, such as Beacon v2 Models or Phenopackets v2

beacon-v2 clinical-data data-exploration json phenopackets-v2 r semantic-similarity semantic-similarity-measures shiny

Last synced: 17 Dec 2024

https://github.com/noeyislearning/coffee-chain-sales

Explore valuable insights into the performance of a coffee chain across various locations, including key attributes such as Area Code, COGS, Profit, Sales, and more. Dive into sales trends, financial performance, and market dynamics.

data-exploration data-science data-visualization exploratory-data-analysis jupyter-notebook market-segmentation-analysis matplotlib product-performance profitability-analysis python3 seaborn

Last synced: 06 Dec 2024

https://github.com/phillipdupuis/mbta-api-playground

Learn about the MBTA V3 API by building queries and exploring the results

data-exploration django django-rest-framework mbta-api pandas pandas-dataframe pandas-profiling python

Last synced: 21 Dec 2024

https://github.com/gjbex/python-dashboards

Repository that contains material for training sessions on creating dashboards using Python.

dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization

Last synced: 22 Nov 2024

https://github.com/anushadatta/airbnb-in-seattle

🏨 Understanding the Airbnb rental landscape in Seattle using data science.

airbnb data-analysis data-exploration data-visualization datascience sentiment-analysis

Last synced: 11 Dec 2024

https://github.com/noeyislearning/global-stock-price-archive

A comprehensive dataset that provides a historical record of stock prices from a wide range of stock markets across the globe. This dataset is a valuable resource for researchers, investors, and analysts seeking to analyze trends, perform financial research, or develop trading strategies.

correlation-analysis currency-exchange data-cleaning data-exploration data-science data-visualization jupyter-notebook matplotlib python3 seaborn

Last synced: 06 Dec 2024

https://github.com/vidhi1290/deep-learning-for-eeg-emotion-classification

This repository contains a Python code script for performing emotion classification using EEG (Electroencephalogram) data. Emotion classification from EEG signals is an important application in neuroscience and human-computer interaction. The code leverages deep learning techniques to analyze EEG data and predict emotional states.

coorelation data-exploration data-preprocessing data-science data-visualization deep-learning deep-learning-algorithms eeg-emotion-recognition egg-signals emotion-distribution emotion-prediction feature-analysis heatmap human-emotions machine-learning machine-learning-algorithms pie-chart spectral-analysis time-series-visualization

Last synced: 08 Dec 2024

https://github.com/abdoomohamedd/cifake-real-and-ai-generated-differentiation

The CIFAKE project is a comprehensive effort to develop and implement techniques for distinguishing between AI-generated images and real images. This project leverages a combination of preprocessing techniques, feature extraction, machine learning, and deep learning models to accurately classify images.

cnn data-exploration deep-learning feature-extraction image-preprocessing machine-learning preprocessing python

Last synced: 06 Nov 2024

https://github.com/mpolinowski/hotel-booking-dataset

Python Pandas Dataset Exploration with Hotel Demand Data.

data-exploration hotel-booking pandas python

Last synced: 30 Nov 2024

https://github.com/cloaky233/dataexploration

Data Exploration with Pandas: A hands-on guide to mastering data manipulation techniques. Covers column selection, table joins, and more. Ideal for beginners and intermediate users looking to enhance their pandas skills. Includes practical examples and clear explanations. More features coming soon!

data-exploration pandas python

Last synced: 19 Nov 2024

https://github.com/sayamalt/cyberbullying-classification-using-fine-tuned-distilbert

Successfully fine-tuned a pretrained DistilBERT transformer model that can classify social media text data into one of 4 cyberbullying labels i.e. ethnicity/race, gender/sexual, religion and not cyberbullying with a remarkable accuracy of 99%.

cyberbullying-detection data-exploration distilbert-model exploratory-data-analysis fine-tune-bert-tensorflow llm model-inference model-training-and-evaluation multiclass-classification natural-language-processing text-classification text-preprocessing text-tokenization

Last synced: 07 Nov 2024

https://github.com/jdacoello/data-camp

Data Camp projects. Solutions to practical exercises (www.datacamp.com)

data-exploration jupyter-notebook

Last synced: 16 Nov 2024

https://github.com/chahelgupta/dep-videogames-dataset

The data extraction and processing involved thorough exploration, preprocessing, and visualization of the "Video Game Sales with Ratings" dataset.

data-analysis data-exploration data-extraction data-preparation data-preprocessing data-processing data-science data-visualization

Last synced: 18 Nov 2024

https://github.com/chahelgupta/fitness-data-analysis-r-project

This project focuses on analyzing fitness data collected from various tracking devices to gain insights into users' activity levels, sleep patterns, calorie expenditure, and heart rate. The dataset used in this project consists of multiple CSV files, each containing different aspects of fitness-related data.

data-analysis data-cleaning data-exploration data-science data-visualization r r-language r-programming r-studio

Last synced: 18 Nov 2024

https://github.com/uhpoler/waterqualityprediction-coursework

This is my coursework for the fourth semester of the first year at KPI, in which I implemented a system for predicting water quality using various machine learning models.

data-exploration data-preprocessing dtc justification knn machine-learning quality-prediction rfc

Last synced: 07 Dec 2024

https://github.com/muneeb706/human_activity_recognition

This project performs data cleaning and data exploration steps for Human Activity Recognition Using Smartphones Data Set in R programming language.

data-analysis data-cleaning data-exploration r-programming

Last synced: 04 Dec 2024

https://github.com/controldata23/automobiles-data-exploration

An Exploratory Data Analysis done on an Automobiles dataset from kaggle

data-exploration data-visualization eda jupyter-notebook matplotlib python-data-analysis

Last synced: 19 Dec 2024

https://github.com/vidhi1290/hr_employee_prediction

"Welcome to the HR Employee Promotion Prediction project! This repository contains the code and resources for a machine learning project that focuses on predicting employee promotions. By analyzing various employee attributes, this project aims to provide valuable insights for HR decision-making and talent recognition within organizations.

data-exploration data-science data-visualization docker hr-employee-prediction hyperparameter-tuning machine-learning matplot model-building numpy pandas scikit-learn seaborn streamlit streamlit-webapp

Last synced: 08 Dec 2024

https://github.com/sunsided/itis

ITIS database exploration

data-exploration graphs itis

Last synced: 20 Dec 2024

https://github.com/lijesh010/netflix_dataset_exploratory_data_analysis_python_project

This repository contains an Exploratory Data Analysis (EDA) Python project on the Netflix dataset. The purpose of this project is to gain insights and better understand the characteristics of the content available on Netflix, including movies and TV shows.

data-analysis data-exploration data-visualization exploratory-data-analysis jupyter-notebook python

Last synced: 09 Dec 2024

https://github.com/lijesh010/roadaccidentanalysisproject

This data analysis project was completed using MS Excel, and includes the creation of a dashboard.

data data-analytics data-exploration data-visualization msexcel

Last synced: 09 Dec 2024

https://github.com/samuelsoaress/wkd-default-reduction

reduction of default from 35% to 25% or less with machine learning techniques

data-analysis data-exploration data-science machine-learning-algorithms

Last synced: 10 Nov 2024

https://github.com/controldata23/shopping-data-from-istanbul

This analysis is an EDA done on Istanbul Shopping dataset from kaggle.

data-analysis-python data-cleaning data-exploration descriptive-statistics eda jupyter-notebook

Last synced: 19 Dec 2024

https://github.com/controldata23/population-of-countries

An Exploratory Data Analysis done on a Countries dataset from kaggle

data-analysis-python data-cleaning data-exploration eda jupyter-notebook pandas

Last synced: 19 Dec 2024

https://github.com/nabilshadman/spark-sql-stock-prices-exploration

An exploration of stock prices of Amazon, Google, and Tesla using Spark SQL

data-exploration finance pyspark

Last synced: 17 Dec 2024

https://github.com/juzershakir/predicting_boston_housing_prices

Builded a model to predict the value of a given house in the Boston real estate market using various statistical analysis tools. Identified the best price that a client can sell their house utilizing machine learning.

bias-variance boston-housing-price-prediction data-exploration decision-tree-regression gridsearchcv k-fold machine-learning matplotlib mlfnd model-evaluation model-validation numpy pandas python3 r2-score sklearn supervised-learning udacity-nanodegree

Last synced: 09 Oct 2024

https://github.com/giatraskon/sandbox.bio-solutions

Bash scripts replicating the commands from sandbox.bio's interactive bioinformatics tutorials, organized by categories such as Data Exploration, File Formats, Quality Control, and Data Analysis.

bam-files bash bed-files bioinformatics bioinformatics-workflows command-line-tools computational-biology data-analysis data-exploration data-wrangling fasta-files fastq-files file-formats genomic-data quality-control sandbox-bio sandbox-bio-tutorials sequence-alignment unix-shell variant-calling

Last synced: 13 Dec 2024

https://github.com/johannaschmidle/ufo-project

Exploring the Relationship Between UFOs, Location, Time, and Human Emotion [SQL, Python]

cluster-analysis data-exploration eda k-means-clustering location-analysis nltk-python sentiment-analysis time-analysis ufo-sightings wordcloud

Last synced: 12 Nov 2024