An open API service indexing awesome lists of open source software.

Data visualization

Data visualization is the visual depiction of data through the use of graphs, plots, and informational graphics. Its practitioners use statistics and data science to convey the meaning behind data in ethical and accurate ways.

https://github.com/nicovandenhooff/wids-datathon-2022

This repository contains solution for the 2022 Women in Data Science Kaggle competition that I participated in, which obtained a top 10% leaderboard standing.

catboost data-visualization datascience energy-consumption ensemble-learning exploratory-data-analysis kaggle lightgbm machine-learning scikit-learn women-in-data-science xgboost

Last synced: 07 May 2026

https://github.com/citizenlabsgr/openbudgetkw

Promoting a deeper understanding of the Kentwood, MI city budget process and city priorities.

csv data-visualization jade javascript react

Last synced: 07 May 2026

https://github.com/leandrocollares/employment-insurance-beneficiaries

A responsive line chart that shows regular Employment Insurance beneficiaries in Canada between 2019 and 2021

d3 data-visualization svelte

Last synced: 07 May 2026

https://github.com/danyal-faheem/project-logs-analyzer

This repo contains scripts to analyze project logs and display some charts related to the data

data data-visualization matplotlib pandas python streamlit

Last synced: 07 May 2026

https://github.com/amarlearning/exploring-67-years-of-lego

In this project, I have explored database of every LEGO set ever built.

data-manipulation data-visualization importing-and-cleaning-data jupyter-notebook pandas python

Last synced: 07 May 2026

https://github.com/cnoret/hexa-watts

Interactive data visualization and machine learning app for energy consumption analysis and prediction in France, built with Streamlit. (Text in French)

data-visualization electricity-forecasting energy-analysis france machine-learning scikit-learn streamlit

Last synced: 07 May 2026

https://github.com/muthukumar0908/-singapore-resale-flat-prices-predicting

This project is to develop a machine learning model and deploy it as a user-friendly web application that predicts the resale prices of flats in Singapore.

data-analysis data-visualization mechine-learing plotly python streamlit

Last synced: 07 May 2026

https://github.com/sammdu/global-warming-hurricane-typhoon

The Effect of Global Warming on Hurricane and Typhoon Occurrence

data-science data-visualization global-warming hurricane-data

Last synced: 07 May 2026

https://github.com/satyam4229/identify-employee-attrition

This is the model where we predict the attrition of the employees of the company by checking there records and all. In the given dataset, we have the features like salary, environment, age, gender and their experience.

data-analysis data-science data-visualization jupyter-notebook kaggle python

Last synced: 08 May 2026

https://github.com/nishumehta/sales-analysis-project

This project aims to analyze sales performance using Excel, SQL, Python, Tableau, and Power BI. The goal is to extract insights from sales data, identify trends, and visualize key performance indicators (KPIs).

data-cleaning data-visualization eda excel matplotlib-pyplot pandas python3 tableau-dashboards

Last synced: 08 May 2026

https://github.com/ropaxyz/octobot-octopus-energy-discord-bot

A Discord bot for Octopus Energy users to track and visualize their energy consumption. Integrates with Octopus Energy's API to fetch and display personalized energy data, costs, and usage charts.

asyncio data-visualization discord-bot energy-monitoring graphql matplotlib octopus-energy octopus-energy-api python rest-api sqlite

Last synced: 08 May 2026

https://github.com/femincan/d3-scatterplot-graph

My solution for the Visualize Data with a Scatterplot Graph project on FCC.

css3 d3js data-visualization html5 javascript

Last synced: 08 May 2026

https://github.com/akansharajput280799/data-exploration-and-visualization-for-streaming-platform

Data Analysis and Visualization for streaming platform to provide insights and recommendations to improve their userbase.

data-visualization excel exploratory-data-analysis matplotlib numpy python seaborn

Last synced: 08 May 2026

https://github.com/dsaikiran01/seismomap

An interactive React + Leaflet web app that visualizes real-time global earthquakes from the USGS API with live map markers, magnitude filtering, dark mode, and responsive UI.

data-visualization earthquakes geoscience material-ui react react-leaflet tailwindcss usgs-api vite

Last synced: 08 May 2026

https://github.com/abhash-rai/regression-car-price-prediction

This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.

data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression

Last synced: 08 May 2026

https://github.com/js-konda/naturaldisasterseda

The project repository for the Exploratory Data analysis of natural disasters done as part of ECE143 course at UCSD

data-science data-visualization pandas python visualization

Last synced: 08 May 2026

https://github.com/samjoesilvano/password_strength_prediction_using_nlp

Developed a predictive model to categorize passwords as Strong, Good, or Weak, enhancing security and reducing breach risks. The project involves cleaning and analyzing data from an SQL database, using the TF-IDF technique for transformation, and implementing a Logistic Regression model to achieve accurate classifications.

data-analysis data-classification data-cleaning data-visualization logistic-regression machine-learning natural-language-processing pandas password-security password-strength python scikit-learn sql tf-idf

Last synced: 08 May 2026

https://github.com/tsear/reddit-discourse-project

Mapping emotional and conceptual discourse across Reddit philosophy communities.

data-visualization emotion-detection network-analysis nlp pandas reddit-api sentiment-analysis spacy text-mining tf-idf topic-modeling

Last synced: 08 May 2026

https://github.com/vanshuchaudhary/flightpriceanalysis-

The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.

business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis

Last synced: 08 May 2026

https://github.com/hemangsharma/hotel-revenue-booking-analysis

This project provides a comprehensive revenue and reservation analysis for Highfield Hotel using historical data exported from booking systems and internal revenue reports. The goal is to derive actionable insights to improve room profitability, understand booking patterns, and support data-driven decision-making.

analysis data-analysis data-visualization hotel

Last synced: 10 Aug 2025

https://github.com/hermann-web/projet-rythmes-urbains-heetch

Heetch's "Rythmes Urbains" project analyzes driver data to understand traffic patterns in Casablanca. It identifies peak hours, main driver-concentration areas, and daily traffic variations. The project involves geospatial mapping, time series analysis, and R programming

casablanca data-mapping data-visualization geospatial-analysis heetch hur open-data r-programming time-series-analysis traffic-analysis transportation urban-mobility

Last synced: 19 Mar 2026

https://github.com/asuquoaa/ann_arbor_weather_analysis_2005-2015

This project analyzes historical weather data from Ann Arbor, Michigan, collected by the National Centers for Environmental Information (NCEI) Global Historical Climatology Network daily (GHCNd).

data-cleaning-and-preprocessing data-visualization

Last synced: 03 Apr 2025

https://github.com/sayamalt/quora-duplicate-question-pairs-identification

Successfully developed a machine learning model which can accurately detect whether any given pair of Quora questions are duplicate or not.

data-visualization machine-learning natural-language-processing nltk paraphrase-detection text-preprocessing

Last synced: 09 Nov 2025

https://github.com/angchekar28/valorant-gameplay-analysis

This project analyzes Valorant gameplay data to understand key factors affecting match outcomes. It compares various machine learning models to predict player performance, rank classification, and match success.

data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python

Last synced: 12 Apr 2026

https://github.com/theanujsinha01/mcdonalds-customer-analysis

This project analyzes customer feedback data to understand what drives people to like or dislike McDonald’s. Using Python and data visualization tools in a Jupyter Notebook, we explore how different factors—such as taste, price, health, and visit frequency—affect customer satisfaction.

case-study data data-visualization dataanalysis

Last synced: 05 Sep 2025

https://github.com/sayamalt/superstore-sales-prediction

Successfully established a machine learning model that can accurately predict the sales of a superstore based on various features such as quantity, profit, discount, postal code, etc. The features are mainly associated with order details and customer demographics.

azure-machine-learning azure-web-app-service cicd-deployment cross-validation data-cleaning-and-preprocessing data-visualization exploratory-data-analysis feature-engineering github-actions-ci-cd hyperparameter-tuning machine-learning model-deployment model-retraining model-testing model-training-and-evaluation regression-models

Last synced: 09 Nov 2025

https://github.com/sayamalt/concrete-strength-prediction

Successfully developed a machine learning model which can accurately predict the strength of cement based on various features such as blast furnace slag, water, coarse aggregate, etc.

cross-validation data-visualization exploratory-data-analysis feature-engineering hyperparameter-optimization machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 09 Nov 2025

https://github.com/vincent-tran-94/Dataviz_Tweets_ChatGPT

Une application Streamlit pour analyser et visualiser les données et les tweets sur la sortie de ChatGPT. Ce projet comprend la gestion des données, l'analyse des sentiments, les tendances émergentes et les applications potentielles de ChatGPT.

data-management data-visualization sentiment-analysis streamlit text-mining twitter

Last synced: 10 Aug 2025

https://github.com/camara94/data_analyse_series_temporelles

Dans ce tutoriel, nous allons répondre aux questions suivantes: 1. Lire les données Microsoft à l'aide du package **Pandas Data reader** 2. Obtenez le **prix maximum** de l'action de **2017 à 2022** 3. Quelle est la **date du cours le plus élevé** de l'action ? 4. Quelle est la **date du cours le plus bas** de l'action ?

data-analysis data-analysis-python data-science data-structures-and-algorithms data-visualization serie series-forecasting

Last synced: 09 Apr 2025

https://github.com/saikiran76/titanicdata-analysis-eda

In this notebook, we're going to analyse the famous Titanic dataset from Kaggle. The dataset is meant for supervised machine learning, but we're only going to do some exploratory analysis at this stage. We'll try to answer some questions using metrics and EDA.:

analysis data-science data-visualization eda python

Last synced: 19 May 2026

https://github.com/kariemseiam/geoegy

An innovative and responsive dashboard to discover, filter, and analyze places across Egypt. Featuring advanced search, interactive maps with Leaflet.js, real-time analytics, dark mode, and seamless data export—all wrapped in a sleek, modern design with RTL support.

accessibility data-analysis data-visualization es6-modules geojson javascript leaflet mapping openstreetmap places-data responsive-design web-development

Last synced: 13 Feb 2026

https://github.com/dcostachar/telco-customer-churn-dashboard

An interactive Tableau dashboard using the Telco Customer Churn dataset to analyze key drivers of customer churn and develop data-driven retention strategies for the telecommunications industry.

business-intelligence customer-churn-analysis data-analysis data-visualization marketing-analytics tableau

Last synced: 09 Mar 2026

https://github.com/itskshitija/analyzing-the-nyc-airbnb-market

The aim of this project is to utilize Python to understand the factors that influence Airbnb prices in New York City, or identifying patterns of all variables. Our analysis provides useful information for travelers and hosts in the city and some of the best insights for the Airbnb business.

data-science data-visualization dataanalysis dataanalysisusingpython

Last synced: 22 Jul 2025

https://github.com/busesimsek/dataanalysisportfolio

A compilation of my data analysis projects using SQL, Python, and Tableau.

data-analysis data-visualization python sql tableau

Last synced: 12 Jun 2025

https://github.com/sravyatogarla/movie-recommendation-system

A complete Movie Recommendation System project implementing Popularity-Based, Content-Based, and Collaborative Filtering models using the MovieLens dataset. Built with Python, Pandas, and Plotly, featuring interactive inputs and visualizations.

capstone-project collaborative-filtering content-based-filtering data-science data-visualization edureka jupyter-notebook machine-learning movie-recomendation-system movielens pandas popularity-based-filtering python recommender-system scikit-learn sql

Last synced: 13 Apr 2026

https://github.com/carcesar/salariogovernadores2023

Visualização dos salários dos governadores em 2023

data-science data-visualization politics

Last synced: 24 Apr 2025

https://github.com/sanjana-bongale/cta_ridership_data_visualization_using_tableau

Tableau-based analysis of Chicago Transit Authority (CTA) ridership trends (2015-2024). It includes interactive dashboards, heatmaps, and comparative visualizations to explore bus and rail boarding data, COVID-19 impact, and long-term trends.

customer-analysis dashbaord data-visualization tableau

Last synced: 16 Feb 2026

https://github.com/archanakokate/ml_cardiovascular-disease-prediction-

EDA and Model building to predict the risk of a heart attack using a Logistic Regression and Random Forest Classifier

data-engineering data-visualization exploratory-data-analysis machine-learning-algorithms

Last synced: 17 Mar 2025

https://github.com/gabrieladados/tableau_dashboards

Dashboards desenvolvidos no Tableau

dashboards data-visualization figma tableau

Last synced: 09 Apr 2025

https://github.com/itskshitija/lego-set-explorer

As a part of the Maven Analytics Lego challenge, I developed an interactive Power BI dashboard exploring the evolution of LEGO sets from 1970 to 2022.

data-analysis data-science data-visualization dataanalysis dataset powerbi powerbi-desktop powerbi-report

Last synced: 12 Jun 2025

https://github.com/d-k-deng/word_cloud_vis

For more detailed info, please see https://github.com/D-K-Deng/Ancient_Chinese_Bibliography_Vis

css data-visualization html javascript react

Last synced: 09 Apr 2026

https://github.com/shahiakhilesh1304/fitbitcasestudy

This is a case study based on data retrieved from a Fitbit band, and we are making predictions about human behavior based on their mood.

case-study data-visualization fitbit jupyter-notebook numpy python3

Last synced: 13 Apr 2026

https://github.com/martingit2/aiportal-frontend

Frontend for Aracanix. Et interaktivt dashboard bygget med React og TypeScript for å visualisere data og administrere en analyseplattform. Se README for lenker til backend og ML-tjeneste.

clerk dashboard data-visualization fullstack react recharts typescript vite

Last synced: 13 Apr 2026

https://github.com/afifahhadie/wine-clasificassion

This project focuses on classifying different types of wine using machine learning techniques. The dataset contains various chemical properties of wines, which are used as features to predict the wine class.

classification data-science data-visualization jupyter-notebook machine-learning machine-learning-algorithms pandas scikit-learn wine-dataset

Last synced: 13 Apr 2026

https://github.com/petarran/gun-violence-usa

Data Science project comparing USA gun violence cases to its causes.

data-science data-visualization r

Last synced: 05 Sep 2025

https://github.com/nullthefirst/py-notebooks

Jupyter Notebooks holding Data Science projects

data-analysis data-science data-visualization datasets jupyter-notebooks python

Last synced: 26 Apr 2026

https://github.com/abhigyan126/prompt2query

A Python desktop application for streamlined data analysis, enabling users to generate and execute Pandas and SQL queries with ease. Focus on reducing analysis time through an intuitive interface and efficient workflows

data-analysis data-science data-visualization database gemini generative-ai ide llm pandas pandas-interface python sql-interface

Last synced: 13 Feb 2026

https://github.com/yash22222/data-analysis-on-real-time-social-media-comments

EngageInsight analyzes user interactions in comment data. It provides insights through visualizations created using Python libraries like Pandas and Matplotlib. The project aims to uncover patterns and trends in user engagement. The visualizations provide an overview of comment lengths, the frequency of different types of replies.

data-analysis data-cleaning-and-preprocessing data-visualization matplotlib pandas pattern-recognition real-time-social-media-data seaborn trend-analysis

Last synced: 14 May 2026

https://github.com/darrenjolson/pba-analysis-app

Data analysis and visualization tool for professional bowling tournaments, predicting performance across different oil patterns and venues.

bowling data-analysis data-visualization flask pba predictive-analytics python reactjs sports-analytics

Last synced: 13 Apr 2026

https://github.com/nurulashraf/polynomial-regression-manufacturing

A Python project implementing polynomial regression to analyse and predict manufacturing-related data. Features include data preprocessing, model training, and visualisation of results. Ideal for exploring machine learning applications in manufacturing process optimisation.

data-analysis data-visualization machine-learning manufacturing polynomial-regression predictive-modeling process-optimization python regression-models scikit-learn

Last synced: 16 Apr 2026

https://github.com/sayamalt/flight-price-prediction

Successfully established a machine learning model to accurately predict the price of a flight in India based on several features such as duration, days left, arrival time, departure time and so on.

data-visualization exploratory-data-analysis feature-engineering hyperparameter-tuning machine-learning model-deployment model-training-and-evaluation regression-models

Last synced: 09 Nov 2025

https://github.com/analysisbyvivek/Road-Accident

Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.

data-analysis data-visualization eda jupyter-notebook kaggle tableau-public

Last synced: 29 Jan 2026

https://github.com/jawwad-fida/stroke-prediction

Predict whether a patient is likely to get stroke using machine learning classification algorithms. Performance Comparison of algorithms.

classification data-science data-visualization machine-learning python stroke-prediction

Last synced: 19 Feb 2026

https://github.com/oncs21/expenai

An open source modern and simple expense tracking app with numerous integrations

bootstrap css data-visualization database-management ejs express mindee mongodb mongoose node-js server-security

Last synced: 09 Apr 2026

https://github.com/suryakaranraja/listview-application-for-zoho-interview

This repository has the files of basic application for designed to retrieve details about the top 50 happiest countries in the world for the year 2022.

data-visualization desktop desktop-app happiness-score interview-with-zoho list-view surface-tablet winui3

Last synced: 09 Apr 2026

https://github.com/amoghkori/working-with-apache-spark-mllib

Implemented Apache Spark MLLib to analyze a large car dataset, predict car selling prices, and gain insights into the car market.

amazon-web-services data-analysis data-visualization exploratory-data-analysis linear-regression machine-learning model-selection pyspark python random-forest sagemaker spark

Last synced: 13 Apr 2026

https://github.com/ccolpasm/bar-chart-task

Este repositorio muestra una visualización interactiva del PIB de EE. UU. utilizando D3.js. Los datos provienen de la API pública de FreeCodeCamp (https://raw.githubusercontent.com/freeCodeCamp/ProjectReferenceData/master/GDP-data.json). El proyecto utiliza HTML, CSS y D3.js para analizar y graficar el crecimiento económico del país.

d3-visualization d3js data-science data-visualization javascript

Last synced: 08 Aug 2025

https://github.com/nature40/casestudies

Case studies for testing the functionality of database systems, sensors, etc

casestudies data-analysis data-visualization database

Last synced: 02 May 2026

https://github.com/ireneflorez/exploringweathertrends

Exploring Weather Trends using SQL, moving averages, and data visualization

data-visualization excel sql

Last synced: 10 Feb 2026

https://github.com/hassanislam463/data-cleaning-and-modelling-top-5-categories-analysis-forage

This project involves cleaning, merging, and analyzing datasets to identify the top 5 performing categories based on aggregate popularity scores. It includes cleaned datasets, a final merged dataset, visualizations, and a presentation summarizing the tasks and results. Tools used: Microsoft Excel, Python, and PowerPoint.

data-analysis data-visualization microsoft-excel

Last synced: 07 Jan 2026

https://github.com/salman-khan-mohammed/music-album-popularity-prediction

"Music Album Popularity Prediction" is a project focused on building a model to forecast the success of music albums. By analyzing streaming data, social media engagement, and other relevant factors, the project aims to predict the popularity of albums across various genres and artists.

data-visualization eda machine-learning outlier outlier-detection outlier-treatment

Last synced: 08 Aug 2025

https://github.com/naninsv/yulu-bike-rental-analysis

Assisting Yulu, India's micro-mobility provider, in understanding factors influencing shared electric cycle demand. Employing statistical tests and analysis on a dataset to identify significant predictors and gauge their impact on cycle demand.

data-visualization eda hypothesis-testing matplotlib-pyplot numpy-library pandas-library python scipy-stats seaborn-python statistical-analysis

Last synced: 13 May 2026

https://github.com/athul64/tmdb-dataset-analysis

This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.

data-visualization dataframe eda numpy pandas python

Last synced: 14 Apr 2026

https://github.com/syncfusionexamples/how-to-collapse-the-visibility-of-specific-data-label-in-.net-maui-cartesian-chart

This article in the Syncfusion Knowledge Base explains how to collapse the visibility of specific data label in .NET MAUI Cartesian chart

charting-library charts column-chart data-label data-label-customization data-label-visibility data-visualization hide-data-label maui-charts

Last synced: 03 Oct 2025

https://github.com/jagoda11/elastic-vision

This repository contains a full-stack application designed to explore data from ElasticSearch🧐indices and visualize it using charts and graphs. The backend is built using Node.js and the frontend is powered🚀 by React.

backend chartjs dashboard-development data-analysis data-visualization docker elasticsearch frontend fullstack javascript material-ui monorepo mui-x node pie-chart react restful-api tables

Last synced: 09 Apr 2026

https://github.com/albertofaraujo/pbi_dashboard_anatel

Analisar o desempenho das operadoras no atendimento das reclamações registradas por usuários de telecomunicações no sistema Anatel Consumidor

data-visualization dax-studio power-query powerbi

Last synced: 19 Feb 2026

https://github.com/gitasong/firemap

Epicodus JavaScript Group Project: Live, interactive map of Oregon/Pacific Northwest fires; practice using Angular2.js with TypeScript, Angular-CLI, Google Maps API, data scraping, data visualization using D3js

angular-2 angular-cli angular-components angular-directives angular2 angularjs apis d3js data-visualization epicodus epicodus-students google-maps google-maps-api javascript

Last synced: 01 Oct 2025

https://github.com/andrewobwocha/titanicsurvival

🚢 End-to-end Python pipeline for Titanic survival classification. Demonstrates EDA, preprocessing, feature engineering, and Logistic Regression evaluation using Scikit-learn.

classification data-preprocessing data-visualization exploratory-data-analysis feature-engineering machine-learning pandas python scikit-learn titanic

Last synced: 13 Jun 2025

https://github.com/gmasson/datadash

DataDash é uma biblioteca JavaScript e CSS para criar dashboards interativos, para visualização de dados dinâmicos em páginas web.

dashboard dashboard-application dashboards data-analysis data-science data-visualization javascript

Last synced: 08 Aug 2025

https://github.com/quangandrei1003/france_air_pollution_pipeline

End-to-end air pollution data pipeline for French metropolitan cities using Airflow, Python, dbt, BigQuery.

airflow bigquery data data-analytics data-engineering data-modeling data-visualization dbt docker etl pandas python terraform

Last synced: 13 Apr 2026

https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling

All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021

data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics

Last synced: 13 Apr 2026

https://github.com/cagandemirmr/flo_sql_server_to_power_bi

In this project, i connect Sql server to Power Bi to visualize my Project

data-visualization dataanalysis dataanalyst directquery powerbi queries sqlserver

Last synced: 08 Aug 2025

https://github.com/v41bh4vr4jput/data-analysis-with-python

This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.

api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn

Last synced: 09 Apr 2026

https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office

Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.

data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python

Last synced: 11 Apr 2026

https://github.com/nandahkrishna/mpas

Movie Data Management and Analysis System developed using Java and Python

analysis data-visualization flask java java-application python

Last synced: 20 Apr 2026

https://github.com/shubham200137/expense-tracker-dashboard

The task is to create a Power BI dashboard from expense data (October–December) stored on SharePoint/OneDrive. It should include dropdowns for file and sheet selection with auto-refresh.

dashboard data-visualization powerautomate powerbi

Last synced: 04 Feb 2026

https://github.com/dmarks84/coursework_project_text-mining-topic-modeling

Project for University of Michigan Applied Data Science Specialization -- Developed functions to score similarity between text passages.

data-modeling data-reporting data-visualization databases eda nlp numpy pandas python statistics text-mining

Last synced: 12 Apr 2026