NumPy
NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.
- GitHub: https://github.com/topics/numpy
- Wikipedia: https://en.wikipedia.org/wiki/NumPy
- Repo: https://github.com/numpy/numpy
- Created by: Travis Oliphant
- Related Topics: python, scipy,
- Last updated: 2026-06-26 00:18:09 UTC
- JSON Representation
https://github.com/sralter/potential_talents
Using NLP techniques (word and sentence embedding tools like SBERT and Learning-to-Rank systems like RankNet and LambdaRank) to rank candidates.
lambdarank learning-to-rank lightgbm matplotlib nlp numpy pandas python pytorch ranknet
Last synced: 09 Apr 2026
https://github.com/epsoft/embedding
Embedding
adam dense embedding flatten itertools numpy onehotencoder optimizers pandas sequential tensorflow
Last synced: 12 Apr 2026
https://github.com/coder5omkar/lendingclubcasestudy
Performed exploratory data analysis (EDA) on the loan dataset from the Lending Club Case Study to identify the key factors influencing loan defaults.
bivariate-analysis data-science exploratory-data-analysis matplotlib numpy pandas seaborn univariate-analysis
Last synced: 06 May 2026
https://github.com/caterinatasinato/python-projects
Projects I worked on as Trainee in Data Analytics at ProfessionAI
matplotlib numpy pandas python3
Last synced: 13 Apr 2026
https://github.com/mosalem149/data-mining-assignments
That repo for data mining quizzes and tasks.
anaconda data-mining dataframe jupyter-notebook matplotlib mlxtend numpy pandas python quizzes scikit-learn seaborn tasks
Last synced: 13 Apr 2026
https://github.com/al-ghaly/stock-market-simulation
Simulate and visualize stock market behavior
matplotlib numpy python python-visualization
Last synced: 16 May 2026
https://github.com/eesunmoon/aml_project
[Project] Analyzing Fitness Patterns & Building Predictive Models
matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow
Last synced: 13 Apr 2026
https://github.com/abideen-olawuwo/frauddetection
A credit card Fraud Detection Model
logistic-regression matplotlib numpy pandas python seaborn sklearn
Last synced: 13 Apr 2026
https://github.com/alansteinbarth/irysy
🪻 Irysy 🌼 eksploracyjna analiza danych – wizualizacje, ML, transparentny kod i autorskie grafiki. Projekt łączący AI, estetykę i praktyczne umiejętności!
ai classification clean-code data-science exploratory-data-analysis feature-engineering github-pages iris-dataset jupyter-notebook machine-learning matplotlib modern-ui numpy open-source pandas portfolio-project python reproducible-research seaborn visualization
Last synced: 17 Oct 2025
https://github.com/cosmoquester/numpy-neural-net
This is neural network samples implemented with numpy without machine learning libraries.
implementation neural-network numpy
Last synced: 22 Jan 2026
https://github.com/mbarbetti/hello-snakemake
Simple implementation of snakemake to sample and plot data
hello-world matplotlib numpy python snakemake
Last synced: 14 Apr 2026
https://github.com/pratyush905/farecast-nyc-taxifare-predictor
Machine learning models to predict nyc taxi fare based on given dataset
jupiter-notebook kaggle machine-learning matplotlib numpy python regression-models scikit-learn
Last synced: 13 Apr 2026
https://github.com/deepcloudlabs/dcl702-2021-jul-12
DCL-702: Data Analytics using Python
data-analytics numpy pandas python3
Last synced: 05 May 2026
https://github.com/anarya22/accenture-north-america-data-analytics-and-visualization-job-simulation-on-forage
Completed a simulation focused on advising a hypothetical social media client as a Data Analyst at Accenture. Cleaned, modelled and analyzed 7 datasets to uncover insights into content trends to inform strategic decisions. Prepared a PowerPoint deck and video presentation to communicate key insights for the client and internal stakeholders.
analyzing-visualization data-cleaning data-visualization numpy pandas powerbi powerpoint-presentations
Last synced: 09 May 2026
https://github.com/sunilvarma-l/liverdiseaseprediction
"Streamlit app to predict liver disease risk using a machine learning model based on patient input data."
machine-learning matplotlib numpy pandas pickle python scikit-learn seaborn streamlit
Last synced: 13 Apr 2026
https://github.com/antodata/hate_crimes_spain_2014_2017
Analysis of hate crimes in Spain between 2014 and 2017 using official data
chi-square chi-square-test data-analysis data-visualization datascience folium hatecrime json lgtbiq linear-regression maps matplotlib numpy pandas python python3 scipy selenium selenium-webdriver sklearn
Last synced: 14 Apr 2026
https://github.com/neelays/xor-xnor_neural_network
NumPy neural network to approximate XOR/XNOR
Last synced: 15 May 2026
https://github.com/chandkund/predicting-diabetes-onset
The "Predicting Diabetes Onset" project aims to build a machine learning model that predicts whether an individual has diabetes based on various health-related features. The dataset used for this project includes attributes related to medical history and physical measurements.
deep-learning numpy pandas python seaborn sklearn-library sklearn-metrics visualization
Last synced: 13 Apr 2026
https://github.com/farhad-here/predict_student_performance
Predict Student Performance, is a data analysis and machine learning project aimed at predicting students' final performance (g3) based on demographic, family, and academic features. The project supports both Regression (predicting exact grades) and classification (Pass/Fail categories).
classification data-analysis data-visualization linear-regression machine-learning numpy pandas postgresql powerbi scikit-learn streamlit
Last synced: 14 Apr 2026
https://github.com/fanisgl/cnn_machine_learning
A Machine Learning exercise that trains Convolutional Neural Network (CNN) using the tensorflow 2 and Keras libraries to predict images from the CIFAR-10 dataset.
data-science keras machine-learning neural-networks numpy pandas pooling-layers relu softmax tensorflow2
Last synced: 24 Feb 2026
https://github.com/navindafernando/lane-detection
Lane Detection with Image Feature Extraction Techniques :aerial_tramway:
image-feature-extraction kernel-filters machine-learning neighborhood numpy opencv sobel-filter
Last synced: 13 Apr 2026
https://github.com/danhenriquex/data_science_and_machine_learning
A.I, Data Science & Machine Learning and Deep Learning with Tensorflow
deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow
Last synced: 13 Apr 2026
https://github.com/icepanorama/internship-visualizations-and-demonstrations
A collection of some of the programs that I've written over the course of my internship.
artificial-intelligence machine-learning matplotlib numpy pandas python3 pytorch scikit-learn
Last synced: 14 Apr 2026
https://github.com/ksprugevics/magic-square
Some simple algorithm implementations for solving (ordinary) magic squares using NumPy
algorithms mathematics numpy python
Last synced: 21 Apr 2026
https://github.com/oleksiilatypov/titanic_ml
Titanic Survival Prediction using ML and Flask Deployment.
data-science docker flask machine-learning numpy pandas skit-learn
Last synced: 13 Apr 2026
https://github.com/waghraj1699/sms-spam-detection
A web app to check wheather the messages are spam or not.
classification-algorithm data-preprocessing data-visualization machine-learning nlp numpy pandas
Last synced: 18 Apr 2026
https://github.com/patilsukanya/assignment-05.-multiple-linear-regression-q2
Used libraries and functions as follows:
collinearity-diagnostics cooks-distance correlation-analysis eda heteroscedasticity homoscedasticity leverage-score mlr multiple-linear-regression numpy ols-regression pairplot python regress-exog residual-analysis rsquare-values smf statsmodels vif
Last synced: 06 May 2026
https://github.com/juzershakir/predicting_boston_housing_prices
Builded a model to predict the value of a given house in the Boston real estate market using various statistical analysis tools. Identified the best price that a client can sell their house utilizing machine learning.
bias-variance boston-housing-price-prediction data-exploration decision-tree-regression gridsearchcv k-fold machine-learning matplotlib mlfnd model-evaluation model-validation numpy pandas python3 r2-score sklearn supervised-learning udacity-nanodegree
Last synced: 22 Oct 2025
https://github.com/juzershakir/train_a_smartcab_to_drive
From just cab, to Smart-Cab
cogsci decay-rate epsilon machine-learning machine-learning-algorithms machine-learning-tutorials matplotlib numpy pandas pygame python27 q-learning reinforcement-learning smartcab training udacity-nanodegree visualization
Last synced: 22 Oct 2025
https://github.com/bhuvantenguria/adobe-gensolve
Adobe-Gensolve
cv2 dotenv firebase-admin flask numpy pandas python react svgpathtools svgwrite
Last synced: 23 Jan 2026
https://github.com/jsnevt/regressao_logistica
Cálculos de probabilidades sobre uma eleição
matplotlib numpy pandas-dataframe python
Last synced: 11 May 2026
https://github.com/deanlogan/data-analysis-course
Code created when completing the Data Analysis with Python Course on freecodecamp.org
course data-analysis numpy pandas python python3
Last synced: 06 May 2026
https://github.com/tomdewildt/interactive-and-explainable-ai-design
Code for The Interactive And Explainable AI Design course of my master's degree
jupyter lime numpy pandas python scikit-learn shap
Last synced: 18 Jun 2026
https://github.com/damiieibikun/web-scrapping-and-python-data-visualization-on-top-500-movies-imdb
Web Scrapping and Python Data visualization on Top 500 movies IMDb
beautifulsoup4 data-analysis data-visualization matplotlib-pyplot numpy pandas plotly-express python requests seaborn web-scraping
Last synced: 13 Apr 2026
https://github.com/cezlul/analyse-ventes-immobilier
Solution ML d'analyse immobilière parisienne : classification automatique appartements vs commerces (K-means, 91%) et prédiction prix (régression, R²=0.98) sur 26K transactions. Valorise portefeuille 169M€ avec recommandations stratégiques data-driven.
data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python sklearn
Last synced: 13 Apr 2026
https://github.com/ljadhav25/data-engineering-poc
This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da
data-analysis etl matplotlib numpy pandas python
Last synced: 13 Apr 2026
https://github.com/egorumaev/2024-ab-tests-metrics
Анализ соответствия размера выборки и плановых значений метрик A/B-теста
latex monte-carlo-method numpy pandas statmodels
Last synced: 04 May 2026
https://github.com/dr-rompecabezas/medical-data-visualizer
[freeCodeCamp] Medical Data Viz Jupyter Notebook built with Matplotlib and Seaborn
jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 04 May 2026
https://github.com/gnikit/fileio
A template allowing to read multiple columns of tab or space delimited files. Similar functionality to numpy.loadtxt()
Last synced: 04 May 2026
https://github.com/ortanav2/parabola-visualizer
Simple parabola-visualizer made with matplot+numpy [Formula: a(x+d)²+e]
mathematics matplotlib numpy parabola visualization
Last synced: 04 May 2026
https://github.com/dmdlgg/calculadora_de_indicadores_estatisticos
Uma calculadora que retorna a média, mediana, desvio padrão, quartis e variância dos dados inseridos pelo usuário. Tecnologias utilizadas: Flet, Python e Numpy. (Ainda em andamento)
Last synced: 04 May 2026
https://github.com/mchenryspagg/investigate_a_dataset
This is a data analysis project that demonstrates the student's ability to use python data analysis libraries such as pandas, numpy and pyplot in matplotlib to investigate a dataset and answer specific questions from the dataset, thus demonstrating skills in data cleaning, data wrangling, and exploratory data analysis.
data-analysis datetime descriptive-analysis descriptive-statistics exploratory-data-analysis numpy pandas pyplot python visualization
Last synced: 04 May 2026
https://github.com/miozilla/fescarefine
fescarefine :skier::mount_fuji::guide_dog: : Refine & Test ML Models # Features Scaling # Normalization # Datasets
gradient-descent linear-regression matplotlib model-testing mse norm numpy ols overfit pandas scatter seaborn sklearn standardization statsmodels underfit
Last synced: 04 May 2026
https://github.com/omogbolahan94/handwashing-history-analysis
Data Wrangling and Analytics
matplotlib numpy pandas plotly scipy seaborn statistical-analysis
Last synced: 04 May 2026
https://github.com/charlenry/machine_learning_estimer_prix_voiture
Projet de Machine Learning pour estimer le prix d'une voiture d'occasion via une application
joblib jupyter-notebook matplotlib-pyplot numpy oop-python pandas plotly-express python sklearn-linear-model sklearn-linear-regression
Last synced: 04 May 2026
https://github.com/abhijeet-bhambere/exploratory_data_analysis
A collection of several projects for conducting EDA on datasets from various sources
exploratory-data-analysis matplotlib numpy pandas python
Last synced: 04 May 2026
https://github.com/0nom4d/308reedpipes
Mathematical EPITECH Project - This project computes a representation of a reedpipe.
coverage epitech epitech-project matplotlib-pyplot numpy pytest python scipy
Last synced: 04 May 2026
https://github.com/jakubfr4czek/apartment-prices-analysis
This repository contains an analysis of apartments prices in Krakow. It was made for "Rachunek prawdopodobieństwa i statystyka" (probability and statistics) course i took at AGH University of Science.
agh agh-wi analysis csv jupyter jupyter-notebook krakow numpy pandas price-analysis price-prediction probability probability-statistics python statistics statistics-learning
Last synced: 04 May 2026
https://github.com/hanzopgp/randomkagglesolving
Trying to build models to resolve some interesting Kaggle datasets thanks to deep learning.
artificial-intelligence deep-learning jupyter-notebook machine-learning matplotlib numpy personal tensorflow
Last synced: 04 May 2026
https://github.com/chathumiamarasinghe/nn-training-model
A comprehensive project for training neural networks to solve real-world problems. This repository includes customizable code for building, training, and evaluating neural network architectures using popular deep learning frameworks.
jupyter-notebook matplotlib numpy phyton scikit-learn
Last synced: 04 May 2026
https://github.com/aqueeqazam/machine-learning-using-scikit
This repository contains all of the algorithms used to train the machine learning models using the Scikit library.
Last synced: 04 May 2026
https://github.com/lin826/nanogpt-demo
Training and finetuning local GPTs.
gpt nanogpt numpy pytorch tqdm transformers
Last synced: 05 May 2026
https://github.com/thekartikeyamishra/object-detection-for-retailer
The AI Customer Feedback Summarizer is a Python-based tool that uses extractive summarization techniques to provide concise summaries of customer feedback.
ai computer-vision cv2 numpy pillow python torch torchvision yolov5
Last synced: 05 May 2026
https://github.com/meicloudie/demo-python-data-analytics
Python Data Analytics
data-analyst jupyter matplotlib numpy pandas python work-project
Last synced: 05 May 2026
https://github.com/csgn/numpy-tr-docs
docs documentation np numpy numpy-tr-docs tr-docs turkish
Last synced: 05 May 2026
https://github.com/chandrasankarsaha-code/customer-churn-prediction
A machine learning project to predict customer churn.
churn-prediction classfication customer machine-learning numpy pandas
Last synced: 05 May 2026
https://github.com/badranalyst/residential-unit-prices-data-analysis-application
Python-based analysis of residential unit prices, focusing on data cleaning, visualization, and exploratory data analysis (EDA). Key features include price distribution, and correlation analysis between factors like size, location, and pricing.
data-analysis data-visualization dataset matplotlib numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/elijahondiek/data-analysis-udacity
Udacity Data Analysis Nano Degree course resources
jupyter-notebook matplotlib numpy pandas python
Last synced: 05 May 2026
https://github.com/lruizap/tomography-with-algebraic-topology
This project aims to analyze images using algebraic topology. The goal is to extract structural information from images through mathematical concepts such as connected components and loops.
gudhi jupyter-notebook matplotlib numpy opencv python tomography
Last synced: 05 May 2026
https://github.com/cicku/en.650.672
HW of EN.650.672
analytics data-analysis numpy pandas
Last synced: 05 May 2026
https://github.com/tapas-gope/diwali-sales-analysis
Analyze Diwali Sales Data to improve customer experience and sales.
exploratory-data-analysis matplotlib-pyplot numpy pandas python seaborn
Last synced: 05 May 2026
https://github.com/jeslipriya/expense-tracker
This is a smart expense tracking web app built with Flask that helps users manage and visualize their spending. It features Pandas and Numpy analysis to suggest ways to cut down on unnecessary expenses, with clean UI, graph visualizations, and secure login.
css flask-application full-stack html matplotlib numpy pandas
Last synced: 05 May 2026
https://github.com/chdl17/marvelvsdc
This GitHub repository contains Python and Tableau for analyzing Marvel Vs DC superheros.
excel jupyter-notebook numpy pandas python tableau-public
Last synced: 05 May 2026
https://github.com/theadeelahmed/mall-customers-segmention
This project segments mall customers based on income, age, and shopping score. Using clustering techniques, it identifies key customer groups for targeted marketing campaigns. Tools used: Pandas, Matplotlib, Seaborn, and Scikit-learn
matplotlib-pyplot numpy pandas python3 seaborn sklearn-clustering
Last synced: 05 May 2026
https://github.com/pyladiesams/python-oss-sprints-mar2024
Sprints to contribute to the most popular data and AI Python OSS libraries
conda dask geopandas jupyter matplotlib numpy pandas pymc sklearn
Last synced: 05 May 2026
https://github.com/rohra-mehak/sciencesync
System for Personalized Google Scholar Alerts Processing and Data Management, and provision of ML based clustering analysis
agglomerative-clustering clustering crossref-api customtkinter google-api google-scholar graph-api machine-learning numpy pandas python3 scientific-article-analysis scikit-learn sqlite3
Last synced: 05 May 2026
https://github.com/jonad/quora_data_analysis
Exploratory data analysis and feature engineering on Quora dataset.
exploratory-data-analysis feature-extraction numpy pandas python3 quora-dataset
Last synced: 05 May 2026
https://github.com/sahaninilesh/future_ds_01_social-media-sentiments-analysis
The Social Media Sentiments Analysis Dataset captures a vibrant tapestry of emotions, trends, and interactions across various social media platforms. This dataset provides a snapshot of user-generated content, encompassing text, timestamps, hashtags, countries, likes, and retweets. Each entry unveils unique stories—moments of surprise, excitement,
jupyter matplotlib numpy pandas python
Last synced: 05 May 2026
https://github.com/tsungtsetu122/patternrecognition-gnn-node-classification
This project implements a Mixed Graph Neural Network (GNN) for semi-supervised multi-class node classification using the Facebook Large Page-Page Network Dataset. The goal of the project is to predict the class of each node (page) based on 128-dimensional feature vectors and the graph structure connecting the nodes.
evaluation-metrics gnn matplotlib networkx numpy optimization pandas python pytorch-geometric
Last synced: 05 May 2026
https://github.com/supernovasatsangi23/modifying-biomarker-gene-identification-for-effective-cancer-categorization
A project that focuses on implementing a hybrid approach that modifies the identification of biomarker genes for better categorization of cancer. The methodology is a fusion of MRMR filter method for feature selection, steady state genetic algorithm and a MLP classifier.
dataset deep-learning deep-neural-networks feature-selection genetic-algorithm machine-learning machine-learning-algorithms mlp-classifier mrmr neural-network numpy pandas-dataframe python python3 scikit-learn scikit-learn-python tkinter-gui tkinter-python
Last synced: 05 May 2026
https://github.com/kalwalt/pybind-test
A simple example with pybind and numpy with C++ code
cplusplus numpy pybind11 python
Last synced: 05 May 2026
https://github.com/romanectatyana/fuel_norm
Practical data analytics case: calculating and optimizing fuel consumption norms for vehicles and heavy equipment. (Аналітичний проєкт із визначення норм витрат палива для автотранспорту та спецтехніки.)
matplotlib numpy pandas plotly power-bi
Last synced: 06 May 2026
https://github.com/shwetapardhi/assignment-1-q9_b--basic-statistics-level-1
Q9) Calculate Skewness, Kurtosis & draw inferences on the following data SP and Weight(WT) Use Q9_b.csv
kurtosis matplotlib-pyplot numpy pandas python seaborn skewness
Last synced: 06 May 2026
https://github.com/kashinathbiradar/bangalore-housing-price-prediction
The objective of the project is to create a machine learning model. We are doing a supervised learning and our aim is to do predictive analysis to predict housing price.
css deta doc html machine-learning matplotlib numpy pandas preprocessing seaborn sklearn sql
Last synced: 06 May 2026
https://github.com/rishabhmathur06/fetching-jobs-data-using-beautifulsoup
beautifulsoup numpy pandas python webscraping
Last synced: 06 May 2026
https://github.com/billgewrgoulas/recommendation-systems
Algorithms for joke rating prediction using the joke data-set from Kaggle.
algorithm clustering collaborative-filtering machine-learning numpy pandas recommender-system scikit-learn scypi
Last synced: 06 May 2026
https://github.com/sankaran-s2001/us-traffic-accidents-analysis-python-eda
Exploratory data analysis of US traffic accidents from 2016-2023, analyzing patterns by time, location, weather, and severity using Python data science libraries.
data-analysis data-science data-visualization eda matplolib numpy pandas python
Last synced: 06 May 2026
https://github.com/andrewsy1004/logistic-regression-spam-classifier
This project implements a spam email classifier using Logistic Regression.
Last synced: 06 May 2026
https://github.com/5hraddha/optimize-oil-well-locations
In the quest for harnessing valuable energy resources, the OilyGiant mining company wants to expand its operations by discovering new oil well locations. To achieve this, a data-driven approach is adopted, leveraging geological exploration data from three distinct regions and employing techniques in data analysis and modeling.
linear-regression numpy pandas scikit-learn supervised-learning
Last synced: 06 May 2026
https://github.com/sopheakchan/dull-razor-algorithm
DullRazor hair removal for dermoscopic skin images.
Last synced: 06 May 2026
https://github.com/paulo-santos-ds/llm
Um aplicativo web simples de tradução que converte texto do português para o inglês, construído com Streamlit e Google Translator
deeptranslate googletranslate numpy pyngrok python streamlit
Last synced: 06 May 2026
https://github.com/cycle-sync-ai/student-score-analysis
A data-driven student performance analysis project using UCI dataset (396 students, 33 features). Implements machine learning models (K-means, PCA, Decision Tree, Random Forest, Linear Regression) to analyze academic patterns and predict student scores based on lifestyle, health, and study habits.
clustering clustering-algorithm decision-trees feature-engineering learning-management-system linear-regression machine-learning machine-learning-algorithms matplotlib numpy pandas pca pickle prediction prediction-algorithm scikit-learn score seaborn student
Last synced: 06 May 2026
https://github.com/kavayk29/text-summarization-fine-tuning-pegasus-model
This project uses the Pegasus model to generate concise, coherent text summaries. Fine-tuned on large datasets, it excels in abstractive summarization, ideal for automating tasks like summarizing articles and research papers.
datasets huggingface-transformers matplotlib nltk numpy pandas teansform tqdm
Last synced: 06 May 2026
https://github.com/hansalemaos/cv2_collage
Create a collage from images with OpenCV
Last synced: 06 May 2026
https://github.com/ananyachibber21/automation-python-projects
Automation with Python (Second Year Projects)
automation chromedriver numpy openpyxl pandas-dataframes python selenium
Last synced: 06 May 2026
https://github.com/fatihilhan42/lane-line-detection-with-python
In this project you will detect lane lines in images using Python and OpenCV. OpenCV means "Open-Source Computer Vision", which is a package that has many useful tools for analyzing images.
image-processing line-detection numpy opencv python
Last synced: 06 May 2026
https://github.com/jayanthpotluri5513/heart-disease-prediction-using-ml
This project aims to develop a machine learning model for heart disease prediction, leveraging the potential of advanced algorithms and a comprehensive dataset.
machine-learning matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/harmanveer-2546/student-performance-in-exam
Student performance analysis and prediction using datasets has become an essential component of modern education systems. With the increasing availability of data on student, schools and universities are using advanced analytics and machine learning algorithms to gain insights into student performance and predict future outcomes.
classification-report cluster dbscan dimensionality-reduction k-means-clustering labelling linear-regression logistic-regression numpy pandas pre-processing random-forest-classifier test-train-split unsupervised-machine-learning visualization
Last synced: 06 May 2026
https://github.com/abdulrahmanaymann/data-storage-and-information-retrieval
This is a Python project that performs tokenization, stop word removal, positional indexing, phrase query searching, term frequency-inverse document frequency (TF-IDF) calculation, cosine similarity computation, and document ranking.
cosine-similarity information-retrieval nltk numpy pandas postional-index python ranking stop-word-removal tf-idf tokenization
Last synced: 06 May 2026
https://github.com/kianaabrisham/svm-from-scratch
Linear SVM from scratch with hinge loss + decision boundaries
classification from-scratch fundamentals hinge-loss numpy optimization scikit-learn svm
Last synced: 07 May 2026
https://github.com/siddhantprateek/machine-learning-resources
Machine Learning Resources
best-practices clustering-algorithm data-analysis deep-learning in-progress journey linear-regression machine-learning machine-learning-algorithms neural-language-modelling neural-language-processing neural-network numpy python3 read reinforcement-learning-algorithms tensorflow visualisation
Last synced: 07 May 2026
https://github.com/dulajkavinda/breast-cancer-detect
🧬 breast cancer classification using machine learning
added machinelearning matplotlib numpy pandas python
Last synced: 07 May 2026
https://github.com/aicorsair/python-case-study-ab-testing-for-lunartech-homepage-cta-button
This repository contains a detailed case study on an A/B test of LunarTech's homepage CTA button, using proxy data structured similarly to the company's real data.
ab-testing click-through-rate confidence-intervals data-analysis data-analytics data-exploration data-science data-visualization hypothesis-testing matplotlib normal-distribution numpy pandas practical-significance python statistical-analysis statistical-significance z-critical z-statistic z-test
Last synced: 07 May 2026