NumPy
NumPy is an open source library for the Python programming language, adding support for large, multidimensional arrays, and matrices, along with a large collection of high-level mathematical functions to operate on these arrays.
- GitHub: https://github.com/topics/numpy
- Wikipedia: https://en.wikipedia.org/wiki/NumPy
- Repo: https://github.com/numpy/numpy
- Created by: Travis Oliphant
- Related Topics: python, scipy,
- Last updated: 2026-06-25 00:18:02 UTC
- JSON Representation
https://github.com/alainlebret/python-et-ia-2
Personal resources from my course "Python & IA"
artificial-intelligence deep-learning image-processing machine-learning matplotlib neural-network numpy python scikit-image scikit-learn
Last synced: 12 Apr 2026
https://github.com/lexxai/goit_python_ds_hw_03
Модуль 3. Класичне машинне навчання. Перенавчання. Лінійна регресія. LaTeX формули.
latex linear-regression matplotlib numpy pandas python scikit-learn
Last synced: 09 Apr 2026
https://github.com/linsanity03/algorithmic_trading
Trading strategy based on the intersection of short-term and long-term moving averages.
jupyter-notebook matplotlib-pyplot numpy pandas
Last synced: 04 May 2026
https://github.com/secary/maths7027
Mathematical Foundations of Data Science
latex mathematics numpy pandas
Last synced: 04 May 2026
https://github.com/asuquoaa/cdc_2017_immunization_data_analysis
This project analyzes data from the CDC's 2017 National Immunization Survey Public Use File (NISPUF17) to answer research questions about immunization, child health, and vaccine-related behaviors.
Last synced: 12 Apr 2026
https://github.com/ewdlop/computational-physcis-notes
Computational Physics written in Python
maple matlab numpy physics-simulation python qsharp scipy wolfram-mathematica
Last synced: 20 Apr 2026
https://github.com/humankernel/hands-on-ml
Practice of Machine Learning
machine-learning matplotlib numpy pandas sklearn
Last synced: 08 Jun 2026
https://github.com/testetson22/abelian_sandpiles
Simple App that implements, builds and tests an approximation of the Abelian sandpile model
Last synced: 12 Apr 2026
https://github.com/bilgeswe/datascience
Using Statistics and Data Science Project in Python using Google Colab tools, CVS and XLSX
box-plot colab-notebook cvs data-science data-scraping data-visualization heatmaps numpy python statistical-analysis statistics xlsx
Last synced: 29 Apr 2026
https://github.com/manoj-kumar-c/data_analyst_projects
Live Data Analyst Projects.
Last synced: 07 Feb 2026
https://github.com/chandkund/iris-classification
Iris Classification project using machine learning to classify iris flowers into Setosa, Versicolour, and Virginica species. Includes data exploration, model training with SVM, Logistic Regression, and Decision Tree, and performance evaluation. Contributions welcome! Python, Pandas, Scikit-learn, Seaborn, Matplotlib.
machine-learning matplotlib-pyplot numpy pandas python sklearn
Last synced: 21 Jan 2026
https://github.com/elmezianech/email-spam-ham-classifier-nb-hard
Naive Bayes Email Classifier: An implementation of a 'hard' Naive Bayes classifier in Python to categorize emails as spam or ham. This code performs extensive data preprocessing, probability calculations, and model training for email classification using the raw Naive Bayes algorithm.
ai emails jupyter-notebook machine-learning ml naive-bayes naive-bayes-classifier numpy pandas python spam-detection spam-filtering
Last synced: 11 Apr 2026
https://github.com/nambukeerthi/youtube_dataproject
Build a Streamlit App – A user-friendly interface. Use Google API – Fetch YouTube channel data. Store in SQL Database – Save channel details for easy access. Search & Join Tables – Allow users to query and view data
api numpy pandas python sql streamlit
Last synced: 09 Apr 2026
https://github.com/richardbmk/datascience_machinelearning
projects related with data science and machine learning projects.
data-science machine-learning matplotlib numpy pandas scikit-learn scipy seaborn
Last synced: 11 Apr 2026
https://github.com/0nom4d/306radiator
Mathematical EPITECH Project - This project reproduces the behaviour of a radiator placed at a position in a room of a given size.
epitech epitech-project numpy pytest python3
Last synced: 29 Apr 2026
https://github.com/djdhairya/electronic-search-engine
bytesio countervector nltk numpy pandas pickel pillow sklearn streamlit
Last synced: 13 Apr 2026
https://github.com/zhanziyuan/pixelpuzzle-web
Encode/decode images using Base64 or shuffle/recover the pixels of images.
anime encryption image image-processing numpy python
Last synced: 11 Apr 2026
https://github.com/yosrak5/predictive_maintenance
End-To-End Predictive Analytics to predict Hardware Maintenance Before it occurs using data preprocessing and Machine Learning classification modeling (XGBOOST , Random Forest )
data-factory data-perspective machine-learning-workbench matplotlib numpy pandas python random-forest seaborn xgboost
Last synced: 25 Feb 2026
https://github.com/shwetapardhi/assignment-1-q11--basic-statistics-level-1
Q11) Suppose we want to estimate the average weight of an adult male in Mexico. We draw a random sample of 2,000 men from a population of 3,000,000 men and weigh them. We find that the average person in our sample weighs 200 pounds, and the standard deviation of the sample is 30 pounds. Calculate 94%,98%,96% confidence interval?
confidence-interval normalization numpy pandas python scipy stats
Last synced: 13 Apr 2026
https://github.com/murugavl/loan_approval_prediction
This project is a Loan Approval Prediction System that uses Machine Learning to determine whether a loan application should be approved or rejected based on various factors. It is deployed using Streamlit for an interactive user experience.
jupyter-notebook machine-learning numpy pandas python random-forest-classifier scikit-learn stremlit
Last synced: 13 Apr 2026
https://github.com/pedasoft-consult/house-price-prediction
This project is a machine learning-powered web application for predicting house prices based on various features such as area, bedrooms, bathrooms, and more. It uses Flask for the backend and HTML/CSS/JavaScript for the frontend.
flask joblib matplotlib numpy pandas r2 random-forest regression seaborn sklearn
Last synced: 13 Apr 2026
https://github.com/aminkhavari78/text-generation-with-lstm-recurrent-neural-networks-in-python-with-keras
use LSTM model for text generation
dropout io lstm-neural-networks numpy pandas sequence-models sys text-processing
Last synced: 20 Apr 2026
https://github.com/svastikkka/data-science-machine-learning
Data Science and Machine Learning
classsification linear-regression machine-learning matplotlib mysql numpy pandas python
Last synced: 26 Jul 2025
https://github.com/akansha2004pandey/chsw_projects
jupyter-notebook nlp numpy pandas python tensorflow
Last synced: 11 Apr 2026
https://github.com/bachboy0/montecarlo
Python-based Monte Carlo simulations for statistical analysis and modeling.
monte-carlo-methods numpy python
Last synced: 21 Apr 2026
https://github.com/kzon94/torn-market-analyzer
Streamlit app that parses Torn Add Listing text, matches items with a custom dictionary, fetches market data via the public API, and generates KPIs and price recommendations using a modular Python analytics pipeline.
data-analysis data-engineering fuzzy-matching market-analytics numpy pandas python streamlit torn-city torn-city-api
Last synced: 11 Apr 2026
https://github.com/dineshdhamodharan24/industrial-copper-modeling
It seems like you have a project that involves modeling industrial copper data using Python and several libraries such as pandas, numpy, and scikit-learn. This is a common and practical approach, as these libraries are widely used for data manipulation, analysis, and machine learning tasks.
bussiness-solution numpy pandas pickle pickle-file seaborn sklearn-library streamlit
Last synced: 13 Apr 2026
https://github.com/alphacrypto246/stock-price-movement-prediction
A project leveraging Polynomial Regression to predict stock price movements based on historical data. Includes data preprocessing, feature engineering, visualization, and model evaluation to provide insights for informed trading decisions.
machine-learning numpy pandas polynomial-regression scikit-learn yfinance
Last synced: 13 Apr 2026
https://github.com/jim-by/tweet-sentiment-analysis
Sentiment analysis of tweets using TextBlob for labeling and RandomForest for classification.
nltk nltk-tokenizer numpy pandas python random-forest-classifier sklearn textblob-sentiment-analysis
Last synced: 13 Apr 2026
https://github.com/nabilshadman/python-uk-weather-analytics
An end-to-end data science workflow of UK weather data
data-engineering data-science data-visualization machine-learning matplotlib numpy pandas scikit-learn
Last synced: 13 Apr 2026
https://github.com/ashraf-khabar/bank-marketing-data-analysis
This project is focused on analyzing bank marketing data using PyTorch, pandas, numpy, and scikit-learn. The goal is to build a predictive model that can help identify potential customers who are more likely to subscribe to a bank's term deposit.
data-cleaning data-science data-visualization dataset deep-learning deep-neural-networks feedforward-neural-network learning neural-networks numpy pandas python pytorch sklearn
Last synced: 09 Apr 2026
https://github.com/williamfridh/umu_5tf078_paper_keyboard_tracker
Project exploring how a CNN alongside computer vision can be used to detect drawn keys and track finger input,.
cnn-classification computer-vision keras-tensorflow numpy python tensorflow
Last synced: 07 Jan 2026
https://github.com/karanyeole/movie-rating-analysis
analyzing movie ratings from various sources to gain insights into audience preferences, trends, and patterns in movie ratings. The project utilizes data analysis and visualization techniques to explore and interpret movie rating data, providing valuable information for movie enthusiasts, filmmakers, and industry professionals.
matplotlib numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/rkarahul/ok.win-big-small-predictor
Predict the next “Big” or “Small” outcome on the OK.Win lottery-style game using OCR + time-series features + ML.
joblib numpy opencv-python paddleocr paddlepaddle paddlepaddle-gpu pandas python sckit-learn
Last synced: 13 Apr 2026
https://github.com/preethiphilip/prosper-loan-data-analysis
This project explores the Prosper loan dataset to uncover trends, borrower behaviors, and factors influencing loan performance. Using Python and data visualization techniques, the analysis examines key variables such as credit scores, loan status, income ranges, and debt-to-income ratios to gain insights into credit risk and lending patterns.
jupyter-notebook matplotlib numpy pandas python3 seaborn
Last synced: 13 Apr 2026
https://github.com/miss-mhv/udacityproject_dataviz
Data analysis of a bicycle rental system. The goal is to understand how trips are made according to age, time of year, day of the week, time of day
jupyter-notebook matplotlib numpy pandas python3 seaborn-plots
Last synced: 13 Apr 2026
https://github.com/prashhhant213/cardioflex-treadmill-analysis-using-descriptive-statistics-probability
Description Analysis and Visualization on CardioFlex Treadmill data to provide insights and recommendations to improve their userbase.
colab-notebook numpy pandas probability python stats
Last synced: 11 Apr 2026
https://github.com/ananyajain2004/fake-news-detection
A python project for detecting fake news using machine learning algorithms.
decision-tree-classifier gradient-boosting-classifier logistic-regression machine-learning numpy pandas python3 random-forest-classifier testing
Last synced: 13 Apr 2026
https://github.com/yashlad27/machine_learning_data_science
Zero To Hero
anaconda3 deep-learning jupyter-notebook ml neural-network numpy pandas pycharm python sklearn-library
Last synced: 11 Apr 2026
https://github.com/alinenog/desenvolve_gb_2022
Formação Desenvolve 2022 do Grupo Boticário na área de dados
data-analysis data-science googlesheet machine-learning numpy pandas python
Last synced: 13 Apr 2026
https://github.com/vvipjain/iphone-sales-analysis
Iphone Sales Analysis
jupyter-notebook numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library pandas-python plotly plotly-express plotly-python python python3
Last synced: 13 Apr 2026
https://github.com/akibahmed229/development-of-intelligent
development of intelligent
data-science jupyter-notebook machine-learning metaploit natural-language-processing numpy pandas python skit-learn terraform
Last synced: 11 Apr 2026
https://github.com/imdeepmind/knn_from_scratch
KNN classifier in pure Numpy and Python
knn knn-classification machine-learning numpy python
Last synced: 11 Apr 2026
https://github.com/dulajkavinda/document-similarity-ml
📑 Check similarity between documents using td-idf and cosine similarity.
machinelearning numpy python scikit
Last synced: 29 Apr 2026
https://github.com/costopoulos/ntua-telecommunications
:radio: :telephone_receiver: NTUA ECE Telecommunications Course Code & Report
numpy nyquist python sampling scipy telecommunications
Last synced: 20 Apr 2026
https://github.com/kucera-lukas/imagesecrets
RESTful API for encoding/decoding messages into/from images.
api asyncio docker fastapi heroku mypy numpy pillow poetry postgresql pydantic pytest pytest-cov pytest-mock python python-typing restful-api steganography swagger tortoise-orm
Last synced: 28 Feb 2025
https://github.com/trainingbypackt/hands-on-machine-learning-with-python
Structure, build, and deploy machine learning projects
machine-learning numpy pandas python scikit-learn scikitlearn-machine-learning
Last synced: 10 Apr 2026
https://github.com/vyjayanthipolapragada/fraud_detection_creditcard
Detecting the fraudulent credit card transactions by training Decision Tree model using Scikit-learn and SnapML
classification-model data-preprocessing decision-tree-classifier kaggle-dataset machine-learning numpy pandas python scikit-learn snapml time tree-model
Last synced: 11 Apr 2026
https://github.com/ebadshabbir/logistic_regression-binomial-
Logistic Regression on Social Network Ads Dataset This project applies Logistic Regression to predict whether a user will purchase a product based on their age and estimated salary, using the Social Network Ads dataset. The data is split into training and test sets, with feature scaling applied for normalization.
classification jupyter-notebook logistic-regression machine-learning matplotlib-pyplot numpy pandas python sklearn
Last synced: 11 Apr 2026
https://github.com/vasu7052/recognizing-handwritten-digits
This is a machine learning project created in Python using Neural Networks and Supervised Learning Algorithms.
machine-learning machine-learning-algorithms numpy python scikit-learn
Last synced: 13 Apr 2026
https://github.com/guptamudit/whatsapp-chat-analyser
Created an online Whats App chat analysis tool which is designed to extract valuable insights from your group or one-on-one conversations. By analyzing your chat data, WhatsChat provides detailed statistics and visualizations.
matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/codewithmayank-py/covid19-data-analysis-using-python
COVID-19 and Happiness Analysis
data-analysis data-analysis-python data-visualization dataset jupyter-notebooks numpy pandas python3 seaborn
Last synced: 11 Apr 2026
https://github.com/mahdibehoftadeh/polynomial-regression-co2-emissions
A simple machine learning polynomial regression using a large dataset to learn and predict CO2 emission of a car by its built features like engine size and cylinders
machine-learning matplotlib numpy nural-network pandas polynomial-regression python scikit-learn
Last synced: 22 Feb 2026
https://github.com/samuele-lolli/data-analytics-techniques
A practical approach to data analytics pipeline.
numpy pandas pytorch scikit-learn
Last synced: 11 Apr 2026
https://github.com/amiriiw/text_classification
Welcome to the Text Classification Project! This project is designed to train a model for classifying texts based on their emotional content and then using it to categorize new texts into corresponding emotional categories.
keras numpy pandas pickle scikit-learn tensorflow text-classification
Last synced: 20 Jan 2026
https://github.com/debjyotisaha/web-application-projects
Web Applications developed with the help Python libraries and ML algorithms
algorithms cufflinks machine-learning matplotlib numpy pandas prediction python scikit-learn seaborn sklearn streamlit
Last synced: 13 Apr 2026
https://github.com/athul64/tmdb-dataset-analysis
This data set contains information about 10,000 movies extracted from TMDB. The dataset contains movies from 1960 to 2015. Including user ratings and revenue. Original data from Kaggle.
data-visualization dataframe eda numpy pandas python
Last synced: 14 Apr 2026
https://github.com/djdhairya/car-parking-detection
cnn cv2 deep deep-learning epoch flask keras numpy numpy-arrays opencv pickle tensorflow
Last synced: 13 Apr 2026
https://github.com/pranavsuriya-sr/e-hub_energysystems
Python based Web Application for analysis and visualization of concepts in Energy Systems.
css energy-systems-analysis html javascript matplotlib numpy python schemdraw streamlit
Last synced: 11 Apr 2026
https://github.com/romerorodriguezd/homography-with-opencv
Homography operations applied to insert images inside others regardless of the perspective.
computer-vision homography matrices-transformation numpy opencv opencv-python
Last synced: 29 Apr 2026
https://github.com/sayansomya/eda_on_covid-19
Exploratory Data Analysis on COVID-19.
analytics eda matplotlib-pyplot numpy pandas-dataframe python3 seaborn
Last synced: 29 Apr 2026
https://github.com/ljadhav25/data-engineering-poc
This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da
data-analysis etl matplotlib numpy pandas python
Last synced: 13 Apr 2026
https://github.com/shubhamahobia/lstm-hamlet
The LSTM Next Word Prediction project trains a model on Hamlet to predict the next word in a sequence. After preprocessing the text, an LSTM network learns patterns in the language. The trained model can then generate text by predicting the next word in a given sequence, with its accuracy evaluated to ensure it effectively models the language.
deep-learning github hamlet lstm lstm-neural-network lstm-neural-networks machine-learning neural-network nltk numpy pandas pickle python
Last synced: 11 Apr 2026
https://github.com/amnydv17/landmark-detection
This project aims to leverage the power of deep learning models to automatically detect and pinpoint landmarks such as famous monuments, buildings, natural landmarks, and other recognizable structures within images.
machine-learning matplotlib numpy pandas python3 scikit-learn seaborn tensorflow
Last synced: 11 Apr 2026
https://github.com/nordic96/lanedetection
Lane Detection Module using OpenCv and Python for educational purpose.
computer-vision lane-detection matplotlib-pyplot numpy opencv python3
Last synced: 04 May 2026
https://github.com/damiieibikun/web-scrapping-and-python-data-visualization-on-top-500-movies-imdb
Web Scrapping and Python Data visualization on Top 500 movies IMDb
beautifulsoup4 data-analysis data-visualization matplotlib-pyplot numpy pandas plotly-express python requests seaborn web-scraping
Last synced: 13 Apr 2026
https://github.com/anshpg/vectorwave
The Jupyter Notebook (ipynb file) contained within this repository serves as the primary codebase for this project. It demonstrates how to preprocess the input data, train the logistic regression classifier, evaluate its performance, and save the trained model for future use.
collab jupyter-notebook matplotlib-pyplot numpy pandas python sklearn
Last synced: 11 Apr 2026
https://github.com/georgehanymilad/diabetes-detection
Machine Learning Project
anaconda classification classification-algorithm data-science dataanalysis kaggle machine-learning machine-learning-algorithms matplotlib numpy pandas python python3 seaborn
Last synced: 13 Apr 2026
https://github.com/adamouization/python-machine-learning-data-science-notes
:orange_book: Jupyter notebooks containing useful Python code and notes for general Machine Learning and Data Science projects.
data data-science data-visualization guide jupyter jupyter-notebook machine-learning matplotlib notes numpy pandas pandas-dataframe python seaborn
Last synced: 11 Apr 2026
https://github.com/eigenein/coursera
My Coursera assignments
course course-work coursera coursera-assignment coursera-course coursera-data-science coursera-machine-learning coursera-specialization courses jupyter-notebook kotlin machine-learning numpy pandas python scipy sklearn
Last synced: 13 Apr 2026
https://github.com/jminjares4/selected-areas-in-networks
Assignments for Selected Areas in Networks (EE 5390)
matplotlib network-analysis numpy pandas python
Last synced: 08 May 2026
https://github.com/jbalooshie/movies-etl
Exercise working with movie datasets from Kaggle and Wikipedia. Python is used to extract, clean, and combine the data, and then it is loaded into a postgreSQL database.
data-analysis data-science jupyter-notebook numpy pandas postgresql postgresql-database python sqlalchemy
Last synced: 11 Apr 2026
https://github.com/danhenriquex/data_science_and_machine_learning
A.I, Data Science & Machine Learning and Deep Learning with Tensorflow
deep-learning deep-neural-networks machine-learning matplotlib numpy pandas tensorflow
Last synced: 13 Apr 2026
https://github.com/giordanodaloisio/demv
Debiaser for Multiple Variables, a model- and data- agnostic method to improve fairness in binary and multi-class classification tasks
bias-mitigation numpy pandas python
Last synced: 29 Apr 2026
https://github.com/unknownman/101-numpy
learning numpy by 101 short videos and jupyter notebook
Last synced: 29 Apr 2026
https://github.com/ayushtiwari134/machine_learning_models
A repo where i upload all the models which i train during my journey of learning Machine Learning from scratch
linear-regression logistic-regression machinelearning matplotlib numpy pandas python random-forest scikit-learn
Last synced: 11 Apr 2026
https://github.com/chandkund/predicting-diabetes-onset
The "Predicting Diabetes Onset" project aims to build a machine learning model that predicts whether an individual has diabetes based on various health-related features. The dataset used for this project includes attributes related to medical history and physical measurements.
deep-learning numpy pandas python seaborn sklearn-library sklearn-metrics visualization
Last synced: 13 Apr 2026
https://github.com/armahdavi/data_analytics_statistics_plotting_pm_airborne_sampling
All codes for the data pipelines processing, statistical modellings, descriptive statistics and plot visualizations from airborne phase of Mahdavi et al. (2021) (Environmental Pollution) Project Miestone: 2018 - 2021
data-science data-visualization machine-learning matplotlib-pyplot numpy pandas python scikit-learn scipy-stats statistics
Last synced: 13 Apr 2026
https://github.com/yanglr/mlnotes-python
Machine learning Notes for Python 3 (NumPy, TensorFlow etc.), with interactive Jupyter notebook.
jupyter jupyter-notebook numpy numpy-notes
Last synced: 21 May 2026
https://github.com/hilarionengarejr/movie-recommender-app
Sentiment analysis on user reviews for movie recommendations using Content Based Filtering.
docker flask nltk numpy pandas python3 scikit-learn selenium
Last synced: 10 Apr 2026
https://github.com/sohan181204/airline-ticket-price-analysis
The analysis of airline ticket prices explores the dynamic and competitive nature of the aviation industry in India. With rapid economic growth and increasing air travel demand, understanding ticket price trends is crucial for airlines, passengers, and policymakers.
Last synced: 11 Apr 2026
https://github.com/pratyush905/farecast-nyc-taxifare-predictor
Machine learning models to predict nyc taxi fare based on given dataset
jupiter-notebook kaggle machine-learning matplotlib numpy python regression-models scikit-learn
Last synced: 13 Apr 2026
https://github.com/kmkurn/text2array
Python library to convert text dataset into arrays.
deep-learning natural-language-processing neural-networks numpy
Last synced: 14 May 2026
https://github.com/sanjaiveeran44/shopping-intent-analasys
This project predicts whether a visitor to an e-commerce website will make a purchase based on their browsing behavior. It includes data analysis, preprocessing, and machine learning modeling to understand online shopping patterns.
matplotlib matplotlib-pyplot numpy pandas python3 sklearn tensorflow
Last synced: 13 Apr 2026
https://github.com/eesunmoon/aml_project
[Project] Analyzing Fitness Patterns & Building Predictive Models
matplotlib numpy pandas python3 recommender-system scikit-learn seaborn tensorflow
Last synced: 13 Apr 2026
https://github.com/aciluca/credit-approval-system
A machine learning application to predict credit card approval using Python, TensorFlow, and Docker.
data-science keras machine-learning numpy pandas python sklearn tensorflow
Last synced: 11 Apr 2026
https://github.com/mosalem149/data-mining-assignments
That repo for data mining quizzes and tasks.
anaconda data-mining dataframe jupyter-notebook matplotlib mlxtend numpy pandas python quizzes scikit-learn seaborn tasks
Last synced: 13 Apr 2026
https://github.com/antonio-f/naive_convnet
Naive Convolutional Neural Network with TensorFlow 2
101 convolutional-neural-networks deep-learning easy matplotlib numpy simple tensorflow2 tutorial
Last synced: 29 Apr 2026
https://github.com/hamishivi/denserflow
A basic numpy neural net library. This achieved high marks in a uni assignment!
deep-learning neural-network numpy
Last synced: 18 May 2026
https://github.com/johanntf/direccion_maximo_crecimiento
Dirección de máximo crecimiento en una función multivariable dado un punto de partida
customtkinter matplotlib numpy python
Last synced: 18 May 2026
https://github.com/sanjanahombal/study-on-sentiment-analysis
This project explores the optimal combination of Bag-of-Words and TF-IDF vectorization with Naive Bayes and SVM for sentiment analysis. It evaluates performance using accuracy, precision, recall, and F1-score, addressing ethical concerns like data privacy and bias to improve sentiment classification in real-world applications.
bag-of-words confusionmatrix googlecollab gridsearch-crossvalidation matplotlib-pyplot naive-bayes-classifier numpy pandas seaborn sklearn svm-classifier tfidf-vectorizer
Last synced: 07 Jan 2026