Projects in Awesome Lists tagged with data-preprocessing-and-cleaning
A curated list of projects in awesome lists tagged with data-preprocessing-and-cleaning .
https://github.com/e-panourgia/data-science-projects
Data Science Projects
annotations augmentation data data-preprocessing-and-cleaning hyperparameter-tuning llm logistic-regression nlp random-forest-classifier xboost-classifier
Last synced: 09 Apr 2025
https://github.com/usk2003/weather-data-analysis-in-szeged
This repository contains an in-depth analysis of historical weather data from Szeged, Hungary. The project uses Python to clean and process data, generate insightful visualizations, and identify patterns and correlations in weather parameters such as temperature, humidity, and precipitation.
analysis data-preprocessing-and-cleaning jupyter-notebook prediction python szeged weather-analysis
Last synced: 25 Feb 2025
https://github.com/asifdotexe/natural-langwiz
Natural LangWiz is a repository for exploring Natural Language Processing (NLP) techniques through Jupyter notebooks. It covers everything from text preprocessing and sentiment analysis to advanced transformer models. Dive in to see how we turn raw text into actionable insights with a touch of NLP wizardry!
api-integration data-preprocessing-and-cleaning data-visualization emojification grammar-checker machine-learning named-entity-recognition natural-language-preprocessing python sentiment-analysis spam-detection text-analysis text-generation text-summarization topic-modeling transformer-models translation vectorization web-scraping wordcloud
Last synced: 05 Mar 2025
https://github.com/imnotamr/ai
A collection of machine learning and AI projects implemented in Jupyter notebooks, covering regression, classification, and neural networks
ai classification colab-notebook data-analysis data-preprocessing data-preprocessing-and-cleaning data-visualization deep-learning deep-neural-networks jupyter-notebook machine-learning model-evaluation predictive-modeling project-based-learning python supervised-learning supervised-learning-algorithms supervised-learning-classifiers unsupervised-learning unsupervised-learning-algorithms
Last synced: 22 Feb 2025
https://github.com/jdonepud/nlp-sentimentclassification
data-preprocessing-and-augmentation data-preprocessing-and-cleaning deep-learning gpt2 natural-language-processing-nlp pytorch-implementation scikit-learn-python sentiment-analysis text-classification-python transformers-models wordcloud-visualization
Last synced: 03 Mar 2025
https://github.com/darshhv/fraud-detection-system
A machine learning project for detecting fraudulent transactions using Random Forest and XGBoost models, with data preprocessing and model evaluation.
data-preprocessing-and-cleaning fraud-detection-using-machine-learning model-evaluation pandas random-forest scikit-learn xgboost
Last synced: 20 Mar 2025
https://github.com/madhurimarawat/big-data-analytics
This repository demonstrates big data processing, visualization, and machine learning using tools such as Hadoop, Spark, Kafka, and Python.
apache-kafka apache-spark big-data big-data-analytics big-data-analytics-techniques data-preprocessing-and-cleaning data-stratification data-visualization hadoop-hdfs hadoop-hive hadoop-installation hadoop-mapreduce hiveql python spark-graphx spark-mllib spark-mllib-library spark-rdd spark-streaming
Last synced: 04 Mar 2025
https://github.com/amanovishnu/anamoly-detection-using-decision-classifier
the kdd 99 anomaly detection application is a flask web app that predicts anomalies in the kdd 99 dataset using a decision tree classifier. it allows users to input features for prediction and offers a user-friendly interface with real-time predictions and low latency.
anamoly-detection data-preprocessing-and-cleaning decision-tree-classifier flask-application kdd-99-dataset machine-learning machine-learning-algorithms
Last synced: 03 Jan 2025