Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists by Gandhasiri-Rahul-Mohan

A curated list of projects in awesome lists by Gandhasiri-Rahul-Mohan .

https://github.com/gandhasiri-rahul-mohan/random-forests-q2-company_data

About the data: Let’s consider a Company dataset with around 10 variables and 400 records.

data-science machine-learning numpy pandas python random-forest seaborn-plots

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/random-forests-q1-fraud_check

Use Random Forest to prepare a model on fraud data treating those who have taxable_income <= 30000 as "Risky" and others are "Good"

data-science machine-learning numpy pandas python random-forest

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/multi-linear-regression-q2-50_startups

Prepare a prediction model for profit of 50_startups data. Do transformations for getting better predictions of profit and make a table containing R^2 value for each prepared model.

data-science machine-learning multilinear-regression python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/clustering-q2-eastwestairline

Perform clustering (hierarchical,K means clustering and DBSCAN) for the airlines data to obtain optimum number of clusters. Draw the inferences from the clusters obtained.

clustering data-science dbscan-clustering hierarchical-clustering kmeans-clustering machine-learning python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/text-mining-q2-text-data

1) Extract reviews of any product from ecommerce website like amazon 2) Perform emotion mining

data-science machine-learning python spacy-models text-mining

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/linear-regression-1-delivery_time

Delivery_time -> Predict delivery time using sorting time. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python.

data-science machine-learning python simplelinearregression

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/recommendation-system

Problem statement. Build a recommender system by using cosine simillarties score.

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/text-mining-q1-elan_mosk

ONE: 1) Perform sentimental analysis on the Elon-musk tweets (Exlon-musk.csv)

data-science machine-learning python spacy-models text-analysis text-mining

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/hypothesis-testing-3-labtat

Analyze the data and determine whether there is any difference in average TAT among the different laboratories at 5% significance level.

data-science hypothesis-testing machine-learning python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/linear-regression-2-salary_data

Salary_hike -> Build a prediction model for Salary_hike. Build a simple linear regression model by performing EDA and do necessary transformations and select the best model using R or Python.

data-science linear-regression machine-learning python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/association-rules-q1-books

Prepare rules for the all the data sets 1) Try different values of support and confidence. Observe the change in number of rules for different support,confidence values 2) Change the minimum length in apriori algorithm 3) Visulize the obtained rules using different plots

apriori-algorithm association-rules data-science machine-learning numpy pandas python unsupervised-machine-learning

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/association-rules-2-my-movies

Prepare rules for the all the data sets 1) Try different values of support and confidence. Observe the change in number of rules for different support,confidence values 2) Change the minimum length in apriori algorithm 3) Visulize the obtained rules using different plots

apriori-algorithm association-rules data-science machine-learning numpy pandas python unsupervised-learning

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/decision-tree-q2-fraud_data

Use decision trees to prepare a model on fraud data treating those who have taxable_income <= 30000 as "Risky" and others are "Good"

data-science decision-trees machine-learning python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/hypothesis-testing-4-cutlets

A randomly selected sample of cutlets was collected from both units and measured? Analyze the data and draw inferences at 5% significance level. Please state the assumptions and tests that you carried out to check validity of the assumptions.

data-science hypothesis-testing machine-learning python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/forecasting-q2-airlines

Forecast the Airlines Passengers data set. Prepare a document for each model explaining.

data-science forecasting machine-learning python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/hypothesis-testing-2-customerorderform

. They audit a certain % of the customer order forms. Any error in order form renders it defective and has to be reworked before processing. The manager wants to check whether the defective % varies by centre. Please analyze the data at 5% significance level and help the manager draw appropriate inferences

data-science hypothesis-testing machine-learning python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/neural-networks-q1-forestfires

PREDICT THE BURNED AREA OF FOREST FIRES WITH NEURAL NETWORKS

data-science machine-learning neural-networks nlp pandas python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/multi-linear-regression-q1-toyotacorolla

Consider only the below columns and prepare a prediction model for predicting Price.

data-science machine-learning multilinear-regression python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/decesion-tree-q1-company_data

Problem Statement: A cloth manufacturing company is interested to know about the segment or attributes causes high sale.

data-science decision-trees machine-learning python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/clustering-q1-crime_data

Perform Clustering(Hierarchical, Kmeans & DBSCAN) for the crime data and identify the number of clusters formed and draw inferences.

clustering data-science dbscan-clustering hierarchical-clustering kmeans-clustering machine-learning python

Last synced: 14 Nov 2024

https://github.com/gandhasiri-rahul-mohan/neural-networks-q2-gas_turbines

The dataset contains 36733 instances of 11 sensor measures aggregated over one hour (by means of average or sum) from a gas turbine.

data-science machine-learning neural-network nlp python-3-6

Last synced: 12 Oct 2024