{"id":24838807,"url":"https://github.com/abdullahkhurshid/ecommerce-marketing-analytics","last_synced_at":"2026-04-12T23:53:38.124Z","repository":{"id":274892755,"uuid":"924131281","full_name":"AbdullahKhurshid/ecommerce-marketing-analytics","owner":"AbdullahKhurshid","description":"Using Apache Spark for marketing analytics","archived":false,"fork":false,"pushed_at":"2025-01-30T00:53:56.000Z","size":18,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-30T01:19:52.483Z","etag":null,"topics":["apache-spark","big-data-analytics","cloud-computing","marketing-analytics","r","supervised-learning","unsupervised-learning"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/AbdullahKhurshid.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2025-01-29T13:25:52.000Z","updated_at":"2025-01-30T01:08:19.000Z","dependencies_parsed_at":"2025-01-30T01:19:54.268Z","dependency_job_id":"2eefc1cc-87cf-4606-8150-aa5c3a0e38e8","html_url":"https://github.com/AbdullahKhurshid/ecommerce-marketing-analytics","commit_stats":null,"previous_names":["abdullahkhurshid/ecommerce-marketing-analytics"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AbdullahKhurshid%2Fecommerce-marketing-analytics","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AbdullahKhurshid%2Fecommerce-marketing-analytics/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AbdullahKhurshid%2Fecommerce-marketing-analytics/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/AbdullahKhurshid%2Fecommerce-marketing-analytics/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/AbdullahKhurshid","download_url":"https://codeload.github.com/AbdullahKhurshid/ecommerce-marketing-analytics/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245589067,"owners_count":20640218,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apache-spark","big-data-analytics","cloud-computing","marketing-analytics","r","supervised-learning","unsupervised-learning"],"created_at":"2025-01-31T06:35:39.183Z","updated_at":"2026-04-12T23:53:38.018Z","avatar_url":"https://github.com/AbdullahKhurshid.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# E-Commerce Marketing Analytics\n\n## Overview\nThis project explores the use of machine learning in the Apache Spark environment. Though the dataset used in this project is small, it is used to simulate cloud computing for big data analytics. As such, we would conduct this project in a manner that is most appropriate for big data analytics.\n\n## Problem Statement\nIn the B2C e-commerce sector, businesses face the challenge of optimizing their operations and enhancing customer understanding to drive revenue growth.\n\nThe dataset is from Kaggle, [transactions.csv](https://www.kaggle.com/datasets/gabrielramos87/an-online-shop-business), contains a one-year record of e-commerce sales transactions comprising 500,000 rows and 8 columns.\n\n| Column Name | Description |\n| --- | --- |\n| CustomerNo | An identification number for each unique customer |\n| TransactionNo | An identification number for each unique transaction |\n| Date | The date on which the transaction was made |\n| ProductNo | An (alpha)numeric code for each unique product |\n| ProductName | Name of Product |\n| Price | Unit Price of the specific product |\n| Quantity | Quantity purchased for a single product within the transaction\n| Country | Country where the customer is based in\n\n## Objectives\nThere are two objectives to this project:\n\n1) Understand the contributing factors to customer loyalty to gain actionable insights for nurturing loyal customer relationships for sustained revenue growth.\n   \n2) Understand customer behaviour through effective\nsegmentation to recommend tailored customer targeting strategies.\n\n## Methodology\n\nFor this project, we adopted the following process:\n\n1) Data Cleaning \u0026 Preparation\n2) Exploratory Data Analysis\n3) Feature Engineering\n4) Machine Learning Modelling\n5) Model Evaluation\n6) Model Deployment\n\nThe steps taken in the above steps can be found in the code documentation file and supporting it would be a codebook for all the variables in the code.\n\n\n\n\n\n\n\n\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fabdullahkhurshid%2Fecommerce-marketing-analytics","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fabdullahkhurshid%2Fecommerce-marketing-analytics","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fabdullahkhurshid%2Fecommerce-marketing-analytics/lists"}