Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/winniecirino/customers-data-preparation-and-enrichment
This repository contains SQL scripts designed to prepare and enrich a customer dataset. The goal is to generate additional insights from the raw data by creating new columns that provide a more detailed understanding of customer demographics, behaviors, and segments.
https://github.com/winniecirino/customers-data-preparation-and-enrichment
sql sql-server
Last synced: 24 days ago
JSON representation
This repository contains SQL scripts designed to prepare and enrich a customer dataset. The goal is to generate additional insights from the raw data by creating new columns that provide a more detailed understanding of customer demographics, behaviors, and segments.
- Host: GitHub
- URL: https://github.com/winniecirino/customers-data-preparation-and-enrichment
- Owner: WinnieCIRINO
- Created: 2024-08-20T05:32:00.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2024-08-20T05:57:40.000Z (6 months ago)
- Last Synced: 2025-01-18T12:15:58.292Z (24 days ago)
- Topics: sql, sql-server
- Language: TSQL
- Homepage:
- Size: 2.93 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Customers Data Preparation and Enrichment
## Overview
This repository contains SQL scripts designed to prepare and enrich a customer dataset. The goal is to generate additional insights from the raw data by creating new columns that provide a more detailed understanding of customer demographics, behaviors, and segments.## Data Source
The dataset used in these scripts is stored in a SQL Server database, within the customerss table of the Solas Summer Learncsmp database.## Scripts Overview
### Basic Data Selection-- SELECT TOP (1000) [...]
Selects the first 1000 rows from the customerss table, extracting key demographic information including CustomerKey, Prefix, FirstName, LastName, BirthDate, MaritalStatus, Gender, EmailAddress, AnnualIncome, TotalChildren, EducationLevel, Occupation, and HomeOwner.### Data Enrichment
A series of SQL scripts are used to create new derived columns:
-- FullNames: Combines Prefix, FirstName, and LastName to form a full name.
-- Year: Extracts the year of birth from the BirthDate.
-- Age: Calculates the current age of the customer based on the year of birth.
-- MaritalStatusFull: Converts marital status codes (S, M) into full descriptions (Single, Married).
-- GenderFull: Converts gender codes (F, M) into full descriptions (Female, Male).
-- Parent Status: Categorizes customers based on the number of children into Parent or Non-Parent.
-- Wealth Category: Segments customers into financial categories (Wealthy, Rich, Average, Poor) based on AnnualIncome.
-- HomeOwnerFull: Converts the homeowner status into a full description (Yes, No).
-- UserName: Extracts the username portion of the EmailAddress.
## Final Data Table Creation
Script: CREATE TABLE Customerss_Prep [...]
A comprehensive SQL script that combines all the above transformations and stores the enriched data into a new table Customerss_Prep.## To use these scripts:
Data Selection: Use the initial selection script to understand the structure of the raw data.
Run Enrichment Scripts: Apply each script in sequence to enrich the data with new insights.
Final Table Creation: Use the final script to generate a fully-prepared dataset stored in the Customerss_Prep table.
## Prerequisites
SQL Server or a compatible database system.
Access to the Solas Summer Learncsmp database.
## How to Run
Open SQL Server Management Studio (SSMS) or your preferred SQL client.
Connect to your database.
Execute the scripts in the order provided.## Contributions
Contributions to this repository are welcome. Please open an issue or submit a pull request with your improvements or suggestions.