{"id":26720210,"url":"https://github.com/patrickdocs/insurance-data-pipeline-etl-visualization","last_synced_at":"2025-10-14T22:10:24.197Z","repository":{"id":283455365,"uuid":"951824792","full_name":"patrickdocs/Insurance-Data-Pipeline-ETL-Visualization","owner":"patrickdocs","description":null,"archived":false,"fork":false,"pushed_at":"2025-03-20T10:03:38.000Z","size":0,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-03-20T10:37:59.638Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/patrickdocs.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2025-03-20T09:54:37.000Z","updated_at":"2025-03-20T10:03:42.000Z","dependencies_parsed_at":"2025-03-20T10:48:08.799Z","dependency_job_id":null,"html_url":"https://github.com/patrickdocs/Insurance-Data-Pipeline-ETL-Visualization","commit_stats":null,"previous_names":["patrickdocs/insurance-data-pipeline-etl-visualization"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patrickdocs%2FInsurance-Data-Pipeline-ETL-Visualization","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patrickdocs%2FInsurance-Data-Pipeline-ETL-Visualization/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patrickdocs%2FInsurance-Data-Pipeline-ETL-Visualization/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/patrickdocs%2FInsurance-Data-Pipeline-ETL-Visualization/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/patrickdocs","download_url":"https://codeload.github.com/patrickdocs/Insurance-Data-Pipeline-ETL-Visualization/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245902821,"owners_count":20691286,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-03-27T18:32:51.251Z","updated_at":"2025-10-14T22:10:19.159Z","avatar_url":"https://github.com/patrickdocs.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Insurance Data Pipeline - ETL \u0026 Visualization\n\n## 📌 Project Overview\nThis project focuses on building an ETL (Extract, Transform, Load) pipeline for processing insurance data and performing data visualization to gain insights. The dataset includes various factors affecting insurance premiums, claims, and policy adjustments.\n\n## 📂 Project Structure\n```\nInsurance-Data-Pipeline-ETL-Visualization/\n│-- insurance.csv            # Raw dataset used for processing\n│-- Insurance_ETL.ipynb      # Jupyter Notebook containing ETL and visualization\n│-- README.md                # Project documentation\n```\n\n## 🔧 Features\n- **ETL Pipeline:** Extracts, transforms, and loads insurance data for analysis.\n- **Data Cleaning \u0026 Validation:** Handles missing values and ensures data integrity.\n- **Data Visualization:** Generates insights using Seaborn and Matplotlib.\n- **Key Insights:** Analyzes claim severity, regional claims distribution, and premium adjustments.\n\n## 📊 Visualizations\n1. **Claims Severity Distribution** - Bar chart visualizing different levels of claim severity.\n2. **Claims Frequency by Region** - Total number of claims across various regions.\n3. **Premium Adjustments** - Examining how premium amounts change based on multiple factors.\n\n## 🚀 How to Run\n1. Clone the repository:\n   ```bash\n   git clone https://github.com/PatrickDoCS/Insurance-Data-Pipeline-ETL-Visualization.git\n   ```\n2. Install dependencies:\n   ```bash\n   pip install pandas numpy matplotlib seaborn\n   ```\n3. Open and run `Insurance_ETL.ipynb` in Jupyter Notebook.\n\n## 📌 Dependencies\n- Python 3.x\n- Pandas\n- NumPy\n- Matplotlib\n- Seaborn\n\n## 📬 Contact\nFor any questions, feel free to reach out:\n- **GitHub:** [PatrickDoCS](https://github.com/PatrickDoCS)\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpatrickdocs%2Finsurance-data-pipeline-etl-visualization","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpatrickdocs%2Finsurance-data-pipeline-etl-visualization","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpatrickdocs%2Finsurance-data-pipeline-etl-visualization/lists"}