{"id":28555056,"url":"https://github.com/floressek/data_analysis_and_visualization","last_synced_at":"2025-10-05T00:40:37.661Z","repository":{"id":297916525,"uuid":"973739857","full_name":"Floressek/Data_Analysis_and_Visualization","owner":"Floressek","description":"This repository contains a collection of statistical data analysis laboratories using R. Each lab focuses on different aspects of data exploration, visualization, and analysis techniques.","archived":false,"fork":false,"pushed_at":"2025-06-18T07:51:13.000Z","size":229,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-07-06T09:40:28.993Z","etag":null,"topics":["data-analysis","data-visualization"],"latest_commit_sha":null,"homepage":"","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/Floressek.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-04-27T16:31:27.000Z","updated_at":"2025-06-09T17:40:11.000Z","dependencies_parsed_at":"2025-06-08T10:36:00.035Z","dependency_job_id":null,"html_url":"https://github.com/Floressek/Data_Analysis_and_Visualization","commit_stats":null,"previous_names":["floressek/data_analysis_and_visualization"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/Floressek/Data_Analysis_and_Visualization","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Floressek%2FData_Analysis_and_Visualization","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Floressek%2FData_Analysis_and_Visualization/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Floressek%2FData_Analysis_and_Visualization/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Floressek%2FData_Analysis_and_Visualization/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/Floressek","download_url":"https://codeload.github.com/Floressek/Data_Analysis_and_Visualization/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/Floressek%2FData_Analysis_and_Visualization/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":278395893,"owners_count":25979685,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-04T02:00:05.491Z","response_time":63,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-analysis","data-visualization"],"created_at":"2025-06-10T05:30:35.603Z","updated_at":"2025-10-05T00:40:37.655Z","avatar_url":"https://github.com/Floressek.png","language":"R","readme":" # R Statistical Data Analysis Project\n\nThis repository contains a collection of statistical data analysis laboratories using R. Each lab focuses on different aspects of data exploration, visualization, and analysis techniques.\n\n## Project Structure\n\n```\n.\n├── .gitignore                 # Git ignore configuration\n├── Lab1/                      # Laboratory 1-2 - Univariate \u0026 Bivariate Data Analysis\n│   ├── Excel/                 # Directory for Excel files\n│   └── Univariate-Bivariate-Data/\n│       ├── main.R             # R script for univariate/bivariate analysis\n│       └── data/              # Output directory for generated charts\n├── Lab3/                      # Laboratory 3 - Correlation Analysis\n│   └── main.R                 # R script for correlation analysis\n├── Lab4/                      # Laboratory 4 - Visualization \u0026 Statistical Plots\n│   └── main.R                 # R script for advanced data visualization\n└── data/                      # Data directory\n    └── LifeCycleSavings_with_additional_columns.csv  # Extended dataset\n```\n\n## Lab Descriptions\n\n### Lab 1-2: Univariate and Bivariate Data Analysis\nThis lab focuses on exploring and visualizing basic relationships in the LifeCycleSavings dataset. The analysis includes:\n- Pair plots to show dependencies between variables\n- Histograms for univariate analysis\n- Bar plots comparing countries on various metrics\n- Age distribution analysis\n- Scatter plots to explore relationships between variables\n\n### Lab 3: Correlation Analysis\nThis lab analyzes the Eggs dataset, focusing on:\n- Correlation between sales (Cases) and various price variables\n- Matrix scatter plots for relationship visualization\n- Advanced correlation visualization techniques\n- Time-series analysis of egg sales and prices\n- Factor analysis (Easter, month, etc.) affecting egg sales\n\n### Lab 4: Statistical Visualization\nThis lab demonstrates various visualization techniques in R:\n- Histogram comparison with different settings\n- Density plots and probability distributions\n- Advanced group comparison plots\n- Box plots for distribution comparison\n- Dual axis plots\n- Pie charts for categorical data\n\n## Key Datasets\n\n### LifeCycleSavings Dataset\nThis dataset contains savings and related economic data for various countries, including:\n- Savings rates (sr)\n- Population demographics (pop15, pop75)\n- Average income (dpi)\n- Income growth (ddpi)\n\n### Eggs Dataset\nAnalyzes egg sales data with related variables:\n- Cases (egg sales)\n- Various price indicators (Egg.Pr, Beef.Pr, Chicken.Pr, etc.)\n- Seasonal factors (Month, Easter)\n\n## Getting Started\n\n### Prerequisites\n- R (recommended version 4.0.0 or higher)\n- RStudio (recommended for easier workflow)\n- Required R packages:\n  - corrplot\n  - car\n  - plotly\n\n### Running the Analysis\n1. Clone this repository\n2. Open the desired lab's main.R file in RStudio\n3. Install any required packages:\n   ```R\n   install.packages(c(\"corrplot\", \"car\", \"plotly\"))\n   ```\n4. Execute the script to see the analysis results\n\n## Output\nThe scripts generate various visualization files in PNG format that are saved to the respective data directories.\n\n## Notes\n- All R scripts use UTF-8 encoding\n- For Lab1, the charts are automatically saved in the 'data' folder\n- Interactive plots in Lab3 require the plotly package\n\n## License\nThis project is provided for educational purposes.\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ffloressek%2Fdata_analysis_and_visualization","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ffloressek%2Fdata_analysis_and_visualization","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ffloressek%2Fdata_analysis_and_visualization/lists"}