{"id":29918116,"url":"https://github.com/4techsadiq/nobel-price-analysis","last_synced_at":"2025-10-10T07:39:53.978Z","repository":{"id":305788526,"uuid":"1023957981","full_name":"4TechSadiq/Nobel-Price-Analysis","owner":"4TechSadiq","description":null,"archived":false,"fork":false,"pushed_at":"2025-07-22T01:26:16.000Z","size":1726,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-08-02T06:38:16.057Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/4TechSadiq.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2025-07-22T01:19:41.000Z","updated_at":"2025-07-22T01:26:20.000Z","dependencies_parsed_at":null,"dependency_job_id":"5edb7cd8-7dbe-4be0-93fc-2edc7c3e0f23","html_url":"https://github.com/4TechSadiq/Nobel-Price-Analysis","commit_stats":null,"previous_names":["4techsadiq/nobel-price-analysis"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/4TechSadiq/Nobel-Price-Analysis","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/4TechSadiq%2FNobel-Price-Analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/4TechSadiq%2FNobel-Price-Analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/4TechSadiq%2FNobel-Price-Analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/4TechSadiq%2FNobel-Price-Analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/4TechSadiq","download_url":"https://codeload.github.com/4TechSadiq/Nobel-Price-Analysis/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/4TechSadiq%2FNobel-Price-Analysis/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":279003186,"owners_count":26083533,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-10T02:00:06.843Z","response_time":62,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-08-02T05:47:43.138Z","updated_at":"2025-10-10T07:39:53.961Z","avatar_url":"https://github.com/4TechSadiq.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Nobel Prize Data Analysis\n\n## Overview\n\nThis project analyzes historical Nobel Prize data from 1901 to 2020 to uncover patterns and trends in Nobel Prize awards. The analysis explores various aspects including gender distribution, country representation, prize sharing trends, and temporal patterns in Nobel Prize awards.\n\n## Dataset\n\nThe dataset contains **962 rows** and **16 columns** covering Nobel Prize winners from 1901 to 2020.\n\n### Dataset Features\n\n| Column | Description | Data Type |\n|--------|-------------|-----------|\n| `year` | Year the prize was awarded | int64 |\n| `category` | Prize category (Chemistry, Literature, Physics, Medicine, Peace, Economics) | object |\n| `prize` | Full prize name | object |\n| `motivation` | Reason for the prize award | object |\n| `prize_share` | Fraction of prize awarded (e.g., \"1/2\", \"1/3\") | object |\n| `laureate_type` | Individual or Organization | object |\n| `full_name` | Winner's full name | object |\n| `birth_date` | Date of birth | datetime64[ns] |\n| `birth_city` | Birth city | object |\n| `birth_country` | Birth country (historical) | object |\n| `birth_country_current` | Birth country (current borders) | object |\n| `sex` | Gender (Male/Female) | object |\n| `organization_name` | Affiliated organization | object |\n| `organization_city` | Organization's city | object |\n| `organization_country` | Organization's country | object |\n| `ISO` | ISO country code | object |\n\n## Data Quality Notes\n\n- **Missing Values**: Organizations (particularly Peace Prize winners) have missing birth information\n- **Date Estimates**: Some birth dates are estimated (e.g., Michael Houghton, Venkatraman Ramakrishnan, Nadia Murad use July 2nd as mid-year estimate)\n- **No Duplicates**: Dataset contains no duplicate entries\n\n### Missing Data Distribution\n- `motivation`: 88 missing values\n- `birth_date`: 28 missing values (primarily organizations)\n- `organization_name`: 255 missing values (individuals without organizational affiliation)\n\n## Installation \u0026 Setup\n\n### Requirements\n\n```python\npandas \u003e= 1.3.0\nnumpy \u003e= 1.21.0\nplotly \u003e= 5.0.0\nseaborn \u003e= 0.11.0\nmatplotlib \u003e= 3.4.0\n```\n\n### Installation\n\n```bash\npip install pandas numpy plotly seaborn matplotlib\n```\n\nFor Google Colab users:\n```bash\n%pip install --upgrade plotly\n```\n\n## Usage\n\n### Basic Setup\n\n```python\nimport pandas as pd\nimport numpy as np\nimport plotly.express as px\nimport plotly.graph_objects as go\nimport seaborn as sns\nimport matplotlib.pyplot as plt\n\n# Load the data\ndf_data = pd.read_csv('nobel_prize_data.csv')\n\n# Convert birth_date to datetime\ndf_data.birth_date = pd.to_datetime(df_data.birth_date)\n\n# Create prize share percentage column\nseparator_values = df_data.prize_share.str.split(\"/\", expand=True)\nnumerator = pd.to_numeric(separator_values[0])\ndenominator = pd.to_numeric(separator_values[1])\ndf_data[\"share_pct\"] = numerator / denominator\n```\n\n## Key Analysis Areas\n\n### 1. Gender Distribution Analysis\n- Overall male vs. female laureate distribution\n- Gender trends by category\n- Historical progression of female representation\n\n### 2. Geographic Analysis\n- Country-wise prize distribution\n- Choropleth mapping of global Nobel Prize distribution\n- Analysis by birth country vs. organization country\n\n### 3. Temporal Analysis\n- Prize frequency over time\n- Impact of world wars on prize distribution\n- 5-year rolling averages for trend identification\n\n### 4. Prize Sharing Trends\n- Evolution of shared vs. individual prizes\n- Category-specific sharing patterns\n- Correlation between prize sharing and total prizes awarded\n\n### 5. Category Analysis\n- Prize distribution across six categories\n- Country specialization in specific categories\n- Historical development of prize categories (Economics added in 1969)\n\n## Key Visualizations\n\n### Plotly Visualizations\n- **Donut Charts**: Gender distribution\n- **Bar Charts**: Prizes by category and country\n- **Choropleth Maps**: Global prize distribution\n- **Stacked Bar Charts**: Category breakdown by country\n- **Line Charts**: Cumulative prizes over time\n\n### Matplotlib Visualizations\n- **Scatter Plots**: Annual prize counts with rolling averages\n- **Dual-Axis Charts**: Prize frequency vs. sharing trends\n\n## Notable Findings\n\n### Historical Milestones\n- **First Nobel Prize**: 1901\n- **Latest Data**: 2020\n- **Economics Prize**: First awarded in 1969\n- **First Female Winners**: Marie Curie (Physics 1903), Bertha von Suttner (Peace 1905), Marie Curie (Chemistry 1911)\n\n### Trends\n- Increasing number of shared prizes over time\n- Growth in total prizes awarded annually\n- Significant impact of world wars on prize distribution\n- United States emergence as leading prize-winning nation\n\n## File Structure\n\n```\nnobel-prize-analysis/\n│\n├── nobel_prize_data.csv          # Main dataset\n├── analysis_notebook.ipynb       # Jupyter notebook with complete analysis\n├── README.md                      # This file\n└── requirements.txt               # Python dependencies\n```\n\n## Data Sources\n\nThe Nobel Prize data spans from Alfred Nobel's 1895 will establishment through 2020, covering all six prize categories:\n- Chemistry\n- Literature  \n- Physics\n- Physiology or Medicine\n- Peace\n- Economic Sciences (added 1969)\n\n## Contributing\n\nWhen contributing to this analysis:\n1. Ensure data integrity when adding new years\n2. Maintain consistent date formatting\n3. Handle missing values appropriately for organizations\n4. Update visualizations to reflect new data ranges\n\n## License\n\nThis project is for educational and research purposes. Nobel Prize data is publicly available information.\n\n## Contact\n\nFor questions about this analysis, please refer to the original data sources and Nobel Prize official documentation.\n\n---\n\n*\"Prizes to those who, during the preceding year, have conferred the greatest benefit to humankind\"* - Alfred Nobel, 1895\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2F4techsadiq%2Fnobel-price-analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2F4techsadiq%2Fnobel-price-analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2F4techsadiq%2Fnobel-price-analysis/lists"}