{"id":28391451,"url":"https://github.com/rohithay/titanic-data-analysis","last_synced_at":"2026-05-15T18:31:24.568Z","repository":{"id":193848080,"uuid":"501193488","full_name":"rohithay/Titanic-Data-Analysis","owner":"rohithay","description":"Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.","archived":false,"fork":false,"pushed_at":"2022-08-07T17:59:52.000Z","size":119,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-05-31T20:34:40.295Z","etag":null,"topics":["data-analysis","machine-learning","matplotlib","pandas","scipy-stats","statistical-models"],"latest_commit_sha":null,"homepage":"","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/rohithay.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2022-06-08T09:59:21.000Z","updated_at":"2023-05-01T13:07:04.000Z","dependencies_parsed_at":null,"dependency_job_id":"e36c2475-cd05-4d3c-bd56-59bb23965714","html_url":"https://github.com/rohithay/Titanic-Data-Analysis","commit_stats":null,"previous_names":["yrohitha/titanic-data-analysis","rohithay/titanic-data-analysis"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/rohithay/Titanic-Data-Analysis","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rohithay%2FTitanic-Data-Analysis","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rohithay%2FTitanic-Data-Analysis/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rohithay%2FTitanic-Data-Analysis/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rohithay%2FTitanic-Data-Analysis/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/rohithay","download_url":"https://codeload.github.com/rohithay/Titanic-Data-Analysis/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/rohithay%2FTitanic-Data-Analysis/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":260868041,"owners_count":23074918,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-analysis","machine-learning","matplotlib","pandas","scipy-stats","statistical-models"],"created_at":"2025-05-31T09:33:44.033Z","updated_at":"2025-10-19T21:02:34.071Z","avatar_url":"https://github.com/rohithay.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"## Titanic-Data-Analysis \n\n#### Predict Survival Outcomes from the 1912 Titanic disaster based on each passenger's features, such as sex and age.\n\n\n### Description\n\n#### Goal and Aim of the Project: \nWe take interesting steps to maximize insight into the data set, uncover underlying structure, detect outliers and anomalies and test underlying assumptions, by answering to the below questions:\n\n* What is the survival rate of this disaster?\n* Are females more likely to survive males?\n* Is there a certain age group more likely to survive other age groups?\n* Are childern and mothers more likely to be survived than others?\n* Did people pay higher fare tend to survive?\n* Are there difference of survival rate between differnt ticket classes?\n* Who will more likely to survive, families or sigletons?\n\n#### Feature Scope: Build a Machine Learning Model to make better predictions. \n\n- [ ] Understand the impact of bias-variance tradeoff. \n- [ ] Understand what factors are important : accuracy, precision, recall. Calculate F1 score.\n- [ ] Choose and Improve on ML model selection.\n\n### Install Setup\n\nThis project requires **Python 2.7** and the following Python libraries installed:\n\n- [numpy](http://www.numpy.org/)\n- [pandas](http://pandas.pydata.org)\n- [matplotlib](http://matplotlib.org/)\n- [scikit-learn](http://scikit-learn.org/stable/)\n\nYou will also need to have software installed to run and execute a [Jupyter Notebook](http://ipython.org/notebook.html)\n\nIf you do not have Python installed yet, it is highly recommended that you install the [Anaconda](http://continuum.io/downloads) distribution of Python, which already has the above packages and more included. Make sure that you select the Python 2.7 installer and not the Python 3.x installer.\n\n\n### Code\n\nTemplate code is provided in the notebook `titanic-data-analysis.ipynb` \n[Notebook](https://github.com/YRohitha/Titanic-Data-Analysis/tree/main/src/titanic-data-analysis.ipynb) file. \n\n\n### Run\n\nIn a terminal or command window, navigate to the top-level project directory (that contains this README) and run one of the following commands:\n\n```bash\njupyter notebook titanic-data-analysis.ipynb\n```\nor\n```bash\nipython notebook titanic-data-analysis.ipynb\n```\nThis will open the Jupyter Notebook software and project file in your web browser.\n\n\n### Data\n\nThe dataset used in this project is included as `titanic_data.csv`. This dataset is sourced from public domain and contains the following attributes:\n\n**Description**\n\nThe sinking of the Titanic is one of the most infamous shipwrecks in history.\n\nOn April 15, 1912, during her maiden voyage, the widely considered “unsinkable” RMS Titanic sank after colliding with an iceberg. Unfortunately, there weren’t enough lifeboats for everyone onboard, resulting in the death of 1502 out of 2224 passengers and crew.\n\nWhile there was some element of luck involved in surviving, it seems some groups of people were more likely to survive than others.\n\nHere, we are trying to answers few questions: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).\n\n**Features**\n- `pclass` : Passenger Class (1 = 1st; 2 = 2nd; 3 = 3rd)\n- `name` : Name\n- `sex` : Sex\n- `age` : Age\n- `sibsp` : Number of Siblings/Spouses Aboard\n- `parch` : Number of Parents/Children Aboard\n- `ticket` : Ticket Number\n- `fare` : Passenger Fare\n- `cabin` : Cabin\n- `embarked` : Port of Embarkation (C = Cherbourg; Q = Queenstown; S = Southampton)\n\n**Target Variable**\n- `survival` : Survival (0 = No; 1 = Yes)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frohithay%2Ftitanic-data-analysis","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Frohithay%2Ftitanic-data-analysis","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Frohithay%2Ftitanic-data-analysis/lists"}