{"id":18308464,"url":"https://github.com/jcardonamde/datasets_ml","last_synced_at":"2026-04-09T02:31:21.434Z","repository":{"id":64182801,"uuid":"564133661","full_name":"jcardonamde/datasets_ml","owner":"jcardonamde","description":"This project analyzes cab and limousine travel data in New York City.  This with the goal of predicting the total duration of trips within the city. Machine learning models were used.","archived":false,"fork":false,"pushed_at":"2022-12-11T17:10:46.000Z","size":2571,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-09T11:43:15.259Z","etag":null,"topics":["data-science","machine-learning","machine-learning-algorithms","matplotlib","numpy","pandas","pipelines","python","seaborn","sklearn"],"latest_commit_sha":null,"homepage":"https://www.loom.com/share/8e2b86e8eb1f40a2b67c20f5ab0cf1e9","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/jcardonamde.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2022-11-10T03:54:29.000Z","updated_at":"2022-12-11T17:17:21.000Z","dependencies_parsed_at":"2023-01-15T03:15:42.905Z","dependency_job_id":null,"html_url":"https://github.com/jcardonamde/datasets_ml","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/jcardonamde/datasets_ml","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jcardonamde%2Fdatasets_ml","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jcardonamde%2Fdatasets_ml/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jcardonamde%2Fdatasets_ml/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jcardonamde%2Fdatasets_ml/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/jcardonamde","download_url":"https://codeload.github.com/jcardonamde/datasets_ml/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jcardonamde%2Fdatasets_ml/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31582585,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-04-08T14:31:17.711Z","status":"online","status_checked_at":"2026-04-09T02:00:06.848Z","response_time":112,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-science","machine-learning","machine-learning-algorithms","matplotlib","numpy","pandas","pipelines","python","seaborn","sklearn"],"created_at":"2024-11-05T16:08:04.076Z","updated_at":"2026-04-09T02:31:21.410Z","avatar_url":"https://github.com/jcardonamde.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# New York City Taxi Trip Duration\n\n![](https://docs.google.com/drawings/d/e/2PACX-1vRLrhh818nyaxd16zQGBnHCV325Gl2JGgCJFUQqJ9GIi-EQ3BtpeE0qz-4DaasifP3tAgW4Kztxt2tQ/pub?w=687\u0026h=386)\n\nAt one time or another, almost all of us have used an Uber or other transportation service in this digital age to take a ride. Ridesharing services are services that use online-enabled platforms to connect between passengers and local drivers using their personal vehicles.\n\nIn most cases they are a convenient method for door-to-door transportation. They are generally cheaper than using licensed cabs. Examples of ridesharing services include Uber, Cabify, Beat, Didi, etc.\n\nTo improve the efficiency of cab dispatch systems for such services, it is important to be able to predict how long a driver will have their cab occupied. If a dispatcher knew approximately when a cab driver would finish their current trip, they could better identify which driver to assign to each pickup request.\n\nThis project worked with a dataset published by the New York City Taxi and Limousine Commission, which includes pickup time, geographic coordinates, number of passengers among other variables. The goal of this project is to predict the total duration of cab trips in New York City.\n\n\n👉 The dataset used for this analysis was downloaded [here](https://www.kaggle.com/c/nyc-taxi-trip-duration)\n\n💻📚 Libraries used: Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn.\n\n:microscope::dart: Applied models: Linear Regression, Regression Tree, Regression XGBoost and Regression KNN. \n\n\n\n👀:bar_chart: Previews:\n\n![](https://docs.google.com/drawings/d/e/2PACX-1vT71-ztcKxRuR5k8vL7Xwj_4Rwyech9vlwYkH5cG8h9Ihf6RhPj1fCw1-uIE_O4O-OtNfX8AQ3s-47l/pub?w=745\u0026h=562)\n\n![](https://docs.google.com/drawings/d/e/2PACX-1vRDyW_PQpwmmpEDO0putBjbiIP3QepLFXcazg6Z4lrgDOZrcka6oc77IMY2jvYdFotfQORX8ZJ3eUxW/pub?w=959\u0026h=537)\n\n![](https://docs.google.com/drawings/d/e/2PACX-1vRMJxGVooqZOS-61DMQ1thq8Nhxb62SArATlxy23qcx6G-tOwmvN5WGvEqtdX_RZTzBVIZH2689dmgJ/pub?w=914\u0026h=518)\n\n![](https://docs.google.com/drawings/d/e/2PACX-1vQynD4knXrhNVvKRB8tc-3GuFSEkF-S8ajHCNzdJe6385Z8brsgTS0cXOYRPmsM9G6pWB73r1ic_Z-W/pub?w=915\u0026h=354)\n\n![](https://docs.google.com/drawings/d/e/2PACX-1vRmvGaZqj53ac1losjZ4f0PJvh2-TsLBG2FDaYog5gRRYywZAHdz0Qn1iZxwm7EsYTWDWCQg6z5QLUz/pub?w=925\u0026h=348)\n\n![](https://docs.google.com/drawings/d/e/2PACX-1vTGVwU_nrYQVfe1qTKFRBB87PQwWBCBV0F70veX4N41YmesYy4a5QDqxESX9M5zydxWMzfMXwNmJFXN/pub?w=922\u0026h=347)\n\n![](https://docs.google.com/drawings/d/e/2PACX-1vR6G_M6QKq7bezu7bgCjA69reLA2C5irNGUFYWhKz6UI5bLfKAKp59ZbJWA87ockeVxNKsHjPI8B9DZ/pub?w=916\u0026h=342)\n\n\n\n\n\n\n\n\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjcardonamde%2Fdatasets_ml","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjcardonamde%2Fdatasets_ml","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjcardonamde%2Fdatasets_ml/lists"}