{"id":20550278,"url":"https://github.com/dnmfarrell/streaker","last_synced_at":"2025-06-16T02:07:47.886Z","repository":{"id":149704075,"uuid":"194184298","full_name":"dnmfarrell/Streaker","owner":"dnmfarrell","description":"Build ML data to Beat the Streak!","archived":false,"fork":false,"pushed_at":"2019-06-28T01:17:32.000Z","size":9,"stargazers_count":0,"open_issues_count":1,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-03-06T05:43:54.583Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Perl","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/dnmfarrell.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-06-28T01:15:31.000Z","updated_at":"2019-06-28T01:17:34.000Z","dependencies_parsed_at":null,"dependency_job_id":"78a463d4-5d3e-4643-a1c6-94b1946bc04b","html_url":"https://github.com/dnmfarrell/Streaker","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/dnmfarrell/Streaker","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dnmfarrell%2FStreaker","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dnmfarrell%2FStreaker/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dnmfarrell%2FStreaker/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dnmfarrell%2FStreaker/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/dnmfarrell","download_url":"https://codeload.github.com/dnmfarrell/Streaker/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/dnmfarrell%2FStreaker/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":260083862,"owners_count":22956409,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-16T02:24:14.622Z","updated_at":"2025-06-16T02:07:47.869Z","avatar_url":"https://github.com/dnmfarrell.png","language":"Perl","funding_links":[],"categories":[],"sub_categories":[],"readme":"Streaker\n--------\nBeat the Streak™ is free MLB [game](https://www.mlb.com/apps/beat-the-streak)\nin which the goal is to pick a batter to get a hit in their next game,\ncorrectly 57 times in a row. In the 18 years it has been running, no one has\nbeaten the streak.\n\nThis project aggregates MLB data into a MySQL database for input into machine\nlearning tools.\n\nN.B. the aggregated data is work-in-progress, this is an early version.\n\nDependencies\n------------\nThis project uses Retrosheet play by play\n[data](https://www.retrosheet.org/game.htm).\n\nThe Chadwick\n[tools](https://github.com/chadwickbureau/chadwick/releases) are used to parse\nthe data.\n\nThe data is aggregated in a local MySQL server database, you'll need it\nlistening on localhost with a ~/.my.cnf file configured with a user and\npassword and permission to create databases.\n\nInstallation\n------------\nThis setup is a bit janky, maybe I'll create a makefile at somepoint.\n\n1. Clone this repo\n2. Set the `STREAKER_USER` and `STREAKER_PASS` env variables with your MySQL creds\n3. Download and unzip the retrosheet pbp data you want to include into the\n   project root\n4. Run `install.sh`, this will create the streaker database on MySQL server\n5. The streaker.streaker table contains the aggregated data, you can unload it\n into training, validation and testing csvs with `sql/unload-data.sql`:\n\n    $ mysql \u003c sql/unload-data.sql\n\n6. Feed the csvs created under /tmp into your favorite ML tool!\n\n\nCopyright (c) 2019 David Farrell\n\nPermission is hereby granted, free of charge, to any person obtaining a copy\nof this software and associated documentation files (the \"Software\"), to deal\nin the Software without restriction, including without limitation the rights\nto use, copy, modify, merge, publish, distribute, sublicense, and/or sell\ncopies of the Software, and to permit persons to whom the Software is\nfurnished to do so, subject to the following conditions:\n\nThe above copyright notice and this permission notice shall be included in all\ncopies or substantial portions of the Software.\n\nTHE SOFTWARE IS PROVIDED \"AS IS\", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR\nIMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,\nFITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE\nAUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER\nLIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,\nOUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE\nSOFTWARE.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdnmfarrell%2Fstreaker","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdnmfarrell%2Fstreaker","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdnmfarrell%2Fstreaker/lists"}