{"id":31649272,"url":"https://github.com/aliyahscoding/weather-insights","last_synced_at":"2026-05-02T17:34:02.019Z","repository":{"id":318106577,"uuid":"1068552102","full_name":"aliyahscoding/weather-insights","owner":"aliyahscoding","description":"End-to-end weather mini-suite: fetch, clean, feature, backtest (naive vs SARIMA vs ML), and Streamlit dashboard. Reproducible, fast, no API keys.","archived":false,"fork":false,"pushed_at":"2025-10-05T04:59:47.000Z","size":100,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"main","last_synced_at":"2025-10-05T06:18:10.250Z","etag":null,"topics":["data-science","forecasting","pandas","python","reproducible-research","sarima","sklearn","streamlit","time-series"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/aliyahscoding.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-10-02T14:55:06.000Z","updated_at":"2025-10-05T05:01:42.000Z","dependencies_parsed_at":"2025-10-05T06:18:13.408Z","dependency_job_id":"1e73d240-7c30-4689-81b3-b92353a05364","html_url":"https://github.com/aliyahscoding/weather-insights","commit_stats":null,"previous_names":["aliyahscoding/weather-insights"],"tags_count":null,"template":false,"template_full_name":null,"purl":"pkg:github/aliyahscoding/weather-insights","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aliyahscoding%2Fweather-insights","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aliyahscoding%2Fweather-insights/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aliyahscoding%2Fweather-insights/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aliyahscoding%2Fweather-insights/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/aliyahscoding","download_url":"https://codeload.github.com/aliyahscoding/weather-insights/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aliyahscoding%2Fweather-insights/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":278740831,"owners_count":26037480,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-07T02:00:06.786Z","response_time":59,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-science","forecasting","pandas","python","reproducible-research","sarima","sklearn","streamlit","time-series"],"created_at":"2025-10-07T07:41:52.437Z","updated_at":"2025-10-07T07:41:55.462Z","avatar_url":"https://github.com/aliyahscoding.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Weather Insights\n\nA tidy, reproducible pipeline for daily weather: ingest → preprocess → backtest baselines/SARIMA/ML → Streamlit dashboard.\n\n## TL;DR\n- **SARIMA** beats seasonal naive by ~X% RMSE on Austin test set (last 365 days).\n- **Ridge ML** with lag/rolling/calendar features matches SARIMA within error bars and trains in seconds.\n\n## How to run\npython -m venv .venv\n.venv\\Scripts\\activate\npip install -r requirements.txt\n\npython -m src.ingest --location \"Austin,US\" --start 2018-01-01 --end 2024-12-31 --outfile data/raw/austin_2018_2024.csv\npython -m src.preprocess --in data/raw/austin_2018_2024.csv --out data/processed/austin_proc.csv\npython -m src.modeling.backtest --dataset data/processed/austin_proc.csv --model sarima --horizon 7\n\nstreamlit run app/streamlit_app.py\n\n\n## Repo layout\nweather-insights/\ndata/{raw,processed,external}\nsrc/{ingest.py,preprocess.py,modeling/,viz.py,utils.py}\napp/streamlit_app.py\nreports/{figures,metrics}\ntests/\n.github/workflows/ci.yml\n\n\n## Results\n| model           | MAE | RMSE |\n|-----------------|-----|------|\n| seasonal_naive  | ... | ...  |\n| sarima          | ... | ...  |\n| ridge_ml        | ... | ...  |\n\n![Test window](reports/figures/test_pred.png)\n\n## Notes\n- Weekly seasonality is strong; SARIMA captures it.\n- Simple ML with lags/rolls/calendar nearly ties SARIMA.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Faliyahscoding%2Fweather-insights","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Faliyahscoding%2Fweather-insights","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Faliyahscoding%2Fweather-insights/lists"}