{"id":51043680,"url":"https://github.com/hackyourfuture/data-assignment-week-1","last_synced_at":"2026-06-22T12:02:10.200Z","repository":{"id":356339460,"uuid":"1223588986","full_name":"HackYourFuture/data-assignment-week-1","owner":"HackYourFuture","description":"HackYourFuture data track week 1 assignment files","archived":false,"fork":false,"pushed_at":"2026-05-22T12:07:54.000Z","size":25,"stargazers_count":0,"open_issues_count":1,"forks_count":2,"subscribers_count":0,"default_branch":"main","last_synced_at":"2026-05-22T17:43:42.646Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Shell","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/HackYourFuture.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2026-04-28T13:17:00.000Z","updated_at":"2026-05-07T15:52:42.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/HackYourFuture/data-assignment-week-1","commit_stats":null,"previous_names":["hackyourfuture/data-assignment-week-1"],"tags_count":0,"template":true,"template_full_name":"HackYourFuture/assignment-template","purl":"pkg:github/HackYourFuture/data-assignment-week-1","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HackYourFuture%2Fdata-assignment-week-1","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HackYourFuture%2Fdata-assignment-week-1/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HackYourFuture%2Fdata-assignment-week-1/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HackYourFuture%2Fdata-assignment-week-1/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/HackYourFuture","download_url":"https://codeload.github.com/HackYourFuture/data-assignment-week-1/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/HackYourFuture%2Fdata-assignment-week-1/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":34647750,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-06-22T02:00:06.391Z","response_time":106,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2026-06-22T12:02:07.783Z","updated_at":"2026-06-22T12:02:10.185Z","avatar_url":"https://github.com/HackYourFuture.png","language":"Shell","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Data Track — Week 1 Assignment (Template)\n\nThe HackYourFuture Data Track Week 1 assignment: **The Data Cleaning Pipeline**.\n\n\u003e 👩‍🎓 **Students:** you are in the wrong place. Do **not** fork or use this template.\n\u003e Go to your cohort's assignment repo under\n\u003e [`HackYourAssignment`](https://github.com/HackYourAssignment) (e.g. `c55-data-week1`,\n\u003e `c56-data-week1`, …). Your teacher posts the exact link in your cohort channel.\n\u003e Fork the cohort repo, branch, and open a PR back to it. Full instructions live in the\n\u003e [Week 1 Assignment on Notion](https://www.notion.so/hackyourfuture/Week-1-Assignment-The-Data-Cleaning-Pipeline-3cc37d4bf482470cbc6667bd1d1bb605).\n\n## For instructors / track maintainers\n\nThis repo is the **upstream template** for the Week 1 assignment. At the start of each\ncohort, generate a cohort-specific repo under the `HackYourAssignment` org from this\ntemplate (GitHub: **Use this template → Create a new repository**, owner =\n`HackYourAssignment`, name = `c\u003cNN\u003e-data-week1`). Students then fork *that* cohort repo\nand open PRs back to it; the auto-grader runs on every push.\n\nEdits to the assignment, dataset, or grader belong here on the template, not on the\ncohort copies.\n\n## Tasks at a glance\n\n| Task | Folder | Points | What you build |\n|---|---|---|---|\n| **Task 1** — Cleaner Pipeline | `task-1/` | 60 | A modular Python pipeline that reads `data/messy_users.csv`, cleans each field via helpers in `src/utils.py`, validates, and writes JSON to `output/clean_users.json`. |\n| **Task 2** — AI Debug Report | `task-2/` | 20 | Document one debugging session where you used an LLM to fix a bug. Fill in the four sections of `AI_DEBUG.md`. |\n| **Task 3** — HYF Azure proof | `task-3/` | 20 | Accept the HYF Azure tenant invite, switch to that directory, screenshot proof at `task-3/azure_proof.png`. |\n\nTotal: 100 · Passing: 60.\n\n## Repository layout\n\n```text\n.\n├── task-1/\n│   ├── data/\n│   │   └── messy_users.csv      # the dataset (committed; do not edit)\n│   ├── src/\n│   │   ├── cleaner.py           # entry point — fill in TODOs\n│   │   └── utils.py             # field-cleaning helpers — fill in TODOs\n│   └── output/\n│       └── clean_users.json     # your cleaner writes here\n├── task-2/\n│   └── AI_DEBUG.md              # fill in the four sections\n├── task-3/\n│   └── azure_proof.png          # add your screenshot here\n├── .hyf/\n│   └── test.sh                  # auto-grader (read it to see exactly what it checks)\n└── .github/workflows/\n    └── grade-assignment.yml     # runs .hyf/test.sh on every PR\n```\n\n## Run the grader locally\n\nBefore opening a PR, run the same checks the auto-grader runs:\n\n```bash\nbash .hyf/test.sh\ncat .hyf/score.json\n```\n\nThis prints a per-task breakdown and writes `score.json`. Iterate until\n`pass: true` (or until you've given it your best attempt), then push.\n\n## Submission\n\nStudents submit on the cohort repo (`HackYourAssignment/c\u003cNN\u003e-data-week1`), not here:\nopen a PR from the student's fork against the cohort repo's `main`, then share the PR\nURL with the teacher.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhackyourfuture%2Fdata-assignment-week-1","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fhackyourfuture%2Fdata-assignment-week-1","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fhackyourfuture%2Fdata-assignment-week-1/lists"}