{"id":13467773,"url":"https://github.com/DAGWorks-Inc/hamilton","last_synced_at":"2025-03-26T03:31:05.408Z","repository":{"id":78294711,"uuid":"605682921","full_name":"DAGWorks-Inc/hamilton","owner":"DAGWorks-Inc","description":"Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.","archived":false,"fork":false,"pushed_at":"2024-11-27T05:12:15.000Z","size":79218,"stargazers_count":1885,"open_issues_count":127,"forks_count":125,"subscribers_count":17,"default_branch":"main","last_synced_at":"2024-12-02T10:14:07.398Z","etag":null,"topics":["dag","data-analysis","data-engineering","data-science","dataframe","etl","etl-framework","etl-pipeline","feature-engineering","hacktoberfest","lineage","llmops","machine-learning","mlops","orchestration","pandas","python","rag","software-engineering"],"latest_commit_sha":null,"homepage":"https://hamilton.dagworks.io/en/latest/","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"bsd-3-clause-clear","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/DAGWorks-Inc.png","metadata":{"files":{"readme":"README-DOCS.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-02-23T17:16:48.000Z","updated_at":"2024-11-30T16:59:04.000Z","dependencies_parsed_at":null,"dependency_job_id":"ac15bcaa-ee99-4baa-927f-5faafd99940d","html_url":"https://github.com/DAGWorks-Inc/hamilton","commit_stats":{"total_commits":1615,"total_committers":79,"mean_commits":"20.443037974683545","dds":0.7133126934984519,"last_synced_commit":"3ed61dd27f93a83601f941330579df746a0ddbd6"},"previous_names":[],"tags_count":137,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DAGWorks-Inc%2Fhamilton","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DAGWorks-Inc%2Fhamilton/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DAGWorks-Inc%2Fhamilton/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/DAGWorks-Inc%2Fhamilton/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/DAGWorks-Inc","download_url":"https://codeload.github.com/DAGWorks-Inc/hamilton/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":245543421,"owners_count":20632648,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["dag","data-analysis","data-engineering","data-science","dataframe","etl","etl-framework","etl-pipeline","feature-engineering","hacktoberfest","lineage","llmops","machine-learning","mlops","orchestration","pandas","python","rag","software-engineering"],"created_at":"2024-07-31T15:01:00.355Z","updated_at":"2025-03-26T03:31:05.400Z","avatar_url":"https://github.com/DAGWorks-Inc.png","language":"Jupyter Notebook","readme":"# Documentation\n\nInstructions for managing documentation on read the docs.\n\n# Build locally\n\nTo build locally, you need to run the following -- make sure you're in the root of the repo:\n\n```bash\npip install .[docs]\n```\nand then one of the following to build and view the documents:\n```bash\nsphinx-build -b dirhtml -W -E -T -a docs /tmp/mydocs\npython -m http.server --directory /tmp/mydocs\n```\nor for auto rebuilding do:\n```bash\nsphinx-autobuild -b dirhtml -W -E -T  --watch hamilton/ -a docs /tmp/mydocs\n```\nThen it'll be running on port 8000.\n\nNote: readthedocs builds will fail if there are ANY WARNINGs in the build.\nSo make sure to check the build log for any warnings, and fix them, else you'll waste time debugging readthedocs\nbuild failures.\n\n# SimplePDF\nTo create a PDF, you can run the following:\n```bash\nsphinx-build -b simplepdf  -W -E -T  -a docs /tmp/mydocs\n# or if you want to auto-rebuild:\nsphinx-autobuild -b simplepdf  -W -E -T  --watch hamilton/ -a docs /tmp/mydocs\n```\nThe PDF will be in `/tmp/mydocs` in a few minutes.\n\n# reST vs myST\nWe use both! The general breakdown of when to use which is:\n1. For documentation that we want to be viewable in github, use myST.\n2. Otherwise default to using reST.\n","funding_links":[],"categories":["Jupyter Notebook","Data Manipulation","By Language","Python"],"sub_categories":["Pipelines","Data Science","Libraries"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FDAGWorks-Inc%2Fhamilton","html_url":"https://awesome.ecosyste.ms/projects/github.com%2FDAGWorks-Inc%2Fhamilton","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2FDAGWorks-Inc%2Fhamilton/lists"}