{"id":15157724,"url":"https://github.com/mdsrqbl/omnihuman","last_synced_at":"2025-04-15T06:50:32.230Z","repository":{"id":254573194,"uuid":"846941126","full_name":"mdsrqbl/omnihuman","owner":"mdsrqbl","description":"AI model that understands text \u0026 humanoids.","archived":false,"fork":false,"pushed_at":"2025-04-05T14:55:41.000Z","size":36,"stargazers_count":92,"open_issues_count":0,"forks_count":28,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-05T15:32:34.806Z","etag":null,"topics":["computer-vision","generative-ai","nlp","pose-synthesis","pytorch","transformers"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/mdsrqbl.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":".github/FUNDING.yml","license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null},"funding":{"custom":["https://www.paypal.com/donate/?hosted_button_id=7SNGNSKUQXQW2"]}},"created_at":"2024-08-24T11:47:43.000Z","updated_at":"2025-04-05T14:55:44.000Z","dependencies_parsed_at":"2024-09-22T06:01:23.829Z","dependency_job_id":null,"html_url":"https://github.com/mdsrqbl/omnihuman","commit_stats":{"total_commits":5,"total_committers":1,"mean_commits":5.0,"dds":0.0,"last_synced_commit":"e5dba1fa6455f4bae6e111c20ba7afe0b2d68279"},"previous_names":["mdsrqbl/omnihuman"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mdsrqbl%2Fomnihuman","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mdsrqbl%2Fomnihuman/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mdsrqbl%2Fomnihuman/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mdsrqbl%2Fomnihuman/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/mdsrqbl","download_url":"https://codeload.github.com/mdsrqbl/omnihuman/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":249023708,"owners_count":21199958,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["computer-vision","generative-ai","nlp","pose-synthesis","pytorch","transformers"],"created_at":"2024-09-26T20:02:03.819Z","updated_at":"2025-04-15T06:50:32.224Z","avatar_url":"https://github.com/mdsrqbl.png","language":"Python","funding_links":["https://www.paypal.com/donate/?hosted_button_id=7SNGNSKUQXQW2"],"categories":[],"sub_categories":[],"readme":"# OmniHuman\n\n\u003e [!IMPORTANT]\n\u003e \u003cdiv align=\"center\"\u003e\u003ch3\u003eThis is the Original one\u003cbr\u003ethis isn't bytedance's plagiarized mock article 😒\u003cbr\u003e#protect_open_source\u003c/h3\u003e\u003c/div\u003e\n\n\u003cdiv align=\"center\"\u003e\n\n\u003cimg src=\"https://github.com/user-attachments/assets/69a319dc-9880-464e-a895-d04640824a12\" height=400/\u003e\n\n**AI model that understands humanoids and text.**\n\n\u003cbr/\u003e\n\n[![PyPi](https://img.shields.io/pypi/v/omnihuman?logo=pypi)](https://pypi.org/project/omnihuman/)\n[![Documentation Status](https://readthedocs.org/projects/omnihuman/badge/?version=latest)](https://omnihuman.readthedocs.io/en/latest/?badge=latest)\n[![python](https://img.shields.io/pypi/pyversions/omnihuman?logo=python)](https://pypi.org/project/omnihuman/)\n\n[![GitHub Repo stars](https://img.shields.io/github/stars/mdsrqbl/omnihuman?logo=github)](https://github.com/mdsrqbl/omnihuman/stargazers)\n[![Downloads](https://img.shields.io/pepy/dt/omnihuman?color=purple\u0026logoColor=white\u0026logo=data:image/svg+xml;base64,PHN2ZyB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciIHdpZHRoPSI4MDAiIGhlaWdodD0iODAwIiBmaWxsPSJub25lIiB2aWV3Qm94PSIwIDAgMjQgMjQiPjxwYXRoIGZpbGw9IiMwMDAiIGZpbGwtcnVsZT0iZXZlbm9kZCIgZD0iTTkuMiAyLjhjLS4yLjMtLjIuOC0uMiAxLjZWMTFINy44Yy0uOSAwLTEuMyAwLTEuNS4yYS44LjggMCAwIDAtLjMuNmMwIC4zLjMuNiAxIDEuMmw0LjEgNC40LjcuNmEuNy43IDAgMCAwIC40IDBsLjctLjZMMTcgMTNjLjYtLjYuOS0xIC45LTEuMmEuOC44IDAgMCAwLS4zLS42Yy0uMi0uMi0uNi0uMi0xLjUtLjJIMTVWNC40YzAtLjggMC0xLjMtLjItMS42YTEuNSAxLjUgMCAwIDAtLjYtLjZjLS4zLS4yLS44LS4yLTEuNi0uMmgtMS4yYy0uOCAwLTEuMyAwLTEuNi4yYTEuNSAxLjUgMCAwIDAtLjYuNnpNNSAyMWExIDEgMCAwIDAgMSAxaDEyYTEgMSAwIDEgMCAwLTJINmExIDEgMCAwIDAtMSAxeiIgY2xpcC1ydWxlPSJldmVub2RkIi8+PC9zdmc+)](https://pepy.tech/projects/omnihuman/)\u003cbr\u003e\n\n| **Support Us** ❤️ | [![PayPal](https://img.shields.io/badge/PayPal-00457C?logo=paypal\u0026logoColor=white)](https://www.paypal.com/donate/?hosted_button_id=7SNGNSKUQXQW2) |\n| ---------------- | -------------------------------------------------------------------------------------------------------------------------------------------------- |\n\n\u003c/div\u003e\n\n---\n\n1. [OmniHuman](#omnihuman)\n   1. [Installation](#installation)\n   2. [Usage](#usage)\n   3. [Citation \\\u0026 License](#citation--license)\n\n## Installation\n\n```bash\npip install omnihuman\n```\n\nor install editable from source\n\n```bash\ngit clone https://github.com/mdsrqbl/omnihuman.git\ncd omnihuman\npip install -e .\n```\n\n## Usage\n\n```python\nimport omnihuman\nimport PIL.Image\n\ntext = \"Raise both hands and clap overhead.\"\nframes = omnihuman.read_frames(\"path/to/image.jpg\")  # (1, channels, height, width)\n# frames = omnihuman.read_frames(\"path/to/video.mp4\")  # (n_frames, channels, height, width)\n\n# model = omnihuman.OmniHuman()\n# frames = model.generate_video(text, frames)\n\nPIL.Image.fromarray(frames[-1].permute(1,2,0).numpy()).show()\n```\n\nFull documentation is available at [omnihuman.readTheDocs.io](https://omnihuman.readthedocs.io/en/latest/).\n\n## Citation \u0026 License\n\n```bibtex\n@misc{mdsr2024omnihuman,\n  author = {Mudassar Iqbal},\n  title = {OmniHuman: AI model that understands text and humanoids.},\n  year = {2024},\n  publisher = {GitHub},\n  howpublished = {\\url{https://github.com/mdsrqbl/omnihuman}}\n}\n```\n\nThis project is licensed under Apache License 2.0 - see the [LICENSE](https://github.com/mdsrqbl/omnihuman/blob/main/LICENSE) file for details.\n\nYou are permitted to use the library \u0026 models, create modified versions, or incorporate pieces of the code into your own work. Your product or research, whether commercial or non-commercial, must provide appropriate credit to the original author(s) by citing this repository \u0026 research papers. And although it follows common sense, you can not steal namespace and must put in the effort to give your work an original name. \n\nStay tuned for research papers!\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmdsrqbl%2Fomnihuman","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmdsrqbl%2Fomnihuman","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmdsrqbl%2Fomnihuman/lists"}