{"id":47296497,"url":"https://github.com/wgsxm/PartCrafter","last_synced_at":"2026-03-30T21:00:41.186Z","repository":{"id":298746895,"uuid":"998715039","full_name":"wgsxm/PartCrafter","owner":"wgsxm","description":"[NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers","archived":false,"fork":false,"pushed_at":"2025-09-19T07:23:51.000Z","size":43862,"stargazers_count":2156,"open_issues_count":3,"forks_count":126,"subscribers_count":178,"default_branch":"main","last_synced_at":"2025-09-19T09:34:44.095Z","etag":null,"topics":["3d","3d-generation","3d-object-generation","3d-object-reconstruction","3d-reconstruction","3d-scene-generation","3d-scene-reconstruction","image-to-3d"],"latest_commit_sha":null,"homepage":"https://wgsxm.github.io/projects/partcrafter/","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/wgsxm.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2025-06-09T06:13:50.000Z","updated_at":"2025-09-19T07:25:03.000Z","dependencies_parsed_at":"2025-09-19T09:15:37.818Z","dependency_job_id":"07998c9c-cd9a-4919-a8ec-826a7de2c9d3","html_url":"https://github.com/wgsxm/PartCrafter","commit_stats":null,"previous_names":["wgsxm/partcrafter"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/wgsxm/PartCrafter","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wgsxm%2FPartCrafter","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wgsxm%2FPartCrafter/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wgsxm%2FPartCrafter/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wgsxm%2FPartCrafter/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/wgsxm","download_url":"https://codeload.github.com/wgsxm/PartCrafter/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/wgsxm%2FPartCrafter/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":31209945,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-03-30T15:24:02.938Z","status":"ssl_error","status_checked_at":"2026-03-30T15:23:44.804Z","response_time":138,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["3d","3d-generation","3d-object-generation","3d-object-reconstruction","3d-reconstruction","3d-scene-generation","3d-scene-reconstruction","image-to-3d"],"created_at":"2026-03-16T17:00:19.174Z","updated_at":"2026-03-30T21:00:41.181Z","avatar_url":"https://github.com/wgsxm.png","language":"Python","funding_links":[],"categories":["3D视觉生成重建"],"sub_categories":["资源传输下载"],"readme":"# [NeurIPS 2025] PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers\n\n\u003ch4 align=\"center\"\u003e\n\n[Yuchen Lin\u003csup\u003e*\u003c/sup\u003e](https://wgsxm.github.io), [Chenguo Lin\u003csup\u003e*\u003c/sup\u003e](https://chenguolin.github.io), [Panwang Pan\u003csup\u003e†\u003c/sup\u003e](https://paulpanwang.github.io), [Honglei Yan](https://openreview.net/profile?id=~Honglei_Yan1), [Yiqiang Feng](https://openreview.net/profile?id=~Feng_Yiqiang1), [Yadong Mu](http://www.muyadong.com), [Katerina Fragkiadaki](https://www.cs.cmu.edu/~katef/)\n\n[![arXiv](https://img.shields.io/badge/arXiv-2506.05573-b31b1b.svg?logo=arXiv)](https://arxiv.org/abs/2506.05573)\n[![Project Page](https://img.shields.io/badge/🏠-Project%20Page-blue.svg)](https://wgsxm.github.io/projects/partcrafter)\n[\u003cimg src=\"https://img.shields.io/badge/YouTube-Video-red\" alt=\"YouTube\"\u003e](https://www.youtube.com/watch?v=ZaZHbkkPtXY)\n[![Model](https://img.shields.io/badge/🤗%20Model-PartCrafter-yellow.svg)](https://huggingface.co/wgsxm/PartCrafter)\n[![Model-Scene](https://img.shields.io/badge/🤗%20Model-PartCrafter--Scene-yellow.svg)](https://huggingface.co/wgsxm/PartCrafter-Scene)\n[![Demo](https://img.shields.io/badge/🤗%20Demo-PartCrafter-green.svg)](https://huggingface.co/spaces/alexnasa/PartCrafter)\n\n\u003cp align=\"center\"\u003e\n    \u003cimg width=\"90%\" alt=\"pipeline\", src=\"./assets/teaser.png\"\u003e\n\u003c/p\u003e\n\n\u003c/h4\u003e\n\nThis repository contains the official implementation of the paper: [PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers](https://wgsxm.github.io/projects/partcrafter/). \nPartCrafter is a structured 3D generative model that jointly generates multiple parts and objects from a single RGB image in one shot. \nHere is our [Project Page](https://wgsxm.github.io/projects/partcrafter).\n\nFeel free to contact me (linyuchen@stu.pku.edu.cn) or open an issue if you have any questions or suggestions.\n\n## 📢 News\n- **2025-09-18**: PartCrafter is accepted to NeurIPS 2025. \n- **2025-08-15**: PartCrafter HuggingFace🤗 demo is available [here](https://huggingface.co/spaces/alexnasa/PartCrafter). Thanks to [alexnasa](https://huggingface.co/alexnasa). \n- **2025-07-23**: The 3D scene version of PartCrafter is released, which is trained on [3D-Front](https://huggingface.co/datasets/huanngzh/3D-Front). \n- **2025-07-20**: A guide for installing PartCrafter on Windows is available in [this fork](https://github.com/JackDainzh/PartCrafter-Windows/tree/windows-main). Thanks to [JackDainzh](https://github.com/JackDainzh)!\n- **2025-07-13**: PartCrafter is fully open-sourced 🚀.\n- **2025-06-09**: PartCrafter is on arXiv. \n\n## 📋 TODO\n- [x] Release inference scripts. \n- [x] Release training code and data preprocessing scripts. \n- [x] Release pretrained checkpoints on both object and scene level. \n- [x] Provide a HuggingFace🤗 demo.\n- [ ] Release preprocessed dataset. \n\n## 🔧 Installation\nWe use `torch-2.5.1+cu124` and `python-3.11`. But it should also work with other versions. Create a conda environment with the following command (optional):\n```\nconda create -n partcrafter python=3.11.13\nconda activate partcrafter\npip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cu124\n```\nThen, install other dependencies with the following command:\n```\ngit clone https://github.com/wgsxm/PartCrafter.git\ncd PartCrafter\nbash settings/setup.sh\n```\nIf you do not have root access and use conda environment, you can install required graphics libraries with the following command:\n```\nconda install -c conda-forge libegl libglu pyopengl\n```\nWe test the above installation on Debian 12 with NVIDIA H20 GPUs. For Windows users, you can try to set up the environment according to [this pull request](https://github.com/wgsxm/PartCrafter/pull/24) and [this fork](https://github.com/JackDainzh/PartCrafter-Windows/tree/windows-main). We sincerely thank [JackDainzh](https://github.com/JackDainzh) for contributing to the Windows support! \n\n## 💡 Quick Start\n### 3D Part-Level Object Generation\n\u003cp align=\"center\"\u003e\n    \u003cimg width=\"90%\" alt=\"pipeline\", src=\"./assets/robot.gif\"\u003e\n\u003c/p\u003e\n\nGenerate a 3D part-level object from an image:\n```\npython scripts/inference_partcrafter.py \\\n  --image_path assets/images/np3_2f6ab901c5a84ed6bbdf85a67b22a2ee.png \\\n  --num_parts 3 --tag robot --render\n```\nThe required model weights will be automatically downloaded:\n- PartCrafter model from [wgsxm/PartCrafter](https://huggingface.co/wgsxm/PartCrafter) → pretrained_weights/PartCrafter\n- RMBG model from [briaai/RMBG-1.4](http://huggingface.co/briaai/RMBG-1.4) → pretrained_weights/RMBG-1.4\n\nThe generated results will be saved to `./results/robot`. We provide several example images from Objaverse and ABO in `./assets/images`. Their filenames start with recommended number of parts, e.g., `np3` which means 3 parts. You can also try other part count for the same input images. \n\nSpecify `--rmbg` if you use custom images. **This will remove the background of the input image and resize it appropriately.**\n\n### 3D Scene Generation\n\u003cp align=\"center\"\u003e\n    \u003cimg width=\"90%\" alt=\"pipeline\", src=\"./assets/dining_room.gif\"\u003e\n\u003c/p\u003e\n\nGenerate a 3D scene from an image:\n```\npython scripts/inference_partcrafter_scene.py \\\n  --image_path assets/images_scene/np6_0192a842-531c-419a-923e-28db4add8656_DiningRoom-31158.png \\\n  --num_parts 6 --tag dining_room --render\n```\nThe required model weights will be automatically downloaded:\n- PartCrafter-Scene model from [wgsxm/PartCrafter-Scene](https://huggingface.co/wgsxm/PartCrafter-Scene) → pretrained_weights/PartCrafter-Scene\n\nThe generated results will be saved to `./results/dining_room`. We provide several example images from 3D-Front in `./assets/images_scene`. Their filenames start with recommended number of parts, e.g., `np3` which means 3 parts. You can also try other part count for the same input images. \n\n## 💻 System Requirements\nA CUDA-enabled GPU with at least 8GB VRAM. You can reduce number of parts or number of tokens to save GPU memory. We set the number of tokens per part to `1024` on object level and `2048` on scene level by default for better quality. \n\n## 📊 Dataset\nPlease refer to [Dataset README](./datasets/README.md) to download and preprocess the dataset. To generate a minimal dataset, you can run:\n```\npython datasets/preprocess/preprocess.py --input assets/objects --output preprocessed_data\n```\nThis script preprocesses GLB files in `./assets/objects` and saves the preprocessed data to `./preprocessed_data`. We provide a pseudo data configuration [here](./datasets/object_part_configs.json), which makes use of the minimal preprocessed data and is compatible with the training settings.\n\n## 🦾 Training\nTo train PartCrafter from scratch, you first need to download TripoSG from [VAST-AI/TripoSG](https://huggingface.co/VAST-AI/TripoSG) and store the weights in `./pretrained_models/TripoSG`. \n```\nhuggingface-cli download VAST-AI/TripoSG --local-dir pretrained_weights/TripoSG\n```\n\nOur training scripts are suitable for training with 8 H20 GPUs (96G VRAM each). Currently, we only finetune the DiT of TripoSG and keep the VAE fixed. But you can also finetune the VAE of TripoSG, which should improve the quality of the generated 3D parts. PartCrafter is compatible with all 3D object generative models based on vector sets such as [Hunyuan3D-2.1](https://github.com/Tencent-Hunyuan/Hunyuan3D-2.1). We warmly welcome pull requests from the community. \n\nWe provide several training configurations [here](./configs). You should modify the path of dataset configs in the training config files, which is currently set to `./datasets/object_part_configs.json`. \n\nIf you use `wandb`, you should also modify the `WANDB_API_KEY` in the training script. If you have trouble connecting to `wandb`, try `export WANDB_BASE_URL=https://api.bandw.top`. \n\nTrain PartCrafter from TripoSG:\n```\nbash scripts/train_partcrafter.sh --config configs/mp8_nt512.yaml --use_ema \\\n  --gradient_accumulation_steps 4 \\\n  --output_dir output_partcrafter \\\n  --tag scaleup_mp8_nt512\n```\n\nFinetune PartCrafter with larger number of parts:\n```\nbash scripts/train_partcrafter.sh --config configs/mp16_nt512.yaml --use_ema \\\n  --gradient_accumulation_steps 4 \\\n  --output_dir output_partcrafter \\\n  --load_pretrained_model scaleup_mp8_nt512 \\\n  --load_pretrained_model_ckpt 10 \\\n  --tag scaleup_mp16_nt512\n```\n\nFinetune PartCrafter with more tokens:\n```\nbash scripts/train_partcrafter.sh --config configs/mp16_nt1024.yaml --use_ema \\\n  --gradient_accumulation_steps 4 \\\n  --output_dir output_partcrafter \\\n  --load_pretrained_model scaleup_mp16_nt512 \\\n  --load_pretrained_model_ckpt 10 \\\n  --tag scaleup_mp16_nt1024\n```\n\n## 😊 Acknowledgement\nWe would like to thank the authors of [DiffSplat](https://chenguolin.github.io/projects/DiffSplat/), [TripoSG](https://yg256li.github.io/TripoSG-Page/), [HoloPart](https://vast-ai-research.github.io/HoloPart/), and [MIDI-3D](https://huanngzh.github.io/MIDI-Page/) \nfor their great work and generously providing source codes, which inspired our work and helped us a lot in the implementation. \n\n\n## 📚 Citation\nIf you find our work helpful, please consider citing:\n```bibtex\n@misc{lin2025partcrafter,\n  title={PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers}, \n  author={Yuchen Lin and Chenguo Lin and Panwang Pan and Honglei Yan and Yiqiang Feng and Yadong Mu and Katerina Fragkiadaki},\n  year={2025},\n  eprint={2506.05573},\n  url={https://arxiv.org/abs/2506.05573}\n}\n```\n\n## 🌟 Star History\n[![Star History Chart](https://api.star-history.com/svg?repos=wgsxm/PartCrafter\u0026type=Date)](https://www.star-history.com/#wgsxm/PartCrafter\u0026Date)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwgsxm%2FPartCrafter","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fwgsxm%2FPartCrafter","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwgsxm%2FPartCrafter/lists"}