{"id":18096664,"url":"https://github.com/andreaschandra/codegen-extended","last_synced_at":"2025-08-09T23:09:09.870Z","repository":{"id":237422064,"uuid":"668079086","full_name":"andreaschandra/codegen-extended","owner":"andreaschandra","description":"CODEGEN: AN OPEN LARGE LANGUAGE MODEL FOR CODE WITH MULTI-TURN PROGRAM SYNTHESIS","archived":false,"fork":false,"pushed_at":"2023-07-19T01:35:22.000Z","size":1421,"stargazers_count":0,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-04-06T03:27:38.996Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/andreaschandra.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":"CODEOWNERS","security":"SECURITY.md","support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2023-07-19T01:34:24.000Z","updated_at":"2023-07-19T01:35:30.000Z","dependencies_parsed_at":null,"dependency_job_id":"bb76a2aa-2a10-49c3-8d73-e4251b7118d5","html_url":"https://github.com/andreaschandra/codegen-extended","commit_stats":null,"previous_names":["andreaschandra/codegen-extended"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/andreaschandra/codegen-extended","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andreaschandra%2Fcodegen-extended","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andreaschandra%2Fcodegen-extended/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andreaschandra%2Fcodegen-extended/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andreaschandra%2Fcodegen-extended/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/andreaschandra","download_url":"https://codeload.github.com/andreaschandra/codegen-extended/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/andreaschandra%2Fcodegen-extended/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":269649848,"owners_count":24453541,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-08-09T02:00:10.424Z","response_time":111,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-31T19:15:01.200Z","updated_at":"2025-08-09T23:09:09.809Z","avatar_url":"https://github.com/andreaschandra.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"\u003cp align=\"center\"\u003e\n  \u003cimg src=\"assets/codegen_logo.png\" width=\"25%\"\u003e\n\u003c/p\u003e\n\n# CodeGen\nOfficial release for the **CodeGen1** and **CodeGen2** models (`350M`, `1B`, `3B`, `7B` `16B`) for **Program Synthesis** by [Salesforce AI Research](https://www.salesforceairesearch.com/).\n\n\u003cp align=\"center\"\u003e\n  \u003cimg src=\"assets/two.gif\" width=\"60%\"\u003e\n\u003c/p\u003e\n\n## News\n\n**July 2023**\n\n[**CodeGen2.5**](https://github.com/salesforce/CodeGen/tree/main/codegen25) released outperforming 16B parameter models with only 7B.\n\n**May 2023**\n\n**CodeGen2.0** released with strong infill sampling capability.\n\n**March 2022**\n\n**CodeGen1.0** released on par with OpenAI Codex at the time.\n\n## Publications\n\n[CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis](https://arxiv.org/abs/2203.13474)  \n[Erik Nijkamp](https://enijkamp.github.io/)\\*, [Bo Pang](https://scholar.google.com/citations?user=s9fNEVEAAAAJ\u0026hl=en)\\*, [Hiroaki Hayashi](https://hiroakih.me/)\\*, [Lifu Tu](https://home.ttic.edu/~lifu/), [Huan Wang](https://scholar.google.com/citations?user=7NpTttkAAAAJ\u0026hl=en), [Yingbo Zhou](https://scholar.google.com/citations?user=H_6RQ7oAAAAJ\u0026hl=en), [Silvio Savarese](https://scholar.google.com/citations?user=ImpbxLsAAAAJ\u0026hl=en), and [Caiming Xiong](https://scholar.google.com/citations?user=vaSdahkAAAAJ\u0026hl=en)   \nICLR, 2023\n\n[CodeGen2: Lessons for Training LLMs on Programming and Natural Languages](https://arxiv.org/abs/2305.02309)   \n[Erik Nijkamp](https://enijkamp.github.io/)\\*, [Hiroaki Hayashi](https://hiroakih.me/)\\*, [Caiming Xiong](https://scholar.google.com/citations?user=vaSdahkAAAAJ\u0026hl=en), [Silvio Savarese](https://scholar.google.com/citations?user=ImpbxLsAAAAJ\u0026hl=en), and [Yingbo Zhou](https://scholar.google.com/citations?user=H_6RQ7oAAAAJ\u0026hl=en)  \nICLR, 2023\n\n## Usage\n\nThe models are available on the [HuggingFace Hub](https://huggingface.co/models?search=salesforce+codegen).\n\n**CodeGen1.0**\n\n```python\nimport torch\nfrom transformers import AutoTokenizer, AutoModelForCausalLM\n\ntokenizer = AutoTokenizer.from_pretrained(\"Salesforce/codegen-2B-mono\")\nmodel = AutoModelForCausalLM.from_pretrained(\"Salesforce/codegen-2B-mono\")\ninputs = tokenizer(\"# this function prints hello world\", return_tensors=\"pt\")\nsample = model.generate(**inputs, max_length=128)\nprint(tokenizer.decode(sample[0], truncate_before_pattern=[r\"\\n\\n^#\", \"^'''\", \"\\n\\n\\n\"]))\n```\n\n**CodeGen2.0**\n\n```python\nimport torch\nfrom transformers import AutoTokenizer, AutoModelForCausalLM\n\ntokenizer = AutoTokenizer.from_pretrained(\"Salesforce/codegen2-7B\")\nmodel = AutoModelForCausalLM.from_pretrained(\"Salesforce/codegen2-7B\", trust_remote_code=True, revision=\"main\")\ninputs = tokenizer(\"# this function prints hello world\", return_tensors=\"pt\")\nsample = model.generate(**inputs, max_length=128)\nprint(tokenizer.decode(sample[0], truncate_before_pattern=[r\"\\n\\n^#\", \"^'''\", \"\\n\\n\\n\"]))\n```\n\n**CodeGen2.5**\n\n```python\nimport torch\nfrom transformers import AutoTokenizer, AutoModelForCausalLM\n\ntokenizer = AutoTokenizer.from_pretrained(\"Salesforce/codegen25-7b-mono\", trust_remote_code=True)\nmodel = AutoModelForCausalLM.from_pretrained(\"Salesforce/codegen25-7b-mono\")\ninputs = tokenizer(\"# this function prints hello world\", return_tensors=\"pt\")\nsample = model.generate(**inputs, max_length=128)\nprint(tokenizer.decode(sample[0]))\n```\n\n## Training\n\nThe Jaxformer library for data pre-processing, training and fine-tuning the CodeGen models can be found here:\n\nhttps://github.com/salesforce/jaxformer\n\n## Citation\nIf you find our code or paper useful, please cite the paper:\n```bibtex\n@article{nijkamp2022codegen,\n  title={CodeGen: An Open Large Language Model for Code with Multi-Turn Program Synthesis},\n  author={Nijkamp, Erik and Pang, Bo and Hayashi, Hiroaki and Tu, Lifu and Wang, Huan and Zhou, Yingbo and Savarese, Silvio and Xiong, Caiming},\n  journal={ICLR},\n  year={2023}\n}\n\n@article{nijkamp2023codegen2,\n  title={CodeGen2: Lessons for Training LLMs on Programming and Natural Languages},\n  author={Nijkamp, Erik and Hayashi, Hiroaki and Xiong, Caiming and Savarese, Silvio and Zhou, Yingbo},\n  journal={ICLR},\n  year={2023}\n}\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fandreaschandra%2Fcodegen-extended","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fandreaschandra%2Fcodegen-extended","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fandreaschandra%2Fcodegen-extended/lists"}