{"id":31211497,"url":"https://github.com/cyberagentailab/camera3","last_synced_at":"2025-09-21T05:27:40.319Z","repository":{"id":239886731,"uuid":"776950450","full_name":"CyberAgentAILab/camera3","owner":"CyberAgentAILab","description":"CAMERA3: An Evaluation Dataset for Controllable Ad Text Generation in Japanese","archived":false,"fork":false,"pushed_at":"2024-05-15T02:49:30.000Z","size":5282,"stargazers_count":4,"open_issues_count":0,"forks_count":0,"subscribers_count":2,"default_branch":"main","last_synced_at":"2025-09-10T07:42:51.725Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/CyberAgentAILab.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-03-24T21:59:40.000Z","updated_at":"2025-05-18T15:41:11.000Z","dependencies_parsed_at":"2024-05-15T23:10:45.283Z","dependency_job_id":"23523e71-48ed-44a6-9866-f375faabb9c8","html_url":"https://github.com/CyberAgentAILab/camera3","commit_stats":null,"previous_names":["cyberagentailab/camera3"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/CyberAgentAILab/camera3","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CyberAgentAILab%2Fcamera3","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CyberAgentAILab%2Fcamera3/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CyberAgentAILab%2Fcamera3/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CyberAgentAILab%2Fcamera3/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/CyberAgentAILab","download_url":"https://codeload.github.com/CyberAgentAILab/camera3/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/CyberAgentAILab%2Fcamera3/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":274496646,"owners_count":25296423,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-09-10T02:00:12.551Z","response_time":83,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2025-09-21T05:27:34.836Z","updated_at":"2025-09-21T05:27:38.583Z","avatar_url":"https://github.com/CyberAgentAILab.png","language":null,"funding_links":[],"categories":[],"sub_categories":[],"readme":"# CAMERA\u003csup\u003e3\u003c/sup\u003e dataset\n\nCAMERA\u003csup\u003e3\u003c/sup\u003e is an evaluation dataset for controllable text generation in the advertising domain in Japanese.\nCAMERA\u003csup\u003e3\u003c/sup\u003e  includes 3,980 ad texts written by expert annotators, taking into account various aspects of ad appeals in LP (Landing Page).\n\n- The annotated data is available at [`data/`](data/) directory in this repository in `json` and `jsonl`.\n- The LP images are available [here](https://storage.googleapis.com/camera-cubed-public/camera-cubed-v1-lp-screenshot-sliced.tar.gz) (2.3GB).\n\n## Dataset\n- json: [`data/camera3-v1.json`](data/camera3-v1.json)\n- jsonl: [`data/camera3-v1.jsonl`](data/camera3-v1.jsonl)\n- lp_images: `camera3-v1-lp-screenshot-sliced/screen-1200-{lp_image_sliced}.png`\n\n| Name | Description |\n| --- | ---- |\n| instance_id | unique id|\n| lp_image_sliced | id associated with LP image |\n| annotator_id | annotator id |\n| kw | search keyword |\n| lp_meta_description | meta description extracted from LP |\n| lp_image_sliced_ocr_text | OCR results for LP imagge |\n| ad_appeal_type | ad appeal type |\n| ad_text | ad text |\n\n\n- Example `json` entry:\n    ```json\n    {\n        \"instance_id\":0,\n        \"lp_image_sliced\":\"screen-1200-100303_00.png\",\n        \"annotator_id\":5,\n        \"kw\":\"マイカー 共済\",\n        \"lp_meta_description\":\"2022年最新の自動車保険のランキングを発表！...\",\n        \"lp_image_sliced_ocr_text\":\"コのほけん!\\n保険比較のコのほけん!\u003e...\",\n        \"ad_appeal_type\":\"価格\",\n        \"ad_text\":\"ネットからの契約だと割引あり\"\n    }\n    ```\n\n## Citation\n```\n@inproceedings{inoue-etal-2024-camera3,\n    title = \"CAMERA³: An Evaluation Dataset for Controllable Ad Text Generation in Japanese\",\n    author = \"Inoue, Go and\n    Kato, Akihiko and\n    Mita, Masato and\n    Honda, Ukyo and\n    Zhang, Peinan\",\n    booktitle = \"Proceedings of the Fourteenth Language Resources and Evaluation Conference\",\n    month = may,\n    year = \"2024\",\n    address = \"Turin, Italy\",\n    publisher = \"European Language Resources Association\"\n}\n```\n\n## License\nThe dataset is licensed under [Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License](https://creativecommons.org/licenses/by-nc-sa/4.0/).\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcyberagentailab%2Fcamera3","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fcyberagentailab%2Fcamera3","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fcyberagentailab%2Fcamera3/lists"}