{"id":18664656,"url":"https://github.com/scionoftech/image_caption_generation","last_synced_at":"2025-11-06T10:30:25.910Z","repository":{"id":101410346,"uuid":"229797814","full_name":"scionoftech/image_caption_generation","owner":"scionoftech","description":"Image caption generation using Deep Learning-LSTM","archived":false,"fork":false,"pushed_at":"2019-12-23T18:10:38.000Z","size":1385,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2024-12-27T17:23:42.400Z","etag":null,"topics":["caption-generation","cnn","rnn"],"latest_commit_sha":null,"homepage":null,"language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/scionoftech.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-12-23T17:42:31.000Z","updated_at":"2021-03-11T11:38:39.000Z","dependencies_parsed_at":null,"dependency_job_id":"07d2234a-07a7-4eb8-a080-485c73f3cafb","html_url":"https://github.com/scionoftech/image_caption_generation","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scionoftech%2Fimage_caption_generation","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scionoftech%2Fimage_caption_generation/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scionoftech%2Fimage_caption_generation/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/scionoftech%2Fimage_caption_generation/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/scionoftech","download_url":"https://codeload.github.com/scionoftech/image_caption_generation/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239491352,"owners_count":19647811,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["caption-generation","cnn","rnn"],"created_at":"2024-11-07T08:24:24.347Z","updated_at":"2025-11-06T10:30:25.823Z","avatar_url":"https://github.com/scionoftech.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Image Caption Generation\n\nCaption generation is a challenging artificial intelligence problem where a textual description must be generated for a given photograph.\n\nIt requires both methods from computer vision to understand the content of the image and a language model from the field of natural language processing to turn the understanding of the image into words in the right order. Recently, deep learning methods have achieved state-of-the-art results on examples of this problem.\n\nDeep learning methods have demonstrated state-of-the-art results on caption generation problems. What is most impressive about these methods is a single end-to-end model can be defined to predict a caption, given a photo, instead of requiring sophisticated data preparation or a pipeline of specifically designed models.\n\n### Steps:\n\n* Photo and Caption Dataset\n* Prepare Photo Data\n* Prepare Text Data\n* Build Deep Learning Model\n* Train Model\n* Evaluate Model\n* Generate New Captions\n\n### Model \n\n![image_cap.png](image_cap.png)\n\nThe Flickr 8k dataset is used for image caption generation, which you can download by filling [this](https://forms.illinois.edu/sec/1713398) form provided by the University of Illinois at Urbana-Champaign. \n\nThis dataset contains 8000 images each with 5 captions\n\nOther Datasets,\n\n* [COCO 2015 Image Captioning](http://cocodataset.org/#captions-2015)\n* [Google Conceptual Captions](https://ai.google.com/research/ConceptualCaptions/download)\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fscionoftech%2Fimage_caption_generation","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fscionoftech%2Fimage_caption_generation","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fscionoftech%2Fimage_caption_generation/lists"}