https://github.com/genmon/gds-image-describer

Describes images using GPT-4 Vision, using the GDS blog as a testbed
https://github.com/genmon/gds-image-describer

Last synced: about 1 year ago
JSON representation

Describes images using GPT-4 Vision, using the GDS blog as a testbed

Host: GitHub
URL: https://github.com/genmon/gds-image-describer
Owner: genmon
Created: 2024-01-16T10:14:44.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-01-18T11:37:49.000Z (over 2 years ago)
Last Synced: 2025-02-04T13:49:09.660Z (over 1 year ago)
Language: TypeScript
Homepage:
Size: 140 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# gds-image-describer

This demo uses the GPT-4 Vision API to generate alt text for images. It helps us understand where this can be useful and what the limitations are.

The demo is scoped to the GDS blog. It works only with URLs that start with `https://gds.blog.gov.uk/...`.

## Initial observations

- The model is good at generating alt text-like descriptions...
- ...but you wouldn't put them on a website without checking and potentially editing/rewriting first.

Therefore something like this may fit into a "auto-suggest" type of workflow.

## Development

To run this locally:

- clone the repo
- install dependencies: `npm install`
- [create an OpenAI API key](https://platform.openai.com/api-keys)
- copy the file `.env.example` to `.env` and add your OpenAI API key
- run the app: `npm run dev`
- visit [127.0.0.1:1999](http://127.0.0.1:1999) in your browser

If you make changes, you can deploy with `npm run deploy`. You will be prompted to sign in to PartyKit, and will be given the public URL of your deployed app at the end of the process. (Note it will take a few minutes for the URL to become active after the first deployment.)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/genmon/gds-image-describer

Awesome Lists containing this project

README