https://github.com/nuhu-ibrahim/website-to-speech

This python code accepts a valid url, coverts it to an image, uses computer vision to predict what is on the website and then convert the prediction to speech that can be listened to.
https://github.com/nuhu-ibrahim/website-to-speech

computer-vision deep-learning helping-virtually-impaired machine-learning website-to-speech

Last synced: about 2 months ago
JSON representation

This python code accepts a valid url, coverts it to an image, uses computer vision to predict what is on the website and then convert the prediction to speech that can be listened to.

Host: GitHub
URL: https://github.com/nuhu-ibrahim/website-to-speech
Owner: nuhu-ibrahim
Created: 2021-02-28T18:49:28.000Z (about 4 years ago)
Default Branch: main
Last Pushed: 2021-03-07T06:03:57.000Z (about 4 years ago)
Last Synced: 2025-02-04T20:30:35.509Z (4 months ago)
Topics: computer-vision, deep-learning, helping-virtually-impaired, machine-learning, website-to-speech
Language: Jupyter Notebook
Homepage:
Size: 438 KB
Stars: 2
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Website to Speech
Created by [Nuhu Ibrahim](https://nuhuibrahim.com)

## Project Goals
The goal of this project is to develop a python script that:
1. Converts a valid web URL into an image,
2. Automatically Describe the image using a caption,
3. and coverts the caption into audio so that it could be read aloud and listened to.

This project aims to assist visually impaired people in gaining a general overview of a website so that they can infer if the website is worth spending time on or not. It is generally known that it takes a long time for people with visual impairment to go through a webpage completely.

Appreciation to the [Public APIs Github Repository](https://github.com/public-apis/public-apis.git) as all the APIs used in this project was first found there.

## Setting up the API Keys
### Part 1: Converting a website to an image
You will need to create an account with restpack [here](https://restpack.io/console/register) and then assign your access token to the "restpack_key" variable in the code.

### Part 2: Automatically describing the image using a simple caption
You will need to register with Cloudmersive website [here](https://account.cloudmersive.com/signup), create an api key and assign it to the "cloudmersive_key" variable in the code.

### Part 3: Converting the caption into an audio so that it could be read aloud
You need to first create an acount on the IBM text to speech website [here](https://cloud.ibm.com/registration?target=%2Fdocs%2Ftext-to-speech%2Fgetting-started.html), create an api key and then assign the key to the "ibm_text_to_speech_key" variable in the code below. You may also need to get the service url and assign it to the ibm_service_url variable

## Installing all required packages
For the python file to work effectively, you need to install the packages below from your terminal.

```bash
# Run the following commands on your terminal.
pip install cloudmersive-validate-api-client
pip install cloudmersive-image-api-client
pip install --upgrade "ibm-watson>=5.0.0"
pip install playsound
pip install -U PyObjC
pip install pathlib
pip install validators
```

## Running the program
```bash
# Run the following commands on your terminal from the project root directory.
python3 website_to_speech.py

#URL is the web address of the page whose prediction you want to listen to aloud.

# ...and that is all.
```
## Thank you
If you find this code interesting or useful, kindly click on the like button and follow me for more.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/nuhu-ibrahim/website-to-speech

Awesome Lists containing this project

README