Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/nuhu-ibrahim/website-to-speech
This python code accepts a valid url, coverts it to an image, uses computer vision to predict what is on the website and then convert the prediction to speech that can be listened to.
https://github.com/nuhu-ibrahim/website-to-speech
computer-vision deep-learning helping-virtually-impaired machine-learning website-to-speech
Last synced: 24 days ago
JSON representation
This python code accepts a valid url, coverts it to an image, uses computer vision to predict what is on the website and then convert the prediction to speech that can be listened to.
- Host: GitHub
- URL: https://github.com/nuhu-ibrahim/website-to-speech
- Owner: nuhu-ibrahim
- Created: 2021-02-28T18:49:28.000Z (almost 4 years ago)
- Default Branch: main
- Last Pushed: 2021-03-07T06:03:57.000Z (almost 4 years ago)
- Last Synced: 2024-10-23T04:17:42.734Z (2 months ago)
- Topics: computer-vision, deep-learning, helping-virtually-impaired, machine-learning, website-to-speech
- Language: Jupyter Notebook
- Homepage:
- Size: 438 KB
- Stars: 2
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Website to Speech
Created by [Nuhu Ibrahim](https://nuhuibrahim.com)## Project Goals
The goal of this project is to develop a python script that:
1. Converts a valid web URL into an image,
2. Automatically Describe the image using a caption,
3. and coverts the caption into audio so that it could be read aloud and listened to.This project aims to assist visually impaired people in gaining a general overview of a website so that they can infer if the website is worth spending time on or not. It is generally known that it takes a long time for people with visual impairment to go through a webpage completely.
Appreciation to the [Public APIs Github Repository](https://github.com/public-apis/public-apis.git) as all the APIs used in this project was first found there.
## Setting up the API Keys
### Part 1: Converting a website to an image
You will need to create an account with restpack [here](https://restpack.io/console/register) and then assign your access token to the "restpack_key" variable in the code.### Part 2: Automatically describing the image using a simple caption
You will need to register with Cloudmersive website [here](https://account.cloudmersive.com/signup), create an api key and assign it to the "cloudmersive_key" variable in the code.### Part 3: Converting the caption into an audio so that it could be read aloud
You need to first create an acount on the IBM text to speech website [here](https://cloud.ibm.com/registration?target=%2Fdocs%2Ftext-to-speech%2Fgetting-started.html), create an api key and then assign the key to the "ibm_text_to_speech_key" variable in the code below. You may also need to get the service url and assign it to the ibm_service_url variable## Installing all required packages
For the python file to work effectively, you need to install the packages below from your terminal.```bash
# Run the following commands on your terminal.
pip install cloudmersive-validate-api-client
pip install cloudmersive-image-api-client
pip install --upgrade "ibm-watson>=5.0.0"
pip install playsound
pip install -U PyObjC
pip install pathlib
pip install validators
```## Running the program
```bash
# Run the following commands on your terminal from the project root directory.
python3 website_to_speech.py#URL is the web address of the page whose prediction you want to listen to aloud.
# ...and that is all.
```
## Thank you
If you find this code interesting or useful, kindly click on the like button and follow me for more.