https://github.com/simalexan/s3-lambda-transcribe-audio-to-text-s3
Transcribe your audio to text with this serverless component
https://github.com/simalexan/s3-lambda-transcribe-audio-to-text-s3
audio lambda s3 serverless speech-to-text transcribe transcribe-audio-files
Last synced: 3 months ago
JSON representation
Transcribe your audio to text with this serverless component
- Host: GitHub
- URL: https://github.com/simalexan/s3-lambda-transcribe-audio-to-text-s3
- Owner: simalexan
- Created: 2019-05-07T23:55:15.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2020-01-19T21:58:07.000Z (over 5 years ago)
- Last Synced: 2025-02-27T10:09:22.094Z (3 months ago)
- Topics: audio, lambda, s3, serverless, speech-to-text, transcribe, transcribe-audio-files
- Language: JavaScript
- Homepage:
- Size: 5.86 KB
- Stars: 94
- Watchers: 5
- Forks: 21
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# S3 Bucket -> Lambda (AWS Transcribe Audio to Text) -> S3 Bucket
## Description
This is a serverless component that takes uploaded MP3, MP4, WAV, FLAC audio files from one S3 Bucket, then using Lambda and AWS Transcribe converts them to text and uploads to another S3 Bucket as JSON. It contains:
- an Input S3 Bucket that accepts MP3, MP4, WAV, FLAC audio files.
- a Lambda that takes the MP3, MP4, WAV, FLAC audio file from the Input S3 bucket, transcribes it to text and uploads it to the Output bucket
- an Output S3 Bucket that receives Text JSON files.
## Deployment Parameters
This component has one CloudFormation deployment parameter:
- `LanguageCode`, a required parameter, represents the language present in the audio file that the AWS Transcribe should detect. Possible values are:
- 'en-US'
- 'es-US'
- 'en-AU'
- 'fr-CA'
- 'en-GB'
- 'de-DE'
- 'pt-BR'
- 'fr-FR'
- 'it-IT'
- 'ko-KR'
- 'es-ES'## Latest Release - 1.1.0
- Upgrading to Node.js 12.x LTS
## Roadmap - Upcoming changes
Here are the upcoming changes that I'll add to this serverless component:
- ESLint
- Tests