Installation

Proof of Concept for transcoding podcasts into text using GCP Speech2Text service, following its NODE JS tutorial.

Installation

Download this repo:

git clone https://github.com/emibcn/Podcast2Text.git

Change directory into it:

cd Podcast2Text

Create local directories:

mkdir flac credentials

Create GCP credentials for consuming Speech2Text service at GCP IAM with -at least- Service Usage Consumer permission.
Copy credentials file to ./credentials directory
Create .env file with GOOGLE_APPLICATION_CREDENTIALS=[CREDENTIALS FILENAME] (without directory)

Usage

There is a script helper to transcode any audio file into text. It's syntax is:

./transcode.sh <FILEPATH> [START]

FILEPATH: Path (relative or absolute) to podcast audio file
START: Initial start seek (transcode beginning at this position). Same syntax as FFMPEG -ss option.

This will encode the supplied file to FLAC format into ./flac directory and then use the encoded file to send it to GCP Speech2Text service and get its transcription printed on screen.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
app		app
.deepsource.toml		.deepsource.toml
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
transcode.sh		transcode.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

Usage

About

Contributors 4

Languages

License

emibcn/Podcast2Text

Folders and files

Latest commit

History

Repository files navigation

Installation

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 4

Languages