podscript is a tool to generate transcripts for podcasts (and other similar audio files), using LLMs and other Speech-to-Text (STT) APIs. Currently, C

Search code, repositories, users, issues, pull requests...

submited by

Style Pass

2024-07-05 15:00:06

podscript is a tool to generate transcripts for podcasts (and other similar audio files), using LLMs and other Speech-to-Text (STT) APIs. Currently, ChatGPT and Deepgram are supported.

Alternatively, you can set keys in environment variable prefixed with PODSCRIPT_, for e.g. PODSCRIPT_OPENAI_API_KEY and PODSCRIPT_DEEPGRAM_API_KEY.

For podcasts on YouTube with autogenerated captions (e.g. Andrew Huberman and Cal Newport), use the ytt subcommand to download the captions from the YouTube video and feed it to ChatGPT to generate a clean transcript.

Use the deepgram subcommand to generate transcripts that are of a higher quality than YouTube autogenerated captions. Deepgram provides a great API (with $200 free signup credit!) and excellent, fast models for transcribing audio files.

Alternatively, you can pass a local audio file to the command by setting --from-file instead of --from-url. You can also customise the path and add a recognizable suffix with --path and --suffix options.