How to build a text-to-speech Discord voice bot in Python

In this post, you'll learn to use the Uberduck API to build a Discord bot that can send text-to-speech messages in voice channels.

Zach Ocean

Head of Marketing

If you want to upload audio to the internet so that you can easily share and embed it, you typically want to use a free video hosting site like YouTube. (As a voice AI site, we have to do this all the time!)

Unfortunately, sites like YouTube don't support uploading audio files directly. Here's how you can create a video with your audio and a static background image so that you can upload it and embed it for free.

1. You need an image to use as the background to your video. You can create an AI image using Dall-E through ChatGPT, or use a tool like Canva.

2. Download the image to the same directory as your audio.

3. Combine the audio and image into a video using ffmpeg (run this command from the same directory as your audio and image files):

ffmpeg -loop 1 -i background-image.png -i audio.wav -c:v libx264 -tune stillimage -pix_fmt yuv420p -shortest output.mp4

Now you can upload output.mp4 to YouTube or any other video hosting site.