How do I transcribe files stored in AWS S3 or Google Cloud Storage?
Often, you have your audio files stored in a private bucket in AWS S3 or Google Cloud Storage. AssemblyAI needs to be able to access these files in order to download and transcribe them. Both AWS and Google Cloud have the concept of a "Pre-Signed URL". You can create a Pre-Signed URL for an object (i.e. audio file) in your private bucket that makes the object temporarily available for AssemblyAI to read.
With a Pre-Signed URL, we have temporary access to the file. When you create a Pre-Signed URL, you can explicitly set the expiration window for the URL. We recommend having the URLs expire in 30 minutes.
Check out this post from our blog to learn more about submitting files from an S3 bucket.
Check out these resources to learn more about Pre-Signed URLs in AWS and Google Cloud: