Audio Enhance

Sieve's audio enhance app is a one-stop-shop for developer-focused audio filtering. It's a perfect alternative to applications like Adobe Podcast Enhance for those looking for a simple, developer-focused solution. It offers a variety of filters as well as backends to multiple audio processing models to make it easy to pick the right filter and provider for your specific use case.

Key Features

  • Multiple Backends: We offer multiple backends to choose from, including cleanvoice, elevenlabs, auphonic, and sieve-resemble-enhance. Each backend has its own strengths, so you can choose the one that best fits your needs.
  • Task Options: Each backend supports a different set of tasks. Select from all, enhance, or denoise. enhance is used when you want the speech to sound clearer and more natural. denoise is used when you want to remove background noise. all is used when you want both enhancements and denoising.
  • Enhancement Steps: For sieve-resemble-enhance, you can specify the number of enhancement steps to take. Higher values yield better results but can make the process slow.

Note: Functionality to enter your own API keys for these third-party backends is coming soon.

Pick the Right Backend

Below are overall rankings Sieve's team has done through a combination of internal benchmarking and customer anecdotal feedback. Please note that this isn't a comprehensive benchmark and is only to provide a general sense of the relative quality and cost of each backend.

BackendOverallDenoiseEnhanceProsConsData Retention
auphonic1st4th1stBest overall quality, true to original audioMinimum 30 seconds billing7 days
cleanvoice2nd1st3rdExcellent denoising, good overallCan sometimes sound underwater when enhancing7 days
sieve-resemble-enhance3rd2nd2ndCost-effective, very flexibleNot as refined as top options, distortative effects at times48 hours
elevenlabs4th3rdN/AIncredible quality (when stable)Distortative effects at times, no explicit enhance option, expensive3 years

Note: Denoise quality differences are subtle among all options.

Pricing

Third-Party Backends

BackendPrice per Minute
cleanvoice$0.025
elevenlabs$0.11
auphonic$0.025 (minimum 30 seconds billed per request)

Sieve Backend (sieve-resemble-enhance)

Unlike the third-party backends, you are billed for the compute time of the model with no minimum billing. This tends to be the cheapest option by an order of magnitude.

As a compute-based example, this 1-minute long audio with the noise backend option cost us $0.0014.

As another compute-based example, this 1-minute-long audio with the enhance backend cost us $0.0092.

Supported File Formats

Audio File Formats

  • MP3
  • AAC
  • OGG
  • WAV
  • M4A
  • FLAC

Video File Formats

  • MP4
  • AVI
  • MKV
  • WEBM
  • MOV