Heaptalk, Jakarta — Indonesia-based startup that focuses on developing AI and Natural Language Processing (NLP) such as speech-to-text, meeting transcription, anti-hoax, voice biometrics, technology regulatory, Prosa.ai, has advanced its cloud-based text-to-speech (TTS) product by allowing users to create about 40 voice variation shortly.
This startup previously provided only three voices in Indonesian, with one male voice and two female voices in 2021. Prosa.ai has upgraded its newest TTS technology by granting ten new voices, augmenting an English version, and embedding the pause and custom voice features, demonstrating its outstanding progress. The features allow them to download audio formats supported by several platforms, such as YouTube, TikTok, and other social media apps.
“We believe the space for developing TTS technology is still promising, specifically to improve voice quality and make it more natural. The need for this technology continues to increase, as indicated by the number of search volumes for TTS technology in Google search results (SERP) that we observe through web traffic analysis tools.” Teguh Eko Budiarto, Co-founder and CEO of Prosa.ai said.
Referring to the recent team survey, the data indicated that audiobook demands occupied the highest percentage, up to 24% of the total audio generation. To accommodate these needs, Prosa.ai established a voice designed primarily for an Indonesian-language audiobook called the character Dini.
In its latest version, Prosa’s text-to-speech product is equipped with several features, spanning:
- Speech Synthesizer to convert written text into speech
- Human-sounding voices enable users to choose voice characters and diction naturally
- Voice tuning to adjust the pitch and adjust the speaking speed
- Flexible Audio File to generate audio files with diverse formats (WAV, MP3, and OPUS), enabling users to play, save, and edit the file.
Existing since 2021, Prosa.ai posted significant enhancements by obtaining the total users to reach 300,000 people by early 2024. This user amount has risen tenfold in the last three years, with an average monthly active user of 20,000. In terms of transactions, the startup has also noted that the total transactions for Prosa’s TTS package have reached more than 25,000 transactions since its launch.
This year, Prosa.ai will enhance its other features, such as voice emotions, and develop a marketplace platform devoted to the voice of talent who want to uplift their income by contributing their voice to the Prosa TTS. Through this move, the CEO of Prosa.ai aims to boost the number of users around two-fold YoY, with a total transaction growth of up to 1,5x compared to the previous year.