DeepL Voice can translate real-time calls

deepL

DeepL introduces a new AI tool: DeepL Voice. This tool can translate and convert real-time conversations into text.

The online text translator DeepL, which claims to be more accurate than Google, is introducing a new AI tool: DeepL Voice. It allows users to listen to someone speak in one language and then automatically translate it into another language. The AI tool speaks a dozen languages. DeepL Voice is not yet available to the wider public.

In mirror image

DeepL Voice focuses on real-time calls and video conferencing. The spoken input is converted to text only, not audio. It works as follows: you place your smartphone between the two conversationalists on the conference table. On the screen, the translations appear in mirror image. Each participant gets to see the translation of the conversation partner along his side of the screen.

read also

DeepL Voice can translate real-time calls

Audio output is not currently possible, but CEO Jarek Kutylowski let Tech Crunch know that this is coming. “This is DeepL’s first product for voice, but it’s unlikely to be the last,” he said. “DeepL Voice is where translation is going to play out over the next year.” No API is currently available for the voice product.

DeepL Voice is multilingual and can understand the following languages today: English, German, Japanese, Korean, Swedish, Dutch, French, Turkish, Polish, Portuguese, Russian, Spanish and Italian.

It will be some time before DeepL’s new voice product is available to DeepL users. The company is now focusing primarily on B2B partners.