OpenAI launches new voice intelligence features in its API - BERITAJA

Albert Michael By: Albert Michael - Friday, 08 May 2026 05:24:50 • 3 min read
OpenAI launches new voice intelligence features in its API - BERITAJA

OpenAI launches new voice intelligence features in its API - BERITAJA is one of the most discussed topics today. In this article, you will find a clear explanation, key facts, and the latest updates related to this topic, presented in a concise and easy-to-understand way. Read more news on Beritaja.

OpenAI ChatGPT website displayed connected a laptop surface is seen successful this illustration photo.Image Credits:Jakub Porzycki/NurPhoto / Getty Images

3:24 PM PDT · May 7, 2026

OpenAI said Thursday that its API will now see a number of caller sound intelligence features designed to thief developers create apps that could talk, transcribe, and construe conversations pinch users.

The company’s new GPT‑Realtime‑2 is different sound model, built to create a realistic vocal simulation that could converse pinch users. However, dissimilar its predecessor (GPT-Realtime-1.5) this 1 is built pinch GPT‑5‑class reasoning that OpenAI says was created to woody pinch much analyzable requests from users.

The institution is besides launching GPT‑Realtime‑Translate which, conscionable arsenic it sounds, is designed to supply real-time translator services that “keep pace” pinch the user, conversationally. The characteristic includes much than 70 input languages (that is, the languages that it could comprehend) and 13 output languages (the languages it relays to the speaker).

Finally, the institution has besides launched a caller transcription capability, GPT-Realtime-Whisper, which gives users unrecorded speech-to-text capabilities that are captured arsenic interactions occur.

“Together, the models we are launching move real-time audio from elemental call-and-response toward sound interfaces that could really do work: listen, reason, translate, transcribe, and return action arsenic a speech unfolds,” the institution said.

Who will these updates beryllium bully for? Companies that want to grow customer work capabilities are an evident target. However, OpenAI besides notes that its caller features will assistance pinch a wide array of areas, including education, media, events, and creator platforms, among others.

As useful arsenic these devices look from an endeavor perspective, it besides seems plausible that they could beryllium misused. The institution said it has built guardrails to extremity its caller features from being abused to create spam, fraud, aliases different forms of online abuse. Certain triggers person been embedded successful the strategy truthful that “conversations could beryllium halted if they are detected arsenic violating our harmful contented guidelines,” OpenAI said.

Techcrunch event

San Francisco, CA | October 13-15, 2026

All of the caller sound models are included successful OpenAI’s Realtime API. Translate and Whisper are billed by the minute, while GPT-Realtime-2 is billed by token consumption.

When you acquisition done links successful our articles, we whitethorn gain a mini commission. This doesn’t impact our editorial independence.

Lucas is simply a elder writer astatine TechCrunch, wherever he covers artificial intelligence, user tech, and startups. He antecedently covered AI and cybersecurity astatine Gizmodo. You could interaction Lucas by emailing lucas.ropek@beritaja.com.

This article discusses OpenAI launches new voice intelligence features in its API - BERITAJA in detail, including key facts, recent developments, and important insights that readers are actively searching for online.