NTT Docomo Business, a subsidiary of Japan’s largest telecom operator, and ElevenLabs Japan have announced a collaboration to develop AI‑powered voice services for contact centres. The partnership, disclosed on 28 June 2026, will combine ElevenLabs’ speech‑generation technology with NTT Docomo Business’s expertise in enterprise communications and AI deployment. The goal is to improve service quality and operational efficiency by enabling more natural voice interactions in customer‑service environments.

ElevenLabs, founded in 2022, specialises in deep‑learning text‑to‑speech (TTS) systems that produce highly realistic audio. According to the company’s website, its models support more than 70 languages, deliver sub‑100‑millisecond latency, and offer a wide range of voice options. The firm has positioned itself as a leading provider of natural‑sounding speech for business applications.

NTT Docomo Business has long focused on delivering communication solutions to enterprises, including cloud‑based contact‑centre platforms. The company’s portfolio includes voice‑over‑IP services, unified communications, and AI‑enhanced customer‑experience tools. By integrating ElevenLabs’ TTS engine, the partnership aims to replace or augment human agents with synthetic voices that can handle routine inquiries, provide multilingual support, and maintain consistent tone and quality.

The collaboration arrives amid a broader trend of telecom operators partnering with AI startups to modernise contact‑centre operations. Japan’s labour market has faced a shortage of skilled call‑centre staff, and firms are increasingly turning to automation to reduce costs and improve response times. AI‑powered voice agents can also mitigate issues such as agent fatigue and inconsistent customer interactions.

While the announcement did not disclose financial terms, both companies highlighted the strategic fit. The partnership will likely involve embedding ElevenLabs’ API into NTT Docomo Business’s existing contact‑centre infrastructure, allowing enterprises to deploy synthetic voices without building their own TTS models. The integration is expected to support real‑time voice generation, adaptive prosody, and emotion‑recognition features that can tailor responses to caller sentiment.

Industry analysts note that the move could accelerate the adoption of AI in Japanese contact centres, where regulatory scrutiny around data privacy and voice‑recording is high. NTT Docomo Business has a track record of complying with Japan’s stringent telecommunications regulations, and the partnership will need to address any data‑handling requirements for voice data generated by ElevenLabs’ models.

The partnership also aligns with ElevenLabs’ recent expansion into enterprise markets. The firm’s business plan, announced in August 2024, introduced a new tier for large‑scale deployments and a credit‑rollover system to support continuous usage. By partnering with a major telecom operator, ElevenLabs can reach a wider customer base and benefit from NTT Docomo Business’s global network infrastructure.

As the collaboration moves from announcement to implementation, both companies are expected to release technical specifications and integration guides in the coming months. The first pilot projects may involve small‑to‑medium enterprises that rely on NTT Docomo Business’s cloud‑based contact‑centre solutions. If successful, the partnership could pave the way for broader rollouts across Japan’s corporate sector.

In summary, the NTT Docomo Business–ElevenLabs partnership marks a significant step toward AI‑enhanced customer service in Japan. By combining advanced speech synthesis with robust enterprise communications, the collaboration aims to deliver natural‑sounding voice interactions that improve efficiency and customer satisfaction. The partnership remains in the early integration phase, with detailed product releases and deployment timelines expected later in 2026.