Translation quality, in the open

AI Translation Quality Benchmark

How well InterMIND actually translates — measured by our automated test suite over a fixed FLORES-200 benchmark set, per language pair, every month. Synthetic and reproducible, not cherry-picked averages.

Chat translation

99/100

+1 vs prev median · 21 pairs · 126 runs · Jul 2026

typed messages, translated inline

Voice translation

77/100

+9 vs prev median · 22 pairs · 43 runs · Jul 2026

one score for speech recognition + translation, end-to-end

Try a live translation Read the methodology

43 language pairs·169 scored runs·23 languages·updated monthly · Jul 2026

Score = semantic fidelity to a reference translation (0–100). Higher is better.

Excellent 90+ Good 70–89 Fair 50–69 Weak <50

Quarter

Text chat translation

Typed messages translated inline as they're sent.

21 of 24 languages average Good or better (70+) this quarter · 19 improved vs Q2 2026

Average quality per language · Q3 2026 vs Q2 2026

Each bar pools every run translating to or from that language this quarter. The dashed notch marks the previous quarter.

Q3 2026 (avg) Q2 2026 median 99 · Jul 2026

100

▲16de

▲16it

▲20pt

▲16es

▲23ja

▼1da

▲22cs

▲1ro

▲32uk

▼3sv

▲16fr

▲9nl

▲3no

▲15pl

▼2fi

▲14hu

▲2en

▲6is

▲1ru

▼10ko

▲7zh

▲58tr

▲21ar

•hi

Faded bars have a thin sample (<3 runs) — read them as provisional.

Strongest pairs · Jul 2026

Voice translation

Live speech, transcribed and translated in real time — the harder end-to-end problem.

17 of 24 languages average Good or better (70+) this quarter · 19 improved vs Q2 2026

Average quality per language · Q3 2026 vs Q2 2026

Each bar pools every run translating to or from that language this quarter. The dashed notch marks the previous quarter.

Q3 2026 (avg) Q2 2026 median 77 · Jul 2026

100

▲31ro

▲19zh

▼1hi

▲32no

▲6ko

▲16nl

▲14ja

▲14es

▲19uk

▲18ru

▲8is

▲9en

▲24fi

▲11sv

▲11de

▲7fr

▲4pl

▲5it

▲2hu

▲2pt

▼5cs

▼13tr

▼3da

▼7ar

Faded bars have a thin sample (<3 runs) — read them as provisional.

Strongest pairs · Jul 2026

Quality over time

Monthly median per modality, since we started publishing.

Chat Voice

100

Mar 2026

Apr 2026

May 2026

Jul 2026

These numbers come from real sessions

Every run on the live demo is scored automatically and folded into next month's benchmark. A session looks like this:

Live translated meeting

→ EN

Try a live translation Get started free