AI translation in WordPress: why it breaks multilingual SEO
The headline number is correct. The missing 1 percent is exactly where the cost lands. Leonardo Losoviz on WP Tavern Jukebox, asked where AI translation now sits compared to human work, said:
“It is so easy, everyone will do it. And when everybody does it, you are not moving forward, you are just running to stay on the same spot.”
He is right at the sentence level. He is also describing a market where the actual translation quality no longer separates one site from another. What separates them is what happens to the structural fields the AI translator should not touch but, in production, usually does.
This is a polemic with reading “99 percent accurate” as a success. The number is true and largely irrelevant. Running a multilingual WordPress site in 2026 has moved from “is the prose any good” to “does the structure survive translation.”
AI translation in multilingual WordPress 2026: TL;DR in 4 points
- On WP Tavern Jukebox, Losoviz says AI handles 99 percent of WordPress translation correctly at a fraction of human cost. The number is real for prose.
- The 1 percent that AI translation pipelines routinely break is not in the sentences. It is in the fields that decide which URL the article lives at and how Google indexes it.
- That 1 percent is exactly what Google indexes. The 99 percent of prose is what readers see after they arrive on a correctly indexed page. If the indexing layer fragments, nobody reads the prose.
- The cure is not a smarter sentence-level translator. It is separating the technical fields from the fields the AI translator may edit, plus a one-screen diacritic audit on every multi-file translation pass.
Glossary: frontmatter, slug, hreflang and canonical in WordPress
The rest of the article rests on seven concepts. If any of them is unfamiliar, that is the same field the AI translator most often mangles. For anyone who has never opened a content file:
- Frontmatter - the metadata block at the very top of an article file, fenced by
---. It holds the title, description, categories, canonical URL, links to other locales, the slug, and dozens of other fields. The AI translator receives the whole file - frontmatter and body together. - Slug - the tail of the article URL, for example
nis2-dora-wordpress-compliance-2026. It is a routing identifier (the address the page actually opens at), not a heading to read. force_slug: true- a frontmatter flag that tells the system “use this exact slug as the URL, even if the file name differs.”- Canonical URL (
canonicalUrl) - a frontmatter field telling Google which URL is the authoritative version to index when several variants exist. - Hreflang - the set of links connecting each language version of the same article so Google understands “this is the German translation of the English source.”
- Taxonomy terms - the values of
categoriesandtags. Each one generates its own URL, for example/de/tag/compliance/. - Redirect map - the list of
(old URL, new URL)pairs that prevents previously indexed links from returning 404 when a slug changes.
These fields are invisible to a reader. For Google and the internal link graph, they are decisive.
How AI translation breaks a German WordPress slug: a concrete case
A pattern easy to reproduce with any full-file AI translation tool against a typical Astro or WordPress content tree:
- The German version of a compliance article lives in a file named
something-compliance-2026.de.mdand ships withslug: nis2-dora-wordpress-Konformität-2026in the frontmatter. The translator picked the German loanword because the field looked like a sentence stem and the prose target language is German. It swapped “compliance” for “Konformität”. - The rest of the cluster still links to the ASCII version, the way the English, Polish, Norwegian, Spanish, and Portuguese siblings do. Every such link returns 404, because the router serves the page at the umlaut URL - the one the translator wrote, given priority by
force_slug: trueover the file name. - Two German pillar pages now live at a URL with
äthat no other locale references. The cluster-internal link graph fragments at the indexing layer until somebody runs a structural audit, which on most production sites is never.
If the same translator had inserted a slightly awkward sentence into the body, the cost would be one reader sighing. The same class of mistake inside the slug field cost months of cluster-internal link signal across a dozen pages.
This is the gap between “99 percent accurate” and “0 percent broken at the layer that matters.”
What “99 percent accurate” AI translation does not measure
When Losoviz says 99 percent, he is talking about sentence-level fidelity. Does the German paragraph mean what the English paragraph meant. Does the terminology stay consistent across the post. Does the register match the audience. Modern AI translators against a published style guide land in the 95-99 percent range on native-speaker review pass rates. The number is real.
What the number does not measure:
- Whether the slug field in frontmatter matches the file name and the routing convention.
- Whether the
force_slugflag was switched on by an authoring decision or by a translator hallucination. - Whether
canonicalUrlstill matches the slug after the slug was changed. - Whether the values in
categoriesandtagsagree with the category and tag URLs the rest of the site uses. A German blog can drift into/de/tag/Konformität/while every other locale routes to the ASCII term URL - producing a tag page no other locale links to. - Whether the hreflang sibling URLs in the layout actually resolve.
- Whether the redirect map has a 301 entry from the prior URL after the slug change.
- Whether 301 redirects from a published-and-indexed earlier URL exist when the translator changes the slug pattern in a fresh pass.
None of these seven things is sentence-level. All of them are structural. The AI translator gets to influence each one because each one is exposed in the frontmatter, which the translator reads and rewrites alongside the body.
Why an AI slug mistake costs more than a sentence mistake
A bad sentence in a translated paragraph is read by a user, possibly ranked low by an AI Overview’s quality assessment, and at worst contributes to a slow erosion of trust. A bad slug field is a routing change that lasts until someone runs a link audit. The cost is paid out in Google Search Console over months, not in one reading session.
For a cluster of cross-linked pillars in non-English locales, the consequences scale with the density of internal links. A single mis-translated pillar slug means every other article in the cluster that pointed at the canonical URL now returns 404. Google sees a cracked graph: a pillar page indexed at one URL, dozens of inbound internal links targeting a sibling URL that nobody serves. PageRank fragments. Topical authority dilutes between the real and the phantom URL. The user-facing prose is still fine, which is the whole problem: the symptom is invisible from the rendered page.
This is the gap between “99 percent accurate” and “0 percent broken at the layer that matters.”
How the AI translation pipeline works in WordPress and where it goes wrong
The mechanism is mundane and well-known to anyone who has ever written a translation prompt that touches frontmatter:
- The translator is shown the full file: frontmatter plus body.
- The system prompt tells it “translate every user-visible string into the target language.”
- The translator is told (correctly) to translate
title,description,seo.title, FAQ questions and answers, and body text. - The translator is told (correctly) not to translate
wpId,pubDate,heroImage. - The
slugfield falls into a grey zone. The translator sees the slug looks German-shaped, because the file uses sentence-case slugs that read like sentence stems. It decides that “compliance” in a German slug is an English loanword and should be “Konformität” - the target language being German. It does the apparently correct thing. Nobody told it the slug field is a routing identifier, not a string for the reader.
A smarter sentence-level translator cannot fix this. The fix is to pull the field out of the translator’s input. In technical terms: the frontmatter should be split, inside the tool, into fields the AI translator may write and fields that are read-only. slug, force_slug, canonicalUrl, redirect_from, and taxonomy term fields belong in the read-only set.
In a system with a proper schema definition this is a one-time engineering investment. In a typical tool that pastes the whole file into the prompt - which is what most production AI translation tools currently are - it is structurally impossible.
How to protect multilingual WordPress from AI translation slug drift
Once the structural fields are protected, the residual risk is that diacritics leak elsewhere: into body content links, into structured-data sources, into hreflang references. The defence is a one-screen pre-publish audit script, run per locale. The rule is straightforward: a Latin Extended diacritic appearing in a URL field after a translation pass is almost always a regression. Each locale gets its own ruleset - the Norwegian variant (ø, æ, å) accepts more diacritic surface than German, the Spanish variant (á, é, í, ó, ú, ñ) is different again, the Portuguese ç, ã, õ class different still.
This is not engineering glamour. It is one regex per locale and a build gate that fails the deploy when the diacritic count in URL fields rises above a known-good baseline. The reason most production multilingual WordPress sites do not have this gate is prosaic: the symptom is invisible from the editorial dashboard.
WPML AI, TranslatePress AI, Weglot, Gato AI Translations: the same structural problem
The Gato AI Translations product page describes the value as “translate any post type, taxonomy, custom field, and string with AI.” Correct and useful. It also implies that custom fields are in scope, which means slug-equivalent fields in custom post types and metadata fields containing URLs are exposed to the same class of mistake. The same shape applies to WPML AI Translation, TranslatePress AI, and Weglot AI: production pipelines are full-file input by default. None of them ships with a structural-integrity audit as part of the product.
The competitive logic Losoviz describes (“when everybody does it, you are not moving forward”) understates the risk. When everybody does it, and nobody runs the structural audit, the average multilingual WordPress site quietly drifts away from index-layer integrity over years. The user-visible prose stays good. The graph rots.
4 steps to stop AI translation from breaking multilingual WordPress SEO
The minimum-viable shift for any team running more than one locale on AI translation:
- Protect the structural fields. Slug, the force-slug flag, canonical URLs, redirect-source URLs, taxonomy terms, and hreflang references should be written by the engineering pipeline, not by the translator. If the translation tool does not support a read-only field set, treat the frontmatter as separate from the body in the workflow: translate the body in the AI tool, leave the frontmatter alone.
- Add a diacritic audit to the build gate. A single regex per locale, run on every pull request that touches multilingual files, catches the entire class of mistakes before deploy.
- Treat every slug change as a redirect event. Any change to a slug field, deliberate or accidental, requires a corresponding 301 in the redirect map. If the build does not enforce this with a fail on missing redirect entries, slug changes from AI translation will sooner or later 404 indexed URLs.
- Measure structural mistakes, not just prose accuracy. A pipeline that ships 99 percent sentence accuracy and 5 percent slug drift across locales is worse on the only metric that matters than a pipeline that ships 95 percent sentence accuracy and zero slug drift.
AI translation in WordPress 2026: where the cost actually lives
Losoviz is right that AI translation has reduced sentence-level accuracy to a commodity, and that “running to stay on the same spot” is the new competitive baseline. The polemic is that the 99 percent rate is being read as a quality ceiling, when the actual ceiling is structural integrity at the routing and indexing layer. The whole cost lives in that 1 percent. And that 1 percent is operational hygiene, not better AI.
For the agency or in-house team running a multi-locale WordPress site, this is the moment to invest in the structural layer, because the AI translation cost has fallen far enough that the relative cost of structural QA is now the largest line item in the multilingual operations budget. The next vendor pitch slot is “AI translation plus structural-integrity audit,” not “AI translation, 99 percent accurate.”
Sources
- Leonardo Losoviz on WP Tavern Jukebox (transcript via WP Tavern)
- Gato AI Translations product page
- WPML AI Translation, TranslatePress AI, Weglot AI: production tooling pages
- W3C internationalisation guidance on URL design across locales



