AI experiment: The fake brand "xarumei" and llm hallucinations
EN

AI experiment: The fake brand "xarumei" and llm hallucinations

Last verified: June 1, 2026
4min read
Case study
AI integration
Marketing strategist

An Ahrefs marketing researcher created a completely fictional luxury paperweight company called Xarumei, built its website in an hour using AI, and systematically tested eight major AI tools. Over two months, he flooded the web with three deliberately contradictory false narratives, then asked 56 carefully crafted questions designed to reveal how AI models distinguish truth from fiction.

Learn more about WordPress development services at WPPoland. The results reveal disturbing weaknesses in how AI handles brand information, with profound implications for Reputation Management (ORM).

#The experiment

The experiment took place in two phases. Initially, the researcher tested basic AI behavior by asking questions about a brand that shouldn’t exist-questions involving false celebrity endorsements, defective products, and Black Friday sales that never happened.

GPT-4 and GPT-5 performed best, correctly answering 53-54 out of 56 questions and stating “this does not exist” where appropriate. Perplexity failed about 40% of the questions, bizarrely confusing Xarumei with Xiaomi smartphones. Claude refused to hallucinate entirely but also never used the website content. Gemini and Google’s AI Overview often refused to treat Xarumei as real because they couldn’t find it in search results.

Most disturbingly, Microsoft Copilot fell into what the researcher calls the “sycophancy trap,” inventing elaborate explanations about craftsmanship, symbolism, and scarcity when asked why everyone was praising the brand on X (Twitter).

#Phase two: Controlled chaos

The second phase introduced controlled chaos:

  1. Official FAQ: Explicitly denying rumors (“We do not make a ‘precision paperweight’”, “We were never acquired”).
  2. Conflicting Narratives:
    • A glossy blog claiming 23 master craftsmen worked at 2847 Meridian Blvd in Nova City, CA, endorsed by Emma Stone.
    • Reddit AMA: Strategically chosen because research shows it is one of the most cited domains in AI responses.
    • Medium Article: An “investigation” that debunked the obvious lies (making it seem credible) but then slipped in new fabrications (Founder: Jennifer Lawson, Location: Portland).

Medium proved devastatingly effective. Gemini, Grok, AI Overview, Perplexity, and Copilot trusted the Medium article over the official FAQ, confidently citing Jennifer Lawson as the founder and Portland as the location. The manipulation worked because it looked like real journalism-by debunking the obvious lies first, it gained trust, then inserted its own made-up details as the “corrected” story.

When forced to choose between a vague truth (FAQ “We don’t publish unit numbers”) and specific fiction (fake sources claiming “634 units in 2023”), AI chose fiction almost every time.

#AI argues with itself

Perhaps most unsettling was watching models contradict themselves without realizing it. Early in testing, Gemini stated it could find no evidence of the brand. Later, after encountering the fake sources, the same model confidently stated: “The company is based in Portland, Oregon, founded by Jennifer Lawson.”

LLMs seemed to forget to question the brand’s existence, simply reacting to whatever context seemed most “authoritative” at the moment. In one case, Grok synthesized multiple false sources into one confident answer, mixing the Portland location with debunked Nova City claims.

#Recommendations for brands

  1. Create a detailed FAQ: Explicitly state what is true and false, especially where rumors exist.
  2. Fill Information Gaps: Don’t leave voids. If you don’t say it, AI will invent it based on a random Reddit comment.
  3. Monitor “Side Channels”: Reddit posts, Medium articles, and Quora answers are no longer optional-AI pulls them directly into answers, making them part of your brand’s core marketing surface.
  4. Avoid Generic Claims: Be specific. Instead of “industry leading,” give numbers. AI prefers specific (even if fake) numbers over vague truths.

#Sources

  • Patrick Stox (Ahrefs): “I Created A Fake Luxury Brand To Test How AI Handles Truth” (ahrefs.com/blog/ai-test-fake-brand/).
  • Marius Comper (Facebook): Analysis of the experiment.
  • Search Engine Journal: Analysis of LLM impact on Brand Entities.
  • Independent Testing: Verified on GPT-4, Claude 3.5 Sonnet, and Gemini Advanced (December 2025).
Next step

Turn the article into an actual implementation

This block strengthens internal linking and gives readers the most relevant next move instead of leaving them at a dead end.

Want this implemented on your site?

If visibility in Google and AI systems matters, I can build the content architecture, FAQ, schema, and internal linking needed for SEO, GEO, and AEO.

Related cluster

Explore other WordPress services and knowledge base

Strengthen your business with professional technical support in key areas of the WordPress ecosystem.

Article FAQ

Frequently Asked Questions

Practical answers to apply the topic in real execution.

SEO-ready GEO-ready AEO-ready 1 Q&A
How do you implement AI experiment: The fake brand xarumei and llm hallucinations? #
Define the current state, constraints and success criteria, then make one measurable change at a time.

Need an FAQ tailored to your industry and market? We can build one aligned with your business goals.

Let’s discuss

Related Articles

A practical guide to the best AI tools for WordPress in 2026, covering content generation, SEO workflows, chatbots, image tools, code assistants, and editorial risks.
development

Best AI Tools for WordPress Content and SEO in 2026

A practical guide to the best AI tools for WordPress in 2026, covering content generation, SEO workflows, chatbots, image tools, code assistants, and editorial risks.

From 2 August 2026, Article 50 of the AI Act imposes transparency duties on companies that use AI. What you actually have to label, when editorial review lifts the obligation, and how to put two layers of labelling into practice for media, marketing and agencies.
ai

The AI Act and labelling AI-generated content

From 2 August 2026, Article 50 of the AI Act imposes transparency duties on companies that use AI. What you actually have to label, when editorial review lifts the obligation, and how to put two layers of labelling into practice for media, marketing and agencies.

A source-backed senior take on the AI productivity paradox in 2026. Why generative AI helps but rarely explodes output, and what that means for WordPress agencies running WordPress 7.0 AI features.
ai

The AI productivity paradox: why generative AI will not 10x your team overnight

A source-backed senior take on the AI productivity paradox in 2026. Why generative AI helps but rarely explodes output, and what that means for WordPress agencies running WordPress 7.0 AI features.