Are you really so sure?

Fabricated quotes, false AI accusations, and fake doctors — a week of people trusting and distrusting AI for all the wrong reasons.

Mignon Fogarty
May 30, 2026

Issue 110

On today’s quest:

— Anthropic releases Claude Opus 4.8
— Are you really so sure?
— AI-generated podcasts claim to be hosted by doctors
— What is WRONG with people?
— How to check for changes in a document
— More people are discovering AI fact checking

Anthropic releases Claude Opus 4.8

A new Claude model just dropped, and people who had early access say it is noticeably better. Simon Willison called it “a modest but tangible improvement,” and Ethan Mollick said he was “impressed with it.” Mollick posted a video it made and a “sophisticated, if minor, academic paper” it ideated and wrote.

The Every blog said it is “legitimately great” — enough so that Anthropic could have called it Opus 5.0 — and that it’s “the best model we’ve tested for writing and knowledge work.”

Anthropic says Opus 4.8 is better at “honesty,” being “more likely to flag uncertainties about its work and less likely to make unsupported claims.”

The new model is only available on paid plans.

Are you really so sure?

I’m becoming increasingly worried about unprovable accusations of people using AI.

A Trinidadian writer named Jamir Nazir won the prestigious Commonwealth Short Story Prize and had his story published online in Granata.

But some people who read the story thought it sounded like it had been written by AI and put it into the Pangram AI detector, which flagged it as 100% written by AI.

HOWEVER, AI detectors are known to work less well on writing that isn’t standard English. When they fail, they often fail on writing from people for whom English isn’t their first language. English is the official language of Trinidad, but Trinidadian English is different from standard American and British English.

Linguist Jack Grieve, who does work on AI systems, wrote on LinkedIn:

“My basic worry is just because a tool performs very well on a test dataset, we shouldn’t assume the results port over to every other text.

In this case, the results being cited to justify the certainty with which the guilt of this author is being proclaimed are grounded in the performance of a tool tested on a single prompting stategy, on a different register (first 1,000 words of novels), and most notably presumably not on Trinidadian English.

The baseline style of human authored Trinadian short stories is not obvious to me, and so I’m surprised it is to an AI system much less to the American computer scientists making these claims.”

Following that debacle, people started putting past winners of the prize through Pangram, and “identified” other stories that were supposedly written by AI. In one case, the author has publicly insisted she did not use it (and I saw a friend vouching for her on social media). Troublingly, the story, which is about enslaved Africans also doesn’t seem to conform to western storytelling norms. In an interview in the Globe and Mail, the author said, “This story is rooted in Vincentian oral storytelling tradition, which by nature is formulaic and relies on repetition, rhythm, and patterned language as core elements of their structure.”

Pangram is the tool of choice these days for making accusations, and someone also used it to suggest that the pope of used AI to write his recent encyclical, Magnifica Humanitas. Pangram said parts of the pope’s commentary on AI were 40% to 100% written by AI, which I seriously doubt. And again, the pope’s language is likely not in line with most of the training data used on these models.

Finally, I have seen multiple stories of writers putting their own human-written work into Pangram to find it flagged as AI, and I have seen it do so with writing that I am also sure is human-written.

People need to be a lot more skeptical about Pangram results — and the results of even worse tools. The sad thing is that Pangram actually is the best tool when tested head-to-head with others. And the ironic thing is that people who are anti-AI are using an AI tool in such a credulous and harmful way.

Dealing with AI writing is a rapidly growing problem for anyone who works with writers, and it’s about more than just detection. If you read one other thing this week, I highly recommend Jane Friedman’s thoughtful piece on policing AI writing in publishing: Writing & Publishing Awards Have Difficult Decisions to Make Regarding AI.

AI-generated podcasts claim to be hosted by doctors

The Podcast Business Journal reports that Inception Point, the company that releases about 10,000 AI-generated podcast episodes each week, has been releasing content with voices that claim to be doctors. At The Podcast Show conference, CEO Jeanine Wright said they will stop the practice but will still have hosts that identify themselves as experts or “having been well read and well studied in a space.” When questioned about whether they would remove the existing shows that pretend to be hosted by doctors, Wright said no.

I’m no expert, but I did find a law firm website that said it is illegal to “impersonate a doctor by advertising or holding oneself out as a doctor or other medical professional without the proper licensure” in every state in the U.S.

(And as an aside, I also learned about “the Dr. Oz paradox”: because of the First Amendment, doctors can give potentially harmful health advice to the general public that would subject them to malpractice liability if they gave it to an actual patient.)

What is WRONG with people?

Another day, another person being stupid with AI. Maybe I shouldn’t be surprised anymore, but the New York Times reports that a buzzy new book about truth in the era of AI includes quotations fabricated by AI.

The author, Steven Rosenbaum, says he accepts responsibility for the errors and is looking into them. And yes. He should accept responsibility. But he should have accepted responsibility — for checking the facts — before he submitted his manuscript.

I’ll never understand what it is about AI that seems to compel people to think they don’t have to read the actual quotations or citations they put in their work.

It’s fine to use AI as a search engine. It’s not fine to not check the original sources because we know that some percent of the time, AI makes things up.

I think the thing that gets me most, though, is the apparent lack of shame. In an astonishing attempt to put a good spin on the situation, his final comment in the article is if the episode “serves as a warning about the risks of A.I.-assisted research and verification, then that is why I wrote the book.”

Mission accomplished, I guess.

UPDATE! I read a follow-up interview with Rosenbaum that made me even more annoyed. The man who put himself forward as an expert on trust in the age of AI told Ars Technica, “he ‘learned a lesson’ and is ‘going to be much more suspicious’ and ‘reticent to trust’ AI outputs going forward.”

How to check for changes in a document

I feel like I owe you a useful tip if you’ve read this far through my rant, so if you ever ask AI to make changes to a document, you can use a simple free tool called DiffChecker to make sure it hasn’t made changes you didn’t want.

I used it just last week to make sure a contract I was signing hadn’t been changed since the last version I read.

More people are discovering AI fact checking

I’ve mentioned AI for fact-checking multiple times in this newsletter, and this week, I saw Wharton professor Ethan Mollick recommend it too. (I find the same thing he mentions below — ChatGPT can be nitpicky about small details that don’t matter.)

Addendum: There’s a disappointing new article in Wired about what a terrible fact checker AI is, but it is filled with flawed arguments:

They used the free models in their tests (something I’ve repeatedly told you to avoid).
They highlighted the slop that comes from Google’s AI Overviews (again, something we should all know is terrible by now).
In some cases, they cited accuracy statistics from 2024 and 2025 (anyone paying attention knows the models are dramatically better now).
They confused hallucination rates on generated output with accuracy in fact-checking.

Again, very disappointing. I’ve been a Wired subscriber since almost when it launched, and I would have expected better.

So again … don’t believe anything you get from an LLM. Especially don’t believe anything you get from a free LLM or Google’s AI Overview. Fact-check everything, and if you choose to use AI, it can be an excellent fact-checker.

To fact-check with AI, enter your text into one of the best paid models with thinking, and say, “Fact check this piece. Show your sources.” Then check every source. It won’t be perfect, but it’s faster and more thorough than entering every fact one by one into a search engine, which also isn’t perfect.

I also sometimes tell it to use a green check emoji for things it thinks are correct, a yellow caution emoji for things it’s not sure about, and a red police car light emoji for things that are clearly wrong. That helps me instantly see the scope of what the problems might be.

Whether you do Google searches or have an LLM give you a list of questionable points with its sources and links, you still have to use your own brain and judgment, and of course, neither system will be able to fact-check things like comments a writer got directly from a source and information behind paywalls (e.g., AP and Chicago style). Google can’t find things that don’t exist on the open web and neither can an LLM.

I think the reason this makes me so angry is that there are plenty of true things that are bad about AI. It has helped children commit suicide, it’s short-circuiting our education system, and it helps stupid and evil people be stupid and evil at scale, just to name a few. But if you don’t tell the truth about what it can do, you are misinforming your readers, leaving them ill-prepared to deal with the challenges they are going to face. And when they eventually realize you’ve been giving them strawman arguments, you’ll have less credibility to make accurate arguments.

Quick Hits

My favorite recent pieces

AI is increasing the need for human oversight, not removing it. In this report from two professional nonfiction writing conferences, PerfectIt said, “The conversation is no longer really about whether AI will be used. It’s about how organizations maintain control, quality, and consistency as AI becomes embedded into increasingly complex workflows.” — PerfectIt

Choosing to stay human — Ethan Mollick

Using AI

How to Cut AI Costs and Climate Impact: Ranking 9 Inputs and Outputs [Level: Beginner. Super practical and useful.] — Orbit Media Studios

18 Ways To Save AI Token Budgets [Level: Intermediate to advanced. An expanded version of the post above.] — Almost Timely

No API? No Problem. Exploring a newish way that non-developers can use AI to efficiently answer complicated questions. [Level: Intermediate] — Phil Simon

Introducing Shortcuts Playground: Create Apple Shortcuts with Claude Code or Codex — MacStories

Claude Prompting Best Practices [Level: Intermediate] — Anthropic

How to Use Claude to Turn Your Handwriting Into a Custom Font or just make any kind of custom font you draw [Level: Beginner] — Inc.

The number of fabricated references in scientific papers increased more than 12x from Jan. 2023 to Feb. 2026. “Fabricated references are embedded in the peer-reviewed literature at scale, and that the rate of fabrication is accelerating.” — The Lancet

You can persuade AI models to accept falsehoods as truth, study shows — The Conversation

The business of AI

Nvidia’s revenue blows past Wall Street expectations as AI boom accelerates — The Guardian

Anthropic set to hit $10.9 billion in revenue in Q2, post its first profitable quarter, source says — CNBC

Microsoft reports expose AI's cost problem: The tech is more expensive than paying human employees — Fortune

Anthropic is paying SpaceX $1.25B per month for compute — Deccan Chronicle

The bubble is getting a lot bubblier, folks — Garbage Day

Education

AI not yet good enough to mark university essays, rewarding ‘style over substance’ — University of Cambridge

English Language Arts Teachers Invite Feedback on Working Framework for Responsible AI Use in Classrooms and Schools — National Council of Teachers of English

Government

David Sacks’s 11th-Hour Plea Led to Trump’s Backtrack on AI Executive Order — Wall Street Journal

State Department Unveils AI-Driven Visa-Scheduling Tool During Secretary Rubio’s India Trip [The platform claims to assign “earlier interview slots to travelers whose trips the algorithm deems most likely to strengthen U.S. economic or strategic interests.] — VisaHQ

White House Approves $9 Billion for Spy Agencies to Catch Up on A.I. [“The C.I.A. and N.S.A. cannot fully deploy the latest models on their classified systems because of a shortage of cutting-edge chips.”] — New York Times

I’m laughing

You can no longer Google the word 'disregard' — TechCrunch (plus, more examples on Bluesky, and more) [I believe this has been mostly fixed now, but it was still funny.]

Departing Meta Staffer Posts Biting Anti-AI Video Internally Amid Mass Layoffs — Mother Jones

Images

OpenAI Introduces Free AI Image Verification Tool Amid Rising Deepfake Concerns — CXO Digitalpulse

Job market

A new economics study argues that the drop in hiring young workers is better explained by the shift to working from home than by AI — SSRN

Meta cut 8,000 workers as Mark Zuckerberg warns ‘success isn’t a given’ in the AI race [About 7,000 employees were also moved to AI roles.] — CNBC

Two seemingly contradictory stories out of Uber. Both are based on a comments from talks, so I wonder about the bigger context:

Uber president says AI spending is getting ‘harder to justify’ — The Verge
Uber CEO says other execs are lying about AI: 'They say it'll be fine' but privately admit millions of jobs are gone — Yahoo Finance

Legal

Illinois Journalists, Actors, and Vocal Artists File Class-Action Lawsuits Against Amazon, Apple, Google, Meta, Microsoft, Nvidia, and Other Companies for Stealing Their Voices — Loevy &b Loevy

MIT Expert Warns Courts "Will Basically Have to Grind to a Halt" as They're Overwhelmed by AI-Generated Lawsuits — Futurism

Neal Kumar Katyal use an AI coach trained on every question asked by a Supreme Court justice in the last 25 years (and more) to prepare for his win before the Supreme Court last year. — TED Talk (also has a transcript)

And here’s a critique arguing that Neal Kumar Katyal actually didn’t do a great job before the Supreme Court even though he won — Reason

Model & Product updates

Gemini app rolling out ‘Extended’ thinking level [and new third-party integrations including Spotify, WhatsApp, and GitHub] — 9to5 Google

At big launch event, Google announces a new model (Gemini 3.5 Flash), new video generation capabilities, a Daily Brief feature, an agent called Gemini Spark, a new agent app called Antigravity 2.0, and plans for a future MacOS app. [One of these agents works 24/7 in the cloud, which seems like a big advantage because you can use it from anywhere and don’t need to keep your computer on and awake.] — Google

Google Search as you know it is over — TechCrunch (Wired speculates whether it should even be called “search” anymore.)

Google reinvents search before AI rivals replace it — Axios

ChatGPT for PowerPoint | Create and edit presentations with ChatGPT — OpenAI

Alibaba's Qwen 3.7 Release Shows Open Source AI Is Still Moving Fast — Startup Fortune

Philosophy

On the pope’s AI encyclical — Carlo Iacono

The pope’s full encyclical, Magnifica Humanitas — The Vatican

AI Agents as Coordination Technology — At the Edges

Publishing

The Economist braces for two internets: one for humans and one for AI agents — Digiday

Bolinda creates ‘AI-cloned’ voice of Barbara Cartland for new audiobooks — The Bookseller

Sudowrite launches AI Feedback feature for fiction writers — Sudowrite

POLITICO agrees to shut down both AI tools at center of landmark arbitration — Washington-Baltimore News Guild

Nonfiction Book Publishers Aren’t Remotely Ready for AI — New York Magazine

Robotics

LimX Luna demo [impressively fluid and complex motion] — YouTube

Robot Hands Are Getting Better — Chris Paxton

Science & Medicine

As a Doctor, I Can Understand the Allure of ChatGPT — New York Times

Security

In the last month, the ~50 groups with access to Mythos Preview have found “more than ten thousand high- or critical-severity vulnerabilities across the most systemically important software in the world.” — Anthropic

Video

Google’s new anything-to-anything AI model is wild — The Verge

Other

OpenAI makes breakthrough on 80-year-old maths problem — The Guardian [Also, the WSJ reports: “The math world is losing its mind over the new solution. This is what AI found, how we missed it—and why it matters.”]

Google DeepMind’s mathematics-optimized framework solved 9 open Erdos problems at the cost of a few hundred dollars per problem — arXiv

AI's Plummeting Prices Are a Software Story, Not a Hardware One [Level: Advanced] — Weighty Thoughts

Can OpenAI’s ‘Master of Disaster’ Fix AI’s Reputation Crisis? — Wired

Starbucks scraps AI inventory tool after nine months — Yahoo Finance

Gemini Becomes No. 2 Consumer AI Referral Source in Q1 2026, Larger Than Every Other ChatGPT Rival Combined — Brightedge

‘AI washing’: firms are scrambling to rebrand themselves as tech-focused — The Guardian

LinkedIn taking measures to throttle AI-generated comments and posts — LinkedIn

Pope Leo encyclical warns AI fueling conflict, urges world to ‘slow down’ — NBC News

AI is forcing McKinsey and its peers to rethink billable-hours pricing as consultants shift more “hourly” work to AI and clients want billing based on results - FT

The social contract of writing [that the writer will have spent more time on a piece than the reader] — jola.dev

8 AI bots now write 50% of X’s Community Notes. [AI notes are rated as more helpful than human-written notes, but because they usually use Grok as the LLM, viewpoints are more centralized and susceptible to Grok’s biases.] — Indicator

Humor makes it easier to forgive AI mistakes — Science Says

Software Developers Say AI Is Rotting Their Brains — 404 Media

Can AI Detectors Be Trusted? The Authors Guild Put Five of Them to the Test — The Authors Guild

What is AI Sidequest?

Are you interested in the intersection of AI with language, writing, and culture? With maybe a little consumer business thrown in? Then you’re in the right place!

I’m Mignon Fogarty: I’ve been writing about language for almost 20 years and was the chair of media entrepreneurship in the School of Journalism at the University of Nevada, Reno. I became interested in AI back in 2022 when articles about large language models started flooding my Google alerts. AI Sidequest is where I write about stories I find interesting. I hope you find them interesting too.

If you loved the newsletter, share your favorite part on social media and tag me so I can engage! [LinkedIn — Facebook — Mastodon]

Written by a human

Are you really so sure?

Fabricated quotes, false AI accusations, and fake doctors — a week of people trusting and distrusting AI for all the wrong reasons.

Anthropic releases Claude Opus 4.8

Are you really so sure?

AI-generated podcasts claim to be hosted by doctors

What is WRONG with people?

How to check for changes in a document

More people are discovering AI fact checking

Quick Hits

My favorite recent pieces

Using AI

Agents

Audio

Bad stuff

The business of AI

Education

Government

I’m laughing

Images

Job market

Legal

Model & Product updates

Philosophy

Publishing

Robotics

Science & Medicine

Security

Video

Other

What is AI Sidequest?