A new Radiology study shows synthetic X-rays are fooling both clinicians and AI models, raising the stakes for provenance across healthcare and robotics.

Medical imaging just ran into its first deepfake crisis. A new Radiology study shows that synthetic X-rays generated by multimodal AI are now indistinguishable from real scans for most radiologists and even other foundation models. That means anyone with access to a hospital archive—or a motivated fraudster—could inject fake fractures, fabricate injuries, or destabilize clinical workflows with a single malicious upload.

What the new study found

Researchers from Mount Sinai tested 17 radiologists across six countries on two datasets: one mixed set with GPT-4o–generated X-rays from multiple body parts, and another chest-only set created with Stanford’s RoentGen diffusion model. They then challenged four leading multimodal models (GPT-4o, GPT-5, Gemini 2.5 Pro, and Llama 4 Maverick) with the same images.

Radiologists spotted GPT-4o deepfakes only 41% of the time when they didn’t know synthetics were included; accuracy rose to 75% when they were warned.
Performance varied wildly: the best human reader hit 92%, the worst 58%. Experience level had no correlation with success.
The AI judges didn’t fare better—accuracy ranged from 57% to 85%, meaning the models that generate imagery also struggle to police it.
Synthetic images shared recognizable “tells”: unnaturally smooth bones, symmetrical lungs, perfectly aligned spines, and fractures that looked too clean or appeared only on one side of a limb.

The research team is already publishing an open deepfake dataset plus training quizzes, but they caution that CT and MRI are the logical next targets. If a 2D chest film can be forged this convincingly, volumetric scans are not far behind.

Why it matters beyond hospitals

This isn’t just a medical story; it’s the same authenticity problem every robotics and AI practitioner will face as synthetic data becomes routine. Consider three immediate spillovers:

Litigation risk. Deepfake evidence could flood insurers and courts with “proof” of injuries that never happened, delaying legitimate claims and spiking compliance costs.
Operational sabotage. A compromised PACS archive is enough to paralyze a hospital. The same attack vector applies to robotics labs that rely on shared datasets or remote teleoperation feeds.
Model drift. Training perception systems on doctored data quietly poisons downstream robots, surgical assistants, or inspection pipelines.

Aswin’s take: build provenance in, not after the fact

Healthcare won’t solve this with better eyeballs alone. Provenance has to live at the point of capture, just like torque sensors live on a robot joint. My playbook for hospital partners (and frankly, any AI-heavy enterprise) looks like this:

Cryptographic signatures on acquisition. Bind each image to the technologist, modality ID, and timestamp using tamper-proof signatures. This mirrors the chain-of-custody work we already do for industrial robots.
Invisible watermarks baked into sensor firmware. If the imaging vendor embeds watermarks at the detector level, any alteration becomes obvious without extra hardware.
Dual-model validation. Pair diagnostic AI with a lightweight discriminator trained solely to spot “too-perfect” anatomical cues. Treat it as a watchdog, not a primary reader.
Red-team drills. Hospitals should run the same kind of adversarial exercises we use in robotics safety audits: inject synthetic files, watch how staff respond, and close the gaps.

Do that and you blunt both the legal exposure and the patient-safety risk before regulations catch up.

What to watch next

Vendors like GE HealthCare, Siemens, and Philips rolling out signed-image pipelines.
Multimodal watermarking standards that span X-ray, CT, MRI, ultrasound, and even robotics perception sensors.
Insurers updating policy language to require provenance for high-value claims.

The headline is simple: synthetic imagery is now good enough to fool experts. If you’re building or buying AI systems that rely on visual data—whether in an operating room or a factory—you’ll need authenticity infrastructure as urgently as you need accuracy.

Source: “Deepfake X-rays are so real even doctors can’t tell the difference,” ScienceDaily, March 26, 2026.

Aswin Sarang

Aswin Sarang is a technology professional and entrepreneur working across robotics, artificial intelligence, and automation. He focuses on building practical systems that bridge engineering, strategy, and real-world deployment, with an emphasis on clarity, scalability, and long-term value. His work spans product development, system integration, and technology consulting, helping organizations navigate complex technical decisions and translate emerging technologies into usable solutions. Known for a first-principles approach, Aswin prioritizes fundamentals over hype and execution over speculation. Beyond technology, he maintains a strong interest in human performance, learning, and personal development, bringing a multidisciplinary perspective to both his professional and creative pursuits.

All Transmissions

TABLE _OF_CONTENTS

Deepfake X-Rays Are Fooling Radiologists. Here’s How Healthcare Should Respond

What the new study found

Why it matters beyond hospitals

Aswin’s take: build provenance in, not after the fact

What to watch next