Beyond PII: How Users Attempt to Estimate and Mitigate Implicit LLM Inference

Published: Sep 15, 2025

Last Updated: Sep 15, 2025

Authors:Synthia Wang, Sai Teja Peddinti, Nina Taft, Nick Feamster

Abstract

Large Language Models (LLMs) such as ChatGPT can infer personal attributes from seemingly innocuous text, raising privacy risks beyond memorized data leakage. While prior work has demonstrated these risks, little is known about how users estimate and respond. We conducted a survey with 240 U.S. participants who judged text snippets for inference risks, reported concern levels, and attempted rewrites to block inference. We compared their rewrites with those generated by ChatGPT and Rescriber, a state-of-the-art sanitization tool. Results show that participants struggled to anticipate inference, performing a little better than chance. User rewrites were effective in just 28\% of cases - better than Rescriber but worse than ChatGPT. We examined our participants' rewriting strategies, and observed that while paraphrasing was the most common strategy it is also the least effective; instead abstraction and adding ambiguity were more successful. Our work highlights the importance of inference-aware design in LLM interactions.

Beyond PII: How Users Attempt to Estimate and Mitigate Implicit LLM Inference

Abstract

Categories