Asset Details

MbrlCatalogueTitleDetail

Paper

Community-Aligned Behavior Under Uncertainty: Evidence of Epistemic Stance Transfer in LLMs

Chang, Aiden,

Volkova, Svitlana,

Gerard, Patrick

2025

Overview

When large language models (LLMs) are aligned to a specific online community, do they exhibit generalizable behavioral patterns that mirror that community's attitudes and responses to new uncertainty, or are they simply recalling patterns from training data? We introduce a framework to test epistemic stance transfer: targeted deletion of event knowledge, validated with multiple probes, followed by evaluation of whether models still reproduce the community's organic response patterns under ignorance. Using Russian--Ukrainian military discourse and U.S. partisan Twitter data, we find that even after aggressive fact removal, aligned LLMs maintain stable, community-specific behavioral patterns for handling uncertainty. These results provide evidence that alignment encodes structured, generalizable behaviors beyond surface mimicry. Our framework offers a systematic way to detect behavioral biases that persist under ignorance, advancing efforts toward safer and more transparent LLM deployments.

Share this book

Add to My Shelf

Publisher

Cornell University Library, arXiv.org

Subject

Large language models

/ Mimicry

/ Uncertainty