"Does this work?" vs "Does this not work?” Are conclusions different even though the LLM was given the same evidence documents?
Yes. Positive vs negative framing leads to more contradictory conclusions than responses from the positive question sampled twice. [2/6]