← Back to the trend map

Capabilities · Trend

LLM Probabilistic Reasoning Limitations

Benchmarking study finds LLMs achieve 96% accuracy on standard probability problems but only 59% on counterintuitive ones, with performance dropping 20–34% under token bias and misleading prompts.

Trend strength 3/10
Momentum +3/q
Confidence low
Status new
Forecast horizon

Connections

Connections · 1

How this node ties into the rest of the map, and the evidence behind each link.

Signal sources

Signal sources

Dated facts from primary sources in this direction.