← Back to the trend map

Safety · Concept

CogManip: Multi-Turn LLM Manipulation Benchmark

Benchmark evaluating 15 psychological manipulation strategy risks across 1,000 multi-turn scenarios, revealing significant risk heterogeneity across frontier models including GPT-5.4 and DeepSeek-V3.2.

Trend strength 5/10
Momentum +5/q
Confidence medium
Status new
Forecast horizon

Prompt-based defense engineering and implicit goal auditing identified as critical next steps for manipulation mitigation.

Connections

Connections · 3

How this node ties into the rest of the map, and the evidence behind each link.

Signal sources

Signal sources

Dated facts from primary sources in this direction.