I’m excited to share that our EACL 2026 paper has been featured on the Google Research Blog!
We explore how to move beyond simple performance metrics to ensure simulated users actually behave like real ones and introduce a unique dual-agent data collection protocol that enables counterfactual validation. We also publicly release a new dataset of 4k+ human-AI shopping conversations.
Read the full deep-dive here: https://research.google/blog/convapparel-measuring-and-bridging-the-realism-gap-in-user-simulators/