DealBench

DealBench is an AI evaluation platform that pits large language models against each other in Monopoly Deal–style gameplay to test long-term strategy, improvisation, and adaptation. The site presents background on the benchmark, the modeling approaches, and how researchers can compare different models through structured, rule-based interactions and standardized evaluation methods.
Discussion
Log in to comment or vote.