Data Colada

[124] "Complexity": 75% of participants missed comprehension questions in AER paper critiquing Prospect Theory

Posted on March 14, 2025March 31, 2025 by Uri Simonsohn

Kahneman and Tversky’s (1979) “Prospect Theory” article is the most cited paper in the history of economics, and it won Kahneman the Nobel Prize in 2002. Among other things, it predicts that people are risk seeking for unlikely gains (e.g., they pay more than $1 for a 1% chance of $100) but risk averse for…

[123] Dear Political Scientists: The binning estimator violates ceteris paribus

Posted on March 5, 2025October 10, 2025 by Uri Simonsohn

This post delves into a disagreement I have with three prominent political scientists, Jens Hainmueller, Jonathan Mummolo, and Yiqing Xu (HMX), on a fundamental methodological question: how to analyze interactions in observational data? In 2019, HMX proposed the "binning estimator" for studying interactions, a technique that is now commonly used by political scientists. I argued…

[122] Arresting Flexibility: A QJE field experiment on police behavior with about 40 outcome variables

Posted on January 7, 2025February 5, 2025 by Uri Simonsohn

A forthcoming paper in the Quarterly Journal of Economics (QJE), "A Cognitive View of Policing" (htm), reports results from a field experiment showing that teaching police officers to "consider different ways of interpreting situations they encounter" led to "reductions in use of force, [and] discretionary arrests" (abstract). In this post I explain why, having spent…

[121] Dear Political Scientists: Don't Bin, GAM Instead

Posted on December 3, 2024March 5, 2025 by Uri Simonsohn

There is a 2019 paper, in the journal Political Analysis (htm), with over 1000 Google cites, titled "How Much Should We Trust Estimates from Multiplicative Interaction Models? Simple Tools to Improve Empirical Practice". The paper is not just widely cited, but is also actually influential. Most political science papers estimating interactions now-a-days, seem to…

[120] Off-Label Smirnov: How Many Subjects Show an Effect in Between-Subjects Experiments?

Posted on September 16, 2024September 16, 2024 by Uri Simonsohn

There is a classic statistical test known as the Kolmogorov-Smirnov (KS) test (Wikipedia). This post is about an off-label use of the KS-test that I don’t think people know about (not even Kolmogorov or Smirnov), and which seems useful for experimentalists in behavioral science and beyond (most useful, I think, for clinical trials and field…

[119] A Hidden Confound in a Psych Methods Pre‑registrations Critique

Posted on September 2, 2024September 2, 2024 by Uri Simonsohn

A forthcoming paper in Psych Methods (.pdf) had a set of coders evaluate 300 pre-registrations in terms of how informative they were about several study attributes (e.g., hypotheses, analysis, DVs). The authors analyzed the subjective codings and concluded that many pre-registrations in psychology, especially those relying on the AsPredicted template, provide insufficient information., Central to…

[118] Harvard’s Gino Report Reveals How A Dataset Was Altered

Posted on July 9, 2024May 4, 2025 by Joe Simmons

As you may know, Harvard professor Francesca Gino is suing us for defamation after (1) we alerted Harvard to evidence of fraud in four studies that she co-authored, (2) Harvard investigated and placed her on administrative leave, and (3) we summarized the evidence in four blog posts. As part of their investigation, Harvard wrote a…

[117] The Impersonator: The Fake Data Were Coming From Inside the Lab

Posted on June 12, 2024June 13, 2024 by Uri Simonsohn

A previous version of this post was supposed to go live in January 2019. But the day before it was scheduled, the Data Colada team (Uri, Leif, and Joe) received an email that we took to be a potential death threat. After discussions with the local police, the FBI, and our families, we decided to…

[116] Our (First?) Day In Court

Posted on May 8, 2024May 7, 2024 by Leif Joe and Uri

“Any update on the lawsuit?” That is the most common question any of us is asked. It is usually preceded by an apologetic preamble, like, “sorry if this is a sensitive question,” or “I don’t know if you’re tired of talking about this, but…” The reality is that, for the most part, our actual sensitivity…

[115] Preregistration Prevalence

Posted on November 13, 2023November 13, 2023 by Uri Simonsohn

Pre-registration is the best and possibly only solution to p-hacking. Ten years ago, pre-registrations were virtually unheard of in psychology, but they have become increasingly common since then. I was curious just how common they have become, and so I collected some data. This post shares the results. The data From the Web of Science…