Statistical Significance Smificance... - Agitator

Statistical Significance Smificance…

August 17, 2022 Kevin Schulman, Founder, DonorVoice and DVCanvass

How many times has someone asked you, ‘is that result statistically significant?” As a researcher I’ve probably heard it 6,321 times +/- 30 with a p value of .02.

I’m arguing here for some humility and some qualitative thinking on testing. Here is the argument for some humility, perhaps and especially from those who consider themselves data analysts or quant jocks.

(This is courtesy of a behavioral scientist acquaintance, Kristian Sorensen)

Suppose you have a test idea and compare the mean response rate for your control and experimental groups. Let’s say it’s 20 subjects in each sample. You use an independent means t-test and your result is significant (t = 2.7, d.f. = 18, p = 0.01). [Note: Yes, you can have small samples and statistically significant results – this isn’t a debate point, it’s a matter of fact]

Mark each of the statements below as “true” or “false.”

You have absolutely disproved the null hypothesis (that is, there is no difference between the population means). [] true/false []
You have found the probability of the null hypothesis being true. [] true/false []
You have absolutely proved your experimental hypothesis (that there is a difference between the population means). [] true/false []
You can deduce the probability of the experimental hypothesis being true. [] true/false []
You know, if you decide to reject the null hypothesis, the probability that you are making the wrong decision. [] true/false []
You have a reliable experimental finding in the sense that if, hypothetically, the experiment were repeated a great number of times, you would obtain a significant result on 99% of occasions. [] true/false []

This quiz was done with 44 psychology students, 39 professors and lecturers of psychology and 30 statistics teachers. Every professor taught null-hypothesis testing and every student had successfully passed one or more statistics courses in which it was taught.

**Percentage of participants in each group who endorsed one or more of the six false statements about the meaning of “p = 0.01” (Gigerenzer et al. 2004; Haller and Krauss 2002)**

80% of the professors and teachers teaching statistics got at least one of the statements wrong.

Here is a way to accurately think about significance testing that benefits from being non-technical and humble, which fosters better conclusions and next steps using donor dollars.

Saying something is statistically significant is akin to saying there is some reason to believe the test idea works. The operational meaning is that we should repeat the test.

This does happen or at least did with test, re-test and rollout in the ol’ days of direct mail with the larger volume players.

One reason all this matters? Too many wannabe-behavioral scientists mistaking one experiment producing “statistically significant” results as proof of a universal law and established truth. Nonsense. People are messy and complicated; misinterpreting statistical significance and declaring victory suggests they aren’t.

Kevin

Feedback

One response to “Statistical Significance Smificance…”

Jerilyn Mitchell Bowers says:

August 17, 2022 at 6:14 am

Just ask any political pollster.

Ask A Behavioral Scientist

Behavioral Science Q & A

Q: As a designer who works with non-profits on fundraising strategy, I see the language like the following: “Our supporters help empower every girl, ensuring she has the resources she needs.” I do not think the word “help” is useful–I think “Our supporters empower every girl, ensuring she has the resources she needs. ” is much more engaging. Thoughts?

Whether “help” is more engaging or not really depends on the framing and context. The word help can sometimes weaken the perceived agency of the supporter, making their role feel secondary rather than central (your point). On the other hand, help can also signal collaboration rather than implying full ownership of the outcome, which might […]

Read Full Answer

Q: We started offering a donor cover option last april 1. The data to date suggests this may be dampening giving.eg. those who say yes to donor cover have a lower average gift (based on analysis of 6000+ gifts). I’m wondering if those who give lower gifts feel more guilt and therefore say yes to donor cover or if the presence of donor cover is making people adjust (lower) their gift size to accommodate the extra 3%. Would love any insights you have.

Great question! Here’s how behavioral science can help unpack what might be happening: Pain of Paying: Even a small extra charge can make giving feel more transactional than emotional, potentially reducing generosity. Fairness Concerns: Some donors might perceive donor cover as a surcharge rather than a contribution to the cause. If they feel the charity […]

Read Full Answer

Q: When writing an appeal, I waffle back and forth between writing “Your gift CAN…” or “Your gift WILL…” Any studies of which of these two words is best for an appeal?

The choice between “Your gift CAN…” and “Your gift WILL…” taps into the psychological framing of certainty vs. possibility. Currently, there is no academic research directly comparing these two framings in charitable appeals. However, I suspect no framing is universally better—the outcome likely depends on your target audience and the campaign’s goal. Here are some thoughts: Certainty Framing – […]

Read Full Answer

Q: Do you have any insight on whether integrating an individual giving appeal with other comms from the charity in both appearance and messaging can uplift results? Or does the actual appeal become ‘lost’ for lack of stand-out?

Integrating an individual giving appeal with other communications from a charity can have both positive and negative effects, and the outcome largely depends on how it’s executed. Advantages of Integration Brand Consistency: Maintaining a consistent appearance and messaging across all communications can reinforce the org’s brand identity and strengthen brand recognition and trust among your […]

Read Full Answer

Q: Is there any research on response rate impact in direct mail when referring to a sustainer gift as ongoing or recurring (catching all frequencies) v. monthly or annual?

I’m not aware of any in-market tests specifically comparing recurring vs. gift frequency language. I suspect the answer might not be the same with all gift frequencies, nor with all people. It sounds like a great opportunity for you to test and find out what works for your audience. Based on the literature, here’s a couple […]

Read Full Answer

Q: A major conservation nonprofit sends me lots of mail, many of which have on the envelope “time to renew” or “2nd notice.” I find this practice deceptive, especially as I haven’t given to said organization since 1997. It must be effective or they wouldn’t do it. But is it ethical?

Based on what we know from existing data, those renewal notices can actually be pretty effective in getting people to donate. They tap into our psychology – creating a sense of urgency, reminding us of past support, and using personalization to make the message hit home. They’re playing on our natural tendencies to feel obligated […]

Read Full Answer

The Agitator Tool Box

Ideas, applications, tools, processes, and case studies of break-through solutions in fundraising, including: