How to Have More Winning Tests?

June 18, 2021 Kevin Schulman, Founder, DonorVoice and DVCanvass

Stop designing tests assuming everyone’s the same. 99.9% of tests are of this variety, the random nth, A/B test.

Hidden in many “losing” test results is a test idea that worked for some people and not others.

Here are results of an experiment with donations going to World Vision and prospective donors randomly split into the control (altruism) and test (self-interest).

Nothing to see here. The test lost. Discard, move on, declare failure. That is the best choice if the alternative is to break out the test response rates by random groupings within the test group. If you append lots of 3rd party data or just look at random RFM groupings or demographics or channel behavior you’ll see lots of sub-groups within the test group that beat the control.

There is a 99.9% this is random noise, but it may feel like signal. In fact the human brain is trained to see patterns and explanation, even where none exist.

Consequently, the human bias is to find something that’s actually nothing. But our bias leads us to come up with a reason why it’s something – e.g. “people in our low dollar, high recency RFM bucket loved the test”, or ” people that collect stamps and skew to Generation Whatever hated our test.” But, if your test wasn’t designed for these groups beforehand, finding these random differences after the fact is a waste of time and money.

What about designing your test assuming people will respond differently based on who they are as people? The World Vision test was such a test. The first bar chart is what we expected, no difference if you mash everybody together.

The altruism appeal – dubbed the control because it’s more typical in the sector – had the ask framed as, “Any donation you make will improve the happiness and wellbeing of an African family.” The researchers didn’t think this would work well for everyone, they thought it would work for people high in Agreeableness as an innate, personality trait.

The self-interest appeal had the ask framed as, “Research by psychologists shows that donating money to charity increases the happiness and wellbeing of the giver.” Again, this wasn’t a test idea developed to work for everyone, only those low in Agreeableness and higher in the innate Neuroticism personality trait.

Why Personality? It travels well. It’s part of you. It is measurable, targetable, we know how to message to traits and most importantly, our personality determines much of what we pay attention to, what we consider, and what we act on.

Framing appeals to match who I am versus who I am not seems obvious except obvious is in short supply so it ain’t really that obvious.

The experiment included measuring Personality traits after the donation in order to break out donation behavior by trait and by control/test. What did they find when the analyzed the results the intended way – i.e. not lumping together the two trait types and blurring any differences in behavior?

The test won among those low in Agreeableness and higher in Neuroticism and lost for those high in Agreeableness.

Framing matters. But it matters differently for different people. You do have different donor segments but their ‘why’ of giving has zero to do with demographics, behavior, channel or any other internally defined segmentation. It also has zero to do with random, persona clusters that were created by throwing everything into a statistical blender.

Start with why people do what they do, recognize it isn’t the same answer for everyone while knowing groups of similar people exist. Build a test aimed for a specific group but also include a group you don’t think it will work with to more fully establish cause and effect.

The random nth should die a quick death. It won’t. But it should…

Kevin

Feedback

Ask A Behavioral Scientist

Behavioral Science Q & A

Q: Do you have any insight on whether integrating an individual giving appeal with other comms from the charity in both appearance and messaging can uplift results? Or does the actual appeal become ‘lost’ for lack of stand-out?

Integrating an individual giving appeal with other communications from a charity can have both positive and negative effects, and the outcome largely depends on how it’s executed. Advantages of Integration Brand Consistency: Maintaining a consistent appearance and messaging across all communications can reinforce the org’s brand identity and strengthen brand recognition and trust among your […]

Read Full Answer

Q: Is there any research on response rate impact in direct mail when referring to a sustainer gift as ongoing or recurring (catching all frequencies) v. monthly or annual?

I’m not aware of any in-market tests specifically comparing recurring vs. gift frequency language. I suspect the answer might not be the same with all gift frequencies, nor with all people. It sounds like a great opportunity for you to test and find out what works for your audience. Based on the literature, here’s a couple […]

Read Full Answer

Q: A major conservation nonprofit sends me lots of mail, many of which have on the envelope “time to renew” or “2nd notice.” I find this practice deceptive, especially as I haven’t given to said organization since 1997. It must be effective or they wouldn’t do it. But is it ethical?

Based on what we know from existing data, those renewal notices can actually be pretty effective in getting people to donate. They tap into our psychology – creating a sense of urgency, reminding us of past support, and using personalization to make the message hit home. They’re playing on our natural tendencies to feel obligated […]

Read Full Answer

Q: I find it irritating when some nonprofits accept my “gift” and then ask me to cover their credit card fees separately. It feels like a practice that does nothing to help win donors and runs the risk of turning others off. Is there any data on this either way?

Interesting question. I had a quick look at the testing done on this topic. On the positive side, in all cases, over half of donors decide to cover the fee. In some cases, it goes as high as 65%. Not a negligible percentage at all. Here’s another test from iRaiser showing consistent results (see point […]

Read Full Answer

Q: What are the three most important things to consider when designing a brilliant supporter journey?

There’s just one thing to consider when designing a supporter journey: the supporter. More specifically, you need to take into account: Who the supporter is i.e. their identity, which is the reason they support this cause, and their personality, which describes the way they “see” and process the world. These will determine the kind of […]

Read Full Answer

Q: Is there any evidence that changes to the tax laws in the USA changed end of year giving behaviors? Previously, ~30% of US taxpayers filed itemized tax returns allowing them to receive a deduction for charitable giving. Tax law changed in 2017 so that now only ~10% itemize. This should mean tax year-end giving should not matter to millions of people for whom it used to matter. Is there any data evidence to support this?

I’m not an expert in this but a quick search surfaced this article on the effect of tax reforms on 2019’s charitable giving. The researchers didn’t find a reduction. Actually, they observed an “increase in charitable contributions in 2019, even with the lower tax rates and the dramatically smaller number of taxpayers who itemize their […]

Read Full Answer

The Agitator Tool Box

Ideas, applications, tools, processes, and case studies of break-through solutions in fundraising, including: