Stop the Direct Mail Testing

September 25, 2014 Kevin Schulman, Founder, DonorVoice and DVCanvass

Is your direct mail testing on auto-pilot? Are you testing out of habit? We hear a lot of very smart, sophisticated direct marketers working for big non-profit brands tell us this.

If you are one of them it is time to get off the merry-go-round and stop testing (with the current approach).

These same marketers we hear from are looking at the time, effort and cost of their habitual, auto-pilot testing and the associated return and making the smart decision to cut way back on the number of tests. It is hard after all to beat the control.

To address the “now what?” question that remains if you find yourself in this boat we put together this 10 point framework for testing. The guiding principle here is that non-profits should think about all that time, effort and money going into habitual, rote testing as their pot of money for innovation.

We believe this testing protocol will lead to far fewer, more meaningful tests (a big plus) and more definitive decision making on outcomes (another big plus).

1) Allocate 25% of your acquisition and house file budget to testing.

2) Of the 25%, put 10% into incremental and 15% into big ideas.

An important corollary here, some of this money should go into researching ideas or paying others to do it. You can even use the online environment to pre-vet ideas with small, quick tests of the ideas to gather data.

3) Set guidelines for expected improvement.

Any idea for incremental must deliver a 5% (or better) improvement in house and 10% in acquisition (will see why difference in minute)
Any idea for breakthrough must deliver a 20% (or better) improvement.

4) Any idea – incremental or breakthrough – must have a ‘reason to believe’ case made that relies on theory of how people make decisions, publicly available experimental test results or past client test results.

The reason to believe must include whether the idea is designed to improve response or average gift or both – this will be the metric(s) on which performance is evaluated.
A major part of this protocol is guided by the view that far more time should be spent on generation of test ideas and therefore, creating the necessary “rules” and incentives to create this outcome.
This may very well result in 3 to 5 tests per year. If they are well conceived and vetted that is a great outcome.

5) Determine test volume with math, not arbitrary, ”best practice” test panels of 25,000 (or whatever)

Use one of many web based calculators (and underlying, simple statistical formulas). Here is one we like but there are plenty – all free.
Inputting past control performance and desired improvement (i.e. the 5% of 20%). Do not use arbitrary 25k and 50k test sizes.
An acquisition example: if our control response rate is 1% and we want to be able to flag a 5% improvement – i.e. response rate greater than 1.05% – to say it is real –the test size would need to be 626,231 (at 80% power and 95% confidence and 2-tail test). That is not a typo. How many acquisition test panels have been used in the history of non-profit DM that are producing meaningless results because of all the statistical noise? A sizeable majority, at least…. If we want to be able to flag a 10% improvement – i.e. better than 1.1% as meaningful – we need a test panel of 157,697. For most large charities this size is very doable but only if the math is understood on why.

6) Do not create a “random nth” control panel that matches the test cell size for comparison.

We are unsure how many charities employ this approach but it can lead to drawing the exact wrong answer on whether the test lost or won.
The problem with the “random nth” control test panel of equal size to the test – e.g. two panels drawn with random nth at 25,000 each – is that creates a point of comparison that has its own statistical noise and far more than the main control with all the volume on it. There are a few retorts that have surfaced in defense of this practice but they are simply off-base.

7) Determine winners and losers with math, not eyeballing it.

Use one of many web based calculators to input test and control performance and statistically declare a winner or loser.

8) Declare a test a winner or loser

Add results to the “reason to believe” document maintain a searchable archive.

9) All winners go full volume, rollout.

10) Losers can be resurfaced and changed with a revised “reason to believe” case.

Feedback

Ask A Behavioral Scientist

Behavioral Science Q & A

Q: As a designer who works with non-profits on fundraising strategy, I see the language like the following: “Our supporters help empower every girl, ensuring she has the resources she needs.” I do not think the word “help” is useful–I think “Our supporters empower every girl, ensuring she has the resources she needs. ” is much more engaging. Thoughts?

Whether “help” is more engaging or not really depends on the framing and context. The word help can sometimes weaken the perceived agency of the supporter, making their role feel secondary rather than central (your point). On the other hand, help can also signal collaboration rather than implying full ownership of the outcome, which might […]

Read Full Answer

Q: We started offering a donor cover option last april 1. The data to date suggests this may be dampening giving.eg. those who say yes to donor cover have a lower average gift (based on analysis of 6000+ gifts). I’m wondering if those who give lower gifts feel more guilt and therefore say yes to donor cover or if the presence of donor cover is making people adjust (lower) their gift size to accommodate the extra 3%. Would love any insights you have.

Great question! Here’s how behavioral science can help unpack what might be happening: Pain of Paying: Even a small extra charge can make giving feel more transactional than emotional, potentially reducing generosity. Fairness Concerns: Some donors might perceive donor cover as a surcharge rather than a contribution to the cause. If they feel the charity […]

Read Full Answer

Q: When writing an appeal, I waffle back and forth between writing “Your gift CAN…” or “Your gift WILL…” Any studies of which of these two words is best for an appeal?

The choice between “Your gift CAN…” and “Your gift WILL…” taps into the psychological framing of certainty vs. possibility. Currently, there is no academic research directly comparing these two framings in charitable appeals. However, I suspect no framing is universally better—the outcome likely depends on your target audience and the campaign’s goal. Here are some thoughts: Certainty Framing – […]

Read Full Answer

Q: Do you have any insight on whether integrating an individual giving appeal with other comms from the charity in both appearance and messaging can uplift results? Or does the actual appeal become ‘lost’ for lack of stand-out?

Integrating an individual giving appeal with other communications from a charity can have both positive and negative effects, and the outcome largely depends on how it’s executed. Advantages of Integration Brand Consistency: Maintaining a consistent appearance and messaging across all communications can reinforce the org’s brand identity and strengthen brand recognition and trust among your […]

Read Full Answer

Q: Is there any research on response rate impact in direct mail when referring to a sustainer gift as ongoing or recurring (catching all frequencies) v. monthly or annual?

I’m not aware of any in-market tests specifically comparing recurring vs. gift frequency language. I suspect the answer might not be the same with all gift frequencies, nor with all people. It sounds like a great opportunity for you to test and find out what works for your audience. Based on the literature, here’s a couple […]

Read Full Answer

Q: A major conservation nonprofit sends me lots of mail, many of which have on the envelope “time to renew” or “2nd notice.” I find this practice deceptive, especially as I haven’t given to said organization since 1997. It must be effective or they wouldn’t do it. But is it ethical?

Based on what we know from existing data, those renewal notices can actually be pretty effective in getting people to donate. They tap into our psychology – creating a sense of urgency, reminding us of past support, and using personalization to make the message hit home. They’re playing on our natural tendencies to feel obligated […]

Read Full Answer

The Agitator Tool Box

Ideas, applications, tools, processes, and case studies of break-through solutions in fundraising, including: