Every P2P payment app β Venmo, Zelle, PayPal, Cash App β wraps itself in "bank-level encryption" badges, padlock icons, fraud-protection microcopy. The assumption baked into the industry is that visible security communication drives user trust.
But when I send $20 to someone on Venmo, I don't read the encryption badge. I check the profile photo, the username, the last transaction. My trust lives somewhere else entirely.
So which is it? Is the industry right and I'm an outlier, or is the industry wrong? And if it is wrong, by how much? Conjoint analysis can answer that with a number.
If I asked people "how important is encryption to you?" they'd all say "very important". Self-report on trust is useless β everyone performs the rational answer. Conjoint analysis asks instead: which of these two options would you pick? Trade-offs force revealed preferences, and with enough repetitions across many respondents, I can decompose each choice into part-worth utilities per attribute level.
I mocked a neutral βgeneric pay appβ confirmation screen with 6 trust attributes (varied at 2β3 levels each) plus a transaction amount as context. Each participant saw 12 forced-choice tasks drawn from a 24-card D-efficient design. Modelled in Conjointly's free academic tier to recover individual-level utilities β not just aggregate averages.
I'm one person with no funding. A paid panel (Prolific, Qualtrics) was out of reach, so my recruiting pool was people I could reach personally β classmates, friends-of-friends, two family members. Twelve was the number who'd sit with me for a 30-minute Zoom each.
I kept the quantitative method (choice-based conjoint) anyway. A dedicated conjoint tool (I used Conjointly's free academic tier) can still produce interpretable utility estimates at n = 12 β the error bars are wide, but the rank ordering of attribute importance was stable when I held out different participants and re-ran. That's enough to answer "which signals matter most?" with confidence, even if it's not enough to pin down the exact importance scores to the US population.
Orme's rule of thumb says ~63 respondents for aggregate CBC estimation at this design's size (largest attribute = 3 levels). I'm an order of magnitude below that β and I say so in the Limitations section. The goal here was to practice the craft and get a directionally honest answer, not publish a representative study.
Twelve participants recruited from my own network β classmates, friends-of-friends, two family members. All active monthly users of at least one P2P payment app (Venmo, Zelle, Cash App, PayPal). A convenience sample, not representative of the US. The Limitations section names this explicitly.
Twelve steps over six weeks. Each one is broken into the thought that kicked it off, why it mattered, and what I actually did. This is the part I wish other case studies showed me when I was learning.
Neutral "generic pay app" frame β no brand colors, no Venmo blue or Cash App green. Each participant saw 12 tasks assembled from the same component kit; only the six attributes varied.
Every participant saw 4 tasks at each amount ($15 / $75 / $400), drawn from the 24-card D-efficient design and randomized per-participant. Below: one representative prompt per tier, plus the full 12-task layout a participant actually saw.
Built as a small design system so every stimulus was a compositing job from identical pieces. Typography, spacing, and CTA stayed locked β only the trust-signal components swapped in.
Name, photo, history and mutuals together = 86.3% of the decision. Technical reassurance = 13.7%. Stop designing encryption PR pages before the product solves the "is this the right person" problem.
Venmo's username culture optimizes for delight. But the data says a verified legal name moves trust more than every security badge combined. There's a design challenge here: surface legal identity without killing the playful handle.
21% importance. Showing "you've sent $X to this person N times before" right above the confirm button is a low-lift, high-yield change β and none of the four apps I tested make this prominent in the confirm flow.
10.9% vs 6.6%. "5 mutual friends" is a stronger trust signal than a platform-verified checkmark. Social proof beats institutional proof for transaction-level trust.
I expected users to shift toward technical signals at higher amounts. They didn't. At $400 they wanted more identity confirmation, not more cryptography. This has implications for step-up friction design.
Legal name, profile photo, shared-history pill, mutuals. That's the trust stack. The padlock stays, but it earns its keep at 7%.
Conjoint asks what people would do. Nobody actually sent $400 in my study. A follow-up with live-fire behavioral data (click-through on real confirm screens) could either validate or deflate these utilities.
The biggest honest caveat. Orme's rule for CBC wants ~63 respondents for aggregate estimation at this design's size. I'm at 12. Point estimates have wide credible intervals; the rank ordering of attribute importance was stable under leave-one-out jackknifes, but treat specific percentages as rough. A follow-up on a paid panel (Prolific, ~$600 for n β 400) is the natural next step.
My recruits are people in my network β skewing young, urban, college-educated. The 45β59 bracket (n = 2) is statistically meaningless. Nothing here can be generalized to older or less digitally-fluent users. That matters a lot for a trust question, where older users may weight technical signals very differently.
My stimuli used generic "bank-level encryption" wording. A branded trust seal (Norton, McAfee, FDIC) might perform differently. Worth a follow-up to isolate wording effects from badge-presence effects.
"Feel comfortable completing" is a proxy for behavioral intent. Some users might feel cautious but still send the money because they have to. This study measures felt trust, not revealed conversion.
Everything here assumed the recipient was someone the sender recognized. The interesting next question is marketplace payments β Facebook Marketplace, Craigslist, OfferUp. When the recipient is a stranger, does encryption badging finally earn its weight? Or does social context (mutuals, platform reputation) stay dominant?
I'd run the same conjoint with a marketplace framing, same 7 attributes, and test whether the importance ranking scrambles. If it does β interesting. If it doesn't β even more interesting.
β back to work