Voice Notes in X DMs: When They Work and When They Backfire

Voice notes feel personal. But do they actually convert better than text for B2B outreach? We tested it. Here's the honest answer.

ConvoWise
5 min read
Voice Notes in X DMs: When They Work and When They Backfire

Someone on Twitter told you voice notes are "the secret weapon for standing out in DMs."

So you recorded a 2-minute message to a founder you've never talked to, sent it, and... crickets.

Here's the thing: voice notes can work incredibly well. But most people use them in exactly the wrong situations. And when voice notes fail, they fail hard. Like, screenshot-your-message-and-roast-you-on-the-timeline hard. Fun.


The Fundamental Problem

Text is skimmable. Voice isn't.

When someone gets a text DM, they can glance at it in 2 seconds and decide if it's worth their time. When they get a voice note, they have to commit to listening before they know what it's about.

For a busy person, that's a big ask. Especially from a stranger.

Think about your own inbox. How many voice notes from people you don't know do you actually listen to? It's the same reason cold DMs get ignored , trust has to come first. Be honest. (My guess: almost zero.)

That's not because voice is bad. It's because voice requires trust. And trust takes time to build.


When Voice Notes Actually Work

Voice notes shine when there's already a relationship. Even a small one.

After You've Exchanged a Few Messages

You've been going back and forth in text. The conversation is getting longer. Switching to voice feels natural, like you'd do with an actual friend.

"Hey, this is getting long, easier to just say it." That makes sense. That doesn't feel weird.

For Follow-Ups That Need Warmth

Instead of the generic "just checking in" text, a quick voice note feels personal. You can hear that it's genuine, not a copy-paste template.

"Hey, been thinking about what you said about the hiring challenges. Had an idea that might help, let me know if you want to chat about it."

Same message in text? Meh. In voice? Feels like you actually care.

When You're Selling High-Touch Services

If you're a coach, consultant, or anyone whose value is partially in the relationship itself, voice notes are a preview of what working with you is like.

A founder selling SaaS? Probably not necessary. A speaking coach? Voice notes make a lot of sense. For more on video-based DM outreach, check our dedicated guide.

Complex Explanations

Sometimes explaining something in text would take 500 words. In voice, it takes 45 seconds and includes the nuance that text would miss.

But, and this is important, you have to actually know what you're going to say before you hit record. Rambling kills the benefit.


When Voice Notes Backfire

If you're about to do any of these, just... don't.

Cold First Touch

Stranger sends you a voice note. What do you do?

Most people: ignore it entirely. Some people: feel annoyed that you're asking for their time before proving you're worth it.

Our data shows cold voice notes get 60% lower response rates than cold text DMs. Not even close.

Busy Executives

CEOs, founders with big audiences, anyone drowning in messages. They don't have time to listen to every voice note. They need to triage fast.

Text lets them do that. Voice doesn't. You're making their life harder, not easier.

When You Haven't Prepared

You can hear when someone is winging it. The "um"s, the pauses, the rambling to nowhere.

If your voice note sounds worse than your written message would have been, you've made it harder for them to like you, not easier.

Anything Over 60 Seconds

If it takes more than a minute, it should probably be a call. Or a Loom. Not a voice note.

I once got a 4-minute voice note from someone trying to pitch me. Four minutes. I made it 30 seconds before I closed it.


The Voice Note Checklist

Before you hit record, ask yourself:

Do they know who I am? If no, probably stick to text.

Have we already exchanged messages? Voice makes more sense after rapport exists.

Do I know exactly what I'm going to say? If not, prepare first.

Can I say it in under 45 seconds? If not, reconsider the format.

Is my audio quality decent? Background noise = skip.

Hit yes on all five? Go for it. Otherwise, text is probably better.


The Hybrid Approach

Here's what actually works well: voice note + text summary.

Send the voice note, then immediately follow it with a one-line text: "TLDR: had an idea about your hiring situation, 40 sec voice note above if you want the context."

Now they can decide whether to listen based on whether the topic is relevant. Best of both worlds.

We've seen this get 2x the response rate of voice-only messages. People appreciate you respecting their time. For more ways to boost reply rates, see our pattern interrupt DMs guide.


Frequently Asked Questions

Do voice notes work for X outreach?

Sometimes. They feel personal and stand out, but require more commitment from the recipient. Best for warm prospects or follow-ups after you've built some rapport. Our data shows cold voice notes actually underperform cold text by about 60%.

When should I send a voice note instead of text?

After initial text conversation, for personalized follow-ups, complex explanations, and when you're selling high-touch services where the relationship matters. Avoid for cold outreach to busy executives.

How long should a voice note be?

30-45 seconds is ideal. 60 seconds is the absolute max. If it takes longer than that, you should probably be having a call, not sending a voice note. And always know what you're going to say before you hit record.

Want Outreach That Actually Converts?

We test every tactic, including voice notes, so you don't waste time on things that don't work.

Book a Strategy Call →

Keep Reading

[

Related

DM Personalization at Scale

](/blog/x-dm-personalization-at-scale)[

Follow-ups

How to Follow Up Without Annoying

](/blog/follow-up-x-without-annoying)

Ready to book more calls?

Get a free X outreach audit. We will show you exactly how to turn DMs into discovery calls.