iPhone · iOS 26 · on-device

From thought to vault in one tap.

Capture on iPhone. Transcribe on the Apple Neural Engine. Route the markdown to iCloud, Obsidian, your n8n watch-folder, your webhook — wherever your automation already lives.

Download on the App Store $2.99 / month · 7-day free trial · No account
On device
Voice
Action Button · Siri · Shortcut
Transcribe
Apple Neural Engine
iCloud Drive Email Webhook SFTP Shortcut
Why VoiceToMD

Built for people who already have a knowledge base.

VoiceToMD doesn't replace your vault. It feeds it. Voice notes are context, not prompts.

01 / Privacy

Audio never leaves your device.

Full Whisper transcription and speaker diarization run on the Apple Neural Engine. No API. No cloud. No account. Works offline after the model downloads on first launch.

WhisperKit · SpeakerKit · Pyannote v3 / v4
02 / Speed

From thought to vault in one tap.

Bind the Action Button. Press it on a locked phone. FaceID, recording, lock-screen pill — Pause, Stop, Send — without ever opening the app. Siri and Control Center work too.

Action Button · Live Activity · App Intents
03 / Routing

Your automation, your destinations.

iCloud Drive, Email, Webhook, SFTP, Shortcut. Fan out one capture to multiple destinations. Markdown with YAML frontmatter — drop it into anything that already reads markdown.

Multi-destination · YAML frontmatter
How it works

Four states. Zero unlocks.

The whole capture happens on the lock screen. Press, talk, send. The Live Activity walks you from recording to vault without ever opening the app.

01 · Press

Press the Action Button.

FaceID resolves, the mic opens — recording is running before you finish raising the phone.

9:41
Saturday, May 9
Action Button
02 · Record

Lock-screen pill takes over.

Red indicator, profile name, destination subtitle, self-ticking timer. Pause and Stop are 44pt pills.

9:41
Saturday, May 9
Recording
Voice Memo · Anzel Vault
0:34
Pause
Done
03 · Transcribe

Whisper on the Neural Engine.

Audio length, ETA, animated progress. Sub-state "Identifying speakers" when diarization runs.

9:41
Saturday, May 9
Transcribing
Whisper Small · Neural Engine
0:08
04 · Send

Tap Send. Lands in your vault.

Final filename, transcribed pill. Hold sends to History to triage later. Send goes straight to your destination.

9:41
Saturday, May 9
Ready
2026-05-09-notes.md
Hold
Send to Vault
Destinations

Five ways out. Your call.

Configure as many destinations as you need, group them into profiles, and fan one capture out to multiple targets simultaneously.

iCloud Drive

Drop into any folder — including inside an Obsidian vault.

Email

Pre-filled compose sheet, ready for tap-to-send.

Webhook

POST JSON to any URL. Headers, secret token, retries.

SFTP

Key-based auth into your home server, NAS, or VPS.

Shortcut

Hand off to any iOS Shortcut — your automation, your call.

Free includes iCloud Drive and Email. Pro adds Webhook, SFTP, Shortcut, and multi-destination fan-out.

Output

Markdown your vault already understands.

2026-05-09-notes-from-the-project-review.md
---
title: Notes from the project review
created: 2026-05-09T14:32:11-05:00
source: voice
profile: voice-memo
duration_sec: 184
destination: anzel-vault-inbox
device: iPhone
model: openai_whisper-small_216MB
audio_retained: true
audio_path: 2026-05-09-notes.m4a
speakers: 2
speaker_labels: [Chris, Hannah]
tags: [voice-note]
---

**Chris:** The screenshot spec needs to land before
the App Store description. They share vocabulary.

**Hannah:** Agreed. Let's sequence it that way.

Frontmatter your vault already understands.

Drop into Obsidian, Foam, Logseq, Quartz, your static-site generator, or any pipeline that reads YAML. Nothing to preprocess. Nothing to clean up.

frontmatter
Standard YAML keys for title, source, profile, model, destination — everything Dataview and Templater already index.
paragraphs
Breaks where the speech actually paused. Configurable per profile.
speakers
**Speaker:** labels appear only when more than one voice is in the room. One-speaker takes stay clean.
filename
Templated with {date}, {slug}, {profile}. Auto-suffixes on collision.
Pricing

Two tiers. Apple handles billing.

Free

Free

$0 forever
  • 1 profile (Voice Memo)
  • 30 transcriptions / 30-day rolling window
  • iCloud Drive + Email destinations
  • On-device transcription
  • Speaker diarization
  • All quick-capture surfaces — Action Button, Siri, Control Center, Shortcuts
Pro

Pro

$2.99 / month or $29.99 / year
7-day free trial
  • Everything in Free
  • Unlimited profiles
  • Unlimited transcriptions
  • Webhook, SFTP, and Shortcut destinations
  • Multi-destination fan-out
  • Vocabulary priming

No account. No subscription manager. Apple handles billing.

Compared

How VoiceToMD fits next to the alternatives.

VoiceToMD Plaud Otter / Fireflies Apple Voice Memos
Where audio is processed On-device Their cloud Their cloud On-device (no transcription)
Built for PKM / vault users with own LLM stack Knowledge workers, no existing system Meeting professionals Anyone
Output Markdown with YAML frontmatter Their app + 27+ exports SaaS dashboard + PDF Audio file in iCloud
Destination Wherever you want Plaud cloud (lock-in) Their dashboard Manual export
Pricing $30/yr or $2.99/mo $99–240/yr $17–30/seat/mo Free
Hardware iPhone you already own Separate $159 device None None
FAQ

Five questions, answered.

Does VoiceToMD send my audio to a server?
No. WhisperKit and SpeakerKit run entirely on the Apple Neural Engine. The app makes no transcription API calls. Audio is retained locally per-profile (default 5 days) so you can re-transcribe with a different model, or it's deleted on successful send.
What iPhone do I need?
iPhone 12 or later, on iOS 26 or later. iPhone 14 Pro and later get Dynamic Island treatment. iPhone 15 Pro and later have the Action Button — the fastest capture path.
Which Whisper models are available?
Three Argmax-quantized variants — Small (216 MB), Distil-Large (594 MB), and Large-Turbo (954 MB). Pick during onboarding. Switch any time in Settings. Re-transcribe past notes with a different model whenever you want.
Does it work with Obsidian?
Yes. Point a profile's iCloud Drive destination at any folder inside your Obsidian vault. Notes appear as .md files with YAML frontmatter, ready for the Dataview / Templater stack you already have. The same profile can fan out to a webhook simultaneously — your vault and your n8n flow both get a copy.
Is there an Android version?
No, and there are no plans for one. VoiceToMD is built around the Action Button, Live Activity, App Intents, and the Apple Neural Engine — none of which have an Android equivalent worth shipping against.

From thought to vault
in one tap.

No account. No cloud transcription. Just press the button and talk.

Download on the App Store