iPhone · iOS 26 · on-device

From thought to vault in one tap.

Capture on iPhone. Transcribe on the Apple Neural Engine. Route the markdown to iCloud, Obsidian, your n8n watch-folder, your webhook — wherever your automation already lives.

Download on the App Store $2.99 / month · 7-day free trial · No account

On device

Voice

Action Button · Siri · Shortcut

Transcribe

Apple Neural Engine

iCloud Drive Email Webhook SFTP Shortcut

Why VoiceToMD

Built for people who already have a knowledge base.

VoiceToMD doesn't replace your vault. It feeds it. Voice notes are context, not prompts.

01 / Privacy

Audio never leaves your device.

Full Whisper transcription and speaker diarization run on the Apple Neural Engine. No API. No cloud. No account. Works offline after the model downloads on first launch.

WhisperKit · SpeakerKit · Pyannote v3 / v4

02 / Speed

From thought to vault in one tap.

Bind the Action Button. Press it on a locked phone. FaceID, recording, lock-screen pill — Pause, Stop, Send — without ever opening the app. Siri and Control Center work too.

Action Button · Live Activity · App Intents

03 / Routing

Your automation, your destinations.

iCloud Drive, Email, Webhook, SFTP, Shortcut. Fan out one capture to multiple destinations. Markdown with YAML frontmatter — drop it into anything that already reads markdown.

Multi-destination · YAML frontmatter

How it works

Four states. Zero unlocks.

The whole capture happens on the lock screen. Press, talk, send. The Live Activity walks you from recording to vault without ever opening the app.

01 · Press

Press the Action Button.

FaceID resolves, the mic opens — recording is running before you finish raising the phone.

9:41

Saturday, May 9

Action Button

02 · Record

Lock-screen pill takes over.

Red indicator, profile name, destination subtitle, self-ticking timer. Pause and Stop are 44pt pills.

9:41

Saturday, May 9

Recording

Voice Memo · Anzel Vault

0:34

Pause

Done

03 · Transcribe

Whisper on the Neural Engine.

Audio length, ETA, animated progress. Sub-state "Identifying speakers" when diarization runs.

9:41

Saturday, May 9

Transcribing

Whisper Small · Neural Engine

0:08

04 · Send

Tap Send. Lands in your vault.

Final filename, transcribed pill. Hold sends to History to triage later. Send goes straight to your destination.

9:41

Saturday, May 9

Ready

2026-05-09-notes.md

Hold

Send to Vault

Destinations

Five ways out. Your call.

Configure as many destinations as you need, group them into profiles, and fan one capture out to multiple targets simultaneously.

iCloud Drive

Drop into any folder — including inside an Obsidian vault.

Email

Pre-filled compose sheet, ready for tap-to-send.

Webhook

POST JSON to any URL. Headers, secret token, retries.

SFTP

Key-based auth into your home server, NAS, or VPS.

Shortcut

Hand off to any iOS Shortcut — your automation, your call.

Free includes iCloud Drive and Email. Pro adds Webhook, SFTP, Shortcut, and multi-destination fan-out.

Output

Markdown your vault already understands.

2026-05-09-notes-from-the-project-review.md

---
title: Notes from the project review
created: 2026-05-09T14:32:11-05:00
source: voice
profile: voice-memo
duration_sec: 184
destination: anzel-vault-inbox
device: iPhone
model: openai_whisper-small_216MB
audio_retained: true
audio_path: 2026-05-09-notes.m4a
speakers: 2
speaker_labels: [Chris, Hannah]
tags: [voice-note]
---

**Chris:** The screenshot spec needs to land before
the App Store description. They share vocabulary.

**Hannah:** Agreed. Let's sequence it that way.

Frontmatter your vault already understands.

Drop into Obsidian, Foam, Logseq, Quartz, your static-site generator, or any pipeline that reads YAML. Nothing to preprocess. Nothing to clean up.

frontmatter

Standard YAML keys for title, source, profile, model, destination — everything Dataview and Templater already index.

paragraphs

Breaks where the speech actually paused. Configurable per profile.

speakers

**Speaker:** labels appear only when more than one voice is in the room. One-speaker takes stay clean.

filename

Templated with {date}, {slug}, {profile}. Auto-suffixes on collision.

Pricing

Two tiers. Apple handles billing.

Free

$0 forever

1 profile (Voice Memo)
30 transcriptions / 30-day rolling window
iCloud Drive + Email destinations
On-device transcription
Speaker diarization
All quick-capture surfaces — Action Button, Siri, Control Center, Shortcuts

Pro

$2.99 / month or $29.99 / year

7-day free trial

Everything in Free
Unlimited profiles
Unlimited transcriptions
Webhook, SFTP, and Shortcut destinations
Multi-destination fan-out
Vocabulary priming

No account. No subscription manager. Apple handles billing.

Compared

How VoiceToMD fits next to the alternatives.

	VoiceToMD	Plaud	Otter / Fireflies	Apple Voice Memos
Where audio is processed	On-device	Their cloud	Their cloud	On-device (no transcription)
Built for	PKM / vault users with own LLM stack	Knowledge workers, no existing system	Meeting professionals	Anyone
Output	Markdown with YAML frontmatter	Their app + 27+ exports	SaaS dashboard + PDF	Audio file in iCloud
Destination	Wherever you want	Plaud cloud (lock-in)	Their dashboard	Manual export
Pricing	$30/yr or $2.99/mo	$99–240/yr	$17–30/seat/mo	Free
Hardware	iPhone you already own	Separate $159 device	None	None

FAQ

Five questions, answered.

Does VoiceToMD send my audio to a server?

No. WhisperKit and SpeakerKit run entirely on the Apple Neural Engine. The app makes no transcription API calls. Audio is retained locally per-profile (default 5 days) so you can re-transcribe with a different model, or it's deleted on successful send.

What iPhone do I need?

iPhone 12 or later, on iOS 26 or later. iPhone 14 Pro and later get Dynamic Island treatment. iPhone 15 Pro and later have the Action Button — the fastest capture path.

Which Whisper models are available?

Three Argmax-quantized variants — Small (216 MB), Distil-Large (594 MB), and Large-Turbo (954 MB). Pick during onboarding. Switch any time in Settings. Re-transcribe past notes with a different model whenever you want.

Does it work with Obsidian?

Yes. Point a profile's iCloud Drive destination at any folder inside your Obsidian vault. Notes appear as .md files with YAML frontmatter, ready for the Dataview / Templater stack you already have. The same profile can fan out to a webhook simultaneously — your vault and your n8n flow both get a copy.

Is there an Android version?

No, and there are no plans for one. VoiceToMD is built around the Action Button, Live Activity, App Intents, and the Apple Neural Engine — none of which have an Android equivalent worth shipping against.

From thought to vault
in one tap.

No account. No cloud transcription. Just press the button and talk.

Download on the App Store

Built for people who already have a knowledge base.

Audio never leaves your device.

From thought to vault in one tap.

Your automation, your destinations.

Four states. Zero unlocks.

Press the Action Button.

Lock-screen pill takes over.

Whisper on the Neural Engine.

Tap Send. Lands in your vault.

Five ways out. Your call.

iCloud Drive

Email

Webhook

SFTP

Shortcut

Markdown your vault already understands.

Frontmatter your vault already understands.

Two tiers. Apple handles billing.

Free

Pro

How VoiceToMD fits next to the alternatives.

Five questions, answered.

From thought to vaultin one tap.

From thought to vault
in one tap.