Otter is built for meetings: capture what was said, who said it, what was decided. Alfie is built for learning: turn a lecture, talk, or podcast into structured knowledge you can actually retain and act on.
Different jobs. Different outputs. This page makes the choice obvious.
Alfie and Otter look adjacent — but they serve completely different jobs.
Both tools work with audio — but the output they optimise for is completely different.
Otter is purpose-built for this job. For meetings, it's hard to beat.
Audio in. Structured knowledge out. Built for retention.
Why learning synthesis matters: A 90-minute lecture transcript is ~15,000 words of undifferentiated text. Reading it cover-to-cover doesn't mean you've retained it. Imposing a schema — outline → key concepts → recall prompts → next actions — at the processing step is what transforms raw audio into something your brain can actually use.
| Alfie | Otter | |
|---|---|---|
| Primary Input | Lectures, talks, podcasts, interviews — content you need to understand | Live meetings, calls, team discussions |
| Core Output | Structured synthesis: key concepts, outline, recall prompts, next actions | Verbatim transcript + meeting notes / action items |
| Best Use | Learning, studying, retaining ideas from audio you consumed solo | Capturing what was said in a collaborative meeting |
| Output Consistency | Same structured schema every time — predictable, reviewable | Varies; depends on meeting flow and speaker clarity |
| Repeatability | Yes — same format for every note, enabling spaced review | Meeting-dependent; not designed for repeated learning review |
| Ideal Content Types | University lectures, conference talks, documentary audio, online courses, podcasts | Work standups, sales calls, team planning sessions, 1:1s |
| AI Chat | Yes — ask questions about your note, drill into concepts | Limited; focused on meeting Q&A retrieval |
| Privacy | Processed in the US, files not retained after processing | Cloud-stored; used to improve Otter AI models |
| Transcript Export | Yes — .txt with speaker labels and timestamps | Yes — multiple formats |
| Pricing | Free (30 min/mo); Pro from $9/mo annual; Max from $19/mo annual | Free (limited); Pro ~$10/mo; Business plans available |
Here's a short excerpt from a recorded lecture on memory consolidation — and what each tool produces.
Source: Raw transcript excerpt (lecture on memory consolidation)
“…so the hippocampus doesn't store memories permanently — it's more like a temporary buffer. What actually happens during sleep is that the neocortex consolidates the important stuff and the hippocampus can let it go. This is why pulling an all-nighter before an exam is counterproductive — you're encoding without the consolidation step…”
Otter typical output
Speaker 1 [14:32]: …so the hippocampus doesn't store memories permanently — it's more like a temporary buffer…
Discussed: hippocampus function, memory consolidation during sleep
No structured concepts, recall prompts, or follow-up actions generated.
Alfie output (consistent every run)
KEY CONCEPT
Hippocampus = short-term buffer; sleep triggers neocortical consolidation (long-term storage)
RECALL PROMPT
Why does sleep improve learning more than reviewing notes?
NEXT ACTION
Schedule review sessions after sleep, not immediately after lecture
From raw audio. No prep. Same structure every time.
Not for meetings. If your core need is capturing what happened in a team standup or sales call, Otter is purpose-built for that. Alfie is purpose-built for the opposite problem: you have content you want to *learn from* — a lecture, a podcast, a recorded talk — and you need structured understanding, not just a record of what was said.
Yes. Every note includes the full verbatim transcript with speaker labels and timestamps, downloadable as .txt. The synthesis layer is on top of — not instead of — the transcript.
Meeting notes answer: "What did we decide?" Alfie's synthesis answers: "What does this mean and what should I understand?" That means structured outlines, key concept extraction, recall prompts, and follow-up actions — a schema designed for retention, not just record-keeping.
Alfie works best on content with ideas — lectures, conference talks, expert interviews, documentary audio, online course videos, and long-form podcasts. It works less well on casual conversation or multi-speaker back-and-forth (like a team brainstorm), where Otter is stronger.
Audio is processed securely in the United States and is not retained after transcription. Alfie does not use your audio or transcripts to train models.
Pro plan supports files up to 3 hours; Max plan supports up to 6 hours. Both handle long-form academic and professional content without splitting.
You can, and the transcript will be accurate. But the synthesis output is optimized for idea-dense content, not conversational meeting dynamics. If your meeting involves a presentation, expert walkthrough, or keynote-style content, Alfie works great. For general team meetings, Otter is the better fit.
We achieve 95%+ accuracy in identifying speakers, even with similar voices or accents. Perfect for professional interview analysis.
We support a wide range of audio and video formats. Reach out if you don't see your desired format listed.
Audio formats: FLAC, MP4, M4A, MPEG, MP3, AMR, AAC, MPGA, OGG, WAV, WEBM, OGA
Video formats: MP4, AVI, MOV, QUICKTIME, WMV, FLV, WEBM, MKV
It varies based on the length of the file. Most files are transcribed within 1-3 minutes. You'll get instant notifications when your transcript is ready.
We support English, Chinese (Mandarin & Cantonese), Spanish, Japanese, German, French, and more. Automatic language detection is included.
Yes, use our browser-based editor to make corrections on the transcript and speakers before exporting.
Yes, you can cancel your Pro subscription anytime with no questions asked. You'll retain access until the end of your billing period.
Start free, then unlock more when you need it.
Upload a lecture, paste a YouTube URL, or drop in a podcast. Alfie handles transcription, synthesis, and formatting automatically — so every session builds on the last.
No credit card required • 30 minutes free to start