Sorry, we don't support your browser.  Install a modern browser

Voice Memo Transcription with Multi Language Support#1228

F

1. Type of request

Series of features (functional upgrade to newVoice Memos).

2. Requester

A dedicated “AI Premium” subscriber and user who primarily journals via voice.

3. Describe the problem that the user has

  • The Core Problem (Missing Functionality): Currently, the Voice Memo feature is a “Black Box.” After recording, the only options are “Play” or “Delete.” There is no Transcribe function. Because there is no text output, the app’s core value proposition—reflecting with AI mentors like Marcus Aurelius or Seneca—is completely inaccessible to voice users. The audio sits as dead data that the AI cannot read or analyze.
  • The Configuration Problem (Language Context): If transcription were added, it would also need to following functionality.
    • Language Mix: I keep the App in English, but I speak in Dutch.
    • Language-Switching: I frequently mix languages (e.g., using English Stoic terms like “Virtue” inside Dutch sentences). I need the transcription engine to handle this specific mix without error, rather than forcing everything into one language.
  • The Input Problem: I cannot import external audio files (e.g., from Apple Voice Memos) into the app for adding to a journal entry.

4. Describe the impact that this has on the user

  • Unlocking the App’s Potential: Adding transcription transforms voice memos from simple storage into active journaling material that the AI can help me reflect on while also let me read back my journals I used voice memos.
  • Workflow Efficiency: A “Transcribe” button allows me to capture thoughts quickly via voice but still get the benefits of a text journal (searchability, AI analysis).
  • Accuracy: Flexible language options ensure the transcript actually matches what I said, preserving the philosophical context of terms like “Virtue” even when speaking in Dutch.

5. Describe the reach of this problem

This impacts every user who prefers speaking over typing, as well as the entire international user base that navigates between their native language and the English app interface.

6. Describe the cost of not doing this request

  • Devalued Premium Subscription: Users pay for “AI Reflection,” but voice users can’t use it. This makes the subscription feel less valuable.
  • Feature Fragmentation: The app feels split in two: a “Smart AI” journal and reflection app and a “Simple” voice recorder.
  • Missed Habits: Users who find typing tedious will simply stop journaling if the voice experience doesn’t offer the same rewards (AI feedback).

7. Describe which business goals this helps

  • Revenue Growth (Premium Driver): High-quality transcription (especially with mixed-language support) is a costly API service. This is the perfect feature to lock behind the AI Premium subscription to drive upgrades.
  • User Retention: Improving the ease of input (voice) increases the frequency of journaling.

8. Describe the evidence that you have on the need for this request

  • Current Workflow Gap: I record a 5-minute thought to a journal questions or section. I want to ask the AI, “What would Marcus say about this?” I cannot do this because the AI cannot hear the file, nor can I transcribe the voice memo so the AI can process the text
  • User Constraint: “I use a mix of Dutch and English. I don’t want the transcriber to write a Dutch word that sounds like ‘Virtue’ when I say ‘Virtue’—it needs to understand the context.”

9. Describe if you have any ideas on how we may solve this

  • A. The Core Solution (The Transcribe Button):
    • Update the post-recording menu to include: Play, Transcribe, Delete.
    • Clicking “Transcribe” converts the audio to text and appends it to the entry, unlocking the standard AI reflection tools.
  • B. Language Configuration (The “Clean UI” Approach):
    • Default Settings: Allow me to set a “Default Transcription Language” in the main settings (e.g., Input: Dutch) so the UI remains clean and I don’t have to select a language every time I record and/or transcribe.
    • Language-Switching Logic: Ensure the underlying model (e.g., is tuned to recognize English words within non-English sentences.
  • C. Import & Export Utility:
    • Allow upload of .mp3/.m4a files to the journal for transcription.
    • When I export a journal entry to text or markdown and it contains a voicememo also export the audio file with it in mp3/wav. And make the text or markdown file contain a relative link or mention of or to the audio file.
  • D. Monetization:
    • Include these features as part of the AI Premium tier.

10. Describe how urgent this is and why

Medium-High. Transcription is now a standard expectation for voice-enabled apps. Adding this closes the gap between the recorder and the AI, making the app fully functional for voice-first users.
Because transcribing is a minimum for unlocking the AI features when using voice memos.

5 months ago