Help Center

Frequently
asked.

Everything you need to know about recording, editing, transcription, security, and pricing.

🚀

Getting Started

Open SpokenAudio and tap the Studio tab at the bottom. Tap the large red dome button to begin recording — the VU meters will light up to show your microphone level. Speak clearly and tap the dome again to pause. When you're finished, tap the Stop button (square) and choose Save. Your recording appears in the Files tab.

Tip: Speak at a normal conversational pace. SpokenAudio's transcription is tuned for clear speech — no need to slow down artificially.

SpokenAudio requires three permissions:

  • Microphone — required to record your voice
  • Speech Recognition — required for live transcription (on-device only, no audio transmitted)
  • Face ID / Touch ID — optional, only if you enable the lock screen in Settings

If you denied a permission during setup, go to iPhone Settings → Privacy & Security → enable each permission for SpokenAudio.

Studio is where you record, edit, and listen. It contains the jog wheel, VU meters, OVR/INS toggle, and live transcript.

Files shows all your saved recordings. Tap a row to open it in Studio for playback or editing. Swipe left to rename or delete. Swipe right to export.

Transcripts shows all generated transcripts. These are stored separately from audio — you can delete audio and keep the transcript as a permanent text record.

Tap the Files tab. Your recordings appear in reverse chronological order (newest first). Tap any row to open it in Studio. Use the search bar at the top of the Files tab to search by name.

🎙

Recording

Yes. Tap the dome button to pause — the VU meters stop and the jog wheel becomes active so you can scrub through what you've recorded. Tap the dome again to resume recording from the pause point. You can pause and resume as many times as you need in a single session.

With Pocket, there is no limit — record for as long as you need. The only practical limit is your iPhone's available storage. SpokenAudio displays a storage usage indicator in the Files tab.

SpokenAudio records at 16kHz mono by default — optimised for voice clarity and transcription accuracy. Recordings are exported as standard WAV files compatible with all audio software. High-quality audio (44.1kHz / 48kHz stereo) is planned for the Suite tier in a future update.

Currently SpokenAudio records in mono at 16kHz. Stereo recording is a planned feature for the Suite tier. For voice dictation, mono at 16kHz delivers excellent clarity and keeps file sizes small without any loss in intelligibility.

Yes. iOS automatically routes to the best available microphone. When AirPods or EarPods are connected, SpokenAudio uses the earpiece microphone — which is significantly better at isolating your voice in noisy environments like a car or busy clinic. This is the recommended setup for dictating in loud environments.

✂️

OVR & INS Editing (Patent Pending)

OVR mode lets you re-record over a mistake without re-recording your entire dictation. Here's how it works:

  • Pause recording
  • Set the toggle to OVR
  • Spin the jog wheel left to rewind to just before your mistake
  • Tap the dome to record — your new audio replaces the old from that point forward
  • Your transcript updates automatically in real time as you speak
  • Tap pause when you've corrected the mistake

Everything before the cursor position is completely untouched. Both the audio and the transcript are corrected simultaneously — no manual text editing needed. This is the fastest way to fix a single word or sentence.

INS mode splices new audio into your recording without removing anything. Position the playhead at the point where you want to add content, then record. The new audio is inserted at the cursor and everything after it shifts forward in time — and the transcript updates live as you speak, inserting new text at the correct position. Use INS when you forgot to mention something and need to add a sentence in the middle of a recording.

Example: You said "The patient is prescribed lisinopril" but forgot to mention the dosage. Use INS to add "10mg daily" immediately after "lisinopril" — the new words appear in your transcript in real time, right where they belong, without touching the rest of the recording.

SpokenAudio's patent-pending live speech-to-text synchronization means your transcript stays perfectly aligned with your audio during OVR and INS edits. When you overwrite a section, the old transcript text is removed and replaced with new text as you speak. When you insert, new text appears at the insertion point and existing text shifts forward — all in real time. No other mobile dictation app offers this. The audio, transcript, and modification record all update simultaneously.

Yes. SpokenAudio supports up to 100 undo levels with Pocket. Tap the undo button (circular arrow) in the Studio view to step back one edit at a time. Each OVR and INS edit is tracked in the modification record, which logs a timestamp for every change made to a recording.

Yes, but only while the recording is paused — the toggle is locked during active recording to prevent accidental mode switches. Pause, change the mode, then position the playhead and resume. You can alternate between OVR and INS as many times as needed within a single session.

The Modification Record is a timestamped log of every OVR and INS edit made to a recording — including what was changed and when. Tap the undo/pencil button in the Studio toolbar to view it. This is useful for audit purposes in clinical and legal settings where a chain of evidence is important.

🎛

Jog Wheel & Seek Audio

The jog wheel is SpokenAudio's transport control — the dial in the center of the Studio view. It becomes active when a recording is paused or loaded. Drag your finger clockwise to move forward in the recording, counter-clockwise to rewind. The speed of your drag controls how fast you seek — slow for precision, fast for long jumps.

Hold and tap the REW button (left) to jump instantly to the very beginning. Hold and tap the FF button (right) to jump to the end.

When you scrub through a recording using the jog wheel, Oxide mode plays your own voice back at the seek speed — like a real cassette dictaphone rewinding. This gives you an immediate audio cue for where you are in the recording without having to watch the timer. The faster you spin, the faster your voice plays.

You can switch to Electro mode (a clean electronic sweep tone) or turn seek audio off entirely in Settings → Feedback.

Yes. Go to Settings → Feedback → Seek audio feedback. You have three options:

  • Oxide — your recorded voice played at seek speed (default)
  • Electro — a clean electronic sweep tone
  • Off — silent seek
📝

Transcription

No. SpokenAudio uses Apple's on-device Speech Recognition framework which runs entirely on your iPhone — no internet connection required, no audio transmitted to any server. Your voice never leaves your device during transcription. This is a core design requirement for HIPAA-conscious workflows.

Transcription accuracy depends on your speech clarity, background noise, and microphone quality. Apple's on-device recognition performs very well for clear, conversational speech in quiet environments. Accuracy decreases in loud environments (cars, busy offices). For best results:

  • Use AirPods or EarPods in noisy environments
  • Speak at a natural pace — don't slow down artificially
  • Enunciate medical terms clearly — the recognizer handles common medical vocabulary well
  • Use the word correction feature to fix any errors immediately after recording

Yes. With Pocket, tap any word in the transcript to correct it. A correction panel appears where you can type the correct text. Corrections are logged with a timestamp in the modification record. Low-confidence words are highlighted in the transcript so you can spot potential errors quickly.

Yes. Open any recording from the Files tab. Tap the transcript icon (document icon) in the toolbar. SpokenAudio will process the recording and generate a transcript. This works for recordings made at any time — the transcript is generated on-demand, not just during live recording.

Yes — and this is by design. In the Files tab, swipe left on a recording and choose "Delete Audio Only." The encrypted audio file is removed from your device but the transcript remains permanently in the Transcripts tab. This is useful when you need a text record but don't want to retain the voice recording itself.

📤

Files & Export

With Pocket, SpokenAudio exports three formats:

  • WAV — uncompressed audio, compatible with all audio software and transcription platforms
  • TXT — plain text transcript
  • SRT — timestamped subtitle format, compatible with video editors, court reporting tools, and medical transcription software

MP3, M4A, and DOCX (with track changes) export are planned for the Suite tier in a future update.

In the Files tab, swipe right on any recording row. An Export button appears. Tap it to open the export picker where you can choose WAV, TXT, or SRT (or any combination). Tap Share to open iOS's standard share sheet — send to Mail, Files, AirDrop, or any connected app.

In the Files tab, swipe left on the recording row and tap Rename. Enter the new name and confirm. Recordings are named automatically by date and time when saved — rename them to match your filing system (e.g. patient initials, case number, date of service).

Swipe left on the recording row in the Files tab. You'll see options based on what the recording contains:

  • Delete Audio Only — removes the audio file, keeps the transcript
  • Delete Everything — removes both audio and transcript permanently

Tap the info icon (ⓘ) on any recording for full details and a "Delete Everything" option. Deletion is permanent — there is no recycle bin.

🔐

Security & Privacy

Every recording and transcript is encrypted using AES-256-GCM authenticated encryption before it is written to your device's storage. AES-256-GCM is the same standard used by governments, banks, and Apple itself. Encryption keys are generated per-device and stored exclusively in the iOS Secure Enclave — a dedicated hardware chip that is isolated from the main processor. Neither Create Simple Code LLC nor any third party has access to these keys.

"Encrypted at rest" means your recordings and transcripts are encrypted whenever they are stored on your device. The raw files are unreadable without the decryption key, which is held in the iOS Secure Enclave. When you open a transcript to read it, SpokenAudio decrypts it temporarily in memory for display — but the file on disk always remains encrypted. If someone accessed your device's storage directly, they would see only encrypted data. This is a key requirement for HIPAA compliance and protects your recordings even if the device is lost or stolen.

Never. SpokenAudio has no cloud component. Recordings, transcripts, and preferences are stored exclusively on your device. No data is transmitted to Create Simple Code LLC or any third-party server. SpokenAudio includes no analytics SDKs, no crash reporting, and no advertising. There is no account or login system — nothing to sync.

Enable biometric lock in Settings → Security → Require Face ID (or Touch ID). When active, SpokenAudio locks when you leave the app and requires Face ID or Touch ID to reopen. SpokenAudio does not access, process, or store any biometric data — authentication is handled entirely by iOS. If Face ID fails, you can unlock with your device passcode.

All recordings are encrypted with keys in the Secure Enclave. Without your device passcode and biometric authentication, the encrypted data is inaccessible. For additional security, use Settings → Secure Deletion in SpokenAudio before reporting the device lost — this destroys all encryption keys, making the data permanently unreadable even with physical access to the device.

You can also remotely erase your device via iCloud's Find My feature, which removes all local data including SpokenAudio's encrypted files.

Secure Deletion (Settings → Secure Deletion) destroys all encryption keys stored in the Secure Enclave. Because the encrypted data can only be decrypted with those keys, destroying the keys makes all recordings permanently and irrecoverably unreadable — even with physical access to the storage chips. This is stronger than simply deleting files, which may leave recoverable data on flash storage.

💳

Trial & Pricing

Download SpokenAudio free from the App Store. Full Pocket access starts immediately — no payment information required, no credit card, no signup. You have 7 days to use every feature. After 7 days, the app shows a one-time purchase option for $14.99 to keep access permanently. Your recordings and transcripts are preserved regardless.

No. SpokenAudio Pocket is a one-time purchase of $14.99. No monthly fees. No annual renewal. No recurring charges of any kind. Buy once, own forever. A Suite tier with additional professional features is planned as an optional subscription in a future update — but Pocket will always remain a one-time purchase.

Your purchase is tied to your Apple ID. On any new iPhone, download SpokenAudio and go to Settings → Restore Purchase. SpokenAudio will verify your purchase with Apple and unlock Pocket at no additional charge. You can use the same purchase on all iPhones signed into the same Apple ID.

All your recordings and transcripts are preserved on your device — they are not deleted when the trial expires. You will see a lock screen prompting you to purchase Pocket to regain access. Once you purchase, everything is immediately accessible again.

The SpokenAudio roadmap includes:

  • Suite tier — high-quality audio (44.1/48kHz), MP3 and M4A export, DOCX with track changes, configurable auto-lock
  • Noise isolation — optional voice processing toggle for dictating in noisy environments
  • Enterprise tier — multilingual transcription, EHR/EMR integration, team management, audit log
🏥

HIPAA & Clinical Use

SpokenAudio's architecture meets the technical safeguard requirements of the HIPAA Security Rule — AES-256-GCM encryption, Secure Enclave key storage, no cloud transmission, no third-party SDKs. However, Create Simple Code LLC is a software developer, not a covered entity or business associate under HIPAA. Whether your specific use of SpokenAudio satisfies all HIPAA requirements for your organization depends on your workflows, policies, and compliance obligations. Consult your compliance officer or legal counsel before using SpokenAudio for PHI.

A BAA is not currently available. Because SpokenAudio does not receive, process, or store any data on behalf of covered entities (all data stays on your device), a BAA may not be required for your use case. Consult your HIPAA compliance officer. A BAA is planned as part of the future Enterprise tier for organizational deployments.

SpokenAudio is used by healthcare professionals for clinical dictation. The on-device architecture, AES-256 encryption, and zero-transmission design make it well-suited for sensitive voice notes. That said, compliance with HIPAA and your institution's policies is your responsibility. We recommend reviewing your organization's mobile device and software policies before adopting SpokenAudio for patient care workflows.

No. SpokenAudio contains no analytics, crash reporting, or advertising SDKs. In-app purchases are processed by Apple's StoreKit — Apple handles only the payment transaction and does not receive any recording data. Speech recognition uses Apple's on-device framework which, per Apple's documentation, does not transmit audio when running in on-device mode on iOS 17 and later.

🔧

Troubleshooting

Most likely SpokenAudio does not have microphone permission. Go to iPhone Settings → Privacy & Security → Microphone → toggle SpokenAudio on. Return to the app and try again. If the button still does nothing, force-quit SpokenAudio (swipe up from the home bar and swipe the app away) and reopen it.

Check Speech Recognition permission: iPhone Settings → Privacy & Security → Speech Recognition → enable SpokenAudio. Also confirm you are speaking clearly and close to the microphone. Speech recognition initialises within the first second or two of recording — there may be a brief delay before the first words appear.

Oxide seek audio only plays when a recording is loaded in Studio. If you are in a fresh (empty) session, there is no audio to play back. Open a saved recording from the Files tab, then try the jog wheel. Also check Settings → Feedback → Seek audio feedback is set to Oxide and not Off.

In-app purchases require an active App Store account and internet connection. Check that you are signed into the App Store (iPhone Settings → App Store). If you have already purchased, tap Restore Purchase instead — this will verify your purchase with Apple and unlock Pocket immediately at no charge.

If the lock screen appears unexpectedly, biometric lock may have been enabled in Settings. Go to Settings → Security → Require Face ID and toggle it off. If you cannot get past the lock screen, use your device passcode when prompted after a failed Face ID attempt.

Pull down on the Files list to refresh it. If the recording still doesn't appear, confirm the save completed — if SpokenAudio was force-quit during the save dialog the recording may not have been written. Also check that you're not filtering the list (clear any active search text at the top of the Files tab).

Still have a question?

SpokenAudio is built by a small, dedicated team. Please allow up to 5 business days for a response.

Contact Support Email Us