Transcriber
On-device speech intelligence for Android. Record, transcribe, translate, and clean up without leaving the phone.
From mic to clean transcript, in one pass.
Two engines, one app. Switch per recording.
- Size
- 2.59 GB on disk · 676 MB GPU footprint
- Best for
- Dialectal Arabic, code-switching, Ukrainian
- Strengths
- Holds Gulf Arabic without collapsing to MSA. Streaming decode keeps peak heap ~6 MB regardless of file length.
- Models
- tiny (75 MB) · small (466 MB) · large-v3-turbo q5_0 (574 MB)
- Best for
- Predictable latency · live transcription on weak devices
- Strengths
- Faster cold start. Pairs with sherpa-onnx for stable speaker IDs across hour-long meetings.
Including the ones that usually drift to English.
Multi-select picker. Empty = full auto-detect. One = forced. Multiple = constrained auto — faster and more accurate than letting Gemma consider 100+ languages.
Three ways to figure out who said what.
Speaker N: labels emitted inline by Gemma 4. No extra download.Bonus: when someone says "Hi, I'm Ahmed" in a segment, the detected name propagates to every segment with the same speaker key.
Four presets that read everything first.
Output is rendered Markdown · Share + Delete per output · all four prompts editable in Settings.
Teach it your jargon. Then forget you did.
atrial fibrillation
ICD-10
MeSH · RxNorm
OpenTelemetry
CNCF
ACM CCS
ASTM · NEN
QCS · ДБН
materials, codes
certiorari
Cornell LII Wex
rechtspraak.nl
SEC · BIS · AFM
NBU · QFMA
instruments, regs
Designed for the moment the phone tries to kill the job.
The compute knobs that actually matter.
Find recordings by what was said in them.
Case-insensitive. Auto-titled recordings get real names — Q3 Planning with Engineering Team instead of Recording_2026-05-17_14-23-30.
A complete speech stack — yours, on the phone, under Apache-2.0.
*.bin / *.litertlm import.