Voice Dictation

Voice Dictation lets you speak into your microphone and have the audio automatically transcribed by Gemini AI. The transcribed text is inserted directly into the current Cursor/VS Code chat input, GitHub issue description, or issue comment — no manual copying required.

Requirements

Windows — built-in winmm.dll (nothing to install), or
Linux — ffmpeg on PATH with a default PulseAudio/PipeWire or ALSA input device, or
macOS — ffmpeg on PATH (AVFoundation default microphone) and microphone permission for your IDE
Gemini API key configured in settings (devManager.gemini.apiKey)

Get a free key at aistudio.google.com.

Status Bar Button

A microphone button 🎙 appears in the status bar at the bottom of the IDE when recording is supported on your OS (Windows, or Linux/macOS with ffmpeg installed and a working default microphone).

Click 🎙 to start recording. The icon turns red ⏹.
Speak your message.
Click the red ⏹ to stop. The icon shows a spinner while Gemini transcribes.
The transcribed text is pasted automatically into the current chat input.

The chat panel is focused automatically — you can keep dictating into the same conversation as many times as you want without creating a new chat.

Keyboard shortcuts

Shortcut	Action
`Ctrl+Alt+H`	Toggle recording (start / stop)
`Ctrl+Alt+Shift+H`	Cancel recording

Both shortcuts are bound with when: "!terminalFocus" in Keyboard Shortcuts (search SnakeFlow to inspect or change them), so they do not fire while the integrated terminal has focus (avoid stealing keys from the shell).

Voice Dictation in GitHub Issues

Microphone buttons are also available directly in the New Issue and Edit Issue forms.

Description field

Click 🎙 next to the Description label → speak → click again to stop. The transcribed text is appended to the description textarea.

Comment field (Edit Issue only)

Click 🎙 next to Add comment → speak → stop. Text is appended to the comment textarea.

Language Setting

By default, Gemini auto-detects the spoken language. To force a specific language:

"devManager.gemini.dictationLanguage": "uk"

Value	Language
`auto`	Auto-detect (default)
`uk`	Ukrainian
`en`	English
`de`	German
`fr`	French
`es`	Spanish
`pl`	Polish
`it`	Italian
`pt`	Portuguese
`nl`	Dutch
`cs`	Czech
`ja`	Japanese
`zh`	Chinese
`ko`	Korean

Cost

Voice dictation uses the same Gemini API key as title generation. Pricing for Gemini 2.5 Flash:

Duration	Cost
30 seconds	~$0.00009
5 minutes	~$0.0009
1 hour of dictation	~$0.011

At typical usage (a few minutes per day), the cost is well under $0.05/month.

Settings Reference

Setting	Default	Description
`devManager.gemini.apiKey`	`""`	Gemini API key (required)
`devManager.gemini.model`	`"gemini-2.5-flash"`	Gemini model used for transcription
`devManager.gemini.dictationLanguage`	`"auto"`	Spoken language hint for transcription