Audio Features - SystemSculpt Documentation | Documentation

Turn any audio into searchable, editable text within Obsidian

Transcribe in 30 seconds

Drag audio file into chat or note
Click "Transcribe" when prompted
Get text with timestamps, speakers, and formatting

Works with recordings, meetings, lectures, podcasts - anything with speech!

What audio transcription does for you

Meeting notes: Record → Transcribe → AI extracts action items
Lecture capture: Never miss important points while note-taking
Interview documentation: Full transcripts with speaker labels
Voice journaling: Speak thoughts, get formatted notes
Content creation: Draft by speaking, polish with AI

Quick start

Record & transcribe

1. Click 🎙️ in ribbon (or Cmd/Ctrl + R)
2. Speak your thoughts
3. Stop recording
4. Auto-transcribes instantly
5. Edit or save as note

Transcribe existing files

1. Drag audio file to chat/note
2. Select "Transcribe"
3. Choose format:
   - Standard (paragraphs)
   - Timestamped (SRT)
4. Use transcript immediately

Supported formats

Primary formats

Format	Extension	Best for	Quality
M4A	.m4a	Apple devices, voice	⭐⭐⭐⭐⭐
MP3	.mp3	Universal, podcasts	⭐⭐⭐⭐
WAV	.wav	Pro recording	⭐⭐⭐⭐⭐
WebM	.webm	Browser recording	⭐⭐⭐⭐
OGG	.ogg	Open source	⭐⭐⭐⭐

Recording sources

Works with:

Phone recordings (Voice Memos, etc.)
Zoom/Teams/Meet recordings
Podcast files
WhatsApp/Telegram voice notes
Professional recordings

File limits:

Max size: 200MB
Max length: 2 hours
Larger files: Split first

Transcription options

Output formats

Standard transcription

Clean paragraphs with proper punctuation.
Perfect for notes and documentation.

Timestamped (SRT)

00:00:00 --> 00:00:05
Welcome everyone to today's meeting.

00:00:05 --> 00:00:12  
Let's start with our quarterly review.

Language support

Auto-detected:

English, Spanish, French, German
Italian, Portuguese, Dutch, Russian
Chinese, Japanese, Korean
50+ languages total

Pro tip: Specify language in settings for faster, more accurate results

Real-world workflows

Meeting workflow

Markdown
## Team Meeting - {{date}}

Recording: [[meeting-audio.m4a]]

### Transcript
[Drops here after processing]

### AI Summary
- Key decisions...
- Action items...
- Next steps...

### Tasks
- [ ] @John - Review proposal (mentioned at 00:15:23)
- [ ] @Sarah - Send report (mentioned at 00:28:45)

Lecture notes workflow

Record lecture on phone
Upload to Obsidian
Transcribe with timestamps
Ask AI to:
- Summarize key concepts
- Create study guide
- Generate quiz questions
- Explain complex topics

Interview workflow

Record interview (phone/recorder)
Get speaker-separated transcript
Use AI to:
- Extract key quotes
- Identify themes
- Create article outline
- Generate summary

Voice journaling workflow

Morning routine:
1. Record thoughts (2-5 minutes)
2. Auto-transcribe
3. AI organizes into:
   - Gratitude items
   - Today's priorities  
   - Ideas to explore
   - Mood tracking

Advanced features

Transcription providers

SystemSculpt API (default)

Included with premium
Optimized models
No extra setup

Custom providers

Yaml
Settings → Transcription:
- Provider: OpenAI/Groq/Custom
- API Key: [Your key]
- Model: Advanced speech recognition

Quality optimization

Best recording practices:

Quiet environment
Speak clearly
6-12 inches from mic
Minimize background noise

File preparation:

Trim silence
Normalize volume
Use lossless formats when possible
Split very long files

Batch processing

Multiple files at once:

Select all audio files
Right-click → "Transcribe with SystemSculpt"
Each creates separate transcript
Perfect for interview series, lecture courses

Tips & tricks

Speed up workflow

Keyboard shortcuts:

Cmd/Ctrl + R: Start/stop recording
Drag & drop for instant processing
Create templates for common formats

Smart organization:

Audio/
├── Recordings/
│   └── 2024-01-15-meeting.m4a
├── Transcripts/
│   └── 2024-01-15-meeting.md
└── Summaries/
    └── 2024-01-15-action-items.md

Accuracy tips

✅ DO:

Speak one at a time
Use good microphone
Record in quiet space
Process soon after recording

❌ DON'T:

Record in noisy environments
Have multiple people talk over each other
Use extremely compressed files
Expect 100% accuracy (always review)

Integration ideas

With templates:

Markdown
## Voice Note - {{date:HH:mm}}
![[voice-{{date}}.m4a]]

### Transcript
<!-- Transcription appears here -->

### Key Points
- 

### Next Actions
- [ ]

With AI chat:

Transcribe meeting
Drop into chat
"Extract all action items with owners"
"Create project timeline from discussion"
"Identify risks mentioned"

Common issues

Problem	Solution
"Transcription failed"	Check internet, file format, size limits
"Poor accuracy"	Improve audio quality, reduce noise
"Wrong language"	Manually specify in settings
"Takes too long"	Large files need time, try smaller segments

Processing details

How it works

Audio uploads securely to SystemSculpt
AI models process speech-to-text
Text returns formatted
Original deleted from servers
You get transcript in Obsidian

Privacy & security

Encrypted transmission
No permanent storage
Process completes in minutes
Your data stays yours

Performance

5-min recording: ~30 seconds
30-min recording: ~2 minutes
1-hour recording: ~3-5 minutes
Varies with server load

Next steps

Document Processing - Extract from PDFs too
Premium Overview - All premium features
Try it now: Record a voice note with Cmd/Ctrl + R!

🎙️ Pro tip: Start with short recordings to test your setup, then tackle longer content!