Live Transcription

UniFlow's Live Transcription captures audio in real time and converts it to text using Deepgram's Nova-2 speech recognition engine. Whether you're in a lecture hall, a Zoom call, or a study group, every word is transcribed, timestamped, and automatically indexed into UniMind's knowledge graph — making every lecture searchable and citable.

Plan requirement: Live Transcription is available on Starter, Pro, and Max plans. Free users can purchase Transcription Time Packs for on-demand access.

Getting Started

Click Live Transcription in the sidebar (or the microphone icon in the title bar)
Select the course this session belongs to (optional but recommended for AI context)
Click Start Recording
Grant microphone permission when prompted
Begin speaking or playing audio — text appears in real time

How It Works

Audio Pipeline

Microphone capture — UniFlow uses the Web Audio API with an AudioWorklet processor for low-latency, high-fidelity audio capture at 24 kHz
Noise processing — Configurable input gain, noise suppression, echo cancellation, and auto gain control
Streaming to Deepgram — Raw PCM audio is streamed over WebSocket to Deepgram's Nova-2 model
Real-time results — Interim (partial) results appear instantly; final results are confirmed when each sentence completes
Utterance detection — Deepgram's UtteranceEnd event detects natural pauses between speakers or topics

Transcription Display

The transcription notebook shows text in a clean, scrollable interface:

Interim text appears in a lighter style as you speak, updating in real time
Finalized text is locked in once Deepgram confirms the sentence
Automatic segmentation groups sentences into paragraphs (every 5 sentences by default)
Timestamps are recorded for every segment

Key Features

Bilingual Support

UniFlow supports transcription in multiple languages:

English (default) — Optimized for academic lectures
Chinese — Full Mandarin support
Automatic language detection based on your settings

Microphone Sensitivity Profiles

Choose the right profile for your environment:

Profile	Best For	Settings
Standard	Quiet classrooms, office	Moderate gain, noise suppression on
Sensitive	Large lecture halls, distant speaker	Higher gain, noise suppression off for clarity
Noisy	Cafés, outdoor, group study	Balanced gain, full noise suppression and echo cancellation

Configure this in Settings > General > Microphone Sensitivity.

Recording Controls

Pause / Resume — Temporarily pause without ending the session
Stop — End the recording and finalize the transcript
Auto-recovery — If the microphone stream is lost (e.g., device disconnected), UniFlow automatically attempts to reconnect

Session Management

Each transcription is saved as a session with:

Session title (editable)
Associated course
Creation date and duration
Full transcript with timestamps
Recording status (recording, paused, completed)

You can browse all sessions in the Transcription Library, filter by course, and search across transcripts.

Classroom Q&A Mode

One of UniFlow's most powerful features is the Classroom Q&A Mode, accessible via the Mini Chat panel during a live transcription session.

What It Does

When enabled, the AI reads your live transcription in real time and helps you:

Identify the professor's latest question from the transcript
Generate a ready-to-speak answer in bilingual format (English + Chinese)
Prepare classroom contributions with the right tone — confident but not overly formal

How to Use It

Start a transcription session
Click the chat icon to open the Mini Chat drawer
Toggle Classroom Q&A Mode on
Use quick actions like:
- "Help me answer" — Finds the latest question and generates a response
- "Class participation" — Prepares a discussion contribution
- "Add a point" — Builds on the current discussion
- "Summarize discussion" — Summarizes key points with your position
- "Ask a question" — Prepares a thoughtful question to ask
- "Respond to classmate" — Crafts a response to another student's point

Answer Format

All Classroom Q&A responses follow a strict bilingual format:

**English:**
I think the main factor here is... because...

**中文：**
我认为这里的主要因素是……因为……

The tone is calibrated to sound like a well-prepared student speaking in class — clear, structured, and natural.

Normal Mode

When Classroom Q&A Mode is off, the Mini Chat works as a general AI assistant with access to the live transcript. You can ask:

"What's being discussed right now?"
"Are there any tasks assigned?"
"Explain that last part"
"Summarize this lecture"
"Highlight the key points"
"Generate study notes"

Cloud Sync

Transcription sessions are automatically synced to the cloud:

Access your transcripts on any device
Transcribed content is indexed into UniMind's knowledge graph
Every word from every lecture becomes searchable and citable when you use UniMind

Transcription Time & Billing

Plan	Included Time	Additional Time
Free	None (purchase Time Packs)	Available via Time Packs
Starter	Included monthly quota	Purchase Time Packs for more
Pro	Larger monthly quota	Purchase Time Packs for more
Max	Generous monthly quota	Purchase Time Packs for more

Transcription Time Packs

Time Packs are one-time purchases that add extra high-precision transcription time to your account:

Work with any plan (including Free)
Valid for 180 days from purchase
Stack with your subscription's included time
Purchase from the Pricing page

Tips

Associate sessions with courses — This enables UniMind to use lecture content when answering questions about that course
Use the Sensitive profile in large halls — It captures distant audio better by reducing noise suppression
Keep the app in focus — Some browsers throttle background audio processing; keeping UniFlow in the foreground ensures the best capture quality
Review and edit after class — Transcripts can be edited for accuracy before they're indexed

Troubleshooting

Microphone not detected

Check that your microphone is connected and not muted
Ensure UniFlow has microphone permission in your system settings
Try selecting a different audio input device in your system preferences
Restart the app

Transcription quality is poor

Switch to the Sensitive microphone profile for distant speakers
Ensure you're in a relatively quiet environment
Check your internet connection — Deepgram requires a stable WebSocket connection
Try moving closer to the audio source

Connection drops during recording

UniFlow includes automatic reconnection logic. If the Deepgram WebSocket disconnects:

Audio chunks are buffered locally (up to 100 chunks)
The client attempts to reconnect automatically
Buffered audio is replayed once the connection is restored

If reconnection fails, you'll see an error indicator. Try stopping and restarting the session.

What's Next

UniMind Engine — Ask AI questions using your lecture transcripts as context
Canvas Sync — Link transcription sessions to specific courses
Document Editor — Export transcripts or lecture notes to a document
Keyboard Shortcuts — Quick access to transcription controls