Overview
KrispVivaVadAnalyzer is a Voice Activity Detection (VAD) analyzer that uses the Krisp VIVA SDK to detect speech in audio streams. It provides high-accuracy speech detection with support for multiple sample rates.
Installation
Prerequisites
You need a Krisp VIVA VAD model file (.kef extension). Set the model path via:
- The
model_pathconstructor parameter, or - The
KRISP_VIVA_VAD_MODEL_PATHenvironment variable
Constructor Parameters
Path to the Krisp model file (
.kef extension). If not provided, uses the
KRISP_VIVA_VAD_MODEL_PATH environment variable.Frame duration in milliseconds. Must be 10, 15, 20, 30, or 32ms.
Audio sample rate in Hz. Must be 8000, 16000, 32000, 44100, or 48000.
Voice Activity Detection parameters object
Usage Example
Technical Details
Sample Rate Requirements
The analyzer supports five sample rates:- 8000 Hz
- 16000 Hz
- 32000 Hz
- 44100 Hz
- 48000 Hz
Model Requirements
- Model files must have a
.kefextension - Model path can be specified via constructor or environment variable
- Model is loaded once during initialization
Notes
- High-accuracy speech detection using Krisp VIVA SDK
- Supports multiple sample rates (8kHz to 48kHz)
- Requires external
.kefmodel file - Thread-safe for pipeline processing
- Automatic session management
- Configurable frame duration