Krisp VIVA

Overview

Krisp’s VIVA SDK provides three capabilities for Pipecat applications:

Voice Isolation — Filter out background noise and voices from the user’s audio input stream, yielding clearer audio for fewer false interruptions and better transcription.
Turn Detection — Determine when a user has finished speaking using Krisp’s streaming turn detection model, as an alternative to the Smart Turn model.
Voice Activity Detection — Detect speech in audio streams using Krisp’s VAD model, supporting sample rates from 8kHz to 48kHz.

You can use any combination of these features together.

KrispVivaFilter Reference

API reference for voice isolation

KrispVivaTurn Reference

API reference for turn detection

KrispVivaVadAnalyzer Reference

API reference for voice activity detection

Krisp VIVA Example

Complete example with Krisp features

Krisp Developers

Get the Krisp SDK and API key

Prerequisites

To complete this setup, you will need access to a Krisp developers account, where you can download the Python SDK, models, and generate an API key.

Get started on the Krisp developers website.

Setup

Download the Python SDK and Models

Log in to the Krisp developer portal
Navigate to the Server SDK Version Tab
Find the latest version of the Python SDK:
- Download the SDK
- Download the Voice Isolation models (for voice isolation)
- Download the Turn Detection models (for turn detection)

Install the Python wheel file

First, unzip the SDK files you downloaded in the previous step. In the unzipped folder, you will find a dist folder containing the Python wheel file you will need to install.
Install the Python wheel file that corresponds to your platform. For example, a macOS ARM64 platform running Python 3.12 would install the following:
```
uv pip install /PATH_TO_DOWNLOADED_SDK/krisp-viva-uar-python-sdk-1.8.0/dist/krisp_audio-1.8.0-cp312-cp312-macosx_12_0_arm64.whl
```

Generate an API key

In the Krisp developer portal, generate an API key for your application.

The KRISP_VIVA_API_KEY is required for Krisp SDK v1.6.1 and later. For older SDK versions, this is not required.

Set up environment variables

Unzip the models you downloaded in the first step.
For voice isolation, choose a model:
- krisp-viva-pro: Mobile, Desktop, Browser (WebRTC, up to 32kHz)
- krisp-viva-tel: Telephony, Cellular, Landline, Mobile, Desktop, Browser (up to 16kHz)
Note: the full model name will be in the format of krisp-viva-tel-v2.kef.
In your .env file, add the environment variables for the features you’re using:

# Krisp SDK API key (required for SDK v1.6.1+)
KRISP_VIVA_API_KEY=your_api_key_here

# Voice isolation model path
KRISP_VIVA_FILTER_MODEL_PATH=/PATH_TO_UNZIPPED_MODELS/krisp-viva-tel-v2.kef

# Turn detection model path
KRISP_VIVA_TURN_MODEL_PATH=/PATH_TO_UNZIPPED_MODELS/krisp-viva-tt-v2.kef

# Voice activity detection model path (optional)
KRISP_VIVA_VAD_MODEL_PATH=/PATH_TO_UNZIPPED_MODELS/krisp-viva-vad-v2.kef

Each feature uses a different model. Set KRISP_VIVA_FILTER_MODEL_PATH for voice isolation, KRISP_VIVA_TURN_MODEL_PATH for turn detection, and KRISP_VIVA_VAD_MODEL_PATH for voice activity detection.

Test the integration

You’re ready to test the integration! Try running the Krisp VIVA foundation example, which demonstrates both voice isolation and turn detection together.

Learn how to run foundational examples in Pipecat.

Voice Isolation

KrispVivaFilter isolates the user’s voice by filtering out background noise and other voices in real-time audio streams. Add it to any transport via the audio_in_filter parameter.

from pipecat.audio.filters.krisp_viva_filter import KrispVivaFilter
from pipecat.transports.base_transport import TransportParams

transport = SmallWebRTCTransport(
    webrtc_connection=webrtc_connection,
    params=TransportParams(
        audio_in_enabled=True,
        audio_in_filter=KrispVivaFilter(),  # Enable Krisp voice isolation
        audio_out_enabled=True,
    ),
)

See the KrispVivaFilter reference for configuration options.

Turn Detection

KrispVivaTurn uses Krisp’s streaming turn detection model to determine when a user has finished speaking. Unlike the Smart Turn model which analyzes audio in batches, KrispVivaTurn processes each audio frame in real time. Configure it as a user turn stop strategy:

from pipecat.audio.turn.krisp_viva_turn import KrispVivaTurn
from pipecat.audio.vad.silero import SileroVADAnalyzer
from pipecat.processors.aggregators.llm_response_universal import (
    LLMContextAggregatorPair,
    LLMUserAggregatorParams,
)
from pipecat.turns.user_stop import TurnAnalyzerUserTurnStopStrategy
from pipecat.turns.user_turn_strategies import UserTurnStrategies

user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
    context,
    user_params=LLMUserAggregatorParams(
        user_turn_strategies=UserTurnStrategies(
            stop=[TurnAnalyzerUserTurnStopStrategy(
                turn_analyzer=KrispVivaTurn()
            )]
        ),
        vad_analyzer=SileroVADAnalyzer(),
    ),
)

See the KrispVivaTurn reference for configuration options.

Voice Activity Detection

KrispVivaVadAnalyzer detects speech in audio streams using Krisp’s VAD model. It supports sample rates from 8kHz to 48kHz, making it suitable for a wide range of applications including telephony and high-quality audio. Configure it as a VAD analyzer:

from pipecat.audio.vad.krisp_viva_vad import KrispVivaVadAnalyzer
from pipecat.audio.vad.vad_analyzer import VADParams
from pipecat.processors.aggregators.llm_response_universal import (
    LLMContextAggregatorPair,
    LLMUserAggregatorParams,
)

user_aggregator, assistant_aggregator = LLMContextAggregatorPair(
    context,
    user_params=LLMUserAggregatorParams(
        vad_analyzer=KrispVivaVadAnalyzer(params=VADParams(stop_secs=0.2)),
    ),
)

See the KrispVivaVadAnalyzer reference for configuration options.

Learning Pipecat

Fundamentals

Features

Telephony

Overview

KrispVivaFilter Reference

KrispVivaTurn Reference

KrispVivaVadAnalyzer Reference

Krisp VIVA Example

Krisp Developers

Prerequisites

Setup

Download the Python SDK and Models

Install the Python wheel file

Generate an API key

Set up environment variables

Test the integration

Voice Isolation

Turn Detection

Voice Activity Detection

Learning Pipecat

Fundamentals

Features

Telephony

​Overview

KrispVivaFilter Reference

KrispVivaTurn Reference

KrispVivaVadAnalyzer Reference

Krisp VIVA Example

Krisp Developers

​Prerequisites

​Setup

​Download the Python SDK and Models

​Install the Python wheel file

​Generate an API key

​Set up environment variables

​Test the integration

​Voice Isolation

​Turn Detection

​Voice Activity Detection

Overview

Prerequisites

Setup

Download the Python SDK and Models

Install the Python wheel file

Generate an API key

Set up environment variables

Test the integration

Voice Isolation

Turn Detection

Voice Activity Detection