Specifications — Overview

Transcription & Translation

All video meetings and voice calls are automatically transcribed using PanOps' self-hosted transcription engine — no third-party transcription service ever receives your data. Transcripts are translated to the CEO's preferred language and made fully searchable.

← Back to Specifications

Self-Hosted Transcription

PanOps uses a state-of-the-art open-weight transcription model, running on PanOps-operated GPU infrastructure. This is a critical design decision: your audio recordings never leave your secure storage to be processed by a third-party API. The PanOps transcription engine runs inside the same infrastructure boundary as your data, and transcripts are written directly to your tenant's encrypted database.

This approach eliminates an entire class of data exposure risk that exists with cloud transcription services like Rev.ai, Assembly AI, or Deepgram — providers that would receive copies of your executive communications for processing.

What Gets Transcribed

Transcription is applied to all audio-bearing communication: video meetings and voice/SMS platform call recordings.

  • Microsoft Teams video meeting recordings
  • Zoom Meetings cloud recordings
  • Google Meet recordings (via Google Workspace connector)
  • RingCentral voice call recordings
  • Dialpad call recordings
  • Zoom Phone call recordings
  • OpenPhone call recordings

Language & Translation

The PanOps transcription engine is natively multilingual — it can transcribe and translate audio in over 90 languages in a single pass. PanOps uses this capability to support international teams: if employees speak in Spanish, Mandarin, French, or any supported language, their audio is automatically transcribed and translated to the CEO's configured preferred language.

Language preference is configured per customer during onboarding and stored in the customer's configuration record. All transcripts delivered to the AI model are in the CEO's preferred language.

Transcription
90+ Languages
The PanOps transcription engine supports transcription in over 90 languages, with strong performance across major world languages.
Translation
Translated to CEO's Language
Non-English (or non-preferred-language) audio is translated in the same transcription pass. No separate translation service required.

Pipeline Overview

The transcription pipeline is event-driven and fully automated. When a recording is downloaded to customer storage, an S3 event triggers the transcription queue. GPU workers pick up jobs, run the PanOps transcription engine, and write the completed transcript directly to the customer's secure database. The entire process requires no manual intervention.

GPU instances scale to zero when no recordings are queued, keeping compute costs near zero during periods of low or no recording volume. Estimated cost is $50–150/month depending on the volume of meetings and calls.


View full technical specifications →