Does the Hearably SDK require server-side infrastructure?

No. The entire Hearably SDK runs client-side in the browser using Web Audio API and WebGPU. No audio is sent to any server, which means zero server costs, zero latency, and zero privacy concerns.

What browsers does the Hearably SDK support?

The DSP engine works in all modern browsers that support Web Audio API (Chrome, Edge, Firefox, Safari). AI features require WebGPU (Chrome 113+, Edge 113+). The SDK degrades gracefully.

How is API pricing structured?

Three models: per-MAU (pay per monthly active user), flat licensing (unlimited users, fixed fee), and custom enterprise licensing. Contact enterprise@hearably.app for a quote.

Can we white-label the SDK?

Yes. Growth and Enterprise plans include custom branding. The SDK can be white-labeled with your product's styling. Enterprise plans offer source code escrow.

API & SDK

Embed audio intelligence
in your product

Professional-grade DSP engine, AI transcription, and text-to-speech — available as an embeddable JavaScript SDK. Runs entirely client-side via Web Audio API. Zero server load, zero per-minute fees.

Request API Access All Enterprise Solutions

Capabilities

Three engines, one SDK

🎛️

DSP Engine

Professional audio processing pipeline

Volume boost up to 800% with look-ahead limiter
10-band parametric EQ (20Hz - 20kHz)
3-band Linkwitz-Riley crossover (250Hz / 4kHz)
Per-band compression with auto-ratio
Noise reduction (biquad + RNNoise)
Under 10ms processing latency

📝

Transcription Engine

Whisper-powered speech-to-text

90+ languages with auto-detection
Real-time streaming or batch processing
Word-level timestamps
SRT, VTT, and JSON output formats
~150 MB model, cached in browser
Runs on WebGPU (Chrome 113+)

🗣️

Voice Engine

Natural text-to-speech synthesis

Kokoro TTS — near-ElevenLabs quality
7 languages, multiple voice presets
82M parameters, ~160 MB download
24kHz audio output
Streaming or batch generation
Runs entirely client-side

Integration

Simple, powerful API

A few lines of JavaScript to add professional audio enhancement to any web application.

app.js

import { HearablySDK } from '@hearably/sdk';

// Initialize with your API key
const hearably = new HearablySDK({ apiKey: 'hby_...' });

// Enhance any audio element
const enhancer = hearably.enhance(audioElement, {
  volumeBoost: 4.0,       // 400% boost
  eq: 'voice-clarity',     // Preset or custom bands
  noiseReduction: true,   // Enable denoising
  limiter: { ceiling: -1 } // dBFS ceiling
});

// Real-time transcription
const captions = await hearably.transcribe(audioElement, {
  language: 'auto',        // Auto-detect language
  onSegment: (seg) => console.log(seg.text)
});

Architecture

Client-side by design

The entire SDK runs in the browser. No audio is sent to any server. This means zero server costs, zero latency, and zero privacy concerns for your users.

Web Audio API

DSP engine uses native Web Audio API nodes (BiquadFilter, DynamicsCompressor, AudioWorklet) for near-zero latency processing. No WASM overhead for core audio.

WebGPU Inference

AI models (Whisper, Kokoro) run via WebGPU for GPU-accelerated inference. Falls back to WASM on unsupported devices. Models are downloaded once and cached.

Web Workers

All AI inference runs in Web Workers off the main thread. Your UI stays responsive while transcription and TTS process in the background.

Zero Server Load

Since everything runs client-side, your infrastructure costs stay flat regardless of usage. No audio processing servers, no GPU instances, no scaling concerns.

Use Cases

Built for product teams

Video Conferencing Platforms

Embed volume boost, noise reduction, and voice clarity directly into your conferencing app. Give users per-participant audio control without any server-side processing.

E-Learning Platforms

Add live captions and audio enhancement to your LMS. Meet ADA/WCAG requirements with a single SDK integration. Students get personal audio controls on every piece of content.

Podcast & Media Platforms

Offer AI transcription, auto-captioning, and audio mastering as features of your platform. Replace server-side processing with client-side SDK — eliminate per-minute API costs.

Accessibility Solutions

Build assistive technology products with professional-grade audio enhancement. Hearing-adaptive EQ, live captions, and voice boost — all accessible via a clean API.

Pricing

Flexible licensing for every scale

Starter

Per-MAU

Pay per monthly active user. Ideal for products with variable or growing usage. Volume discounts at scale.

DSP Engine (all features)
Up to 10,000 MAU
Email support
Standard SLA

POPULAR

Growth

Flat License

Flat monthly fee, unlimited users. Predictable costs for products with established user bases.

DSP + Transcription Engines
Unlimited MAU
Priority support (24h SLA)
Custom branding

Enterprise

Custom

Full platform access with dedicated support. White-label, custom models, and on-premise deployment options.

All three engines
Unlimited MAU
Dedicated account manager
Custom SLA & support
Source code escrow

Add audio intelligence to your product

Professional DSP, AI transcription, and TTS — all running client-side. Zero server load, zero per-minute fees.

Request API Access

Include your use case and estimated MAU. We'll respond within 24 hours.

Embed audio intelligencein your product