Parse Audio

Transcribe audio files with speaker detection. Returns full transcript, timestamps, and suggested questions about the content.

🎙️
Speaker Detection
Identifies and labels different speakers
⏱️
Timestamps
Word-level timing for precise navigation
🌍
Multi-Language
Transcription in 50+ languages
example.js
REST API
const formData = new FormData();
formData.append('file', fileInput.files[0]);

const response = await fetch(
  'https://api.skimming.ai/source/v1/api/parse/audio',
  {
    method: 'POST',
    headers: {
      'Authorization': 'Bearer YOUR_API_KEY',
    },
    body: formData
  }
);
const data = await response.json();
console.log(data.success.textContent);

What's in the API

Powerful features designed to help you build amazing applications with ease.

High-Accuracy Transcription

State-of-the-art speech recognition for accurate conversion of audio to searchable text.

Speaker Diarization

Automatically identify and label different speakers throughout the recording.

Word-Level Timestamps

Precise timing information for each word enables navigation to specific audio moments.

50+ Language Support

Transcribe audio in over 50 languages with automatic language detection.

Smart Content Questions

AI-generated questions based on transcript content for deeper exploration.

Universal Format Support

Process MP3, WAV, OGG, AAC, FLAC, and M4A files with consistent results.

Common Use Cases

See how developers are using this API to solve real-world problems.

Podcast Processing

Transcribe episodes for show notes and SEO.

Meeting Notes

Convert recorded meetings to searchable text.

Interview Transcription

Transcribe interviews with speaker labels.

Voice Memo Conversion

Turn voice memos into written notes.

Technical Specifications

Everything you need to know to integrate this API.

Endpoint

Base URL

/v1/source/v1/api/parse/audio

HTTP Method

Request type

POST

Authentication

Security method

Bearer Token (API Key)

Rate Limit

Request limits

Based on subscription tier

Frequently Asked Questions

Haven’t got your answer? Contact our support now

Is speaker diarization included?

What audio quality is needed?

What's the max audio length?