Paragraphs
Automatic paragraph formatting with paragraphs=true
Paragraph formatting adds line breaks to a transcript based on natural pauses, topic changes, and other conversational cues in the audio. Without paragraph formatting, the result would be a long unstructured block of text that's difficult to read or skim through.
How paragraph detection works
Aldea doesn't insert line breaks at fixed intervals or arbitrary word counts. Instead, it analyzes different signals from the audio, such as:
-
Pause duration: Extended silences between utterances often indicate a transition. In most cases, a speaker finalizing one thought and beginning another. Aldea's STT models use these pauses as signals for potential paragraph boundaries.
-
Change in topic and context: Aldea's models evaluate whether the content that comes before and after a break represents a shift in the subject. For example, a speaker who pauses briefly to breathe mid-sentence won't trigger a paragraph break, but a speaker who finishes discussing Q1 revenue and moves to hiring plans will.
Paragraph formatting ensures that the final transcript feels natural, aligns with the speaker's intent, and mirrors how the conversation actually flowed.
Usage
Add paragraphs=true as a query parameter:
curl -X POST "https://api.aldea.ai/v1/listen?paragraphs=true" \
-H "Authorization: Bearer YOUR_ALDEA_API_KEY" \
--data-binary @audio.wavYou can combine it with other formatting parameters such as smart formatting for fully polished output in a single request:
curl -X POST "https://api.aldea.ai/v1/listen?paragraphs=true&smart_format=true&numerals=true" \
-H "Authorization: Bearer YOUR_ALDEA_API_KEY" \
--data-binary @audio.wavWhen to use paragraph formatting:
Paragraph formatting adds structure to your transcript, which is important for:
- Meeting and call transcripts: It makes it easier to scan though long-form recordings as reviewers can jump to specific sections instead of reading the entire transcript.
- Content publishing: Podcast transcripts, interview write-ups, and lecture notes require structured paragraphs before publishing or sharing.
- Accessibility: Screen readers and assistive technologies navigate structured text more effectively.
- Downstream summaries: If you feed transcripts into a summarization model, paragraph boundaries give the model natural chunks to work with, which improves summary quality.