find every breath. inventory them. decide what to do.
source
drop a dialogue recording (WAV / AIFF / FLAC)
detection looks for non-voice non-silence regions adjacent to speech · capped at 180s
find every breath. inventory them. decide what to do.
system log
>ready