Links

REST API for Stereo (2) Channels

Perform the ASR by separating left and right channels of a stereo WAV file.
Perform the ASR by separating left and right channels of a stereo WAV file. Now it is support only two channels. It is very convenient for Call center voice records, which record the each speaker in different channels.

Request URL

Request Method

POST

Headers

Parameter Names
Description
Notes
apikey
Enter your apikey from https://ai.iapp.co.th
Required

Request Parameter (Form Data)

Parameter Names
Description
Notes
file
Speech Voice file in Stereo WAV Format (2 channels) only.
Required

Return Values (JSON)

When speaker diarization is enabled, it will return as follows:
Parameter Names
Description
channel
Channel source { 'left', 'right'}
speech
Array of transcribed text with confidence score JSON object. Speech array can contains multiple transcribed text if there are silence periods. The transcribed text is splitted by silences (more than 1 sec) between speech.
start_second
Beginning second of the speaker's speech.
stop_second
Ending second of the speaker's speech.

Sample Request

curl --location --request POST 'https://api.iapp.co.th/asr/stereo' \
--header 'apikey: {your api key}' \
--form '[email protected]"/Users/kobkrit/Desktop/ETC/2ppl.wav"'