Want create site? Find Free WordPress Themes and plugins.
Gulf Arabic Conversational Telephone Speech contains 975 Gulf Arabic speakers taking part in spontaneous telephone conversations in Colloquial Gulf Arabic. A total of 976 conversation sides are provided (one speaker appears on two distinct calls). The average duration per side is about 5.7 minutes. This corpus was collected and transcribed in 2004 by Appen Pty Ltd. (Appen), Syndey, Australia, working under a U.S. Government contract.
The single-channel files represent just one side of a normal conversation. The "devtest" set represents a relatively balanced (representative) sample drawn from the total pool of collected calls, based on a test-set selection process applied by the National Institute of Standards and Technology (NIST) and based on demographic, phone and audit information as provided by Appen.
*
Gulf Arabic Conversational Telephone Speech, Transcripts contains transcripts of 975 Gulf Arabic speakers taking part in spontaneous telephone conversations in Colloquial Gulf Arabic. A total of 976 conversation sides are provided (one speaker appears on two distinct calls). The data was collected and transcribed in 2004 by Appen Pty Ltd., Sydney, Australia, working under a U.S. Government contract.
Each transcript file is a tab-delimited flat table, where each line contains information and text for a single contiguous utterance, presented via the following fields:
- beginning time stamp in seconds, in square brackets ("[5.7189]")
- ending time stamp in seconds, in square brackets
- channel/speaker-ID ("A:" or "B:")
- "consonant skeleton" orthography for the utterance, in UTF-8
- "diacritized" orthography for the utterance, in ASCII
Did you find apk for android? You can find new Free Android Games and apps.