User Tools

Site Tools


segmentation_into_utterances

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
segmentation_into_utterances [2015/05/08 21:38]
sm2842 created
segmentation_into_utterances [2021/02/11 16:44] (current)
Line 1: Line 1:
 +=====Utterance segmentation=====
 Utterance segmentation can more straightforward in a corpus of spoken data than a corpus of written data, because much of the utterance segmentation may have been already been carried out by the transcriber.  Utterance segmentation can more straightforward in a corpus of spoken data than a corpus of written data, because much of the utterance segmentation may have been already been carried out by the transcriber. 
  
Line 4: Line 5:
  
   -Any sentence-final period is treated as an utterance delimiter. Even when two full NP-VP sentences are transcribed as being separated by a comma, for whatever reason, these should be treated as a single utterance.   -Any sentence-final period is treated as an utterance delimiter. Even when two full NP-VP sentences are transcribed as being separated by a comma, for whatever reason, these should be treated as a single utterance.
- 
   -Speaker switches are also utterance delimiters. This is non-trivial, as full NP-VP sentences can sometimes be co-constructed by speakers in conversation. Nevertheless, co-construction by two or more speakers of a single utterance will not be assumed in this corpus for ease of analysis.   -Speaker switches are also utterance delimiters. This is non-trivial, as full NP-VP sentences can sometimes be co-constructed by speakers in conversation. Nevertheless, co-construction by two or more speakers of a single utterance will not be assumed in this corpus for ease of analysis.
- 
   -False starts and speech errors are included in the full utterance which follows, if the full utterance is produced by the same speaker. If there is an incomplete sentence followed by a speaker switch, the incomplete sentence should form its own utterance.   -False starts and speech errors are included in the full utterance which follows, if the full utterance is produced by the same speaker. If there is an incomplete sentence followed by a speaker switch, the incomplete sentence should form its own utterance.
  
segmentation_into_utterances.1431121135.txt.gz ยท Last modified: 2021/02/11 17:01 (external edit)