For partial words, use target hypothesis.
Example
So uhm <,> then we all <.> dec </.> they all decided they wanted to go to the disco like but I had no money
Token | So | uhm | then | we | all | dec | they | all | decided |
---|---|---|---|---|---|---|---|---|---|
Tag | RB | UH | RB | PRP | RB | VVD | PRP | RB | VVD |
Sometimes, it may be difficult to use target hypothesis. In these cases, see UNCLEAR.
Words that contain null-to-low semantic value are tagged as discourse markers (i.e. UH). These words are usually affirmative responses, where the words contain less semantic value than their alternative usage. For example, well in “oh well” no longer contains the sense of well as in “the child behaved well”.
Examples
Oh right
Token | Oh | right |
---|---|---|
Tag | UH | UH |
Ah cool
Token | Ah | cool |
---|---|---|
Tag | UH | UH |
He rang her alright
Token | He | rang | her | alright |
---|---|---|---|---|
Tag | PRP | VVD | PRP | UH |
Function: “retroactive focusing power, but more importantly, […] they can be interpreted as countering potential inferences, objections, or doubts” (Miller & Weinert, 1995)
Since clause-final 'like' is extremely common, and does not (a) appear in the same distribution, and (b) have the same function as other forms of 'like', they should be tagged as UH.
All the people were out like.
Token | All | the | people | were | out | like |
---|---|---|---|---|---|---|
Tag | PDT | TD | NNS | VBD | IN | UH |
ye
Example
Did she go out with ye.
Token | Did | she | go | out | with | ye |
---|---|---|---|---|---|---|
Tag | VVD | PRP | VV | IN | IN | PRP |
UNCLEAR
Either use target hypothesis or the tag XX. N.B. XX is also used in the Switchboard Corpus for partial words, and unclear parts of speech. Here, we tag partial words using target hypothesis. If the partial word is unclear, then proceed to tag as XX.
Example
Did you go UNCLEAR
Token | Did | you | go | UNCLEAR |
---|---|---|---|---|
Tag | VVD | PRP | VV | XX |