Following [9], DFs can be classified based on how the actual utterance must be modified to obtain the intended fluent utterance, i.e., the utterance a speaker would produce if asked to repeat his or her utterance. The types can be characterized by the type of editing required.
SHE UH GOT REAL LUCKY THOUGH --> SHE GOT REAL LUCKY THOUGH
IT'S A IT'S A FAIRLY LARGE COMMUNITY --> IT'S A FAIRLY LARGE COMMUNITY
I DID YOU HAPPEN TO SEE ... --> DID YOU HAPPEN TO SEE ...
We know from prior work [9] that these three types of
DF are the most frequent across a variety of spontaneous speech corpora,
accounting for
over 85% of DF tokens in the Switchboard corpus.
See [9] for a description of other, less frequent, types
of DF that are not modeled explicitly in our LM.
For example, we are not modeling word substitutions or speech errors.