Hi,
Is it normal that in an open-vocabulary LM (built with the "-unk"
option) the <unk> token is present as unigram, but not in bigrams and
trigrams?
(Sorry if this is a silly question, but I am not so familiar with
language models, and I was told that it would not be the case with other
toolkits).
Thanks again,
Amélie
--
--------------------------------------------------------------------
Amélie DELTOUR
ENSIMAG / Universität Karlsruhe
E-mail : amelie.deltour at ADDRESS HIDDEN
--------------------------------------------------------------------
Click here to go to the SRILM home page.