Search SRILM-USER Archives

Match: Format: Sort by:
Search:

Open-vocabulary LM

From: =?ISO-8859-1?Q?Am=E9lie?= DELTOUR <amelie.deltour at ADDRESS HIDDEN>
Date: Tue, 25 Feb 2003 17:13:00 +0100

Hi,
Is it normal that in an open-vocabulary LM (built with the "-unk"
option) the <unk> token is present as unigram, but not in bigrams and
trigrams?
(Sorry if this is a silly question, but I am not so familiar with
language models, and I was told that it would not be the case with other
toolkits).
Thanks again,

Amélie

--
--------------------------------------------------------------------
Amélie DELTOUR
ENSIMAG / Universität Karlsruhe
E-mail : amelie.deltour at ADDRESS HIDDEN
--------------------------------------------------------------------

Click here to go to the SRILM home page.