Search SRILM-USER Archives

Re: cahce based models

From: Andreas Stolcke <stolcke at ADDRESS HIDDEN>
Date: Wed, 07 Mar 2007 17:27:24 PST

In message <3BE78265-2376-4D96-8AB4-547D82E15E92 at ADDRESS HIDDEN>you wrote:
> Hi Hani,
>
> if I'm correctly interpreting your question, the LM subclass CacheLM
> provides a simple cache component implementation.
>
> Word probability is boosted if the very same word occured in a window
> of the last N words (more occurencies yield higher probability). You
> get ngram to interpolate whatever model you're using with a cache
> component using -cache. The source code of this one is very
> straightforward if you're interested in the details.
>
> If you're looking for the original papers, Kuhn and De Mori published
> on this in 1990 (as to my knowledge at least).
>
> Hope this helps.
>
> Cheers from Aachen,
>
> Juri

Thanks for this dead-on response!

At risk of stating the obvious, the code for CacheLM is in
$SRILM/lm/src/CacheLM.cc, and is quite short and easy to follow.

Best,

Andreas

>
> On 8. Mar, 2007, at 01:17, Hani Safadi wrote:
>
> > Hi,
> > I would like to get more information on the cache-based models
> > implemented in SRILM. and how to use them.
> > The paper briefly mentions them, and there is no information in the
> > man pages.
> > Thanks
> > --
> > Looking forward to hearing from you.
> > Best wishes,
> > Hani Safadi
>

Click here to go to the SRILM home page.