[SRILM User List] how are the probabilities computed in ngram-count
    Saman Noorzadeh 
    saman_2004 at yahoo.com
       
    Tue Apr 10 01:29:37 PDT 2012
    
    
  
Hello
I am getting confused about the models that ngram-count make:
ngram-count -order 2  -write-vocab vocabulary.voc -text mytext.txt   -write model1.bo
ngram-count -order 2  -read model1.bo -lm model2.BO
forexample: (the text is very large and these words are just a sample)
in model1.bo:
cook   14 
cook was 1
in model2.BO:
-1.904738  cook was 
my question is that the probability of 'cook was' bigram should be log10(1/14), but ngram-count result shows: log(1/80)== -1.9047
how is these probabilities computed?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://www.speech.sri.com/pipermail/srilm-user/attachments/20120410/991f8ee3/attachment.html>
    
    
More information about the SRILM-User
mailing list