<br>Dear SRILM List Members,<br><br>I was experimenting with the "-use-server" option of ngram and it appears to work for "-ppl" calculations from text but I was receiving different numbers when working with count files. With some debugging, I realized that this was due to the server receiving <unk> tokens from the client. <br>
<br>I made the following modification:<br><br>line 352, LM.cc, version 1.5.7:<br> //vocab.getIndices(words, wids, order + 1, vocab.unkIndex());<br> vocab.addWords(words, wids, order + 1);<br><br>and I am able to get the same results with or without using a server.<br>
<br>I have not checked whether this will effect "-cache-served-ngrams" policy or whether this may have other impacts on the results.<br><br>Regards,<br>Ergun<br><br clear="all">Ergun Bicici<br>Koc University<br>
<br>