<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<div class="moz-cite-prefix">On 5/25/2013 1:37 PM, Sander Maijers
wrote:<br>
</div>
<blockquote cite="mid:51A120F8.8050602@student.ru.nl" type="cite">Hi,
<br>
<br>
Should one surround the sentences in the sentences file for
ngram's '-ppl' with <s> sos and </s> eos tokens? They
are in the LM.
<br>
<br>
I have tested it just now, and it seems that the sentence boundary
markers are inferred by ngram when left out, and adopted when put
in. Where is this documented?
<br>
</blockquote>
<br>
In the <a
href="http://www.speech.sri.com/projects/srilm/manpages/ngram.1.html">man
page</a>. The relevant options are<br>
<br>
-no-sos<br>
Disable the automatic insertion of start-of-sentence
tokens for sentence probability computation. The<br>
probability of the initial word is thus computed with
an empty context.<br>
<br>
-no-eos<br>
Disable the automatic insertion of end-of-sentence
tokens for sentence probability computation. End-<br>
of-sentence is thus excluded from the total
probability.<br>
<br>
<br>
Andreas<br>
<br>
<blockquote cite="mid:51A120F8.8050602@student.ru.nl" type="cite">
<br>
Best,
<br>
Sander
<br>
_______________________________________________
<br>
SRILM-User site list
<br>
<a class="moz-txt-link-abbreviated" href="mailto:SRILM-User@speech.sri.com">SRILM-User@speech.sri.com</a>
<br>
<a class="moz-txt-link-freetext" href="http://www.speech.sri.com/mailman/listinfo/srilm-user">http://www.speech.sri.com/mailman/listinfo/srilm-user</a>
<br>
_______________________________________________
<br>
SRILM-User site list
<br>
<a class="moz-txt-link-abbreviated" href="mailto:SRILM-User@speech.sri.com">SRILM-User@speech.sri.com</a>
<br>
<a class="moz-txt-link-freetext" href="http://www.speech.sri.com/mailman/listinfo/srilm-user">http://www.speech.sri.com/mailman/listinfo/srilm-user</a>
<br>
</blockquote>
<br>
</body>
</html>