Thank You sir for your help.<br><br>I have actually another very silly question. <br>After I get the probability distribution over word, I build another language model and when I try to find difference between my previous LM (where I used my class definition with no [p] value) and my recently created LM , I found no difference.<br>
<br>I might have a understanding problem in basic theory [as just read about it in books] or am I doing anything wrong in the step.<br><br>My recent steps that I am following:<br><br>[1] replace-words-with-classes classes=atis_sphinx.def addone=1 normalize=1 outfile=countExpansion compound_LM.txt<br>
<br>[2] replace-words-with-classes classes=countExpansion compound_LM.txt > output_text_with_classes <br><br>[3] ngram-count -text output_text_with_classes classes=countExpansion -lm class_based_model_2.lm<br><br>also tried ngram-count -text output_text_with_classes -lm class_based_model_2.lm<br>
<br>Please do suggest me where I am wrong.<br><br>And really sorry for my stupid question.<br><br>Thank You <br><br><div class="gmail_quote">On Wed, Jun 6, 2012 at 12:26 AM, Andreas Stolcke <span dir="ltr"><<a href="mailto:stolcke@icsi.berkeley.edu" target="_blank">stolcke@icsi.berkeley.edu</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div bgcolor="#FFFFFF" text="#000000">
You can build class-based LMs using your own class assignments.<br>
<br>
Step 2 works with a classfile with or without probabilities (the
probs are optional in the format).<br>
<br>
For step 3, you need some probability distribution over the words to
obtain a proper language model.<br>
For example, use the "uniform-classes" script to insert uniform
probabilities for those class assignments that don't have any.<br>
If you have a large training set, you can run <br>
<br>
replace-with-words-classes classes=<classfile> addone=1
normalize=1 outfile=OUTPUT TEXTFILE<br>
<br>
to count the number of times each word occurs and estimate class
expansion probabilities (written to OUTFILE).<br>
<br>
Andreas<div><div class="h5"><br>
<br>
On 6/5/2012 1:37 AM, Shammur Absar Chowdhury wrote:
</div></div><blockquote type="cite"><div><div class="h5">Hello
<div><br>
</div>
<div>I am new to srilm and at the same time I am recently learning
about language model. My aim was to build a class based language
model with a given class definition.<br>
<br>
So far I have used, the below 3 commands from <a href="http://www.speech.sri.com/pipermail/srilm-user/2010q1/000843.html" target="_blank">http://www.speech.sri.com/pipermail/srilm-user/2010q1/000843.html</a><br>
<br>
<br>
1. ngram-class -vocab vocab.txt \<br>
-text LM.txt \<br>
-numclasses 16 \<br>
-classes classfile<br>
2. replace-words-with-classes classes=classfile LM.txt >
Output_text_with_classes<br>
3. ngram-count -text Output_text_with_classes -lm
Class_based_model<br>
<br>
<br>
But as far as I think that the first command here induces the
classes. Now what if I want srilm to use my assigned class tag
and its followed words list to make the class model, how will I
do it? I meant I try formating my classes tag in the
class-format and then run the second step but as in the format I
am suppose to assign a probability, p - which I cant assign in
my manual created class file.<br>
<br>
Could any one please help me and give a direction or suggest
some reading for me.<br>
Thank you .<br>
<br>
Shammur Absar Chowdhury<br>
<br>
</div>
<br>
<fieldset></fieldset>
<br>
</div></div><pre>_______________________________________________
SRILM-User site list
<a href="mailto:SRILM-User@speech.sri.com" target="_blank">SRILM-User@speech.sri.com</a>
<a href="http://www.speech.sri.com/mailman/listinfo/srilm-user" target="_blank">http://www.speech.sri.com/mailman/listinfo/srilm-user</a></pre>
</blockquote>
<br>
</div>
</blockquote></div><br><br clear="all"><br>-- <br>Shammur Absar Chowdhury<br><br>