Search SRILM-USER Archives

Match: Format: Sort by:
Search:

Re: class-based language model

From: ilya oparin <ioparin at ADDRESS HIDDEN>
Date: Wed, 13 Sep 2006 12:49:02 +0100 (BST)

--0-795665711-1158148142=:86818
Content-Type: text/plain; charset=iso-8859-1
Content-Transfer-Encoding: 8bit

Hi,

I don't get exactly what you dislike in the debugging info that you get with the "-debug 2" option but I would add "-numclasses" option. If it is zero, then class merging is supressed altogether, as it is stated in the manual on ngram-class. May be then you will get the output you expect.

Martha Yifiru <marthayifiru at ADDRESS HIDDEN> wrote: Hi all,

Is there a tutorial or introduction on how to develop a class-based (where classes are induced automatically by ngram-class) language model? I thought that ngram-class is used to induce classes automatically and then the language model training and evaluation follow using ngram-count and ngram, respectively. Thus used the following command:
    
    ngram-class -debug 2 -text textfile.txt  -class-counts classcount_file                 -classes input_to_ngram

But the result is not similar with my expectation. It was giving me even perplexity values.

Would you please give me some ideas on how to develop class-based language model?

Waiting from you, I remain.
Martha.

_______________________________________
Address:

Martha Yifiru Tachbelie
Sedanstrasse 24
20146 Hamburg
Germany
Tel. +49 40 52721540    

---------------------------------
Do you Yahoo!?
Everyone is raving about the  all-new Yahoo! Mail.

best regards,
Ilya

---------------------------------
The all-new Yahoo! Mail goes wherever you go - free your email address from your Internet provider.
--0-795665711-1158148142=:86818
Content-Type: text/html; charset=iso-8859-1
Content-Transfer-Encoding: 8bit

Hi,<br><br>I don't get exactly what you dislike in the debugging info that you get with the "-debug 2" option but I would add "-numclasses" option. If it is zero, then class merging is supressed altogether, as it is stated in the manual on ngram-class. May be then you will get the output you expect.<br><br><b><i>Martha Yifiru <marthayifiru at ADDRESS HIDDEN></i></b> wrote:<blockquote class="replbq" style="border-left: 2px solid rgb(16, 16, 255); margin-left: 5px; padding-left: 5px;"> Hi all,<br><br>Is there a tutorial or introduction on how to develop a class-based (where classes are induced automatically by ngram-class) language model? I thought that ngram-class is used to induce classes automatically and then the language model training and evaluation follow using ngram-count and ngram, respectively. Thus used the following command:<br>    <br>    ngram-class -debug 2 -text textfile.txt  -class-counts classcount_file    
            -classes input_to_ngram<br><br>But the result is not similar with my expectation. It was giving me even perplexity values.<br><br>Would you please give me some ideas on how to develop class-based language model?<br><br>Waiting from you, I remain.<br>Martha.<br><br><br>_______________________________________<br>Address:<br><br>Martha Yifiru Tachbelie<br>Sedanstrasse 24<br>20146 Hamburg<br>Germany<br>Tel. +49 40 52721540<div>    </div><hr size="1">Do you Yahoo!?<br> Everyone is raving about the <a href="http://us.rd.yahoo.com/evt=42297/*http://advision.webevents.yahoo.com/mailbeta"> all-new Yahoo! Mail.</a></blockquote><br><BR><BR>best regards,<br>Ilya<p>
<hr size=1>
The <a href="http://us.rd.yahoo.com/mail/uk/taglines/default/nowyoucan/free_from_isp/*http://us.rd.yahoo.com/evt=40565/*http://uk.docs.yahoo.com/nowyoucan.html">all-new Yahoo! Mail</a> goes wherever you go - free your email address from your Internet provider.
--0-795665711-1158148142=:86818--

Click here to go to the SRILM home page.