ngram-class with -incremental + -save-maxclasses

Matt Lease mlease at cs.brown.edu
Fri Mar 28 12:03:25 PDT 2008


What is the behavior of -save-maxclasses for ngram-class when 
-incremental is used?  My understanding of -incremental is that C as 
specified by -numclasses determines the number of classes for the entire 
run-time (i.e. C+1 for the new word being merged into the existing C 
classes), in which case -save-maxclasses would seem not to add anything 
(ie perhaps it's only intended for V^3 clustering).

If one wanted to get different clusterings with the greedy algorithm 
without re-running each from scratch, it looks like you can use the 
-class-counts option and then feed this counts file into a subsequent 
invocation of ngram-class.  For example, run it initially with C=1000, 
then feed the output class counts into a second invocation with C=500, 
say.  Is this the correct procedure?

Thanks!



More information about the SRILM-User mailing list