Divider
  Speech Technology and Research Laboratory
  People
  Current Research Activities
  Past Research Activities
  Publications
  Career Opportunities
  Seminars
  Technologies for License
  In the News
  Contact Us
  STAR Search
  Information and Computing Sciences Division
SpacerAbout UsDividerR and D DivisionsDividerCareersDividerNewsroomDividerContact UsDividerSRI HomeSpacer

Spacer
         
  SRI Logo

Search SRILM-USER Archives

Match: Format: Sort by:
Search:

question about vocabulary

From: lavecchia <Caroline.Lavecchia at ADDRESS HIDDEN>
Date: Tue, 04 May 2004 16:18:11 +0200

Hello everybody,

I would like to know if it's possible with the SRILM toolkit to generate
a vocabulary with the 20000 most frequent words of a corpus for example.

I know that with -write-vocab  in the ngram-count function I can
generate a vocabulary but only with all the words of the corpus.

Thanks in advance and sorry for my bad english,

Caroline L.

Click here to go to the SRILM home page.

 

About Us  Vertical divider  R&D Divisions  Divider  Careers  Divider  Newsroom  Divider  Contact Us
©2006 SRI International, 333 Ravenswood Avenue, Menlo Park, CA 94025-3493
SRI International is an independent, nonprofit corporation. Privacy policy

Last modified Nov 21, 2008