Divider
  Speech Technology and Research Laboratory
  People
  Current Research Activities
  Past Research Activities
  Publications
  Career Opportunities
  Seminars
  Technologies for License
  In the News
  Contact Us
  STAR Search
  Information and Computing Sciences Division
SpacerAbout UsDividerR and D DivisionsDividerCareersDividerNewsroomDividerContact UsDividerSRI HomeSpacer

Spacer
         
  SRI Logo

Search SRILM-USER Archives

Match: Format: Sort by:
Search:

Re: limitations in ngram-merge

From: Andreas Stolcke <stolcke at ADDRESS HIDDEN>
Date: Wed, 08 Feb 2006 23:18:03 PST

In message <159323335F97074D9A594D676652B06754A028 at ADDRESS HIDDENabo>you wrote:
> Hi
>
> We are currently having a problem with the merging of count files using ngram
> -merge.
> It seems that there is a limitation in the size of the resulting file of 2GB.
> Can you give us some information if this is limitation is due to the program
> or if it is a limitation due to the configuration of our system. We are runni
> ng ngram-merge in a PIV 2,66GHz 1GB RAM in Suse 10.0.

It's probably an OS limitation.  SRILM uses level-2 I/O functions
(see fopen(3)).  

We have certainly handled files larger than 2 GB on our Linux machines.
But those files that are usually gzipped (ending in .gz).  SRILM
doesn't read or write those directly, since the I/O is to a pipe
that talks to the gzip program.  Maybe you can try using gzipped files
in your case too.

--Andreas

Click here to go to the SRILM home page.

 

About Us  Vertical divider  R&D Divisions  Divider  Careers  Divider  Newsroom  Divider  Contact Us
©2006 SRI International, 333 Ravenswood Avenue, Menlo Park, CA 94025-3493
SRI International is an independent, nonprofit corporation. Privacy policy

Last modified Nov 21, 2008