Divider
  Speech Technology and Research Laboratory
  People
  Current Research Activities
  Past Research Activities
  Publications
  Career Opportunities
  Seminars
  Technologies for License
  In the News
  Contact Us
  STAR Search
  Information and Computing Sciences Division
SpacerAbout UsDividerR and D DivisionsDividerCareersDividerNewsroomDividerContact UsDividerSRI HomeSpacer

Spacer
         
  SRI Logo

Search SRILM-USER Archives

Match: Format: Sort by:
Search:

Re: pfsg-format

From: Andreas Stolcke <stolcke at ADDRESS HIDDEN>
Date: Thu, 25 Mar 2004 09:33:16 PST

In message <4063152D.3060201 at ADDRESS HIDDEN>you wrote:
> Hi !
> I've got one question about the pfsg format : is the transition cost,
> between 2 states, considered to be 10000.5 times the log-probability of
> the bigram corresponding to the 2 states ?

correct.

> Because, when I use a language model made from an ARPA file (by using
> the NgramLM class) to compute the probability of a word (my language
> model is based on letters) and when I use a language model made from a
> PFSG file (I convert the ARPA thanks to the make-ngram-pfsg script and
> then by using the LatticeLM class), I don't have the same
> log-probability from both representations. Why is there a difference ?
> Since I convert the ARPA file into a PFSG file, it should be the same.

How big are the differences?  there will be some discrepancy due to
rounding the scaled log probabilities to an integer, but it should
be a small error.

--Andreas

Click here to go to the SRILM home page.

 

About Us  Vertical divider  R&D Divisions  Divider  Careers  Divider  Newsroom  Divider  Contact Us
©2006 SRI International, 333 Ravenswood Avenue, Menlo Park, CA 94025-3493
SRI International is an independent, nonprofit corporation. Privacy policy

Last modified Nov 21, 2008