Divider
  Speech Technology and Research Laboratory
  People
  Current Research Activities
  Past Research Activities
  Publications
  Career Opportunities
  Seminars
  Technologies for License
  In the News
  Contact Us
  STAR Search
  Information and Computing Sciences Division
SpacerAbout UsDividerR and D DivisionsDividerCareersDividerNewsroomDividerContact UsDividerSRI HomeSpacer

Spacer
         
  SRI Logo

Search SRILM-USER Archives

Match: Format: Sort by:
Search:

bug in lattice-tool?

From: ilya oparin <ioparin at ADDRESS HIDDEN>
Date: Wed, 8 Nov 2006 08:28:43 +0000 (GMT)

Andreas,

We've possibly found a bug in lattice-tool. Here, in
Brno, we work with th Czech language that has
diacritized letters. So, lattice-tool does everything
well with all the calculations until it comes to
matching of the best path with the reference file to
get number of del, subs and ins - and finally WER. It
appears that if both files are in ISO encoding and
there is a diacritized letter in the reference, it can
be matched to a non-diacritized word in the output,
that is actually a different word. So, the WER goes
down significantly from what really is (and what is
correctly output by HResults in HTK).

best regards,
Ilya

Send instant messages to your online friends http://uk.messenger.yahoo.com

Click here to go to the SRILM home page.

 

About Us  Vertical divider  R&D Divisions  Divider  Careers  Divider  Newsroom  Divider  Contact Us
©2006 SRI International, 333 Ravenswood Avenue, Menlo Park, CA 94025-3493
SRI International is an independent, nonprofit corporation. Privacy policy

Last modified Nov 21, 2008