<html>
<head>
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<div class="moz-cite-prefix">On 5/1/2013 5:04 PM, Maria Eskevich
wrote:<br>
</div>
<blockquote
cite="mid:2E4F3A95-77B9-4CA9-8DA7-E617E0DCC8A5@computing.dcu.ie"
type="cite">
<meta http-equiv="Content-Type" content="text/html;
charset=ISO-8859-1">
Dear Andreas,
<div><br>
</div>
<div>You were right, I haven't had the gawk installed. Thanks for
the help.</div>
<div><br>
</div>
<div>Could I please ask another question. If I have a file with
lattice in HTK format, is it possible to get the 1-best list
with corresponding timing and probability information?</div>
<div>As I understood the option -acoustic-mesh should keep this
information, but I don't see any writing option that would
combine the 2 things together. Basically I need the 1 best list
with time/confidence scores information for each word.</div>
<div><br>
</div>
<div>Maybe some additional changes with the following command line
can help?</div>
<div>./lattice-tool -in-lattice file.lat -read-htk
-viterbi-decode -acoustic-mesh<br>
</div>
</blockquote>
<br>
There is currently no option to dump all the acoustic information
(scores, alignments) out in nbest format, although the information
is available internally.<br>
<br>
But for getting the 1-best version of that information there is a
workaround. You can generate confusion networks with a low
posterior scaling factor. That will force the 1-best in the CN to
be the same as the 1-best in a Viterbi decoding. But using the
the -acoustic-mesh option you can then read off the time alignments
and score information. Try<br>
<br>
lattice-tool -read-htk -in-lattice LATTICEFILE -acoustic-mesh
-write-mesh CNFILE -posterior-scale 0.01<br>
<br>
and postprocess CNFILE. It will contain stuff like <br>
<br>
align 3 he 1 we 0 ate 0 me 0 h 0 u 0 t 0 you're 0 if_you_have 0 say
0 pete 0 you 0 deep 0 aid 0 you'd 0 is 0 i 0 they 0 c 0 a 0 keep 0 q
0 t. 0 a. 0 these 0 p. 0 oh 0 uh 0 lee 0 hee 0 she 0 really 0 indeed
0 hehe 0 he'd 0 are_you 0 heat 0 eight 0 or 0 but_you 0 to_be 0
uhhuh 0 a._i. 0 [laugh] 0 p 0 it's 0 see 0 but 0 e 0 hate 0 but_he 0
re 0 i_mean 0 neat 0 i_see 0 and_he 0 ee 0 uh_you 0 need 0 yeah_you
0 maybe 0 and_it 0 v 0 okay 0 v. 0 eee 0 do_you 0 e. 0 hes 0 g 0 mm
0 he's 0 easy 0 may 0 any 0 pay 0 if 0 b. 0 they'd 0 you_you 0 hey 0
beep 0 it 0 c. 0 gee 0 if_you 0 be 0 three 0 if_he 0 b 0 is_it 0 eat
0 d 0 d. 0 eighty 0<br>
info 3 he 0.04 0.26 -165.916 -2.62371 :#[hh]iy,0.08:hh[iy]#,0.18: :<br>
info 3 we 0.07 0.23 -160.802 -2.66818 :#[w]iy,0.03:w[iy]#,0.20: :<br>
info 3 ate 0.03 0.27 -173.698 -2.75711 :#[ey]t,0.15:ey[t]#,0.12: :<br>
....<br>
<br>
Because of the low posterior scaling, all the posterior probability
is on the 1-best word ("he" in this case). Then you find the "info"
record associated with that word an it will give you the start time,
duration, acoustic and LM scores, and pronunciation and phone
durations (the format is defined in the wlat-format(5) man page).<br>
<br>
You can safely add a pruning option if the CN construction takes too
long, since you are only interested in the 1-best output.<br>
<br>
Andreas<br>
<br>
<br>
<br>
<br>
<br>
<blockquote
cite="mid:2E4F3A95-77B9-4CA9-8DA7-E617E0DCC8A5@computing.dcu.ie"
type="cite">
<div>
<br>
Best,</div>
<div>Maria</div>
<div><br>
<div>
<div>On 30 Apr 2013, at 23:25, Andreas Stolcke <<a
moz-do-not-send="true"
href="mailto:stolcke@icsi.berkeley.edu">stolcke@icsi.berkeley.edu</a>>
wrote:</div>
<br class="Apple-interchange-newline">
<blockquote type="cite">
<meta content="text/html; charset=ISO-8859-1"
http-equiv="Content-Type">
<div text="#000000" bgcolor="#FFFFFF">
<div class="moz-cite-prefix">On 4/30/2013 5:02 AM, Maria
Eskevich wrote:<br>
</div>
<blockquote
cite="mid:797ADC62-F98A-4813-A2B2-54E4B2889DD4@computing.dcu.ie"
type="cite">
<meta http-equiv="Content-Type" content="text/html;
charset=ISO-8859-1">
<br>
Dear Andreas,
<div><br>
</div>
<div>I downloaded the 1.7 version of SRILM and followed
the instruction for installation (checked with INSTALL
file details and <a moz-do-not-send="true"
href="http://www1.icsi.berkeley.edu/%7Ewooters/SRILM/3%20Install%2807F18266%29.html">http://www1.icsi.berkeley.edu/~wooters/SRILM/3%20Install(07F18266).html</a>).</div>
<div><br>
</div>
<div>My system is macosx 10.8.3, processor 2.9 Ghz Intel
Core i7.</div>
</blockquote>
The compiler warnings are not a problem. Verify that the
binaries in $SRILM/bin/macosx are runnable, e.g., ngram
-version.<br>
If that's not the case then there is some problem with
your compiler or linker and you should shared your
complete log output -- hopefully some macosx expert can
help.<br>
<br>
The tests could be failing because you don't have gawk
installed.<br>
<br>
Andreas<br>
<br>
<br>
</div>
</blockquote>
</div>
<br>
</div>
</blockquote>
<br>
</body>
</html>