<html>

  <head>

    <meta content="text/html; charset=ISO-8859-1"

      http-equiv="Content-Type">

  </head>

  <body text="#000000" bgcolor="#FFFFFF">

    <div class="moz-cite-prefix">On 5/1/2013 5:04 PM, Maria Eskevich

      wrote:<br>

    </div>

    <blockquote

      cite="mid:2E4F3A95-77B9-4CA9-8DA7-E617E0DCC8A5@computing.dcu.ie"

      type="cite">

      <meta http-equiv="Content-Type" content="text/html;

        charset=ISO-8859-1">

      Dear Andreas,

      <div><br>

      </div>

      <div>You were right, I haven't had the gawk installed. Thanks for

        the help.</div>

      <div><br>

      </div>

      <div>Could I please ask another question. If I have a file with

        lattice in HTK format, is it possible to get the 1-best list

        with corresponding timing and probability information?</div>

      <div>As I understood the option -acoustic-mesh should keep this

        information, but I don't see any writing option that would

        combine the 2 things together. Basically I need the 1 best list

        with time/confidence scores information for each word.</div>

      <div><br>

      </div>

      <div>Maybe some additional changes with the following command line

        can help?</div>

      <div>./lattice-tool -in-lattice file.lat -read-htk

         -viterbi-decode -acoustic-mesh<br>

      </div>

    </blockquote>

    <br>

    There is currently no option to dump all the acoustic information

    (scores, alignments) out in nbest format, although the information

    is available internally.<br>

    <br>

    But for getting the 1-best version of that information there is a

    workaround.   You can generate confusion networks with a low

    posterior scaling factor.  That will force the 1-best in the CN to

    be the same as the 1-best in a  Viterbi decoding.   But using the

    the -acoustic-mesh option you can then read off the time alignments

    and score information.  Try<br>

    <br>

        lattice-tool -read-htk -in-lattice LATTICEFILE  -acoustic-mesh

    -write-mesh CNFILE -posterior-scale 0.01<br>

    <br>

    and postprocess CNFILE.  It will contain stuff like <br>

    <br>

    align 3 he 1 we 0 ate 0 me 0 h 0 u 0 t 0 you're 0 if_you_have 0 say

    0 pete 0 you 0 deep 0 aid 0 you'd 0 is 0 i 0 they 0 c 0 a 0 keep 0 q

    0 t. 0 a. 0 these 0 p. 0 oh 0 uh 0 lee 0 hee 0 she 0 really 0 indeed

    0 hehe 0 he'd 0 are_you 0 heat 0 eight 0 or 0 but_you 0 to_be 0

    uhhuh 0 a._i. 0 [laugh] 0 p 0 it's 0 see 0 but 0 e 0 hate 0 but_he 0

    re 0 i_mean 0 neat 0 i_see 0 and_he 0 ee 0 uh_you 0 need 0 yeah_you

    0 maybe 0 and_it 0 v 0 okay 0 v. 0 eee 0 do_you 0 e. 0 hes 0 g 0 mm

    0 he's 0 easy 0 may 0 any 0 pay 0 if 0 b. 0 they'd 0 you_you 0 hey 0

    beep 0 it 0 c. 0 gee 0 if_you 0 be 0 three 0 if_he 0 b 0 is_it 0 eat

    0 d 0 d. 0 eighty 0<br>

    info 3 he 0.04 0.26 -165.916 -2.62371 :#[hh]iy,0.08:hh[iy]#,0.18: :<br>

    info 3 we 0.07 0.23 -160.802 -2.66818 :#[w]iy,0.03:w[iy]#,0.20: :<br>

    info 3 ate 0.03 0.27 -173.698 -2.75711 :#[ey]t,0.15:ey[t]#,0.12: :<br>

    ....<br>

    <br>

    Because of the low posterior scaling, all the posterior probability

    is on the 1-best word ("he" in this case).  Then you find the "info"

    record associated with that word an it will give you the start time,

    duration, acoustic and LM scores, and pronunciation and phone

    durations (the format is defined in the wlat-format(5) man page).<br>

    <br>

    You can safely add a pruning option if the CN construction takes too

    long, since you are only interested in the 1-best output.<br>

    <br>

    Andreas<br>

    <br>

    <br>

    <br>

    <br>

    <br>

    <blockquote

      cite="mid:2E4F3A95-77B9-4CA9-8DA7-E617E0DCC8A5@computing.dcu.ie"

      type="cite">

      <div>

        <br>

        Best,</div>

      <div>Maria</div>

      <div><br>

        <div>

          <div>On 30 Apr 2013, at 23:25, Andreas Stolcke <<a

              moz-do-not-send="true"

              href="mailto:stolcke@icsi.berkeley.edu">stolcke@icsi.berkeley.edu</a>>

            wrote:</div>

          <br class="Apple-interchange-newline">

          <blockquote type="cite">

            <meta content="text/html; charset=ISO-8859-1"

              http-equiv="Content-Type">

            <div text="#000000" bgcolor="#FFFFFF">

              <div class="moz-cite-prefix">On 4/30/2013 5:02 AM, Maria

                Eskevich wrote:<br>

              </div>

              <blockquote

                cite="mid:797ADC62-F98A-4813-A2B2-54E4B2889DD4@computing.dcu.ie"

                type="cite">

                <meta http-equiv="Content-Type" content="text/html;

                  charset=ISO-8859-1">

                <br>

                Dear Andreas,

                <div><br>

                </div>

                <div>I downloaded the 1.7 version of SRILM and followed

                  the instruction for installation (checked with INSTALL

                  file details and  <a moz-do-not-send="true"

href="http://www1.icsi.berkeley.edu/%7Ewooters/SRILM/3%20Install%2807F18266%29.html">http://www1.icsi.berkeley.edu/~wooters/SRILM/3%20Install(07F18266).html</a>).</div>

                <div><br>

                </div>

                <div>My system is macosx 10.8.3, processor 2.9 Ghz Intel

                  Core i7.</div>

              </blockquote>

              The compiler warnings are not a problem.   Verify that the

              binaries in $SRILM/bin/macosx are runnable, e.g., ngram

              -version.<br>

              If that's not the case then there is some problem with

              your compiler or linker and you should shared your

              complete log output -- hopefully some macosx expert can

              help.<br>

              <br>

              The tests could be failing because you don't have gawk

              installed.<br>

              <br>

              Andreas<br>

              <br>

              <br>

            </div>

          </blockquote>

        </div>

        <br>

      </div>

    </blockquote>

    <br>

  </body>

</html>