Divider
  Speech Technology and Research Laboratory
  People
  Current Research Activities
  Past Research Activities
  Publications
  Career Opportunities
  Seminars
  Technologies for License
  In the News
  Contact Us
  STAR Search
  Information and Computing Sciences Division
SpacerAbout UsDividerR and D DivisionsDividerCareersDividerNewsroomDividerContact UsDividerSRI HomeSpacer

Spacer
         
  SRI Logo

Elizabeth Shriberg

Publications

Elizabeth Shriberg, Andreas Stolcke, Suman Ravuri (2013). Addressee Detection for Dialog Systems Using Temporal and Spectral Dimensions of Speaking Style, Proc. Interspeech 2013, Lyon, pp. 2559-2563.

Malcolm Slaney, Elizabeth Shriberg, Jui-Ting Huang (2013). Pitch-Gesture Modeling Using Subband Autocorrelation Change Detection, Proc. Interspeech 2013, Lyon, pp. 1911-1915.

Nigel Ward, Steven Werner, David Novick, Tatsuya Kawahara, Elizabeth Shriberg, Louis-Philippe Morency, Catharine Oertel (2013). The Similar Segments in Social Speech Task, Proceedings of the MediaEval Workshop, 2013.

L. Heck, D. Hakkani-Tur, M. Chinthakunta, G. Tur, R. Iyer, P. Parthasarathy, L. Stifelman, E. Shriberg, A. Fidler (2013). Multimodal Conversational Search and Browse, First Workshop on Speech, Language, and Audio in Multimedia, Marseille.

H. Lee, A. Stolcke, and E. Shriberg (2013). Using Out-of-Domain Data for Lexical Addressee Detection in Human-Human-Computer Dialog, Proc. NAACL, Atlanta 2013.

E. Shriberg, A. Stolcke, D. Hakkani-Tur, L. Heck (2012), Learning When to Listen: Detecting System-Addressed Speech in Human-Human-Computer Dialog, Proc. Interspeech, 2012.

Kornel Laskowski & Elizabeth Shriberg (2012), Corpus-Independent History Compression for Stochastic Turn-Taking Models, Proc. IEEE ICASSP, pp. 4937-4940, Kyoto.

A. Stolcke, A. Mandal, & E. Shriberg (2012), Speaker Recognition With Region-Constrained MLLR Transforms, Proc. IEEE ICASSP, pp. 4397-440, Kyoto.

E. Shriberg & A. Stolcke (2011), Language-independent constrained cepstral features for speaker recognition, Proc. IEEE ICASSP, pp. 5296-5299, Prague.

Marcel Kockmann, Luciana Ferrer, Lukas Burget, Elizabeth Shriberg & Jan Cernocky (2011), Recent Progress in Prosodic Speaker Verification, Proc. IEEE ICASSP, pp. 4556-4559, Prague.

D. Hakkani-Tur, G. Tur, L. Heck, & E. Shriberg (2011), Bootstrapping Domain Detection Using Query Click Logs for New Domains, Proc. Interspeech, Florence.

M. H. Sanchez, L. Ferrer, E. Shriberg, & A. Stolcke (2011), Constrained Cepstral Speaker Recognition Using Matched UBM and JFA Training, Proc. Interspeech, pp. 141-144, Florence.

N. Scheffer, L. Ferrer, M. Graciarena, S. Kajarekar, E. Shriberg & A. Stolcke (2011), The SRI NIST 2010 Speaker Recognition Evaluation System, Proc. IEEE ICASSP, pp. 5292-5295, Prague.

M. Graciarena, M. Delplanche,E. Shriberg & A. Stolcke (2011), Bird Species Recognition Combining Acoustic and Sequence Modeling, Proc. IEEE ICASSP, pp. 341-344, Prague.

A. Stolcke, M. Akbacak, L. Ferrer, S. Kajarekar, C. Richey, N. Scheffer, & E. Shriberg (2010), Improving Language Recognition with Multilingual Phone Recognition and Speaker Adaptation Transforms, Proc. Odyssey Speaker and Language Recognition Workshop, pp. 256-262, Brno, Czech Republic. (PDF)

M. Graciarena, M. Delplanche,E. Shriberg, A. Stolcke, & L. Ferrer (2010), Acoustic Front-end Optimization for Bird Species Recognition, Proc. IEEE ICASSP, Dallas, pp. 293-296.

William Horton, Daniel Spieler, and Elizabeth Shriberg (2010). A corpus analysis of patterns of age-related change in conversational speech. Psychology and Aging.

Dilek Hakkani-Tur, Gokhan Tur, Benoit Favre, and Elizabeth Shriberg (2010). Finding the Structure of Documents. In D. Bikel & I. Zitouni (Eds.) Multilingual Natural Language Applications: From Theory to Practice, Prentice Hall.

G. Tur et al. (2010). The CALO Meeting Assistant System. IEEE Transactions on Audio, Speech, and Language Processing.

H. Franco, H. Bratt, R. Rossier, R. Rao, E. Shriberg, V. Abrash, K. Precoda (2010). EduSpeak: A speech recognition and pronunciation scoring toolkit for computer-aided language learning applications. Language Testing, Volume 27, Number 3, pp. 401-418. (Link requires SAGE journals subscription).

K. Laskowski and E. Shriberg (2010). Comparing the contributions of context and prosody in text-independent dialog act recognition. Proc. ICASSP, Dallas, Texas, March 2010, pp. 5374-5377.

L. Ferrer, N. Scheffer, E. Shriberg (2010). A comparison of approaches for modeling prosodic features in speaker recognition. Proc. ICASSP, Dallas, Texas, March 2010, pp. 4414-4417.

M. Graciarena, M. Delplanche, E. Shriberg, A. Stolcke, L. Ferrer (2010). Acoustic front-end optimization for bird species recognition. Proc. ICASSP, Dallas, Texas, March 2010, pp. 293-296.

J. Kolar, Y. Liu, and E. Shriberg (2010). Speaker adaptation of language and prosodic models for automatic dialog act segmentation of speech. Speech Communication, Volume 52, Issue 3, pp. 236-245.

L. Ferrer, K. Sonmez, & E. Shriberg (2009). An anticorrelation kernel for subsystem training in multiple classifier systems. Journal of Machine Learning Research, Vol. 10, pp. 2079-2114.

E. Shriberg, S. Kajarekar, N. Scheffer (2009). Does Session Variability Compensation in Speaker Recognition Model Intrinsic Variation Under Mismatched Conditions? Proc. Interspeech, Brighton, UK, September 2009, pp. 1551-1554.

K. Laskowski and E. Shriberg (2009). Modeling Other Talkers for Improved Dialog Act Recognition in Meetings. Proc. Interspeech, Brighton, UK, September 2009, pp. 2783-2786.

M. Graciarena, T. Bocklet, E. Shriberg, A. Stolcke, S. Kajarekar (2009). Feature-Based and Channel-Based Analyses of Intrinsic Variability in Speaker Verification. Proc. Interspeech, Brighton, UK, September 2009, pp. 2015-2018.

E. Shriberg, B. Favre, J. Fung, D. Hakkani-Tur, and S. Cuendet (2009). Prosodic similarities of dialog act boundaries across speaking styles. In Linguistic Patterns in Spontaneous Speech, Shu-Chuan Tseng (Ed.), Language and Linguistics Monograph Series A25. Taipei: Institute of Linguistics, Academia Sinica, pp. 213-239.

T. Bocklet and E. Shriberg (2009). Speaker Recognition Using Syllable-Based Constraints for Cepstral Frame Selection , Proc. ICASSP, Taipei, Taiwan.

S. Kajarekar, N. Scheffer, M. Graciarena, E. Shriberg, A. Stolcke, L. Ferrer, T. Bocklet (2009). The SRI NIST 2008 Speaker Recognition Evaluation System , Proc. ICASSP, Taipei, Taiwan.

B. Favre, D. Hakkani-Tur, and E. Shriberg (2009). Syntactically-Informed Models for Comma Prediction , Proc. ICASSP, Taipei, Taiwan.

J. Kolar, Y. Liu, and E. Shriberg (2009). Genre Effects on Automatic Sentence Segmentation of Speech: A Comparison of Broadcast News and Broadcast Conversations , Proc. ICASSP, Taipei, Taiwan.

E. Shriberg, M. Graciarena, H. Bratt, A. Kathol, S. Kajarekar, H. Jameel, C. Richey, F. Goodman (2008). Effects of Vocal Effort and Speaking Style on Text-Independent Speaker Verification, Proc. Interspeech, Brisbane, Australia, pp. 609-612.

E. Shriberg & A. Stolcke (2008), The Case for Automatic Higher-Level Features in Forensic Speaker Recognition, Proc. Interspeech, Brisbane, Australia, pp. 1509-1512. [Note: Invited overview paper for Special Session on Forensic Speaker Recognition, organized by Geoff Morrison].

L. Ferrer, M. Graciarena, A. Zymnis, and E. Shriberg (2008), System combination using auxiliary information for speaker verification. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, Las Vegas, Nevada.

M. Ostendorf et al. (2008), Speech segmentation and spoken document processing. IEEE Signal Processing Magazine, Vol. 25, Issue 3, pps. 59-69.

F. Yang, G. Tur, and E. Shriberg (2008), Exploiting dialog act tagging and prosodic information for action item identification. Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, Las Vegas, Nevada.

E. Shriberg, L. Ferrer, S. Kajarekar, N. Scheffer, A. Stolcke, & M. Akbacak (2008), Detecting Nonnative Speech Using Speaker Recognition Approaches. Proc. Odyssey Speaker and Language Recognition Workshop, Stellenbosch, South Africa.

L. Ferrer, K. Sonmez, and E. Shriberg (2008), An anticorrelation kernel for improved system combination in speaker verification. Proc. Odyssey Speaker and Language Recognition Workshop, Stellenbosch, South Africa.

G. Myers, G. Tur, L. Voss, B. Bolles, S. Kajarekar, E. Shriberg, D. Hakkani-Tur (2008). Multimedia Information Extraction Roadmap. In Proceedings of the AAAI Fall Smposium on Multimedia Information Extraction, Arlington, VA.

G. Tur, A. Stolcke, L. Voss, J. Dowding, B. Favre, R. Fernandez, M. Frampton, M. Frandsen, C. Frederickson, M. Graciarena, D. Hakkani-Tur, D. Kintzing, K. Leveque, S. Mason, J. Niekrasz, S. Peters, M. Purver, K. Riedhammer, E. Shriberg, J. Tien, D. Vergyri, & F. Yang (2008), The CALO Meeting Speech Recognition and Understanding System, Proc. IEEE Spoken Language Technology Workshop, pp. 69-72, Goa, India.

E. E. Shriberg (2007), Higher Level Features in Speaker Recognition. In C. Muller (Ed.) Speaker Classification I. Volume 4343 of Lecture Notes in Computer Science / Artificial Intelligence. Springer: Heidelberg / Berlin / New York, pp. 241-259.

A. Stolcke, S. Kajarekar, L. Ferrer, & E. Shriberg (2007), Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms, IEEE Transactions on Audio, Speech, and Language Processing, Special issue on speaker and language recognition. 15(7), 1987-1998.

S. Cuendet, D. Hakkani-Tur, E. Shriberg, J. Fung, B. Favre (2007), Cross-Genre Feature Comparisons for Spoken Sentence Segmentation. International Journal of Semantic Computing, Volume 1, Issue 3, pp. 335-346.

S. Cuendet, D. Hakkani-Tur, E. Shriberg, J. Fung, B. Favre (2007), Cross-Genre Feature Comparisons for Spoken Sentence Segmentation. (An earlier version of the preceding paper). Proceedings International Conference on Semantic Computing, September 2007, Irvine, CA, pp. 265-271.

E. Shriberg and L. Ferrer (2007), A Text-Constrained Prosodic System for Speaker Verification. In Proceedings Interspeech, pp. 1226-1229, Antwerp.

J. Kolar, Y. Liu, and E. Shriberg (2007), Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings. In Proceedings Interspeech, pp. 1621-1624, Antwerp.

L. Ferrer, K. Sonmez, and E. Shriberg (2007), A Smoothing Kernel for Spatially Related Features and Its Application to Speaker Verification. In Proceedings Interspeech, 738-741, Antwerp.

G. Tur, E. Shriberg, A. Stolcke, S. Kajarekar (2007), Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification. In Proceedings Interspeech, pp. 2049-2052, Antwerp.

F. Enos, E. Shriberg, M. Graciarena, J. Hirschberg, A. Stolcke (2007), Detecting Deception Using Critical Segments. In Proceedings Interspeech, pp. 2281-2284, Antwerp.

J. Fung, D. Hakkani-Tur, M. Magimai-Doss, E. Shriberg, S. Cuendet, N. Mirghafori (2007), Prosodic Features and Feature Selection for Multi-Lingual Sentence Segmentation. In Proceedings Interspeech, pp. 2585-2588, Antwerp.

S. Cuendet, E. Shriberg, B. Favre, J. Fung, D. Hakkani-Tur (2007), An Analysis of Sentence Segmentation Features for Broadcast News, Broadcast Conversations, and Meetings. Proceedings SIGIR Workshop on Searching Conversational Spontaneous Speech, 23-27 July, Amsterdam, Netherlands, pp. 37-43.

S. Cuendet, D. Hakkani-Tur, E. Shriberg (2007), Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech. Proceedings MLMI 2007, June, Brno, Czech Republic.

Y. Liu and E. Shriberg (2007), Comparing Evaluation Metrics for Sentence Boundary Detection. Proc. IEEE ICASSP, Honolulu, Hawaii.

L. Ferrer, E. Shriberg, S. Kajarekar, and K. Sonmez (2007), Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition. Proc. IEEE ICASSP, Honolulu, Hawaii.

M. Graciarena, S. Kajarekar, A. Stolcke, and E. Shriberg (2007), Noise Robust Speaker Identification for Spontaneous Arabic Speech. Proc. IEEE ICASSP, Honolulu, Hawaii.

M. Magimai-Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. Mirghafori (2007), Entropy Based Classifier Combination for Sentence Segmentation. Proc. IEEE ICASSP, Honolulu, Hawaii.

Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, & M. Harper (2006), Enriching Speech Recognition with Automatic Detection of Sentence Boundaries and Disfluencies. IEEE Trans. Audio, Speech and Language Processing 14(5), 1526-1540. (PDF, abstract)

Y. Liu and E. Shriberg (2006), More Than Words Can Say: Using Prosody to Find Sentence Boundaries in Speech. 4th ASA/ASJ Joint Meeting Lay Language Papers. Popular version of paper IaSC2, 4th ASA/ASJ Joint Meeting, Honolulu, HI.

O. Cetin and E. Shriberg (2006), Analysis of Overlaps in Meetings by Dialog Factors, Hot Spots, Speakers, and Collection Site: Insights for Automatic Speech Recognition. Proc. ICSLP, pp. 293-296, Pittsburgh.

J. Kolar, E. Shriberg, Y. Liu (2006), On Speaker-Specific Prosodic Models for Automatic Dialog Act Segmentation of Multi-Party Meetings. Proc. ICSLP, pp. 2014-2017, Pittsburgh.

F. Enos, S. Benus, R. Cautin, M. Graciarena, J. Hirschberg and E. Shriberg (2006), Personality Factors in Human Deception Detection: Comparing Human to Machine Performance. Proc. ICSLP, pp. 813-816, Pittsburgh.

M. Zimmermann, D. Tur, J. Fung, N. Mirghafori, L. Gottlieb, E. Shriberg, Y. Liu (2006), The ICSI+ Multi-Lingual Sentence Segmentation System. Proc. ICSLP, pp. 117-120, Pittsburgh.

F. Weng, S. Varges, B. Raghunathan, F. Ratiu, H. Pon-Barry, B. Lathrop, Q. Zhang, H. Bratt, T. Scheideck, K. Xu, M. Purver, R. Mishra, A. Lien, M. Raya, S. Peters, Y. Meng, J. Russell, L. Cavedon, E. Shriberg, H. Schmidt, R. Prieto (2006), CHAT: A Conversational Helper for Automotive Tasks. Proc. ICSLP, pp. 1061-1064, Pittsburgh.

Y. Liu, N. V. Chawla, M. P. Harper, E. Shriberg, & A. Stolcke (2006), A study in machine learning from imbalanced data for sentence boundary detection in speech, Computer Speech and Language 20(4), 468-494. (PDF, abstract)

J. Kolar, E. Shriberg, Y. Liu (2006), Using Prosody for Automatic Sentence Segmentation of Multi-Party Meetings. Proc. International Conference on Text, Speech, and Dialogue (TSD), Czech Republic.

S. S. Kajarekar, H. Bratt, E. Shriberg, & R. de Leon (2006), A Study of Intentional Voice Modifications for Evading Automatic Speaker Recognition. Proc. IEEE Odyssey 2006 Speaker and Language Recognition Workshop, San Juan, Puerto Rico.

O. Cetin and E.E. Shriberg (2006), Overlap in Meetings: ASR Effects and Analysis by Dialog Factors, Speakers, and Collection Site. MLMI06 (3rd Joint Workshop on Multimodal and Related Machine Learning Algorithms), Washington DC.

M. Zimmermann, D. Hakkani-Tur, E. Shriberg, A. Stolcke (2006). Text based Dialog Act Classification for Multiparty Meetings. MLMI06 (3rd Joint Workshop on Multimodal and Related Machine Learning Algorithms), Washington DC.

L. Ferrer, E. Shriberg, S. S. Kajarekar, A. Stolcke, K. Sonmez, A. Venkataraman, & H. Bratt (2006), The Contribution of Cepstral and Stylistic Features to SRI's 2005 NIST Speaker Recognition Evaluation System. Proc. IEEE ICASSP, Toulouse. (PDF)

M. Graciarena, E. Shriberg, A. Stolcke, F. Enos, J. Hirschberg, S. Kajarekar (2006), Combining Prosodic, Lexical and Cepstral Systems for Deceptive Speech Detection. Proc. IEEE ICASSP, Toulouse.

O. Cetin and E. Shriberg (2006). Speaker Overlaps and ASR Errors in Meetings: Effects Before, During, and After the Overlap. Proc. IEEE ICASSP, Toulouse.

M. Zimmermann, A. Stolcke, & E. Shriberg (2006), Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings. Proc. IEEE ICASSP, Toulouse.

S. Benus, F. Enos, J. Hirschberg, E. Shriberg (2006). Pauses in Deceptive Speech. Speech Prosody 2006, Dresden.

M. Zimmermann, Y. Liu, E. Shriberg, & A. Stolcke (2005), A* based Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings. Proc. IEEE Speech Recognition and Understanding Workshop, Cancun.

E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, & A. Stolcke (2005), Modeling Prosodic Feature Sequences for Speaker Recognition. Speech Communication 46(3-4), 455-472.

E. E. Shriberg (2005). Spontaneous Speech: How People Really Talk, and Why Engineers Should Care. Proc. Eurospeech, pp. 1781-1784, Lisbon. [Overview paper to accompany keynote address].

Y. Liu, N. V. Chawla, M. P. Harper, E. Shriberg, & A. Stolcke (2006), A study in machine learning from imbalanced data for sentence boundary detection in speech, Computer Speech and Language 20(4), 468-494. (PDF, abstract)

J. Hirschberg, S. Benus, J. M. Brenier, F. Enos, S. Friedman, S. Gilman, C. Girand, M. Graciarena, A. Kathol, L. Michaelis, B. Pellom, E. Shriberg, & A. Stolcke (2005), Distinguishing Deceptive from Non-Deceptive Speech. Proc. Eurospeech, Lisbon.

D. Jones, W. Shen, E. Shriberg, A. Stolcke, T. Kamm, & D. Reynolds (2005), Two Experiments Comparing Reading with Listening for Human Processing of Conversational Telephone Speech. Proc. Eurospeech, Lisbon.

A. Venkataraman, Y. Liu, E. Shriberg, & A. Stolcke (2005), Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?. Proc. Eurospeech, Lisbon.

Y. Liu, E. Shriberg, A. Stolcke, & M. Harper (2005), Comparing HMM, Maximum Entropy, and Conditional Random Fields for Disfluency Detection. Proc. Eurospeech, Lisbon.

A. Stolcke, L. Ferrer, S. Kajarekar, E. Shriberg, & A. Venkataraman (2005), MLLR Transforms as Features in Speaker Recognition. Proc. Eurospeech, Lisbon.

M. Zimmermann, Y. Liu, E. Shriberg, & A. Stolcke (2005), Toward Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings. Proc. MLMI, Edinburgh.

Y. Liu, A. Stolcke, E. Shriberg, and M. Harper (2005), Using Conditional Random Fields for Sentence Boundary Detection in Speech, Proc. ACL, Ann Arbor, MI, pp. 451-458.

B. Wrede, S. Bhagat, R. Dhillon, E. Shriberg (2005). Meeting Recorder Project: Hot Spot Labeling Guide. ICSI Technical Report TR-05-004.

C. Walker, S. Strassel, E. Shriberg, Y. Liu, J. Ang, H. Lee (2005). LDC2005T24: MDE RT-04 Training Data Text/Annotations (LDC Metadata Annotation Corpus).

M. Ostendorf, E. Shriberg, & A. Stolcke (2005). Human Language Technology: Opportunities and Challenges. Overview paper to accompany special double session on Human Language Technology (organized by the authors). Proc. ICASSP 2005, Philadelphia.

J. Ang, Y. Liu, & E. Shriberg (2005). Automatic Dialog Act Segmentation and Classification in Multiparty Meetings. Proc. ICASSP 2005, Philadelphia, pp. 1061-1064.

S. S. Kajarekar, L. Ferrer, E. Shriberg, K. Sonmez, A. Stolcke, A. Venkataraman, and J. Zheng (2005), SRI's 2004 NIST Speaker Recognition Evaluation System, Proc. IEEE ICASSP, Philadelphia, vol. 1, pp. 173-176.

Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, J. Ang, D. Hillard, M. Ostendorf, M. Tomalin, P. Woodland, M. Harper (2005). Structural Metadata Research in the EARS Program. Proc. ICASSP 2005, Philadelphia.

S. Kajarekar, L. Ferrer, E. Shriberg, K. Sonmez, A. Stolcke, A. Venkataraman, J. Zheng (2005). SRI's 2004 NIST Speaker Recognition Evaluation System. Proc. ICASSP 2005, Philadelphia.

L. Chen, Y. Liu, M. Harper, E. Shriberg (2004). Multimodal Model Integration For Sentence Unit Detection. Proc. Intl. Conf. Multimodal Interfaces (ICMI), State College, PA.

E. Shriberg, L. Ferrer, A. Venkataraman, S. Kajarekar (2004). SVM Modeling of "SNERF-Grams" for Speaker Recognition. Proc. Intl. Conf. on Spoken Language Processing, Jeju, Korea.

Y. Liu, E. Shriberg, A. Stolcke, & M. Harper (2004), Using Machine Learning to Cope with Imbalanced Classes in Natural Speech: Evidence from Sentence Boundary and Disfluency Detection. Proc. Intl. Conf. on Spoken Language Processing, Jeju, Korea.

H. Cheng, H. Bratt, R. Mishra, E. Shriberg, S. Upson, J. Chen, F. Weng. S. Peters, L. Cavedon, J. Niekrasz. (2004). A Wizard of Oz Framework for Collecting Spoken Human-Computer Dialogs. Proc. Intl. Conf. on Spoken Language Processing, Jeju, Korea.

F. Weng, et al. (2004). A Conversational Dialog System for Cognitively Overloaded Users. Proc. Intl. Conf. on Spoken Language Processing, Jeju, Korea.

Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, B. Peskin, & M. Harper (2004), The ICSI-SRI-UW Metadata Extraction System. Proc. Intl. Conf. on Spoken Language Processing, Jeju, Korea.

Michel Galley, Kathleen McKeown, Julia Hirschberg, Elizabeth Shriberg (2004). Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies. Proc. 42nd Meeting of the ACL, July 21-26, Barcelona.

Y. Liu, A. Stolcke, E. Shriberg, & M. Harper (2004), Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech. Proc. Conf. on Empirical Methods in Natural Language Processing, Barcelona.

S. Kajarekar, L. Ferrer, K. Sonmez, J. Zheng, E. Shriberg, and A. Stolcke (2004), Modeling NERFs for Speaker Recognition. Proc. Odyssey 04 Speaker and Language Recognition Workshop, pp. 51-56, Toledo, Spain.

D. Hillard, M. Ostendorf, A. Stolcke, Y. Liu, & E. Shriberg (2004). Improving Automatic Sentence Boundary Detection with Confusion Networks. Proc. HLT-NAACL, May 2004, Boston, pp. 69-72.

E. Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. Carvey (2004). The ICSI Meeting Recorder Dialog Act (MRDA) Corpus. Proc. 5th SIGdial Workshop on Discourse and Dialogue, M. Strube and C. Sidner (Eds.), April 30 - May 1, Cambridge, MA, pp. 97-100.

E. Shriberg and A. Stolcke (2004). Direct Modeling of Prosody: An Overview of Applications in Automatic Speech Processing. Proc. International Conference on Speech Prosody 2004 Nara, Japan.

R. Dhillon, S. Bhagat, H. Carvey, and E. Shriberg (2004). Meeting Recorder Project: Dialog Act Labeling Guide. ICSI Technical Report TR-04-002.

E. Shriberg and A. Stolcke (2004). Prosody Modeling for Automatic Speech Recognition and Understanding. Mathematical Foundations of Speech and Language Processing. M. Johnson, S. Khudanpur, M. Ostendorf and R. Rosenfeld (Editors), IMA Volumes in Mathematics and Its Applications, Vol 138, Springer-Verlag, New York, pp. 105-114.

B. Wrede and E. Shriberg (2003), The Relationship Between Dialogue Acts and Hot Spots in Meetings. Proc. IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands, pp. 180-185.

S. Kajarekar, L. Ferrer, A. Venkataraman, K. Sonmez, E. Shriberg, A. Stolcke, & R. R. Gadde (2003), Speaker Recognition Using Prosodic and Lexical Features. Proc. IEEE Speech Recognition and Understanding Workshop, St. Thomas, U.S. Virgin Islands.

Y. Liu, E. Shriberg, & A. Stolcke (2003), Automatic disfluency identification in conversational speech using multiple knowledge sources. Proc. Eurospeech, Geneva.

B. Wrede and E. Shriberg (2003), Spotting "Hotspots" in Meetings: Human Judgments and Prosodic Cues. Proc. Eurospeech, Geneva, pp. 2805-2808.

S. Bhagat, H. Carvey and E. Shriberg (2003), Automatically Generated Prosodic Cues to Lexically Ambiguous Dialog Acts in Multiparty Meetings. Proc. International Congress of Phonetic Sciences, Barcelona.

L. Ferrer, H. Bratt, V. R. R. Gadde, S. Kajarekar, E. Shriberg, K. Sonmez, A. Stolcke, & A. Venkataraman (2003), Modeling duration patterns for speaker recognition. Proc. Eurospeech, Geneva.

Dustin Hillard, Mari Ostendorf, and Elizabeth Shriberg (2003), Detection of Agreement vs. Disagreement in Meetings: Training with Unlabeled Data. Proc. HLT-NAACL Conference, Edmonton, Canada, May 2003.

S. Kajarekar et. al (2003), TalkPrinting: Improving Speaker Recognition by Modeling Stylistic Features. Proc. First NSF/NIJ Symposium on Intelligence and Security Informatics, H. Chen et al (Eds), Tucson, AZ. Published as Lecture Notes in Computer Science, Vol. 2665, ppp. 350-354.

L. Ferrer, E. Shriberg, and A. Stolcke (2003), A prosody-based approach to end-of-utterance detection that does not require speech recognition. Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, Hong Kong.

D. Vergyri, A. Stolcke, V. R. R. Gadde, L. Ferrer, & E. Shriberg (2003), Prosodic Knowledge Sources for Automatic Speech Recognition. Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, Hong Kong.

A. Venkataraman, L. Ferrer, A. Stolcke, & E. Shriberg (2003), Training a Prosody-based Dialog Act Tagger from Unlabeled Data. Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, Hong Kong.

A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, C. Wooters (2003), The ICSI Meeting Corpus. Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, Hong Kong.

N. Morgan, D. Baron, S. Bhagat, H. Carvey, R. Dhillon, J. Edwards, D. Gelbart, A. Janin, A. Krupski, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, & C. Wooters (2003), Meetings about meetings: research at ICSI on speech in multiparty conversations . Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing, Hong Kong.

A. Venkataraman, A. Stolcke, & E. Shriberg (2002), Automatic Dialog Act Tagging with Minimal Supervision. Proc. 9th Australian International Conference on Speech Science and Technology, Melbourne.

Ang, J., Dhillon, R., Krupski, A., Shriberg, E. and Stolcke, A. (2002), Prosody-Based Automatic Detection of Annoyance and Frustration in Human-Computer Dialog. Proc. Intl. Conf. on Spoken Language Processing, vol. 3, pp. 2037-2040, Denver.

L. Ferrer, E. Shriberg, and A. Stolcke (2002), Is the Speaker Done Yet? Faster and More Accurate End-of-Utterance Detection Using Prosody in Human-Computer Dialog. Proc. Intl. Conf. on Spoken Language Processing, vol. 3, pp. 2061-2064, Denver.

Baron, D., Shriberg, E. and Stolcke, A. (2002), Automatic Punctuation and Disfluency Detection in Multi-Party Meetings Using Prosodic and Lexical Cues. Proc. Intl. Conf. on Spoken Language Processing, vol. 2, pp. 949-952, Denver.

Weber, F., Manganaro, L., Peskin, B. & Shriberg, E. (2002). Using Prosodic and Lexical Information for Speaker Identification. Proc. ICASSP, vol. 1, pp. 141-144, Orlando.

Shriberg, E. E. (2001). To "Errrr" is Human: Ecology and Acoustics of Speech Disfluencies. Journal of the International Phonetic Association 31(1), 153-169, Cambridge University Press.

Shriberg, E. & Stolcke, A. (2001). Prosody Modeling for Automatic Speech Understanding: An Overview of Recent Research at SRI. Proc. ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding, pp. 13-16, Red Bank, NJ. (An updated version of this paper appears as Shriberg & Stolcke, 2002.)

Shriberg, E., Stolcke, A. & Baron, D. (2001). Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech. Proc. ISCA Tutorial and Research Workshop on Prosody in Speech Recognition and Understanding, pp. 139-146, Red Bank, NJ.

Shriberg, E., Stolcke, A. & Baron, D. (2001). Observations on Overlap: Findings and Implications for Automatic Processing of Multi-Party Conversation. Proc. EUROSPEECH, vol. 2, pp. 1359-1362, Aalborg, Denmark.

Tur, G., Hakkani-Tur, D., Stolcke, A. & Shriberg, E. (2001). Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation, Computational Linguistics 27(1), 31-57.

N. Morgan, D. Baron, J. Edwards, D. Ellis, D. Gelbart, A. Janin, T. Pfau, E. Shriberg, & A. Stolcke (2001), The Meeting Project at ICSI, Proc. of HLT 2001, First International Conference on Human Language Technology Research, pp. 246-252, San Diego, CA.

C. Teixeira, H. Franco, E. Shriberg, K. Precoda, K. Sonmez (2001). Evaluation of Speaker's Degree of Nativeness Using Text-Independent Prosodic Features. Proceedings of the Workshop on Multilingual Speech and Language Processing , Aalborg, Denmark.

Sonmez, K., Plauche, M., Shriberg, E. and Franco, H. (2000). Consonant discrimination in elicited and spontaneous speech: A case for signal-adaptive front ends in ASR. Proc. International Conference on Spoken Language Processing, vol. 1, pp. 325-328, Beijing.

C. Teixeira, H. Franco, E. Shriberg, K. Precoda, and K. Sonmez (2000). Prosodic features for automatic text-independent evaluation of degree of nativeness for language learners. Proc. International Conference on Spoken Language Processing, vol. 3, pp. 187-190, Beijing.

A. Stolcke, H. Bratt, J. Butzberger, H. Franco, V. R. Rao Gadde, M. Plauche, C. Richey, E. Shriberg, K. Sonmez, F. Weng, J. Zheng (2000), The SRI March 2000 Hub-5 Conversational Speech Transcription System. Proc. NIST Speech Transcription Workshop, College Park, MD.

Sonmez, K., Plauche, M., Shriberg, E. and Franco, H. (2000). Consonant discrimination in elicited and spontaneous speech.. Proc. NIST Speech Transcription Workshop, College Park, MD, 2000.

E. Shriberg, A. Stolcke, D. Hakkani-Tur, & G. Tur (2000), Prosody-Based Automatic Segmentation of Speech into Sentences and Topics, Speech Communication 32(1-2), 127-154 (Special Issue on Accessing Information in Spoken Audio).

A. Stolcke, K. Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky, P. Taylor, R. Martin, C. Van Ess-Dykema, & M. Meteer (2000), Dialogue act modeling for automatic tagging and recognition of conversational speech, Computational Linguistics 26(3), 339-373.

Z. Rivlin, D. Appelt, R. Bolles, A. Cheyer, D. Hakkani-Tur, D. Israel, L. Julia, D. Martin, G. Myers, K. Nitz, B. Sabata, A. Sankar, E. Shriberg, K. Sonmez, A. Stolcke, & G. Tur (2000), MAESTRO: Conductor of Multimedia Analysis Technologies, Communications of the ACM 43(2) (Special Issue on News on Demand).

D. Hakkani-Tur, G. Tur, A. Stolcke, & E. Shriberg (1999), Combining Words and Prosody for Information Extraction from Speech. Proc. EUROSPEECH, vol. 5, pp. 1991-1994, Budapest.

Shriberg E. (1999). Phonetic Consequences of Speech Disfluency. Symposium on The Phonetics of Spontaneous Speech (S. Greenberg and P. Keating, organizers), Proc. International Congress of Phonetic Sciences, vol. 1, pp. 619-622, San Francisco.

Plauche, M. and Shriberg, E. (1999). Data-Driven Subclassification of Disfluent Repetitions Based on Prosodic Features. Proc. International Congress of Phonetic Sciences, vol. 2, pp. 1513-1516, San Francisco.

A. Stolcke, E. Shriberg, D. Hakkani-Tur, & G. Tur (1999), Modeling the Prosody of Hidden Events for Improved Word Recognition. Proc. EUROSPEECH, vol. 1, pp. 307-310, Budapest.

A. Stolcke, E. Shriberg, D. Hakkani-Tur, G. Tur, Z. Rivlin, K. Sonmez (1999), Combining Words and Speech Prosody for Automatic Topic Segmentation. Proc. DARPA Broadcast News Workshop, pp. 61-64, Herndon, VA.

Shriberg, E., Bates, R., Stolcke, A., Taylor, P., Jurafsky, D., Ries, K., Coccaro, N., Martin, R., Meteer, M., Van Ess-Dykema, C. (1998). Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech? In M. Swerts and J. Hirschberg (eds.) Special Double Issue on Prosody and Conversation. Language and Speech 41(3-4), 439-487.

Kemal Sonmez, Elizabeth Shriberg, Larry Heck & Mitchel Weintraub (1998). Modeling Dynamic Prosodic Variation for Speaker Verification. Proc. Intl. Conf. on Spoken Language Processing, vol. 7, pp. 3189-3192, Sydney, Australia.

Harry Bratt, Leo Neumeyer, Elizabeth Shriberg, & Horacio Franco (1998). Collection and Detailed Transcription of a Speech Database for Development of Language Learning Technologies. Proc. Intl. Conf. on Spoken Language Processing, vol. 4, pp. 1539-1542, Sydney, Australia.

E. Shriberg & A. Stolcke (1998). How Far Do Speakers Back Up In Repairs? A Quantitative Model. Proc. Intl. Conf. on Spoken Language Processing, vol. 5, pp. 2183-2186, Sydney, Australia.

A. Stolcke, E. Shriberg, R. Bates, M. Ostendorf, D. Hakkani, M. Plauche, G. Tur, & Y. Lu (1998), Automatic Detection of Sentence Boundaries and Disfluencies Based on Recognized Words. Proc. Intl. Conf. on Spoken Language Processing, vol. 5, pp. 2247-2250, Sydney, Australia.

Eklund, R. & Shriberg, E. (1998). Crosslinguistic Disfluency Modeling: A Comparative Analysis of Swedish and American English Human-Human and Human-Machine Dialogues. Proc. Intl. Conf. on Spoken Language Processing, vol. 6, pp. 2631-2634, Sydney, Australia.

Jurafsky, D., Shriberg, E., Fox, B. & Curl, T. (1998). Lexical, Prosodic, and Syntactic Cues for Dialog Acts. Proceedings of ACL/COLING 98 Workshop on Discourse Relations and Discourse Markers, pp. 114-120, Montreal.

Stolcke, A., Shriberg, E., Bates, R., Coccaro, N., Jurafsky, D., Martin, R., Meteer, M., Ries, K., Taylor, P., Van Ess-Dykema, C. (1998). Dialog Act Modeling for Conversational Speech. Papers from the 1998 AAAI Spring Symposium, Technical Report SS-98-01, pp. 98-105, AAAI Press, Menlo Park, CA.

Jurafsky, D., Shriberg, E.E., & Biasca, D. (1997). Switchboard SWBD-DAMSL Shallow Discourse Function Annotation Coders Manual, Draft 13. Institute of Cognitive Science Technical Report 97-02, University of Colorado, Boulder.

Jurafsky, D., Bates, R., Coccaro, N., Martin, R., Meteer, M., Ries, K., Shriberg, E., Stolcke, A. Taylor, P., Van Ess-Dykema, C. (1997). Switchboard Discourse Language Modeling Project Report. In LVCSR Summer Research Workshop Technical Reports - Research Note 30, Center for Speech and Language Processing, Johns Hopkins University Baltimore, MD.

Jurafsky, D., Bates, R., Coccaro, N., Martin, R., Meteer, M., Ries, K., Shriberg, E., Stolcke, A., Taylor, P., & Van Ess-Dykema, C. (1997). Automatic Detection of Discourse Structure for Speech Recognition and Understanding. Proc. 1997 IEEE Workshop on Speech Recognition and Understanding, pp. 88-95, Santa Barbara.

Shriberg, E.E., Bates, R.A., & Stolcke, A. (1997). A prosody-only decision-tree model for disfluency detection. Proc. Eurospeech 97, vol. 5, pp. 2383-2386, Rhodes, Greece.

Sonmez, K., Heck, L., Weintraub, M. & Shriberg, E.E. (1997). A lognormal tied mixture model of pitch for prosody-based speaker recognition. Proc. Eurospeech 97, vol. 3, pp. 1391-1394, Rhodes, Greece.

Shriberg, E.E., Bates, R.A., & Stolcke, A. (1996). Integrated acoustic and language modeling of speech disfluencies. Journal of the Acoustical Society of America 100(4), 2848 [abstract].

Shriberg, E.E. (1996). Disfluencies in SWITCHBOARD. Proc. International Conference on Spoken Language Processing, Addendum, pp. 11-14, Philadelphia, PA.

Shriberg, E. E. & Stolcke, A. (1996). Word predictability after hesitations: A corpus-based study. Proc. International Conference on Spoken Language Processing, vol. 3, pp. 1868-1871, Philadelphia, PA.

Shriberg, E.E., Ladd, D.R., Terken, J M.B., and Stolcke, A. (1996). Modeling pitch range variation within and across speakers: Predicting F0 targets when 'speaking up'. Proc. International Conference on Spoken Language Processing, Addendum, pp. 1-4, Philadelphia, PA.

Stolcke, A. & Shriberg, E.E. (1996). Automatic linguistic segmentation of conversational speech. Proc. International Conference on Spoken Language Processing, vol. 2, pp. 1005-1008, Philadelphia, PA.

Rosenfeld, R., Agarwal, R., Byrne, B., Iyer, R., Liberman, M., Shriberg, E., Unverferth, J., Vergyri, D., Vidal, E. (1996). Error analysis and disfluency modeling in the Switchboard domain. Proc. International Conference on Spoken Language Processing, Philadelphia, PA.

Ostendorf, M., Byrne, B., Bacchiani, M., Finke, M., Gunawardana, A., Ross, K., Roweis, S., Shriberg, E., Talkin, D., Waibel, A., Wheatley, B., Zeppenfeld, T. (1996). Modeling systematic variations in pronunciation via a language-dependent hidden speaking mode. In Research Notes No. 24, 1996 LVCSR Summer Research Workshop Technical Reports, Center for Language and Speech Processing, Johns Hopkins University.

Stolcke, A. & Shriberg, E.E. (1996). Statistical language modeling for speech disfluencies. Proc. International Conference on Acoustics, Speech and Signal Processing, vol. 1, pp. 405-408, Atlanta, GA.

Weintraub, M., Aksu, Y., Dharanipragada, S., Khudanpur, S., Ney, H., Prange, J., Stolcke, A., Jelinek, F. and Shriberg, E. (1996). LM95 project report: Fast training and portability. In Research Note No. 1. Center for Language and Speech Processing, Johns Hopkins University, Baltimore, MD.

Shriberg, E.E. (1995). Acoustic properties of disfluent repetitions. Proc. International Congress of Phonetic Sciences, 4, pp. 384-387, Stockholm, Sweden.

Dahl, D., Bates, M., Brown, M., Fisher, W., Hunicke-Smith, K., Pallett, D., Pao, C., Rudnicky, A., and Shriberg, E. (1994). Expanding the scope of the ATIS task: the ATIS-3 corpus. Proc. ARPA Workshop on Human Language Technology, pp. 43-48, Plainsboro, NJ.

Shriberg, E.E. (1994). Preliminaries to a Theory of Speech Disfluencies. PhD thesis, University of California at Berkeley.

Shriberg, E.E. & Lickley, R.J. (1993). Intonation of clause-internal filled pauses. Phonetica 50, 172-179.

Shriberg, E.E. (1992). Perceptual restoration of filtered vowels with added noise. Language and Speech 35(1-2), 127-136.

Bear, J., Dowding, J., Shriberg, E.E. & Price, P.J. (1993). A system for labeling self-repairs in speech.. SRI Technical Note 522.

Nakatani, C.H. & Shriberg, E.E. (1993). Proposal for labeling disfluencies in ToBI. Paper presented at the Third ToBI Labeling Workshop, Ohio State University.

Shriberg, E.E., Bear, J. & Dowding, J. (1992). Automatic detection and correction of repairs in human-computer dialog. Proc. DARPA Speech and Natural Language Workshop, M. Marcus, (ed.), pp. 419-424, Harriman, NY.

Shriberg, E.E., Wade, E. & Price, P.J. (1992). Human-machine problem solving using spoken language systems (SLS): Factors affecting performance and user satisfaction. Proc. DARPA Speech and Natural Language Workshop, M. Marcus, (ed.), pp. 49-54, Harriman, NY.

Shriberg, E.E. & Lickley, R.J. (1992). Intonation of clause-internal filled pauses. Proc. 2nd International Conference on Spoken Language Processing, pp. 991-994, Banff, Alberta, Canada.

Shriberg, E.E. & Lickley, R.J. (1992). The relationship of filled-pause F0 to prosodic context. In Proceedings of the IRCS Workshop on Prosody in Natural Speech, Technical Report IRCS-92-37, 201-209, University of Pennsylvania, Institute for Research in Cognitive Science, Philadelphia, PA.

Bear, J., Dowding, J. & Shriberg, E.E. (1992). Integrating multiple knowledge sources for detection and correction of repairs in human-computer dialog. Proc. Annual Meeting of the Association for Computational Linguistics, pp. 56-63, Newark, Delaware.

Butzberger, J.W., Murveit, H., Shriberg, E.E. & Price, P.J. (1992). Spontaneous speech effects in large vocabulary speech recognition applications. Proc. DARPA Speech and Natural Language Workshop, M. Marcus (ed.), Morgan Kaufmann, pp. 339-343.

Price, P.J., Hirschman, L., Shriberg, E.E. & Wade, E. (1992). Subject-based evaluation measures for interactive spoken language systems. Proc. DARPA Speech and Natural Language Workshop, M. Marcus (ed.), pp. 34-39, Harriman, NY.

Wade, E., Shriberg, E.E. & Price, P.J. (1992). User behaviors affecting speech recognition. Proc. 2nd International Conference on Spoken Language Processing, pp. 995-998, Banff, Alberta, Canada.

Shriberg, E.E. & Ohala, J.J. (1991). "Correction" in the perception of filtered vowels. Journal of the Acoustical Society of America 89, 8SP6.

Ohala, J.J. & Shriberg, E.E. (1990). Hyper-correction in speech perception. Proc. International Conference on Spoken Language Processing, pp. 405-408, Kobe, Japan.

Shriberg, E.E. (1990). Hypercorrection in vowel identification as evidence against the direct realist view of speech perception. Unpublished M.A. thesis, University of California, Berkeley.

Snow, C.E., Cancino, H., Gonzales, P. & Shriberg, E.E. (1989a). Giving formal definitions: An oral language correlate of school literacy. In D. Bloome (ed.), Classrooms and Literacy. Norwood, NJ: Ablex.

Snow, C.E., Cancino, H., Gonzales, P. & Shriberg, E.E. (1989b). Second language learners' formal definitions: An oral language correlate of school literacy. In D. Bloome (Ed.), Literacy in Functional Settings. Norwood, NJ: Ablex.

Hafter, E.R., Buell, T.N., Basiji, D. & Shriberg, E.E. (1988a). Discrimination of direction for complex sounds presented in the free-field. In Basic Issues in Hearing, H. Duifhuis & J.W. Horst (eds.), San Diego: Academic Press.

Hafter, E.R., Buell, T.N., Basiji, D. & Shriberg, E.E. (1988b). Discrimination of direction in the free-field. Journal of the Acoustical Society of America 83, S122.

 

About Us  Vertical divider  R&D Divisions  Divider  Careers  Divider  Newsroom  Divider  Contact Us
©2011 SRI International, 333 Ravenswood Avenue, Menlo Park, CA 94025-3493
SRI International is an independent, nonprofit corporation. Privacy policy

Last modified Feb 03, 2014