Corpus-Independent History Compression for Stochastic Turn-Taking Models,
Proc. IEEE ICASSP, pp. 4937-4940, Kyoto.
A. Stolcke, A. Mandal, & E. Shriberg (2012),
Speaker Recognition With Region-Constrained MLLR Transforms,
Proc. IEEE ICASSP, pp. 4397-440, Kyoto.
E. Shriberg & A. Stolcke (2011),
Language-independent constrained cepstral features for speaker recognition,
Proc. IEEE ICASSP, pp. 5296-5299, Prague.
Marcel Kockmann, Luciana Ferrer, Lukas Burget, Elizabeth Shriberg & Jan Cernocky (2011),
Recent Progress in Prosodic Speaker Verification,
Proc. IEEE ICASSP, pp. 4556-4559, Prague.
D. Hakkani-Tur, G. Tur, L. Heck, & E. Shriberg (2011),
Bootstrapping Domain Detection Using Query Click Logs for New Domains,
Proc. Interspeech, Florence.
M. H. Sanchez, L. Ferrer, E. Shriberg, & A. Stolcke (2011),
Constrained Cepstral Speaker Recognition Using Matched UBM and JFA Training,
Proc. Interspeech, pp. 141-144, Florence.
N. Scheffer, L. Ferrer, M. Graciarena, S. Kajarekar, E. Shriberg & A. Stolcke (2011),
The SRI NIST 2010 Speaker Recognition Evaluation System,
Proc. IEEE ICASSP, pp. 5292-5295, Prague.
M. Graciarena, M. Delplanche,E. Shriberg & A. Stolcke (2011),
Bird Species Recognition Combining Acoustic and Sequence Modeling,
Proc. IEEE ICASSP, pp. 341-344, Prague.
A. Stolcke, M. Akbacak, L. Ferrer, S. Kajarekar, C. Richey, N. Scheffer, & E. Shriberg (2010),
Improving Language Recognition with Multilingual Phone Recognition and
Speaker Adaptation Transforms,
Proc. Odyssey Speaker and Language Recognition Workshop,
pp. 256-262, Brno, Czech Republic.
(PDF)
M. Graciarena, M. Delplanche,E. Shriberg, A. Stolcke, & L. Ferrer (2010),
Acoustic Front-end Optimization for Bird Species Recognition,
Proc. IEEE ICASSP, Dallas, pp. 293-296.
William Horton, Daniel Spieler, and Elizabeth Shriberg (2010). A corpus analysis of patterns of age-related change in conversational speech. Psychology and Aging.
Dilek Hakkani-Tur, Gokhan Tur, Benoit Favre, and Elizabeth Shriberg (2010). Finding the Structure of Documents. In D. Bikel & I. Zitouni (Eds.) Multilingual Natural Language Applications: From Theory to Practice,
Prentice Hall.
G. Tur et al. (2010). The CALO Meeting Assistant System. IEEE Transactions on Audio, Speech, and Language Processing.
H. Franco, H. Bratt, R. Rossier, R. Rao, E. Shriberg, V. Abrash, K. Precoda (2010).
EduSpeak: A speech recognition and pronunciation scoring toolkit for computer-aided language learning applications.
Language Testing, Volume 27, Number 3, pp. 401-418.
(Link requires SAGE journals subscription).
K. Laskowski and E. Shriberg (2010).
Comparing the contributions of context and prosody in text-independent dialog act recognition.
Proc. ICASSP, Dallas, Texas, March 2010, pp. 5374-5377.
L. Ferrer, N. Scheffer, E. Shriberg (2010).
A comparison of approaches for modeling prosodic features in speaker recognition.
Proc. ICASSP, Dallas, Texas, March 2010, pp. 4414-4417.
M. Graciarena, M. Delplanche, E. Shriberg, A. Stolcke, L. Ferrer (2010).
Acoustic front-end optimization for bird species recognition.
Proc. ICASSP, Dallas, Texas, March 2010, pp. 293-296.
J. Kolar, Y. Liu, and E. Shriberg (2010).
Speaker adaptation of language and prosodic models
for automatic dialog act segmentation of speech.
Speech Communication, Volume 52, Issue 3,
pp. 236-245.
L. Ferrer, K. Sonmez, & E. Shriberg (2009).
An anticorrelation kernel for subsystem training in multiple classifier systems.
Journal of Machine Learning Research, Vol. 10, pp. 2079-2114.
E. Shriberg, S. Kajarekar, N. Scheffer (2009).
Does Session Variability Compensation in Speaker Recognition
Model Intrinsic Variation Under Mismatched Conditions?
Proc. Interspeech, Brighton, UK, September 2009, pp. 1551-1554.
K. Laskowski and E. Shriberg (2009).
Modeling Other Talkers for Improved Dialog Act Recognition in Meetings.
Proc. Interspeech, Brighton, UK, September 2009, pp. 2783-2786.
M. Graciarena, T. Bocklet, E. Shriberg, A. Stolcke, S. Kajarekar (2009).
Feature-Based and Channel-Based Analyses of Intrinsic Variability in Speaker Verification.
Proc. Interspeech, Brighton, UK, September 2009, pp. 2015-2018.
E. Shriberg, B. Favre, J. Fung, D. Hakkani-Tur, and S. Cuendet (2009).
Prosodic similarities of dialog act boundaries across speaking styles.
In Linguistic Patterns in Spontaneous Speech, Shu-Chuan Tseng (Ed.),
Language and Linguistics Monograph Series A25. Taipei: Institute of Linguistics, Academia Sinica, pp. 213-239.
T. Bocklet and E. Shriberg (2009).
Speaker Recognition Using Syllable-Based Constraints for Cepstral Frame Selection ,
Proc. ICASSP, Taipei, Taiwan.
S. Kajarekar, N. Scheffer, M. Graciarena, E. Shriberg, A. Stolcke, L. Ferrer, T. Bocklet (2009).
The SRI NIST 2008 Speaker Recognition Evaluation System ,
Proc. ICASSP, Taipei, Taiwan.
B. Favre, D. Hakkani-Tur, and E. Shriberg (2009).
Syntactically-Informed Models for Comma Prediction ,
Proc. ICASSP, Taipei, Taiwan.
J. Kolar, Y. Liu, and E. Shriberg (2009).
Genre Effects on Automatic Sentence Segmentation of Speech:
A Comparison of Broadcast News and Broadcast Conversations ,
Proc. ICASSP, Taipei, Taiwan.
E. Shriberg, M. Graciarena, H. Bratt, A. Kathol, S. Kajarekar, H. Jameel, C. Richey, F. Goodman (2008).
Effects of Vocal Effort and Speaking Style on Text-Independent Speaker Verification,
Proc. Interspeech, Brisbane, Australia, pp. 609-612.
E. Shriberg & A. Stolcke (2008),
The Case for Automatic Higher-Level Features in Forensic Speaker
Recognition, Proc. Interspeech, Brisbane, Australia,
pp. 1509-1512. [Note: Invited overview paper for Special Session on
Forensic Speaker Recognition, organized by Geoff Morrison].
L. Ferrer, M. Graciarena, A. Zymnis, and E. Shriberg (2008),
System combination using auxiliary information for speaker verification.
Proc. IEEE International Conference on Acoustics,
Speech and Signal Processing, Las Vegas, Nevada.
M. Ostendorf et al. (2008),
Speech segmentation and spoken document processing.
IEEE Signal Processing Magazine,
Vol. 25, Issue 3, pps. 59-69.
F. Yang, G. Tur, and E. Shriberg (2008),
Exploiting dialog act tagging and prosodic information for action item identification.
Proc. IEEE International Conference on Acoustics,
Speech and Signal Processing, Las Vegas, Nevada.
E. Shriberg, L. Ferrer, S. Kajarekar, N. Scheffer, A. Stolcke,
& M. Akbacak (2008),
Detecting Nonnative Speech Using Speaker Recognition Approaches.
Proc. Odyssey Speaker and Language Recognition Workshop,
Stellenbosch, South Africa.
L. Ferrer, K. Sonmez, and E. Shriberg (2008),
An anticorrelation kernel for improved system combination in speaker verification.
Proc. Odyssey Speaker and Language Recognition Workshop,
Stellenbosch, South Africa.
G. Myers, G. Tur, L. Voss, B. Bolles, S. Kajarekar, E. Shriberg, D. Hakkani-Tur (2008). Multimedia Information Extraction Roadmap.
In Proceedings of the AAAI Fall Smposium on Multimedia Information Extraction, Arlington, VA.
G. Tur, A. Stolcke, L. Voss, J. Dowding, B. Favre, R. Fernandez, M. Frampton,
M. Frandsen, C. Frederickson, M. Graciarena, D. Hakkani-Tur, D. Kintzing,
K. Leveque, S. Mason, J. Niekrasz, S. Peters, M. Purver, K. Riedhammer,
E. Shriberg, J. Tien, D. Vergyri, & F. Yang (2008),
The CALO Meeting Speech Recognition and Understanding System,
Proc. IEEE Spoken Language Technology Workshop, pp. 69-72, Goa, India.
E. E. Shriberg (2007),
Higher Level Features in Speaker Recognition.
In C. Muller (Ed.)
Speaker Classification I.
Volume 4343 of Lecture Notes in Computer Science / Artificial Intelligence. Springer: Heidelberg / Berlin / New York, pp. 241-259.
A. Stolcke, S. Kajarekar, L. Ferrer, & E. Shriberg (2007),
Speaker Recognition with Session Variability Normalization Based on
MLLR Adaptation Transforms,
IEEE Transactions on Audio, Speech, and Language Processing,
Special issue on speaker and language recognition.
15(7), 1987-1998.
S. Cuendet, D. Hakkani-Tur, E. Shriberg, J. Fung, B. Favre (2007),
Cross-Genre Feature Comparisons for Spoken Sentence Segmentation.
International Journal of Semantic Computing, Volume 1, Issue 3,
pp. 335-346.
S. Cuendet, D. Hakkani-Tur, E. Shriberg, J. Fung, B. Favre (2007),
Cross-Genre Feature Comparisons for Spoken Sentence Segmentation.
(An earlier version of the preceding paper).
Proceedings International Conference on Semantic Computing, September 2007, Irvine, CA, pp. 265-271.
E. Shriberg and L. Ferrer (2007),
A Text-Constrained Prosodic System for Speaker Verification.
In Proceedings Interspeech, pp. 1226-1229, Antwerp.
J. Kolar, Y. Liu, and E. Shriberg (2007),
Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings.
In Proceedings Interspeech, pp. 1621-1624, Antwerp.
L. Ferrer, K. Sonmez, and E. Shriberg (2007),
A Smoothing Kernel for Spatially Related Features and Its Application to Speaker Verification.
In Proceedings Interspeech, 738-741, Antwerp.
G. Tur, E. Shriberg, A. Stolcke, S. Kajarekar (2007),
Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification.
In Proceedings Interspeech, pp. 2049-2052, Antwerp.
F. Enos, E. Shriberg, M. Graciarena, J. Hirschberg, A. Stolcke (2007),
Detecting Deception Using Critical Segments.
In Proceedings Interspeech, pp. 2281-2284, Antwerp.
J. Fung, D. Hakkani-Tur, M. Magimai-Doss, E. Shriberg, S. Cuendet, N. Mirghafori (2007),
Prosodic Features and Feature Selection for Multi-Lingual Sentence Segmentation.
In Proceedings Interspeech, pp. 2585-2588, Antwerp.
S. Cuendet, E. Shriberg, B. Favre, J. Fung, D. Hakkani-Tur (2007),
An Analysis of Sentence Segmentation Features for Broadcast News, Broadcast Conversations, and Meetings.
Proceedings SIGIR Workshop on Searching Conversational Spontaneous Speech, 23-27 July, Amsterdam, Netherlands, pp. 37-43.
S. Cuendet, D. Hakkani-Tur, E. Shriberg (2007),
Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech.
Proceedings MLMI 2007, June, Brno, Czech Republic.
Y. Liu and E. Shriberg (2007),
Comparing Evaluation Metrics for Sentence Boundary Detection.
Proc. IEEE ICASSP,
Honolulu, Hawaii.
L. Ferrer, E. Shriberg, S. Kajarekar, and K. Sonmez (2007),
Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition.
Proc. IEEE ICASSP,
Honolulu, Hawaii.
M. Graciarena, S. Kajarekar, A. Stolcke, and E. Shriberg (2007),
Noise Robust Speaker Identification for Spontaneous Arabic Speech.
Proc. IEEE ICASSP,
Honolulu, Hawaii.
M. Magimai-Doss, D. Hakkani-Tur, O. Cetin, E. Shriberg, J. Fung, and N. Mirghafori (2007),
Entropy Based Classifier Combination for Sentence Segmentation.
Proc. IEEE ICASSP,
Honolulu, Hawaii.
Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, & M. Harper (2006),
Enriching Speech Recognition with Automatic Detection of Sentence Boundaries
and Disfluencies.
IEEE Trans. Audio, Speech and Language Processing
14(5), 1526-1540.
(PDF,
abstract)
Y. Liu and E. Shriberg (2006),
More Than Words Can Say: Using Prosody to Find Sentence Boundaries in Speech.
4th ASA/ASJ Joint Meeting Lay Language Papers.
Popular version of paper IaSC2, 4th ASA/ASJ Joint Meeting, Honolulu, HI.
O. Cetin and E. Shriberg (2006),
Analysis of Overlaps in Meetings by Dialog Factors, Hot Spots, Speakers, and Collection Site: Insights for Automatic Speech Recognition.
Proc. ICSLP, pp. 293-296, Pittsburgh.
J. Kolar, E. Shriberg, Y. Liu (2006),
On Speaker-Specific Prosodic Models for Automatic Dialog Act Segmentation of Multi-Party Meetings.
Proc. ICSLP, pp. 2014-2017, Pittsburgh.
F. Enos, S. Benus, R. Cautin, M. Graciarena, J. Hirschberg and E. Shriberg (2006),
Personality Factors in Human Deception Detection: Comparing Human to Machine Performance.
Proc. ICSLP, pp. 813-816, Pittsburgh.
M. Zimmermann, D. Tur, J. Fung, N. Mirghafori, L. Gottlieb, E. Shriberg, Y. Liu (2006),
The ICSI+ Multi-Lingual Sentence Segmentation System.
Proc. ICSLP, pp. 117-120, Pittsburgh.
F. Weng, S. Varges, B. Raghunathan, F. Ratiu, H. Pon-Barry, B. Lathrop, Q. Zhang, H. Bratt, T. Scheideck, K. Xu, M. Purver, R. Mishra, A. Lien, M. Raya, S. Peters, Y. Meng, J. Russell, L. Cavedon, E. Shriberg, H. Schmidt, R. Prieto (2006),
CHAT: A Conversational Helper for Automotive Tasks.
Proc. ICSLP, pp. 1061-1064, Pittsburgh.
Y. Liu, N. V. Chawla, M. P. Harper, E. Shriberg, & A. Stolcke (2006),
A study in machine learning from imbalanced data for sentence boundary
detection in speech,
Computer Speech and Language 20(4), 468-494.
(PDF,
abstract)
J. Kolar, E. Shriberg, Y. Liu (2006),
Using Prosody for Automatic Sentence Segmentation of Multi-Party Meetings.
Proc. International Conference on Text, Speech, and Dialogue (TSD), Czech Republic.
S. S. Kajarekar, H. Bratt, E. Shriberg, & R. de Leon (2006),
A Study of Intentional Voice Modifications for
Evading Automatic Speaker Recognition.
Proc. IEEE Odyssey 2006 Speaker and Language Recognition Workshop,
San Juan, Puerto Rico.
O. Cetin and E.E. Shriberg (2006),
Overlap in Meetings: ASR Effects and Analysis by
Dialog Factors, Speakers, and Collection Site.
MLMI06 (3rd Joint Workshop on Multimodal
and Related Machine Learning Algorithms), Washington
DC.
M. Zimmermann, D. Hakkani-Tur, E. Shriberg, A. Stolcke (2006).
Text based Dialog Act Classification for Multiparty Meetings.
MLMI06 (3rd Joint Workshop on Multimodal
and Related Machine Learning Algorithms), Washington
DC.
L. Ferrer, E. Shriberg, S. S. Kajarekar, A. Stolcke, K. Sonmez,
A. Venkataraman, & H. Bratt (2006),
The Contribution of Cepstral and Stylistic Features to SRI's 2005 NIST
Speaker Recognition Evaluation System.
Proc. IEEE ICASSP, Toulouse.
(PDF)
M. Graciarena, E. Shriberg, A. Stolcke, F. Enos, J. Hirschberg, S. Kajarekar
(2006),
Combining Prosodic, Lexical and Cepstral Systems for Deceptive Speech Detection.
Proc. IEEE ICASSP,
Toulouse.
O. Cetin and E. Shriberg (2006).
Speaker Overlaps and ASR Errors in Meetings: Effects Before, During,
and After the Overlap.
Proc. IEEE ICASSP, Toulouse.
M. Zimmermann, A. Stolcke, & E. Shriberg (2006),
Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings.
Proc. IEEE ICASSP,
Toulouse.
S. Benus, F. Enos, J. Hirschberg, E. Shriberg (2006).
Pauses in Deceptive Speech.
Speech Prosody 2006, Dresden.
M. Zimmermann, Y. Liu, E. Shriberg, & A. Stolcke (2005),
A* based Joint Segmentation and Classification of Dialog Acts in Multiparty
Meetings.
Proc. IEEE Speech Recognition and Understanding Workshop, Cancun.
E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, & A. Stolcke (2005),
Modeling Prosodic Feature Sequences for Speaker Recognition.
Speech Communication 46(3-4), 455-472.
E. E. Shriberg (2005).
Spontaneous Speech: How People Really Talk, and Why Engineers Should Care.
Proc. Eurospeech, pp. 1781-1784, Lisbon. [Overview paper to accompany keynote address].
Y. Liu, N. V. Chawla, M. P. Harper, E. Shriberg, & A. Stolcke (2006),
A study in machine learning from imbalanced data for sentence boundary
detection in speech,
Computer Speech and Language 20(4), 468-494.
(PDF,
abstract)
J. Hirschberg, S. Benus, J. M. Brenier, F. Enos, S. Friedman,
S. Gilman, C. Girand, M. Graciarena, A. Kathol, L. Michaelis,
B. Pellom, E. Shriberg, & A. Stolcke (2005),
Distinguishing Deceptive from Non-Deceptive Speech.
Proc. Eurospeech, Lisbon.
D. Jones, W. Shen, E. Shriberg, A. Stolcke, T. Kamm, & D. Reynolds (2005),
Two Experiments Comparing Reading with Listening for Human Processing of
Conversational Telephone Speech.
Proc. Eurospeech, Lisbon.
A. Venkataraman, Y. Liu, E. Shriberg, & A. Stolcke (2005),
Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?.
Proc. Eurospeech, Lisbon.
Y. Liu, E. Shriberg, A. Stolcke, & M. Harper (2005),
Comparing HMM, Maximum Entropy, and Conditional Random Fields for Disfluency
Detection.
Proc. Eurospeech, Lisbon.
A. Stolcke, L. Ferrer, S. Kajarekar, E. Shriberg, & A. Venkataraman (2005),
MLLR Transforms as Features in Speaker Recognition.
Proc. Eurospeech, Lisbon.
M. Zimmermann, Y. Liu, E. Shriberg, & A. Stolcke (2005),
Toward Joint Segmentation and Classification of Dialog Acts in Multiparty
Meetings.
Proc. MLMI,
Edinburgh.
Y. Liu, A. Stolcke, E. Shriberg, and M. Harper (2005),
Using Conditional Random Fields for Sentence Boundary Detection in Speech,
Proc. ACL, Ann Arbor, MI, pp. 451-458.
B. Wrede, S. Bhagat, R. Dhillon, E. Shriberg (2005).
Meeting Recorder Project: Hot Spot Labeling Guide.
ICSI Technical Report TR-05-004.
C. Walker, S. Strassel, E. Shriberg, Y. Liu, J. Ang, H. Lee (2005).
LDC2005T24: MDE RT-04 Training Data Text/Annotations (LDC Metadata Annotation Corpus).
M. Ostendorf, E. Shriberg, & A. Stolcke (2005).
Human Language Technology: Opportunities and Challenges.
Overview paper to accompany special double session on Human Language Technology (organized by the authors). Proc. ICASSP 2005, Philadelphia.
J. Ang, Y. Liu, & E. Shriberg (2005).
Automatic Dialog Act Segmentation and Classification in Multiparty Meetings.
Proc. ICASSP 2005, Philadelphia, pp. 1061-1064.
S. S. Kajarekar, L. Ferrer, E. Shriberg, K. Sonmez, A. Stolcke,
A. Venkataraman, and J. Zheng (2005),
SRI's 2004 NIST Speaker Recognition Evaluation System,
Proc. IEEE ICASSP, Philadelphia, vol. 1, pp. 173-176.
Y. Liu, E. Shriberg, A. Stolcke, B. Peskin, J. Ang, D. Hillard, M. Ostendorf, M. Tomalin, P. Woodland, M. Harper (2005).
Structural Metadata Research in the EARS Program.
Proc. ICASSP 2005, Philadelphia.
S. Kajarekar, L. Ferrer, E. Shriberg, K. Sonmez, A. Stolcke, A. Venkataraman, J. Zheng (2005).
SRI's 2004 NIST Speaker Recognition Evaluation System.
Proc. ICASSP 2005, Philadelphia.
L. Chen, Y. Liu, M. Harper, E. Shriberg (2004).
Multimodal Model Integration For Sentence Unit Detection.
Proc. Intl. Conf. Multimodal Interfaces (ICMI), State College, PA.
E. Shriberg, L. Ferrer, A. Venkataraman, S. Kajarekar (2004).
SVM Modeling of "SNERF-Grams" for Speaker Recognition.
Proc. Intl. Conf. on Spoken Language Processing,
Jeju, Korea.
Y. Liu, E. Shriberg, A. Stolcke, & M. Harper (2004),
Using Machine Learning to Cope with Imbalanced Classes in Natural Speech:
Evidence from Sentence Boundary and Disfluency Detection.
Proc. Intl. Conf. on Spoken Language Processing,
Jeju, Korea.
H. Cheng, H. Bratt, R. Mishra, E. Shriberg, S. Upson, J. Chen,
F. Weng. S. Peters, L. Cavedon, J. Niekrasz. (2004).
A Wizard of Oz Framework for Collecting Spoken Human-Computer Dialogs.
Proc. Intl. Conf. on Spoken Language Processing, Jeju, Korea.
F. Weng, et al. (2004). A Conversational Dialog System for Cognitively Overloaded Users.
Proc. Intl. Conf. on Spoken Language Processing, Jeju, Korea.
Y. Liu, E. Shriberg, A. Stolcke, D. Hillard, M. Ostendorf, B. Peskin,
& M. Harper (2004),
The ICSI-SRI-UW Metadata Extraction System.
Proc. Intl. Conf. on Spoken Language Processing,
Jeju, Korea.
Michel Galley, Kathleen McKeown, Julia Hirschberg, Elizabeth Shriberg (2004).
Identifying Agreement and Disagreement in Conversational Speech: Use of Bayesian Networks to Model Pragmatic Dependencies.
Proc. 42nd Meeting of the ACL, July 21-26, Barcelona.
Y. Liu, A. Stolcke, E. Shriberg, & M. Harper (2004),
Comparing and Combining Generative and Posterior Probability Models:
Some Advances in Sentence Boundary Detection in Speech.
Proc. Conf. on Empirical Methods in Natural Language
Processing,
Barcelona.
S. Kajarekar, L. Ferrer, K. Sonmez, J. Zheng, E. Shriberg,
and A. Stolcke (2004),
Modeling NERFs for Speaker Recognition.
Proc. Odyssey 04 Speaker and Language Recognition Workshop,
pp. 51-56, Toledo, Spain.
D. Hillard, M. Ostendorf, A. Stolcke, Y. Liu, & E. Shriberg (2004).
Improving Automatic Sentence Boundary Detection with Confusion Networks.
Proc. HLT-NAACL, May 2004, Boston, pp. 69-72.
E. Shriberg, R. Dhillon, S. Bhagat, J. Ang, and H. Carvey (2004).
The ICSI Meeting Recorder Dialog Act (MRDA) Corpus.
Proc. 5th SIGdial Workshop on Discourse and Dialogue, M. Strube and C. Sidner (Eds.), April 30 - May 1, Cambridge, MA, pp. 97-100.
E. Shriberg and A. Stolcke (2004).
Direct Modeling of Prosody: An Overview of Applications in Automatic
Speech Processing. Proc. International
Conference on Speech Prosody 2004 Nara, Japan.
R. Dhillon, S. Bhagat, H. Carvey, and E. Shriberg (2004).
Meeting Recorder Project: Dialog Act Labeling Guide.
ICSI Technical Report TR-04-002.
E. Shriberg and A. Stolcke (2004).
Prosody Modeling for Automatic Speech Recognition and Understanding.
Mathematical Foundations of Speech and Language Processing.
M. Johnson, S. Khudanpur, M. Ostendorf and R. Rosenfeld (Editors),
IMA Volumes in Mathematics and Its Applications, Vol 138, Springer-Verlag, New York, pp. 105-114.
B. Wrede and E. Shriberg (2003),
The Relationship Between Dialogue Acts and Hot Spots in Meetings.
Proc. IEEE Speech Recognition and Understanding Workshop,
St. Thomas, U.S. Virgin Islands, pp. 180-185.
S. Kajarekar, L. Ferrer, A. Venkataraman, K. Sonmez, E. Shriberg, A. Stolcke,
& R. R. Gadde (2003),
Speaker Recognition Using Prosodic and Lexical Features.
Proc. IEEE Speech Recognition and Understanding Workshop,
St. Thomas, U.S. Virgin Islands.
Y. Liu, E. Shriberg, & A. Stolcke (2003),
Automatic disfluency identification in conversational speech using multiple
knowledge sources.
Proc. Eurospeech,
Geneva.
B. Wrede and E. Shriberg (2003),
Spotting "Hotspots" in Meetings: Human Judgments and Prosodic Cues.
Proc. Eurospeech,
Geneva, pp. 2805-2808.
S. Bhagat, H. Carvey and E. Shriberg (2003),
Automatically Generated Prosodic Cues to Lexically Ambiguous Dialog Acts
in Multiparty Meetings.
Proc. International Congress of Phonetic Sciences,
Barcelona.
L. Ferrer, H. Bratt, V. R. R. Gadde, S. Kajarekar, E. Shriberg, K. Sonmez,
A. Stolcke, & A. Venkataraman (2003),
Modeling duration patterns for speaker recognition.
Proc. Eurospeech,
Geneva.
Dustin Hillard, Mari Ostendorf, and Elizabeth Shriberg (2003),
Detection of Agreement vs. Disagreement in Meetings:
Training with Unlabeled Data.
Proc. HLT-NAACL Conference, Edmonton, Canada, May 2003.
S. Kajarekar et. al (2003), TalkPrinting: Improving Speaker
Recognition by Modeling Stylistic Features. Proc. First NSF/NIJ
Symposium on Intelligence and Security Informatics, H. Chen et al
(Eds), Tucson, AZ. Published as Lecture Notes in Computer
Science, Vol. 2665, ppp. 350-354.
L. Ferrer, E. Shriberg, and A. Stolcke (2003),
A prosody-based approach to end-of-utterance detection that does not
require speech recognition.
Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing,
Hong Kong.
D. Vergyri, A. Stolcke, V. R. R. Gadde, L. Ferrer, & E. Shriberg (2003),
Prosodic Knowledge Sources for Automatic Speech Recognition.
Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing,
Hong Kong.
A. Venkataraman, L. Ferrer, A. Stolcke, & E. Shriberg (2003),
Training a Prosody-based Dialog Act Tagger from Unlabeled Data.
Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing,
Hong Kong.
A. Janin, D. Baron, J. Edwards, D. Ellis, D. Gelbart, N. Morgan,
B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, C. Wooters (2003),
The ICSI Meeting Corpus.
Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing,
Hong Kong.
N. Morgan, D. Baron, S. Bhagat, H. Carvey, R. Dhillon, J. Edwards, D. Gelbart,
A. Janin, A. Krupski, B. Peskin, T. Pfau, E. Shriberg, A. Stolcke, &
C. Wooters (2003),
Meetings about meetings: research at ICSI on speech in multiparty conversations .
Proc. IEEE Intl. Conf. on Acoustics, Speech and Signal Processing,
Hong Kong.
A. Venkataraman, A. Stolcke, & E. Shriberg (2002),
Automatic Dialog Act Tagging with Minimal Supervision.
Proc. 9th Australian International Conference on Speech Science
and Technology, Melbourne.
Ang, J., Dhillon, R., Krupski, A., Shriberg, E. and Stolcke, A. (2002),
Prosody-Based Automatic Detection of Annoyance and Frustration
in Human-Computer Dialog.
Proc. Intl. Conf. on Spoken Language Processing,
vol. 3, pp. 2037-2040, Denver.
L. Ferrer, E. Shriberg, and A. Stolcke (2002),
Is the Speaker Done Yet? Faster and More Accurate
End-of-Utterance Detection Using Prosody in Human-Computer Dialog.
Proc. Intl. Conf. on Spoken Language Processing,
vol. 3, pp. 2061-2064, Denver.
Baron, D., Shriberg, E. and Stolcke, A. (2002),
Automatic Punctuation and Disfluency Detection in Multi-Party
Meetings Using Prosodic and Lexical Cues.
Proc. Intl. Conf. on Spoken Language Processing,
vol. 2, pp. 949-952, Denver.
Weber, F., Manganaro, L., Peskin, B. & Shriberg, E. (2002).
Using Prosodic and Lexical Information for Speaker Identification.
Proc. ICASSP, vol. 1, pp. 141-144, Orlando.
Shriberg, E. E. (2001).
To "Errrr" is Human: Ecology and Acoustics of Speech Disfluencies.
Journal of the International Phonetic Association 31(1), 153-169,
Cambridge University Press.
Shriberg, E. & Stolcke, A. (2001).
Prosody Modeling for Automatic Speech Understanding: An Overview of
Recent Research at SRI.
Proc. ISCA Tutorial and Research
Workshop on Prosody in Speech Recognition and Understanding,
pp. 13-16, Red Bank, NJ.
(An updated version of this paper appears as Shriberg & Stolcke, 2002.)
Shriberg, E., Stolcke, A. & Baron, D. (2001).
Can Prosody Aid the Automatic Processing of Multi-Party Meetings?
Evidence from Predicting Punctuation, Disfluencies, and Overlapping Speech.
Proc. ISCA Tutorial and Research Workshop on Prosody in
Speech Recognition and Understanding, pp. 139-146, Red Bank, NJ.
Shriberg, E., Stolcke, A. & Baron, D. (2001).
Observations on Overlap: Findings and Implications for
Automatic Processing of Multi-Party Conversation.
Proc. EUROSPEECH, vol. 2, pp. 1359-1362, Aalborg, Denmark.
Tur, G., Hakkani-Tur, D., Stolcke, A. & Shriberg, E. (2001).
Integrating Prosodic and Lexical Cues for Automatic Topic Segmentation,
Computational Linguistics 27(1), 31-57.
N. Morgan, D. Baron, J. Edwards, D. Ellis, D. Gelbart, A. Janin,
T. Pfau, E. Shriberg, & A. Stolcke (2001),
The Meeting Project at ICSI,
Proc. of HLT 2001, First International Conference on Human
Language Technology Research, pp. 246-252, San Diego, CA.
C. Teixeira, H. Franco, E. Shriberg, K. Precoda, K. Sonmez (2001).
Evaluation of Speaker's Degree of Nativeness Using Text-Independent
Prosodic Features.
Proceedings of the Workshop on Multilingual
Speech and Language Processing , Aalborg, Denmark.
Sonmez, K., Plauche, M., Shriberg, E. and Franco, H. (2000).
Consonant discrimination in elicited and spontaneous
speech: A case for signal-adaptive front ends in ASR.
Proc. International Conference on Spoken Language Processing,
vol. 1, pp. 325-328, Beijing.
C. Teixeira, H. Franco, E. Shriberg, K. Precoda, and K. Sonmez (2000).
Prosodic features for automatic text-independent evaluation of degree of
nativeness for language learners.
Proc. International Conference on Spoken Language Processing,
vol. 3, pp. 187-190, Beijing.
A. Stolcke, H. Bratt, J. Butzberger, H. Franco, V. R. Rao Gadde, M. Plauche,
C. Richey, E. Shriberg, K. Sonmez, F. Weng, J. Zheng (2000),
The SRI March 2000 Hub-5 Conversational Speech Transcription System.
Proc. NIST Speech Transcription Workshop,
College Park, MD.
Sonmez, K., Plauche, M., Shriberg, E. and Franco, H. (2000).
Consonant discrimination in elicited and spontaneous
speech..
Proc. NIST Speech Transcription Workshop,
College Park, MD, 2000.
E. Shriberg, A. Stolcke, D. Hakkani-Tur, & G. Tur (2000),
Prosody-Based Automatic Segmentation of Speech into Sentences and Topics,
Speech Communication 32(1-2), 127-154
(Special Issue on Accessing Information in Spoken Audio).
A. Stolcke, K. Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky,
P. Taylor, R. Martin, C. Van Ess-Dykema, & M. Meteer (2000),
Dialogue act modeling for automatic tagging and recognition of
conversational speech,
Computational Linguistics 26(3), 339-373.
Z. Rivlin, D. Appelt, R. Bolles, A. Cheyer, D. Hakkani-Tur, D. Israel,
L. Julia, D. Martin, G. Myers, K. Nitz, B. Sabata, A. Sankar, E. Shriberg,
K. Sonmez, A. Stolcke, & G. Tur (2000),
MAESTRO:
Conductor of Multimedia Analysis Technologies,
Communications of the ACM
43(2) (Special Issue on News on Demand).
D. Hakkani-Tur, G. Tur, A. Stolcke, & E. Shriberg (1999),
Combining Words and Prosody for Information Extraction from Speech.
Proc. EUROSPEECH, vol. 5, pp. 1991-1994, Budapest.
Shriberg E. (1999).
Phonetic Consequences of Speech Disfluency. Symposium on The
Phonetics of Spontaneous Speech (S. Greenberg and P. Keating,
organizers), Proc. International Congress of Phonetic
Sciences, vol. 1, pp. 619-622, San Francisco.
Plauche, M. and Shriberg, E. (1999).
Data-Driven Subclassification of Disfluent Repetitions Based on Prosodic Features.
Proc. International Congress of Phonetic Sciences,
vol. 2, pp. 1513-1516, San Francisco.
A. Stolcke, E. Shriberg, D. Hakkani-Tur, & G. Tur (1999),
Modeling the Prosody of Hidden Events for Improved Word Recognition.
Proc. EUROSPEECH, vol. 1, pp. 307-310, Budapest.
A. Stolcke, E. Shriberg, D. Hakkani-Tur, G. Tur, Z. Rivlin, K. Sonmez (1999),
Combining Words and Speech Prosody for Automatic Topic Segmentation.
Proc. DARPA Broadcast News Workshop, pp. 61-64, Herndon, VA.
Shriberg, E., Bates, R., Stolcke, A., Taylor, P., Jurafsky, D., Ries,
K., Coccaro, N., Martin, R., Meteer, M., Van Ess-Dykema, C. (1998).
Can Prosody Aid the Automatic Classification of Dialog Acts in
Conversational Speech?
In M. Swerts and J. Hirschberg (eds.) Special Double Issue on Prosody
and Conversation.
Language and Speech 41(3-4), 439-487.
Kemal Sonmez, Elizabeth Shriberg, Larry Heck & Mitchel Weintraub
(1998).
Modeling Dynamic Prosodic Variation for Speaker Verification.
Proc. Intl. Conf. on Spoken Language Processing, vol. 7,
pp. 3189-3192, Sydney, Australia.
Harry Bratt, Leo Neumeyer, Elizabeth Shriberg, & Horacio Franco (1998).
Collection and Detailed Transcription of a Speech Database for
Development of Language Learning Technologies.
Proc. Intl. Conf. on Spoken Language Processing,
vol. 4, pp. 1539-1542, Sydney, Australia.
E. Shriberg & A. Stolcke (1998).
How Far Do Speakers Back Up In Repairs? A Quantitative Model.
Proc. Intl. Conf. on Spoken Language Processing,
vol. 5, pp. 2183-2186, Sydney, Australia.
A. Stolcke, E. Shriberg, R. Bates, M. Ostendorf, D. Hakkani, M. Plauche,
G. Tur, & Y. Lu (1998),
Automatic Detection of Sentence Boundaries and Disfluencies Based on
Recognized Words.
Proc. Intl. Conf. on Spoken Language Processing,
vol. 5, pp. 2247-2250, Sydney, Australia.
Eklund, R. & Shriberg, E. (1998).
Crosslinguistic Disfluency Modeling: A Comparative Analysis of
Swedish and American English Human-Human and Human-Machine Dialogues.
Proc. Intl. Conf. on Spoken Language Processing,
vol. 6, pp. 2631-2634, Sydney, Australia.
Jurafsky, D., Shriberg, E., Fox, B. & Curl, T. (1998).
Lexical, Prosodic, and Syntactic Cues for Dialog Acts.
Proceedings of ACL/COLING 98 Workshop on Discourse Relations and
Discourse Markers, pp. 114-120, Montreal.
Stolcke, A., Shriberg, E., Bates, R., Coccaro, N., Jurafsky, D.,
Martin, R., Meteer, M., Ries, K., Taylor, P., Van Ess-Dykema,
C. (1998).
Dialog Act Modeling for Conversational Speech.
Papers from the 1998 AAAI Spring Symposium, Technical Report SS-98-01,
pp. 98-105, AAAI Press, Menlo Park, CA.
Jurafsky, D., Shriberg, E.E., & Biasca, D. (1997).
Switchboard SWBD-DAMSL Shallow Discourse Function Annotation Coders Manual,
Draft 13.
Institute of Cognitive Science Technical Report 97-02,
University of Colorado, Boulder.
Jurafsky, D., Bates, R., Coccaro, N., Martin, R., Meteer, M., Ries, K., Shriberg, E., Stolcke, A.
Taylor, P., Van Ess-Dykema, C. (1997).
Switchboard Discourse Language Modeling Project Report.
In LVCSR Summer Research Workshop Technical Reports - Research Note 30,
Center for Speech and Language Processing,
Johns Hopkins University Baltimore, MD.
Jurafsky, D., Bates, R., Coccaro, N., Martin, R., Meteer, M.,
Ries, K., Shriberg, E., Stolcke, A., Taylor, P., & Van Ess-Dykema,
C. (1997).
Automatic Detection of Discourse Structure for Speech Recognition and
Understanding.
Proc. 1997 IEEE Workshop on Speech Recognition
and Understanding, pp. 88-95, Santa Barbara.
Shriberg, E.E., Bates, R.A., & Stolcke, A. (1997).
A prosody-only decision-tree model for disfluency detection.
Proc. Eurospeech 97,
vol. 5, pp. 2383-2386, Rhodes, Greece.
Sonmez, K., Heck, L., Weintraub, M. & Shriberg, E.E. (1997).
A lognormal tied mixture model of pitch for prosody-based speaker recognition.
Proc. Eurospeech 97, vol. 3, pp. 1391-1394, Rhodes, Greece.
Shriberg, E.E., Bates, R.A., & Stolcke, A. (1996).
Integrated acoustic and language modeling of speech disfluencies.
Journal of the Acoustical Society of America 100(4), 2848 [abstract].
Shriberg, E.E. (1996).
Disfluencies in SWITCHBOARD.
Proc. International Conference on Spoken Language Processing,
Addendum, pp. 11-14, Philadelphia, PA.
Shriberg, E. E. & Stolcke, A. (1996).
Word predictability after hesitations: A corpus-based study.
Proc. International Conference on Spoken Language
Processing, vol. 3, pp. 1868-1871, Philadelphia, PA.
Shriberg, E.E., Ladd, D.R., Terken, J M.B., and Stolcke, A. (1996).
Modeling pitch range variation within and across speakers: Predicting F0
targets when 'speaking up'. Proc. International Conference on
Spoken Language Processing, Addendum, pp. 1-4, Philadelphia, PA.
Stolcke, A. & Shriberg, E.E. (1996).
Automatic linguistic segmentation of conversational speech.
Proc. International Conference on Spoken Language Processing,
vol. 2, pp. 1005-1008, Philadelphia, PA.
Rosenfeld, R., Agarwal, R., Byrne, B., Iyer, R., Liberman, M.,
Shriberg, E., Unverferth, J., Vergyri, D., Vidal, E. (1996). Error
analysis and disfluency modeling in the Switchboard domain.
Proc. International Conference on Spoken Language Processing,
Philadelphia, PA.
Ostendorf, M., Byrne, B., Bacchiani, M., Finke, M., Gunawardana, A.,
Ross, K., Roweis, S., Shriberg, E., Talkin, D., Waibel, A., Wheatley,
B., Zeppenfeld, T. (1996).
Modeling systematic variations in pronunciation via a language-dependent
hidden speaking mode.
In Research Notes No. 24,
1996 LVCSR Summer Research Workshop Technical Reports,
Center for Language and Speech Processing, Johns Hopkins University.
Stolcke, A. & Shriberg, E.E. (1996).
Statistical language modeling for speech disfluencies.
Proc. International Conference on Acoustics,
Speech and Signal Processing, vol. 1, pp. 405-408, Atlanta, GA.
Weintraub, M., Aksu, Y., Dharanipragada, S., Khudanpur, S., Ney, H., Prange, J.,
Stolcke, A., Jelinek, F. and Shriberg, E. (1996). LM95 project report: Fast
training and portability. In Research Note No. 1. Center for
Language and Speech Processing, Johns Hopkins University, Baltimore, MD.
Shriberg, E.E. (1995).
Acoustic properties of disfluent repetitions.
Proc. International Congress of Phonetic Sciences, 4, pp. 384-387,
Stockholm, Sweden.
Dahl, D., Bates, M., Brown, M., Fisher, W., Hunicke-Smith, K., Pallett, D., Pao, C., Rudnicky, A., and Shriberg, E. (1994).
Expanding the scope of the ATIS task: the ATIS-3 corpus.
Proc. ARPA Workshop on Human Language Technology, pp. 43-48,
Plainsboro, NJ.
Shriberg, E.E. (1994).
Preliminaries to a Theory of Speech Disfluencies.
PhD thesis, University of California at Berkeley.
Shriberg, E.E. & Lickley, R.J. (1993). Intonation of clause-internal
filled pauses. Phonetica 50, 172-179.
Shriberg, E.E. (1992). Perceptual restoration of filtered vowels with
added noise. Language and Speech 35(1-2), 127-136.
Bear, J., Dowding, J., Shriberg, E.E. & Price, P.J. (1993).
A system for labeling self-repairs in speech.. SRI Technical Note 522.
Nakatani, C.H. & Shriberg, E.E. (1993). Proposal for labeling
disfluencies in ToBI. Paper presented at the Third ToBI Labeling
Workshop, Ohio State University.
Shriberg, E.E., Bear, J. & Dowding, J. (1992).
Automatic detection and correction of repairs in human-computer dialog.
Proc. DARPA Speech and Natural Language Workshop, M. Marcus,
(ed.), pp. 419-424, Harriman, NY.
Shriberg, E.E., Wade, E. & Price, P.J. (1992). Human-machine problem
solving using spoken language systems (SLS): Factors affecting
performance and user satisfaction. Proc. DARPA Speech and Natural
Language Workshop, M. Marcus, (ed.), pp. 49-54, Harriman, NY.
Shriberg, E.E. & Lickley, R.J. (1992). Intonation of clause-internal filled pauses.
Proc. 2nd International Conference on Spoken Language Processing,
pp. 991-994, Banff, Alberta, Canada.
Shriberg, E.E. & Lickley, R.J. (1992). The relationship of
filled-pause F0 to prosodic context. In Proceedings of the IRCS
Workshop on Prosody in Natural Speech, Technical Report IRCS-92-37,
201-209, University of Pennsylvania, Institute for Research in
Cognitive Science, Philadelphia, PA.
Bear, J., Dowding, J. & Shriberg, E.E. (1992).
Integrating multiple knowledge sources for detection and correction of repairs in human-computer dialog.
Proc. Annual Meeting of the Association for Computational
Linguistics, pp. 56-63, Newark, Delaware.
Butzberger, J.W., Murveit, H., Shriberg, E.E. & Price,
P.J. (1992). Spontaneous speech effects in large vocabulary speech
recognition applications. Proc. DARPA Speech and Natural Language
Workshop, M. Marcus (ed.), Morgan Kaufmann, pp. 339-343.
Price, P.J., Hirschman, L., Shriberg, E.E. & Wade,
E. (1992). Subject-based evaluation measures for interactive spoken
language systems. Proc. DARPA Speech and Natural Language Workshop,
M. Marcus (ed.), pp. 34-39, Harriman, NY.
Wade, E., Shriberg, E.E. & Price, P.J. (1992). User behaviors
affecting speech recognition.
Proc. 2nd International Conference on Spoken Language Processing,
pp. 995-998, Banff, Alberta, Canada.
Shriberg, E.E. & Ohala, J.J. (1991). "Correction" in the perception of
filtered vowels.
Journal of the Acoustical Society of America 89, 8SP6.
Ohala, J.J. & Shriberg, E.E. (1990). Hyper-correction in speech
perception. Proc. International Conference on Spoken Language
Processing, pp. 405-408, Kobe, Japan.
Shriberg, E.E. (1990). Hypercorrection in vowel identification as
evidence against the direct realist view of speech
perception. Unpublished M.A. thesis, University of California,
Berkeley.
Snow, C.E., Cancino, H., Gonzales, P. & Shriberg, E.E. (1989a). Giving
formal definitions: An oral language correlate of school literacy. In
D. Bloome (ed.), Classrooms and Literacy. Norwood, NJ: Ablex.
Snow, C.E., Cancino, H., Gonzales, P. & Shriberg, E.E. (1989b). Second
language learners' formal definitions: An oral language correlate of
school literacy. In D. Bloome (Ed.), Literacy in Functional
Settings. Norwood, NJ: Ablex.
Hafter, E.R., Buell, T.N., Basiji, D. & Shriberg,
E.E. (1988a). Discrimination of direction for complex sounds presented
in the free-field. In Basic Issues in Hearing, H. Duifhuis &
J.W. Horst (eds.), San Diego: Academic Press.
Hafter, E.R., Buell, T.N., Basiji, D. & Shriberg,
E.E. (1988b). Discrimination of direction in the free-field.
Journal of the Acoustical Society of America 83, S122.