WO2004042697A3 - Multi-lingual speech recognition with cross-language context modeling - Google Patents

Multi-lingual speech recognition with cross-language context modeling Download PDF

Info

Publication number
WO2004042697A3
WO2004042697A3 PCT/US2003/035010 US0335010W WO2004042697A3 WO 2004042697 A3 WO2004042697 A3 WO 2004042697A3 US 0335010 W US0335010 W US 0335010W WO 2004042697 A3 WO2004042697 A3 WO 2004042697A3
Authority
WO
WIPO (PCT)
Prior art keywords
cross
speech recognition
different
words
context modeling
Prior art date
Application number
PCT/US2003/035010
Other languages
French (fr)
Other versions
WO2004042697A2 (en
Inventor
Johan Schalkwyk
Original Assignee
Speechworks Int Inc
Johan Schalkwyk
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=32175698&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=WO2004042697(A3) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Speechworks Int Inc, Johan Schalkwyk filed Critical Speechworks Int Inc
Priority to AU2003287490A priority Critical patent/AU2003287490A1/en
Publication of WO2004042697A2 publication Critical patent/WO2004042697A2/en
Publication of WO2004042697A3 publication Critical patent/WO2004042697A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/025Phonemes, fenemes or fenones being the recognition units

Abstract

An approach to multi-lingual speech recognition that permits different words in an utterance to be from different languages. Words from different languages are represented using different sets of sub-word units that are each associate with the corresponding language. Despite the use of different sets of sub-word units, the approach enables use of cross-word context at boundaries between words from different languages (cross-language context) to select appropriate variants of the subword units to match the context.
PCT/US2003/035010 2002-11-04 2003-11-04 Multi-lingual speech recognition with cross-language context modeling WO2004042697A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2003287490A AU2003287490A1 (en) 2002-11-04 2003-11-04 Multi-lingual speech recognition with cross-language context modeling

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/287,438 2002-11-04
US10/287,438 US7149688B2 (en) 2002-11-04 2002-11-04 Multi-lingual speech recognition with cross-language context modeling

Publications (2)

Publication Number Publication Date
WO2004042697A2 WO2004042697A2 (en) 2004-05-21
WO2004042697A3 true WO2004042697A3 (en) 2004-07-22

Family

ID=32175698

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/035010 WO2004042697A2 (en) 2002-11-04 2003-11-04 Multi-lingual speech recognition with cross-language context modeling

Country Status (3)

Country Link
US (1) US7149688B2 (en)
AU (1) AU2003287490A1 (en)
WO (1) WO2004042697A2 (en)

Families Citing this family (102)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8214196B2 (en) 2001-07-03 2012-07-03 University Of Southern California Syntax-based statistical translation model
US7194369B2 (en) * 2001-07-23 2007-03-20 Cognis Corporation On-site analysis system with central processor and method of analyzing
WO2004001623A2 (en) 2002-03-26 2003-12-31 University Of Southern California Constructing a translation lexicon from comparable, non-parallel corpora
US7716050B2 (en) * 2002-11-15 2010-05-11 Voice Signal Technologies, Inc. Multilingual speech recognition
US7200557B2 (en) * 2002-11-27 2007-04-03 Microsoft Corporation Method of reducing index sizes used to represent spectral content vectors
TWI224771B (en) * 2003-04-10 2004-12-01 Delta Electronics Inc Speech recognition device and method using di-phone model to realize the mixed-multi-lingual global phoneme
US8548794B2 (en) 2003-07-02 2013-10-01 University Of Southern California Statistical noun phrase translation
DE10334400A1 (en) * 2003-07-28 2005-02-24 Siemens Ag Method for speech recognition and communication device
US7502731B2 (en) * 2003-08-11 2009-03-10 Sony Corporation System and method for performing speech recognition by utilizing a multi-language dictionary
WO2005050958A2 (en) * 2003-11-14 2005-06-02 Voice Signal Technologies, Inc. Installing language modules in a mobile communication device
US7689404B2 (en) * 2004-02-24 2010-03-30 Arkady Khasin Method of multilingual speech recognition by reduction to single-language recognizer engine components
WO2005089340A2 (en) * 2004-03-15 2005-09-29 University Of Southern California Training tree transducers
US8296127B2 (en) 2004-03-23 2012-10-23 University Of Southern California Discovery of parallel text portions in comparable collections of corpora and training using comparable texts
US8666725B2 (en) 2004-04-16 2014-03-04 University Of Southern California Selection and use of nonstatistical translation components in a statistical machine translation framework
US8036893B2 (en) * 2004-07-22 2011-10-11 Nuance Communications, Inc. Method and system for identifying and correcting accent-induced speech recognition difficulties
US7640159B2 (en) * 2004-07-22 2009-12-29 Nuance Communications, Inc. System and method of speech recognition for non-native speakers of a language
US7430503B1 (en) * 2004-08-24 2008-09-30 The United States Of America As Represented By The Director, National Security Agency Method of combining corpora to achieve consistency in phonetic labeling
US7406408B1 (en) * 2004-08-24 2008-07-29 The United States Of America As Represented By The Director, National Security Agency Method of recognizing phones in speech of any language
US7711542B2 (en) * 2004-08-31 2010-05-04 Research In Motion Limited System and method for multilanguage text input in a handheld electronic device
JP5452868B2 (en) 2004-10-12 2014-03-26 ユニヴァーシティー オブ サザン カリフォルニア Training for text-to-text applications that use string-to-tree conversion for training and decoding
EP1693828B1 (en) * 2005-02-21 2008-01-23 Harman Becker Automotive Systems GmbH Multilingual speech recognition
US8676563B2 (en) 2009-10-01 2014-03-18 Language Weaver, Inc. Providing human-generated and machine-generated trusted translations
US8886517B2 (en) 2005-06-17 2014-11-11 Language Weaver, Inc. Trust scoring for language translation systems
US7643985B2 (en) * 2005-06-27 2010-01-05 Microsoft Corporation Context-sensitive communication and translation methods for enhanced interactions and understanding among speakers of different languages
US7697827B2 (en) 2005-10-17 2010-04-13 Konicek Jeffrey C User-friendlier interfaces for a camera
US8467672B2 (en) * 2005-10-17 2013-06-18 Jeffrey C. Konicek Voice recognition and gaze-tracking for a camera
US10319252B2 (en) 2005-11-09 2019-06-11 Sdl Inc. Language capability assessment and training apparatus and techniques
EP1960997B1 (en) 2005-12-08 2010-02-10 Nuance Communications Austria GmbH Speech recognition system with huge vocabulary
US7890325B2 (en) * 2006-03-16 2011-02-15 Microsoft Corporation Subword unit posterior probability for measuring confidence
US7966173B2 (en) * 2006-03-22 2011-06-21 Nuance Communications, Inc. System and method for diacritization of text
US8943080B2 (en) 2006-04-07 2015-01-27 University Of Southern California Systems and methods for identifying parallel documents and sentence fragments in multilingual document collections
US8886518B1 (en) 2006-08-07 2014-11-11 Language Weaver, Inc. System and method for capitalizing machine translated text
US8086463B2 (en) * 2006-09-12 2011-12-27 Nuance Communications, Inc. Dynamically generating a vocal help prompt in a multimodal application
US8433556B2 (en) 2006-11-02 2013-04-30 University Of Southern California Semi-supervised training for statistical word alignment
US7873517B2 (en) * 2006-11-09 2011-01-18 Volkswagen Of America, Inc. Motor vehicle with a speech interface
US20080126093A1 (en) * 2006-11-28 2008-05-29 Nokia Corporation Method, Apparatus and Computer Program Product for Providing a Language Based Interactive Multimedia System
US9122674B1 (en) 2006-12-15 2015-09-01 Language Weaver, Inc. Use of annotations in statistical machine translation
US8468149B1 (en) 2007-01-26 2013-06-18 Language Weaver, Inc. Multi-lingual online community
US8615389B1 (en) 2007-03-16 2013-12-24 Language Weaver, Inc. Generation and exploitation of an approximate language model
US8831928B2 (en) 2007-04-04 2014-09-09 Language Weaver, Inc. Customizable machine translation service
US8825466B1 (en) 2007-06-08 2014-09-02 Language Weaver, Inc. Modification of annotated bilingual segment pairs in syntax-based machine translation
US8290775B2 (en) * 2007-06-29 2012-10-16 Microsoft Corporation Pronunciation correction of text-to-speech systems between different spoken languages
US8051061B2 (en) * 2007-07-20 2011-11-01 Microsoft Corporation Cross-lingual query suggestion
US7953591B2 (en) * 2007-07-26 2011-05-31 International Business Machines Corporation Automatically identifying unique language independent keys correlated with appropriate text strings of various locales by key search
US7949515B2 (en) * 2007-07-26 2011-05-24 International Business Machines Corporation Automatically identifying unique language independent keys correlated with appropriate text strings of various locales by value and key searches
US8244534B2 (en) * 2007-08-20 2012-08-14 Microsoft Corporation HMM-based bilingual (Mandarin-English) TTS techniques
US8209164B2 (en) * 2007-11-21 2012-06-26 University Of Washington Use of lexical translations for facilitating searches
US20090171663A1 (en) * 2008-01-02 2009-07-02 International Business Machines Corporation Reducing a size of a compiled speech recognition grammar
US7917488B2 (en) * 2008-03-03 2011-03-29 Microsoft Corporation Cross-lingual search re-ranking
US8536976B2 (en) 2008-06-11 2013-09-17 Veritrix, Inc. Single-channel multi-factor authentication
US8166297B2 (en) 2008-07-02 2012-04-24 Veritrix, Inc. Systems and methods for controlling access to encrypted data stored on a mobile device
WO2010051342A1 (en) 2008-11-03 2010-05-06 Veritrix, Inc. User authentication for social networks
US8442833B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Speech processing with source location estimation using signals from two or more microphones
US8442829B2 (en) * 2009-02-17 2013-05-14 Sony Computer Entertainment Inc. Automatic computation streaming partition for voice recognition on multiple processors with limited memory
US8788256B2 (en) * 2009-02-17 2014-07-22 Sony Computer Entertainment Inc. Multiple language voice recognition
US8990064B2 (en) 2009-07-28 2015-03-24 Language Weaver, Inc. Translating documents based on content
US8190420B2 (en) * 2009-08-04 2012-05-29 Autonomy Corporation Ltd. Automatic spoken language identification based on phoneme sequence patterns
US8380486B2 (en) 2009-10-01 2013-02-19 Language Weaver, Inc. Providing machine-generated translations and corresponding trust levels
EP3091535B1 (en) 2009-12-23 2023-10-11 Google LLC Multi-modal input on an electronic device
US11416214B2 (en) 2009-12-23 2022-08-16 Google Llc Multi-modal input on an electronic device
US9177545B2 (en) * 2010-01-22 2015-11-03 Mitsubishi Electric Corporation Recognition dictionary creating device, voice recognition device, and voice synthesizer
US10417646B2 (en) 2010-03-09 2019-09-17 Sdl Inc. Predicting the cost associated with translating textual content
US9798653B1 (en) * 2010-05-05 2017-10-24 Nuance Communications, Inc. Methods, apparatus and data structure for cross-language speech adaptation
US8543374B2 (en) * 2010-08-12 2013-09-24 Xerox Corporation Translation system combining hierarchical and phrase-based models
US9239829B2 (en) * 2010-10-01 2016-01-19 Mitsubishi Electric Corporation Speech recognition device
US8352245B1 (en) * 2010-12-30 2013-01-08 Google Inc. Adjusting language models
US8296142B2 (en) 2011-01-21 2012-10-23 Google Inc. Speech recognition using dock context
US11003838B2 (en) 2011-04-18 2021-05-11 Sdl Inc. Systems and methods for monitoring post translation editing
US8694303B2 (en) 2011-06-15 2014-04-08 Language Weaver, Inc. Systems and methods for tuning parameters in statistical machine translation
WO2013003772A2 (en) * 2011-06-30 2013-01-03 Google Inc. Speech recognition using variable-length context
US8886515B2 (en) 2011-10-19 2014-11-11 Language Weaver, Inc. Systems and methods for enhancing machine translation post edit review processes
US8825481B2 (en) * 2012-01-20 2014-09-02 Microsoft Corporation Subword-based multi-level pronunciation adaptation for recognizing accented speech
US8942973B2 (en) 2012-03-09 2015-01-27 Language Weaver, Inc. Content page URL translation
US10261994B2 (en) 2012-05-25 2019-04-16 Sdl Inc. Method and system for automatic management of reputation of translators
US9336771B2 (en) * 2012-11-01 2016-05-10 Google Inc. Speech recognition using non-parametric models
US9152622B2 (en) 2012-11-26 2015-10-06 Language Weaver, Inc. Personalized machine translation via online adaptation
US9594744B2 (en) * 2012-11-28 2017-03-14 Google Inc. Speech transcription including written text
CN103971678B (en) * 2013-01-29 2015-08-12 腾讯科技(深圳)有限公司 Keyword spotting method and apparatus
KR102084646B1 (en) * 2013-07-04 2020-04-14 삼성전자주식회사 Device for recognizing voice and method for recognizing voice
US8768704B1 (en) 2013-09-30 2014-07-01 Google Inc. Methods and systems for automated generation of nativized multi-lingual lexicons
US9213694B2 (en) 2013-10-10 2015-12-15 Language Weaver, Inc. Efficient online domain adaptation
CN105793920B (en) * 2013-11-20 2017-08-08 三菱电机株式会社 Voice recognition device and sound identification method
US9842592B2 (en) 2014-02-12 2017-12-12 Google Inc. Language models using non-linguistic context
US10339920B2 (en) * 2014-03-04 2019-07-02 Amazon Technologies, Inc. Predicting pronunciation in speech recognition
US9412365B2 (en) 2014-03-24 2016-08-09 Google Inc. Enhanced maximum entropy models
US9858922B2 (en) 2014-06-23 2018-01-02 Google Inc. Caching speech recognition scores
US9299347B1 (en) 2014-10-22 2016-03-29 Google Inc. Speech recognition using associative mapping
US10134394B2 (en) 2015-03-20 2018-11-20 Google Llc Speech recognition using log-linear model
US9953631B1 (en) 2015-05-07 2018-04-24 Google Llc Automatic speech recognition techniques for multiple languages
US10229674B2 (en) 2015-05-15 2019-03-12 Microsoft Technology Licensing, Llc Cross-language speech recognition and translation
US9886958B2 (en) * 2015-12-11 2018-02-06 Microsoft Technology Licensing, Llc Language and domain independent model based approach for on-screen item selection
US9978367B2 (en) 2016-03-16 2018-05-22 Google Llc Determining dialog states for language models
US10832664B2 (en) 2016-08-19 2020-11-10 Google Llc Automated speech recognition using language models that selectively use domain-specific model components
US10810485B2 (en) * 2016-09-13 2020-10-20 Intel Corporation Dynamic context-selective convolutional neural network for time series data classification
EP3520038A4 (en) 2016-09-28 2020-06-03 D5A1 Llc Learning coach for machine learning system
US10311860B2 (en) 2017-02-14 2019-06-04 Google Llc Language model biasing system
WO2018175098A1 (en) 2017-03-24 2018-09-27 D5Ai Llc Learning coach for machine learning system
WO2018194960A1 (en) * 2017-04-18 2018-10-25 D5Ai Llc Multi-stage machine learning and recognition
US11321612B2 (en) 2018-01-30 2022-05-03 D5Ai Llc Self-organizing partially ordered networks and soft-tying learned parameters, such as connection weights
US11475875B2 (en) * 2018-10-26 2022-10-18 Sriram Chakravarthy Method and system for implementing language neutral virtual assistant
CN110517668B (en) * 2019-07-23 2022-09-27 普强时代(珠海横琴)信息技术有限公司 Chinese and English mixed speech recognition system and method
KR20210078829A (en) * 2019-12-19 2021-06-29 엘지전자 주식회사 Artificial intelligence apparatus and method for recognizing speech with multiple languages

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002027535A1 (en) * 2000-09-28 2002-04-04 Intel Corporation Method and system for expanding a word graph to a phone graph based on a cross-word acoustical model to improve continuous speech recognition
US20030009335A1 (en) * 2001-07-05 2003-01-09 Johan Schalkwyk Speech recognition with dynamic grammars

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4980918A (en) 1985-05-09 1990-12-25 International Business Machines Corporation Speech recognition system with efficient storage and rapid assembly of phonological graphs
US5268990A (en) 1991-01-31 1993-12-07 Sri International Method for recognizing speech using linguistically-motivated hidden Markov models
US5241619A (en) 1991-06-25 1993-08-31 Bolt Beranek And Newman Inc. Word dependent N-best search method
US5510981A (en) * 1993-10-28 1996-04-23 International Business Machines Corporation Language translation apparatus and method using context-based translation models
US6304841B1 (en) * 1993-10-28 2001-10-16 International Business Machines Corporation Automatic construction of conditional exponential models from elementary features
NZ294659A (en) 1994-11-01 1999-01-28 British Telecomm Method of and apparatus for generating a vocabulary from an input speech signal
DE19510083C2 (en) 1995-03-20 1997-04-24 Ibm Method and arrangement for speech recognition in languages containing word composites
US6278973B1 (en) 1995-12-12 2001-08-21 Lucent Technologies, Inc. On-demand language processing system and method
SG49804A1 (en) * 1996-03-20 1998-06-15 Government Of Singapore Repres Parsing and translating natural language sentences automatically
US5991720A (en) 1996-05-06 1999-11-23 Matsushita Electric Industrial Co., Ltd. Speech recognition system employing multiple grammar networks
US5875426A (en) 1996-06-12 1999-02-23 International Business Machines Corporation Recognizing speech having word liaisons by adding a phoneme to reference word models
US6088669A (en) 1997-01-28 2000-07-11 International Business Machines, Corporation Speech recognition with attempted speaker recognition for speaker model prefetching or alternative speech modeling
US6078886A (en) 1997-04-14 2000-06-20 At&T Corporation System and method for providing remote automatic speech recognition services via a packet network
EP0979497A1 (en) 1997-10-08 2000-02-16 Koninklijke Philips Electronics N.V. Vocabulary and/or language model training
US6085160A (en) * 1998-07-10 2000-07-04 Lernout & Hauspie Speech Products N.V. Language independent speech recognition
US6438520B1 (en) 1999-01-20 2002-08-20 Lucent Technologies Inc. Apparatus, method and system for cross-speaker speech recognition for telecommunication applications
US6912499B1 (en) * 1999-08-31 2005-06-28 Nortel Networks Limited Method and apparatus for training a multilingual speech model set
AU7938300A (en) 1999-10-06 2001-05-10 Lernout And Hauspie Speech Products N.V. Attribute-based word modeling
EP1285434A1 (en) 2000-05-23 2003-02-26 Thomson Licensing S.A. Dynamic language models for speech recognition

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002027535A1 (en) * 2000-09-28 2002-04-04 Intel Corporation Method and system for expanding a word graph to a phone graph based on a cross-word acoustical model to improve continuous speech recognition
US20030009335A1 (en) * 2001-07-05 2003-01-09 Johan Schalkwyk Speech recognition with dynamic grammars

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
ANDREJ ET AL: "Crosslingual Speech Recognition with Multilingual Acoustic Models Based on Agglomerative and Tree-Based Triphone Clustering", EUROSPEECH 2001 - 7TH EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY, vol. 4, 3 September 2001 (2001-09-03) - 7 September 2001 (2001-09-07), Aalborg, Denmark, pages 2725, XP007004958 *
KANTHAK S, NEY H: "Multilingual Acoustic Modeling Using Graphemes", PROC. EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY, vol. 2, September 2003 (2003-09-01), Geneva, Switzerland, pages 1145 - 1148, XP002276941 *
SCHULTZ AND A WAIBEL T: "EXPERIMENTS ON CROSS-LANGUAGE ACOUSTIC MODELING", EUROSPEECH 2001 - 7TH EUROPEAN CONFERENCE ON SPEECH COMMUNICATION AND TECHNOLOGY, vol. 4, 3 September 2001 (2001-09-03) - 7 September 2001 (2001-09-07), Aalborg, Denmark, pages 2721, XP007004957 *
SIXTUS ACHIM ET AL: "From within-word model search to across-word model search in large vocabulary continuous speech recognition", COMPUT SPEECH LANG;COMPUTER SPEECH AND LANGUAGE APRIL 2002, vol. 16, no. 2, April 2002 (2002-04-01), pages 245 - 271, XP007005547 *
WARD, T. ET AL.: "TOWARDS SPEECH UNDERSTANDING ACROSS MULTIPLE LANGUAGES", 1998 INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING - ICSLP98, 30 November 1998 (1998-11-30) - 4 December 1998 (1998-12-04), Sydney, Australia, pages F400, XP007000277 *
ZHIRONG WANG ET AL: "Towards Universal Speech Recognition", PROCEEDINGS FOURTH IEEE INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, 14 October 2002 (2002-10-14), Los Alamitos, CA, USA, pages 247 - 252, XP010624324 *

Also Published As

Publication number Publication date
US20040088163A1 (en) 2004-05-06
US7149688B2 (en) 2006-12-12
AU2003287490A8 (en) 2004-06-07
AU2003287490A1 (en) 2004-06-07
WO2004042697A2 (en) 2004-05-21

Similar Documents

Publication Publication Date Title
WO2004042697A3 (en) Multi-lingual speech recognition with cross-language context modeling
Saadane et al. A conventional orthography for Algerian Arabic
Yuan et al. Automatic phonetic segmentation using boundary models.
TW200638337A (en) Using a spoken utterance for disambiguation of spelling inputs into a speech recognition system
Grézl et al. Study of probabilistic and bottle-neck features in multilingual environment
WO2006086511A8 (en) Method and apparatus utilizing voice input to resolve ambiguous manually entered text input
WO2007118020A3 (en) Method and system for managing pronunciation dictionaries in a speech application
WO2006062707A3 (en) System and method for speech recognition-enabled automated call routing
WO2009026270A3 (en) Hmm-based bilingual (mandarin-english) tts techniques
WO2004086359A3 (en) System for speech recognition and correction, correction device and method for creating a lexicon of alternatives
DE69922104D1 (en) Speech recognizer with vocabulary adaptable through spelled word input
EP1205908A3 (en) Pronunciation of new input words for speech processing
WO2008067562A3 (en) Multimodal speech recognition system
EP1217609A3 (en) Speech recognition
WO2007005884A3 (en) Generating chinese language couplets
TW200601263A (en) Apparatus and method for synthesized audible response to an utterance in speaker-independent voice recognition
CA2304251A1 (en) System and method for creating a language grammar
WO2009040790A3 (en) Method and system for spell checking
AU2003295682A1 (en) Multilingual speech recognition
WO2009006081A3 (en) Pronunciation correction of text-to-speech systems between different spoken languages
WO2003058603A3 (en) System and method for speech recognition by multi-pass recognition generating refined context specific grammars
EP1291848A3 (en) Multilingual pronunciations for speech recognition
WO2006070373A3 (en) A system and a method for representing unrecognized words in speech to text conversions as syllables
TW200630958A (en) Method and device of speech recognition and language-understanding analysis and nature-language dialogue system using the method
WO2007117814A3 (en) Voice signal perturbation for speech recognition

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Country of ref document: JP