CA2198306A1 - Method and apparatus for an improved language recognition system - Google Patents

Method and apparatus for an improved language recognition system

Info

Publication number
CA2198306A1
CA2198306A1 CA002198306A CA2198306A CA2198306A1 CA 2198306 A1 CA2198306 A1 CA 2198306A1 CA 002198306 A CA002198306 A CA 002198306A CA 2198306 A CA2198306 A CA 2198306A CA 2198306 A1 CA2198306 A1 CA 2198306A1
Authority
CA
Canada
Prior art keywords
language
word
model
machines
language recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002198306A
Other languages
French (fr)
Other versions
CA2198306C (en
Inventor
Hiyan Alshawi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia of America Corp
Original Assignee
Lucent Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lucent Technologies Inc filed Critical Lucent Technologies Inc
Publication of CA2198306A1 publication Critical patent/CA2198306A1/en
Application granted granted Critical
Publication of CA2198306C publication Critical patent/CA2198306C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/193Formal grammars, e.g. finite state automata, context free grammars or word networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams

Abstract

Methods and apparatus for a language model and language recognition systems are disclosed. The method utilizes a plurality of probabilistic finite state machines having the ability to recognize a pair of sequences, one sequence scanned leftwards, the other scanned rightwards. Each word in the lexicon of the language model is associated with one or more such machines which model the semantic relations between the word and other words. Machine transitions create phrases from a set of word string hypotheses, and incrementally calculate costs related to the probability that such phrases represent the language to be recognized. The cascading lexical head machines utilized in the methods and apparatus capture the structural associations implicit in the hierachical organization of a sentence, resulting in a language model and language recognition systems that combine the lexical sensitivity of N-gram models with the structural properties of dependency grammar.
CA002198306A 1996-04-10 1997-02-24 Method and apparatus for an improved language recognition system Expired - Fee Related CA2198306C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/631,874 US5870706A (en) 1996-04-10 1996-04-10 Method and apparatus for an improved language recognition system
US631,874 1996-04-10

Publications (2)

Publication Number Publication Date
CA2198306A1 true CA2198306A1 (en) 1997-10-11
CA2198306C CA2198306C (en) 2001-05-01

Family

ID=24533134

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002198306A Expired - Fee Related CA2198306C (en) 1996-04-10 1997-02-24 Method and apparatus for an improved language recognition system

Country Status (4)

Country Link
US (1) US5870706A (en)
EP (1) EP0801378B1 (en)
CA (1) CA2198306C (en)
DE (1) DE69717899T2 (en)

Families Citing this family (84)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB9625284D0 (en) * 1996-12-04 1997-01-22 Canon Kk A data processing method and apparatus for identifying a classification to which data belongs
FR2763775B1 (en) * 1997-05-23 1999-08-13 France Telecom METHOD OF VISUALIZING PATHS WITHIN A GRAPHIC REPRESENTATION OF A NETWORK
US6195631B1 (en) * 1998-04-15 2001-02-27 At&T Corporation Method and apparatus for automatic construction of hierarchical transduction models for language translation
JP3252815B2 (en) * 1998-12-04 2002-02-04 日本電気株式会社 Continuous speech recognition apparatus and method
US6397181B1 (en) * 1999-01-27 2002-05-28 Kent Ridge Digital Labs Method and apparatus for voice annotation and retrieval of multimedia data
GB9904663D0 (en) * 1999-03-01 1999-04-21 Canon Kk Apparatus and method for generating processor usable data from natural langage input data
US7082396B1 (en) * 1999-04-30 2006-07-25 At&T Corp Methods and apparatus for rapid acoustic unit selection from a large speech corpus
US6697780B1 (en) * 1999-04-30 2004-02-24 At&T Corp. Method and apparatus for rapid acoustic unit selection from a large speech corpus
US7369994B1 (en) 1999-04-30 2008-05-06 At&T Corp. Methods and apparatus for rapid acoustic unit selection from a large speech corpus
AU5451800A (en) * 1999-05-28 2000-12-18 Sehda, Inc. Phrase-based dialogue modeling with particular application to creating recognition grammars for voice-controlled user interfaces
US20020032564A1 (en) * 2000-04-19 2002-03-14 Farzad Ehsani Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface
US6904405B2 (en) * 1999-07-17 2005-06-07 Edwin A. Suominen Message recognition using shared language model
US9076448B2 (en) * 1999-11-12 2015-07-07 Nuance Communications, Inc. Distributed real time speech recognition system
US7392185B2 (en) 1999-11-12 2008-06-24 Phoenix Solutions, Inc. Speech based learning/training system using semantic decoding
US7050977B1 (en) * 1999-11-12 2006-05-23 Phoenix Solutions, Inc. Speech-enabled server for internet website and method
US7725307B2 (en) 1999-11-12 2010-05-25 Phoenix Solutions, Inc. Query engine for processing voice based queries including semantic decoding
JP2003515778A (en) * 1999-12-02 2003-05-07 トムソン ライセンシング ソシエテ アノニム Speech recognition method and apparatus using different language models
JP4465564B2 (en) * 2000-02-28 2010-05-19 ソニー株式会社 Voice recognition apparatus, voice recognition method, and recording medium
US7024350B2 (en) * 2000-07-20 2006-04-04 Microsoft Corporation Compact easily parseable binary format for a context-free grammer
US20020082868A1 (en) * 2000-12-27 2002-06-27 Pories Walter J. Systems, methods and computer program products for creating and maintaining electronic medical records
US6961694B2 (en) * 2001-01-22 2005-11-01 Microsoft Corporation Method and apparatus for reducing latency in speech-based applications
US6813616B2 (en) 2001-03-07 2004-11-02 International Business Machines Corporation System and method for building a semantic network capable of identifying word patterns in text
US7426505B2 (en) * 2001-03-07 2008-09-16 International Business Machines Corporation Method for identifying word patterns in text
US7177792B2 (en) * 2001-05-31 2007-02-13 University Of Southern California Integer programming decoder for machine translation
US8214196B2 (en) * 2001-07-03 2012-07-03 University Of Southern California Syntax-based statistical translation model
US20030061046A1 (en) * 2001-09-27 2003-03-27 Qingwei Zhao Method and system for integrating long-span language model into speech recognition system
US6892176B2 (en) * 2001-12-18 2005-05-10 Matsushita Electric Industrial Co., Ltd. Hash function based transcription database
WO2004001623A2 (en) * 2002-03-26 2003-12-31 University Of Southern California Constructing a translation lexicon from comparable, non-parallel corpora
AU2003271083A1 (en) * 2002-10-08 2004-05-04 Matsushita Electric Industrial Co., Ltd. Language model creation/accumulation device, speech recognition device, language model creation method, and speech recognition method
US7711545B2 (en) * 2003-07-02 2010-05-04 Language Weaver, Inc. Empirical methods for splitting compound words with application to machine translation
US8548794B2 (en) * 2003-07-02 2013-10-01 University Of Southern California Statistical noun phrase translation
WO2005017768A1 (en) * 2003-08-15 2005-02-24 Silverbrook Research Pty Ltd Improving accuracy in searching digital ink
AU2004265700B2 (en) * 2003-08-15 2008-10-02 Silverbrook Research Pty Ltd Natural language recognition using distributed processing
WO2005089340A2 (en) * 2004-03-15 2005-09-29 University Of Southern California Training tree transducers
US8296127B2 (en) * 2004-03-23 2012-10-23 University Of Southern California Discovery of parallel text portions in comparable collections of corpora and training using comparable texts
US8666725B2 (en) * 2004-04-16 2014-03-04 University Of Southern California Selection and use of nonstatistical translation components in a statistical machine translation framework
US20130304453A9 (en) * 2004-08-20 2013-11-14 Juergen Fritsch Automated Extraction of Semantic Content and Generation of a Structured Document from Speech
US7584103B2 (en) * 2004-08-20 2009-09-01 Multimodal Technologies, Inc. Automated extraction of semantic content and generation of a structured document from speech
US7912699B1 (en) 2004-08-23 2011-03-22 At&T Intellectual Property Ii, L.P. System and method of lattice-based search for spoken utterance retrieval
WO2006042321A2 (en) * 2004-10-12 2006-04-20 University Of Southern California Training for a text-to-text application which uses string to tree conversion for training and decoding
US8200495B2 (en) * 2005-02-04 2012-06-12 Vocollect, Inc. Methods and systems for considering information about an expected response when performing speech recognition
US7937396B1 (en) 2005-03-23 2011-05-03 Google Inc. Methods and systems for identifying paraphrases from an index of information items and associated sentence fragments
US8886517B2 (en) 2005-06-17 2014-11-11 Language Weaver, Inc. Trust scoring for language translation systems
US8676563B2 (en) 2009-10-01 2014-03-18 Language Weaver, Inc. Providing human-generated and machine-generated trusted translations
US7974833B2 (en) 2005-06-21 2011-07-05 Language Weaver, Inc. Weighted system of expressing language information using a compact notation
US7389222B1 (en) 2005-08-02 2008-06-17 Language Weaver, Inc. Task parallelization in a text-to-text system
US7813918B2 (en) * 2005-08-03 2010-10-12 Language Weaver, Inc. Identifying documents which form translated pairs, within a document collection
US7624020B2 (en) * 2005-09-09 2009-11-24 Language Weaver, Inc. Adapter for allowing both online and offline training of a text to text system
US7937265B1 (en) 2005-09-27 2011-05-03 Google Inc. Paraphrase acquisition
US10319252B2 (en) * 2005-11-09 2019-06-11 Sdl Inc. Language capability assessment and training apparatus and techniques
US8943080B2 (en) 2006-04-07 2015-01-27 University Of Southern California Systems and methods for identifying parallel documents and sentence fragments in multilingual document collections
US7831423B2 (en) * 2006-05-25 2010-11-09 Multimodal Technologies, Inc. Replacing text representing a concept with an alternate written form of the concept
WO2007150005A2 (en) 2006-06-22 2007-12-27 Multimodal Technologies, Inc. Automatic decision support
US8886518B1 (en) 2006-08-07 2014-11-11 Language Weaver, Inc. System and method for capitalizing machine translated text
US8433556B2 (en) 2006-11-02 2013-04-30 University Of Southern California Semi-supervised training for statistical word alignment
US9122674B1 (en) 2006-12-15 2015-09-01 Language Weaver, Inc. Use of annotations in statistical machine translation
US8468149B1 (en) 2007-01-26 2013-06-18 Language Weaver, Inc. Multi-lingual online community
US8615389B1 (en) 2007-03-16 2013-12-24 Language Weaver, Inc. Generation and exploitation of an approximate language model
US8831928B2 (en) * 2007-04-04 2014-09-09 Language Weaver, Inc. Customizable machine translation service
US8825466B1 (en) 2007-06-08 2014-09-02 Language Weaver, Inc. Modification of annotated bilingual segment pairs in syntax-based machine translation
US8463610B1 (en) * 2008-01-18 2013-06-11 Patrick J. Bourke Hardware-implemented scalable modular engine for low-power speech recognition
JP5530729B2 (en) * 2009-01-23 2014-06-25 本田技研工業株式会社 Speech understanding device
CA2754257C (en) * 2009-03-06 2016-05-10 Biomedical Device Consultants and Laboratories of Colorado, LLC Fatigue testing system for prosthetic devices
US8990064B2 (en) 2009-07-28 2015-03-24 Language Weaver, Inc. Translating documents based on content
US8380486B2 (en) 2009-10-01 2013-02-19 Language Weaver, Inc. Providing machine-generated translations and corresponding trust levels
US10417646B2 (en) * 2010-03-09 2019-09-17 Sdl Inc. Predicting the cost associated with translating textual content
KR101154011B1 (en) * 2010-06-07 2012-06-08 주식회사 서비전자 System and method of Multi model adaptive and voice recognition
US9118669B2 (en) 2010-09-30 2015-08-25 Alcatel Lucent Method and apparatus for voice signature authentication
US8959102B2 (en) 2010-10-08 2015-02-17 Mmodal Ip Llc Structured searching of dynamic structured document corpuses
CN102122506B (en) * 2011-03-08 2013-07-31 天脉聚源(北京)传媒科技有限公司 Method for recognizing voice
CN102117335B (en) * 2011-03-25 2014-01-22 天脉聚源(北京)传媒科技有限公司 Method for retrieving multimedia information
US11003838B2 (en) 2011-04-18 2021-05-11 Sdl Inc. Systems and methods for monitoring post translation editing
US8914290B2 (en) 2011-05-20 2014-12-16 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US8694303B2 (en) 2011-06-15 2014-04-08 Language Weaver, Inc. Systems and methods for tuning parameters in statistical machine translation
US8886515B2 (en) 2011-10-19 2014-11-11 Language Weaver, Inc. Systems and methods for enhancing machine translation post edit review processes
US8942973B2 (en) 2012-03-09 2015-01-27 Language Weaver, Inc. Content page URL translation
US10261994B2 (en) 2012-05-25 2019-04-16 Sdl Inc. Method and system for automatic management of reputation of translators
US9152622B2 (en) 2012-11-26 2015-10-06 Language Weaver, Inc. Personalized machine translation via online adaptation
US9978395B2 (en) 2013-03-15 2018-05-22 Vocollect, Inc. Method and system for mitigating delay in receiving audio stream during production of sound from audio stream
US9280970B1 (en) * 2013-06-25 2016-03-08 Google Inc. Lattice semantic parsing
US9213694B2 (en) 2013-10-10 2015-12-15 Language Weaver, Inc. Efficient online domain adaptation
US9530404B2 (en) 2014-10-06 2016-12-27 Intel Corporation System and method of automatic speech recognition using on-the-fly word lattice generation with word histories
DE102015006662B4 (en) * 2015-05-22 2019-11-14 Audi Ag Method for configuring a voice control device
US10714121B2 (en) 2016-07-27 2020-07-14 Vocollect, Inc. Distinguishing user speech from background speech in speech-dense environments

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4718094A (en) * 1984-11-19 1988-01-05 International Business Machines Corp. Speech recognition system
EP0238698B1 (en) * 1986-03-27 1991-08-14 International Business Machines Corporation Method and apparatus for generating word model baseforms for speech recognition
US4866778A (en) * 1986-08-11 1989-09-12 Dragon Systems, Inc. Interactive speech recognition apparatus
DE3723078A1 (en) * 1987-07-11 1989-01-19 Philips Patentverwaltung METHOD FOR DETECTING CONTINUOUSLY SPOKEN WORDS
US5033088A (en) * 1988-06-06 1991-07-16 Voice Processing Corp. Method and apparatus for effectively receiving voice input to a voice recognition system
US5222187A (en) * 1989-12-29 1993-06-22 Texas Instruments Incorporated Grammar-based checksum constraints for high performance speech recognition circuit
JP2836159B2 (en) * 1990-01-30 1998-12-14 株式会社日立製作所 Speech recognition system for simultaneous interpretation and its speech recognition method
US5202952A (en) * 1990-06-22 1993-04-13 Dragon Systems, Inc. Large-vocabulary continuous speech prefiltering and processing system
US5297040A (en) * 1991-10-23 1994-03-22 Franklin T. Hu Molecular natural language processing system
US5233681A (en) * 1992-04-24 1993-08-03 International Business Machines Corporation Context-dependent speech recognizer using estimated next word context
US5434777A (en) * 1992-05-27 1995-07-18 Apple Computer, Inc. Method and apparatus for processing natural language
US5390279A (en) * 1992-12-31 1995-02-14 Apple Computer, Inc. Partitioning speech rules by context for speech recognition
US5384892A (en) * 1992-12-31 1995-01-24 Apple Computer, Inc. Dynamic language model for speech recognition
CA2151371A1 (en) * 1992-12-31 1994-07-21 Yen-Lu Chow Recursive finite state grammar
CA2126380C (en) * 1993-07-22 1998-07-07 Wu Chou Minimum error rate training of combined string models
US5434906A (en) * 1993-09-13 1995-07-18 Robinson; Michael J. Method and apparatus for processing an incoming call in a communication system
US5615296A (en) * 1993-11-12 1997-03-25 International Business Machines Corporation Continuous speech recognition and voice response system and method to enable conversational dialogues with microprocessors
US5621859A (en) * 1994-01-19 1997-04-15 Bbn Corporation Single tree method for grammar directed, very large vocabulary speech recognizer
US5584024A (en) * 1994-03-24 1996-12-10 Software Ag Interactive database query system and method for prohibiting the selection of semantically incorrect query parameters
US5615286A (en) * 1995-05-05 1997-03-25 Bell Communications Research, Inc. Method for determining a most likely sequence of states

Also Published As

Publication number Publication date
EP0801378B1 (en) 2002-12-18
US5870706A (en) 1999-02-09
DE69717899T2 (en) 2003-08-21
CA2198306C (en) 2001-05-01
EP0801378A2 (en) 1997-10-15
DE69717899D1 (en) 2003-01-30
EP0801378A3 (en) 1998-09-30
MX9702521A (en) 1997-10-31

Similar Documents

Publication Publication Date Title
CA2198306A1 (en) Method and apparatus for an improved language recognition system
Jelinek Statistical methods for speech recognition
Bahl et al. A maximum likelihood approach to continuous speech recognition
WO2000033211A3 (en) Automatic segmentation of a text
EP0867858A3 (en) Pronunciation generation in speech recognition
EP0874353A3 (en) Pronunciation generation in speech recognition
KR900018909A (en) Language recognition method and language recognizer training method
ATE203119T1 (en) LANGUAGE RECOGNITION SYSTEM FOR COMPOUND WORD LANGUAGES
Riley et al. Automatic generation of detailed pronunciation lexicons
Hakkinen et al. N-gram and decision tree based language identification for written words
Brousseau et al. French speech recognition in an automatic dictation system for translators: the transtalk project.
White Natural language understanding and speech recognition
Lyu et al. Golden Mandarin (III)-A user-adaptive prosodic-segment-based Mandarin dictation machine for Chinese language with very large vocabulary
Black et al. Analysis of unknown words through morphological decomposition
Heeman et al. Incorporating POS tagging into language modeling
Heeman et al. Deriving phrase-based language models
Furui Speech recognition-past, present, and future
JP2965529B2 (en) Voice recognition device
Yokoyama et al. Unsupervised class-based language model adaptation for spontaneous speech recognition
Fosler-Lussier A tutorial on pronunciation modeling for large vocabulary speech recognition
Smaïli et al. An hybrid language model for a continuous dictation prototype
Prieto et al. Automatic learning of structural language models
Pieraccini et al. Factorization of language constraints in speech recognition
JPS6229796B2 (en)
Kuhn et al. Recent results in automatic learning rules for semantic interpretation.

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed