CA2115210A1 - Interactive Computer System Recognizing Spoken Commands - Google Patents

Interactive Computer System Recognizing Spoken Commands

Info

Publication number
CA2115210A1
CA2115210A1 CA2115210A CA2115210A CA2115210A1 CA 2115210 A1 CA2115210 A1 CA 2115210A1 CA 2115210 A CA2115210 A CA 2115210A CA 2115210 A CA2115210 A CA 2115210A CA 2115210 A1 CA2115210 A1 CA 2115210A1
Authority
CA
Canada
Prior art keywords
active
state
series
computer program
vocabulary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA2115210A
Other languages
French (fr)
Other versions
CA2115210C (en
Inventor
Joseph C. Andreshak
Gregg H. Daggett
John Karat
John Lucassen
Stephen E. Levy
Robert L. Mack
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Publication of CA2115210A1 publication Critical patent/CA2115210A1/en
Application granted granted Critical
Publication of CA2115210C publication Critical patent/CA2115210C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Abstract

An interactive computer system having a processor executing a target computer program, and having a speech recognizer for converting an utterance into a command signal for the target computer program. The target computer program has a series of active program states occurring over a series of time periods. At least a first active-state image is displayed for a first active state occurring during a first time period. At least one object displayed in the first active-state image is identified, and a list of one or more first active-state commands identifying functions which can be performed in the first active state of the target computer program is generated from the identified object. A first active-state vocabulary of acoustic command models for the first active state comprises the acoustic command models from a system vocabulary representing the first active-state commands. A speech recognizer measures the value of at least one feature of an utterance during each of a series of successive time intervals within the first time period to produce a series of feature signals. The measured feature signals are compared to each of the acoustic command models in the first active-state vocabulary to generate a match score for the utterance and each acoustic command model. The speech recognizer outputs a command signal corresponding to the command model from the first active-state vocabulary having the best match score.
CA002115210A 1993-04-21 1994-02-08 Interactive computer system recognizing spoken commands Expired - Fee Related CA2115210C (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US5095093A 1993-04-21 1993-04-21
US050,950 1993-04-21

Publications (2)

Publication Number Publication Date
CA2115210A1 true CA2115210A1 (en) 1994-10-22
CA2115210C CA2115210C (en) 1997-09-23

Family

ID=21968512

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002115210A Expired - Fee Related CA2115210C (en) 1993-04-21 1994-02-08 Interactive computer system recognizing spoken commands

Country Status (8)

Country Link
US (1) US5664061A (en)
EP (1) EP0621531B1 (en)
JP (1) JP2856671B2 (en)
KR (1) KR970006403B1 (en)
CN (1) CN1086484C (en)
AT (1) ATE185203T1 (en)
CA (1) CA2115210C (en)
DE (1) DE69420888T2 (en)

Families Citing this family (102)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5983179A (en) * 1992-11-13 1999-11-09 Dragon Systems, Inc. Speech recognition system which turns its voice response on for confirmation when it has been turned off without confirmation
US5764852A (en) * 1994-08-16 1998-06-09 International Business Machines Corporation Method and apparatus for speech recognition for distinguishing non-speech audio input events from speech audio input events
ATE210311T1 (en) * 1995-01-18 2001-12-15 Koninkl Philips Electronics Nv METHOD AND DEVICE FOR GENERATING A HUMAN/MACHINE DIALOGUE WITH OPERATOR INTERVENTION
JP3750150B2 (en) * 1995-03-30 2006-03-01 三菱電機株式会社 Mobile communication terminal
JPH09149157A (en) * 1995-11-24 1997-06-06 Casio Comput Co Ltd Communication terminal equipment
GB9602701D0 (en) * 1996-02-09 1996-04-10 Canon Kk Image manipulation
US5960395A (en) 1996-02-09 1999-09-28 Canon Kabushiki Kaisha Pattern matching method, apparatus and computer readable memory medium for speech recognition using dynamic programming
US5819225A (en) * 1996-05-30 1998-10-06 International Business Machines Corporation Display indications of speech processing states in speech recognition system
US5867817A (en) * 1996-08-19 1999-02-02 Virtual Vision, Inc. Speech recognition manager
US6654955B1 (en) * 1996-12-19 2003-11-25 International Business Machines Corporation Adding speech recognition libraries to an existing program at runtime
US5897618A (en) * 1997-03-10 1999-04-27 International Business Machines Corporation Data processing system and method for switching between programs having a same title using a voice command
US6192338B1 (en) * 1997-08-12 2001-02-20 At&T Corp. Natural language knowledge servers as network resources
EP0962014B1 (en) * 1997-12-30 2003-11-12 Koninklijke Philips Electronics N.V. Speech recognition device using a command lexicon
US6298324B1 (en) * 1998-01-05 2001-10-02 Microsoft Corporation Speech recognition system with changing grammars and grammar help command
US6301560B1 (en) * 1998-01-05 2001-10-09 Microsoft Corporation Discrete speech recognition system with ballooning active grammar
JP4562910B2 (en) * 1998-03-23 2010-10-13 マイクロソフト コーポレーション Operating system application program interface
US6144938A (en) 1998-05-01 2000-11-07 Sun Microsystems, Inc. Voice user interface with personality
FI981154A (en) * 1998-05-25 1999-11-26 Nokia Mobile Phones Ltd Voice identification procedure and apparatus
US7082391B1 (en) * 1998-07-14 2006-07-25 Intel Corporation Automatic speech recognition
US6243076B1 (en) 1998-09-01 2001-06-05 Synthetic Environments, Inc. System and method for controlling host system interface with point-of-interest data
FR2783625B1 (en) * 1998-09-21 2000-10-13 Thomson Multimedia Sa SYSTEM INCLUDING A REMOTE CONTROL DEVICE AND A VOICE REMOTE CONTROL DEVICE OF THE DEVICE
US6240347B1 (en) 1998-10-13 2001-05-29 Ford Global Technologies, Inc. Vehicle accessory control with integrated voice and manual activation
US6928614B1 (en) 1998-10-13 2005-08-09 Visteon Global Technologies, Inc. Mobile office with speech recognition
US6230129B1 (en) * 1998-11-25 2001-05-08 Matsushita Electric Industrial Co., Ltd. Segment-based similarity method for low complexity speech recognizer
US6192343B1 (en) 1998-12-17 2001-02-20 International Business Machines Corporation Speech command input recognition system for interactive computer display with term weighting means used in interpreting potential commands from relevant speech terms
US7206747B1 (en) 1998-12-16 2007-04-17 International Business Machines Corporation Speech command input recognition system for interactive computer display with means for concurrent and modeless distinguishing between speech commands and speech queries for locating commands
US8275617B1 (en) 1998-12-17 2012-09-25 Nuance Communications, Inc. Speech command input recognition system for interactive computer display with interpretation of ancillary relevant speech query terms into commands
US6233560B1 (en) 1998-12-16 2001-05-15 International Business Machines Corporation Method and apparatus for presenting proximal feedback in voice command systems
US6937984B1 (en) 1998-12-17 2005-08-30 International Business Machines Corporation Speech command input recognition system for interactive computer display with speech controlled display of recognized commands
US7567677B1 (en) * 1998-12-18 2009-07-28 Gateway, Inc. Noise reduction scheme for a computer system
US6230135B1 (en) 1999-02-02 2001-05-08 Shannon A. Ramsay Tactile communication apparatus and method
US6408301B1 (en) 1999-02-23 2002-06-18 Eastman Kodak Company Interactive image storage, indexing and retrieval system
US6345254B1 (en) * 1999-05-29 2002-02-05 International Business Machines Corp. Method and apparatus for improving speech command recognition accuracy using event-based constraints
US6421655B1 (en) * 1999-06-04 2002-07-16 Microsoft Corporation Computer-based representations and reasoning methods for engaging users in goal-oriented conversations
US6308157B1 (en) * 1999-06-08 2001-10-23 International Business Machines Corp. Method and apparatus for providing an event-based “What-Can-I-Say?” window
US6871179B1 (en) * 1999-07-07 2005-03-22 International Business Machines Corporation Method and apparatus for executing voice commands having dictation as a parameter
US6374226B1 (en) 1999-08-06 2002-04-16 Sun Microsystems, Inc. System and method for interfacing speech recognition grammars to individual components of a computer program
US6510414B1 (en) 1999-10-05 2003-01-21 Cisco Technology, Inc. Speech recognition assisted data entry system and method
US6594630B1 (en) * 1999-11-19 2003-07-15 Voice Signal Technologies, Inc. Voice-activated control for electrical device
US7319962B2 (en) * 1999-12-24 2008-01-15 Medtronic, Inc. Automatic voice and data recognition for implanted medical device instrument systems
FR2803927B1 (en) * 2000-01-14 2002-02-22 Renault METHOD AND DEVICE FOR CONTROLLING EQUIPMENT ON-VEHICLE USING VOICE RECOGNITION
US7047196B2 (en) * 2000-06-08 2006-05-16 Agiletv Corporation System and method of voice recognition near a wireline node of a network supporting cable television and/or video delivery
US6408277B1 (en) 2000-06-21 2002-06-18 Banter Limited System and method for automatic task prioritization
US8290768B1 (en) 2000-06-21 2012-10-16 International Business Machines Corporation System and method for determining a set of attributes based on content of communications
US9699129B1 (en) 2000-06-21 2017-07-04 International Business Machines Corporation System and method for increasing email productivity
US6510410B1 (en) * 2000-07-28 2003-01-21 International Business Machines Corporation Method and apparatus for recognizing tone languages using pitch information
CN1272698C (en) * 2000-10-11 2006-08-30 佳能株式会社 Information processing device, information processing method, and storage medium
US7644057B2 (en) 2001-01-03 2010-01-05 International Business Machines Corporation System and method for electronic communication management
US8095370B2 (en) 2001-02-16 2012-01-10 Agiletv Corporation Dual compression voice recordation non-repudiation system
DE10115899B4 (en) * 2001-03-30 2005-04-14 Siemens Ag Method for creating computer programs by means of speech recognition
US7610547B2 (en) * 2001-05-04 2009-10-27 Microsoft Corporation Markup language extensions for web enabled recognition
US20020178182A1 (en) * 2001-05-04 2002-11-28 Kuansan Wang Markup language extensions for web enabled recognition
US7409349B2 (en) * 2001-05-04 2008-08-05 Microsoft Corporation Servers for web enabled speech recognition
US7506022B2 (en) * 2001-05-04 2009-03-17 Microsoft.Corporation Web enabled recognition architecture
US7203188B1 (en) 2001-05-21 2007-04-10 Estara, Inc. Voice-controlled data/information display for internet telephony and integrated voice and data communications using telephones and computing devices
US7020841B2 (en) 2001-06-07 2006-03-28 International Business Machines Corporation System and method for generating and presenting multi-modal applications from intent-based markup scripts
EP1405159A1 (en) * 2001-07-03 2004-04-07 Koninklijke Philips Electronics N.V. Interactive display and method of displaying a message
US7711570B2 (en) * 2001-10-21 2010-05-04 Microsoft Corporation Application abstraction with dialog purpose
US8229753B2 (en) 2001-10-21 2012-07-24 Microsoft Corporation Web server controls for web enabled recognition and/or audible prompting
US20040034529A1 (en) * 2002-08-14 2004-02-19 Hooper Howard Gaines Multifunction printer that converts and prints voice data
US7421390B2 (en) * 2002-09-13 2008-09-02 Sun Microsystems, Inc. Method and system for voice control of software applications
US7389230B1 (en) 2003-04-22 2008-06-17 International Business Machines Corporation System and method for classification of voice signals
US20040230637A1 (en) * 2003-04-29 2004-11-18 Microsoft Corporation Application controls for speech enabled recognition
US7363060B2 (en) * 2003-05-02 2008-04-22 Nokia Corporation Mobile telephone user interface
US20050187913A1 (en) 2003-05-06 2005-08-25 Yoram Nelken Web-based customer service interface
US8495002B2 (en) 2003-05-06 2013-07-23 International Business Machines Corporation Software tool for training and testing a knowledge base
US20050009604A1 (en) * 2003-07-11 2005-01-13 Hsien-Ta Huang Monotone voice activation device
US8160883B2 (en) 2004-01-10 2012-04-17 Microsoft Corporation Focus tracking in dialogs
US7552055B2 (en) * 2004-01-10 2009-06-23 Microsoft Corporation Dialog component re-use in recognition systems
CN1691581B (en) * 2004-04-26 2010-04-28 彭诗力 Multi-pattern matching algorithm based on characteristic value
CN100403255C (en) * 2005-03-17 2008-07-16 英华达(上海)电子有限公司 Method of using voice to operate game
JP4667138B2 (en) * 2005-06-30 2011-04-06 キヤノン株式会社 Speech recognition method and speech recognition apparatus
KR100632400B1 (en) * 2005-11-11 2006-10-11 한국전자통신연구원 Apparatus and method for input/output using voice recognition
US8229733B2 (en) * 2006-02-09 2012-07-24 John Harney Method and apparatus for linguistic independent parsing in a natural language systems
US20080213047A1 (en) * 2006-08-21 2008-09-04 Bryant Corwin J Systems and methods for liner tensioning in pipeline rehabilitation
WO2008136081A1 (en) * 2007-04-20 2008-11-13 Mitsubishi Electric Corporation User interface device and user interface designing device
US8150699B2 (en) * 2007-05-17 2012-04-03 Redstart Systems, Inc. Systems and methods of a structured grammar for a speech recognition command system
US8538757B2 (en) * 2007-05-17 2013-09-17 Redstart Systems, Inc. System and method of a list commands utility for a speech recognition command system
US8620652B2 (en) * 2007-05-17 2013-12-31 Microsoft Corporation Speech recognition macro runtime
US20080312929A1 (en) * 2007-06-12 2008-12-18 International Business Machines Corporation Using finite state grammars to vary output generated by a text-to-speech system
US7962344B2 (en) * 2007-06-29 2011-06-14 Microsoft Corporation Depicting a speech user interface via graphical elements
US8165886B1 (en) 2007-10-04 2012-04-24 Great Northern Research LLC Speech interface system and method for control and interaction with applications on a computing system
US8595642B1 (en) 2007-10-04 2013-11-26 Great Northern Research, LLC Multiple shell multi faceted graphical user interface
CN101436404A (en) * 2007-11-16 2009-05-20 鹏智科技(深圳)有限公司 Conversational biology-liked apparatus and conversational method thereof
US8958848B2 (en) * 2008-04-08 2015-02-17 Lg Electronics Inc. Mobile terminal and menu control method thereof
KR20090107365A (en) * 2008-04-08 2009-10-13 엘지전자 주식회사 Mobile terminal and its menu control method
US8762963B2 (en) * 2008-12-04 2014-06-24 Beck Fund B.V. L.L.C. Translation of programming code
KR101528266B1 (en) * 2009-01-05 2015-06-11 삼성전자 주식회사 Portable terminal and method for offering application thereof
US8606578B2 (en) * 2009-06-25 2013-12-10 Intel Corporation Method and apparatus for improving memory locality for real-time speech recognition
CN101976186B (en) * 2010-09-14 2013-04-03 方正科技集团苏州制造有限公司 Voice recognition method of computer and computer
US20120089392A1 (en) * 2010-10-07 2012-04-12 Microsoft Corporation Speech recognition user interface
US20120155663A1 (en) * 2010-12-16 2012-06-21 Nice Systems Ltd. Fast speaker hunting in lawful interception systems
WO2012169679A1 (en) * 2011-06-10 2012-12-13 엘지전자 주식회사 Display apparatus, method for controlling display apparatus, and voice recognition system for display apparatus
WO2013022135A1 (en) * 2011-08-11 2013-02-14 Lg Electronics Inc. Electronic device and method of controlling the same
US10186262B2 (en) * 2013-07-31 2019-01-22 Microsoft Technology Licensing, Llc System with multiple simultaneous speech recognizers
US9653073B2 (en) * 2013-11-26 2017-05-16 Lenovo (Singapore) Pte. Ltd. Voice input correction
US9589564B2 (en) * 2014-02-05 2017-03-07 Google Inc. Multiple speech locale-specific hotword classifiers for selection of a speech locale
KR102281178B1 (en) * 2014-07-09 2021-07-23 삼성전자주식회사 Method and apparatus for recognizing multi-level speech
US11741951B2 (en) * 2019-02-22 2023-08-29 Lenovo (Singapore) Pte. Ltd. Context enabled voice commands
CN110598671B (en) * 2019-09-23 2022-09-27 腾讯科技(深圳)有限公司 Text-based avatar behavior control method, apparatus, and medium
CN117255988A (en) * 2021-03-01 2023-12-19 苹果公司 Virtual object placement based on finger expression
US20230169967A1 (en) * 2021-11-30 2023-06-01 Google Llc Dynamic assistant suggestions during assistant browsing

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CH644246B (en) * 1981-05-15 1900-01-01 Asulab Sa SPEECH-COMMANDED WORDS INTRODUCTION DEVICE.
JPS58195957A (en) * 1982-05-11 1983-11-15 Casio Comput Co Ltd Program starting system by voice
US4704696A (en) * 1984-01-26 1987-11-03 Texas Instruments Incorporated Method and apparatus for voice control of a computer
US4980918A (en) * 1985-05-09 1990-12-25 International Business Machines Corporation Speech recognition system with efficient storage and rapid assembly of phonological graphs
US4759068A (en) * 1985-05-29 1988-07-19 International Business Machines Corporation Constructing Markov models of words from multiple utterances
US4776016A (en) * 1985-11-21 1988-10-04 Position Orientation Systems, Inc. Voice control system
US4839634A (en) * 1986-12-01 1989-06-13 More Edward S Electro-optic slate for input/output of hand-entered textual and graphic information
PH24865A (en) * 1987-03-24 1990-12-26 Ibm Mode conversion of computer commands
US4931950A (en) * 1988-07-25 1990-06-05 Electric Power Research Institute Multimedia interface and method for computer system
US5157384A (en) * 1989-04-28 1992-10-20 International Business Machines Corporation Advanced user interface
DE3928049A1 (en) * 1989-08-25 1991-02-28 Grundig Emv VOICE-CONTROLLED ARCHIVE SYSTEM
EP0438662A2 (en) * 1990-01-23 1991-07-31 International Business Machines Corporation Apparatus and method of grouping utterances of a phoneme into context-de-pendent categories based on sound-similarity for automatic speech recognition
JPH04163618A (en) * 1990-10-26 1992-06-09 Oki Electric Ind Co Ltd Sound operation computer
US5182773A (en) * 1991-03-22 1993-01-26 International Business Machines Corporation Speaker-independent label coding apparatus
EP0576628A1 (en) * 1991-08-02 1994-01-05 Broderbund Software, Inc. System for interactve performance and animation of prerecorded audiovisual sequences

Also Published As

Publication number Publication date
DE69420888T2 (en) 2000-04-27
ATE185203T1 (en) 1999-10-15
CA2115210C (en) 1997-09-23
US5664061A (en) 1997-09-02
EP0621531A1 (en) 1994-10-26
KR970006403B1 (en) 1997-04-28
DE69420888D1 (en) 1999-11-04
JPH06348452A (en) 1994-12-22
CN1086484C (en) 2002-06-19
JP2856671B2 (en) 1999-02-10
CN1105464A (en) 1995-07-19
EP0621531B1 (en) 1999-09-29

Similar Documents

Publication Publication Date Title
CA2115210A1 (en) Interactive Computer System Recognizing Spoken Commands
US5983186A (en) Voice-activated interactive speech recognition device and method
US6553342B1 (en) Tone based speech recognition
US4969194A (en) Apparatus for drilling pronunciation
US4811399A (en) Apparatus and method for automatic speech recognition
CN107972028B (en) Man-machine interaction method and device and electronic equipment
CN100587806C (en) Speech recognition method and apparatus thereof
EP0805434A3 (en) Method and system for speech recognition using continuous density hidden Markov models
CA2213699A1 (en) A communication system and method using a speaker dependent time-scaling technique
JP2000501847A (en) Method and apparatus for obtaining complex information from speech signals of adaptive dialogue in education and testing
AU4049389A (en) Language training
GB2278708A (en) Children's speech training aid
CA2181205A1 (en) Discriminative Utterance Verification for Connected Digits Recognition
JPH096389A (en) Voice recognition interactive processing method and voice recognition interactive device
CA2077728A1 (en) Speech coding apparatus having speaker dependent prototypes generated from a nonuser reference data
WO1998011537A3 (en) Process for the multilingual use of a hidden markov sound model in a speech recognition system
Rudnicky et al. Interactive problem solving with speech
EP0755046A3 (en) Speech recogniser using a hierarchically structured dictionary
US5278911A (en) Speech recognition using a neural net
US5440661A (en) Time series association learning
EP0916972A3 (en) Speech recognition method and speech recognition device
Chanjaradwichai et al. Design and evaluation of a non-verbal voice-controlled cursor for point-and-click tasks
JP3083915U (en) Dog emotion discrimination device based on phonetic feature analysis of call
JPH07168520A (en) Education device for languages with device for discriminating learning skillfulness
JP2685429B2 (en) Voice recognition device

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed