CN105989840A - 自然语言语音服务环境中的混合处理的系统及方法 - Google Patents
自然语言语音服务环境中的混合处理的系统及方法 Download PDFInfo
- Publication number
- CN105989840A CN105989840A CN201610183358.1A CN201610183358A CN105989840A CN 105989840 A CN105989840 A CN 105989840A CN 201610183358 A CN201610183358 A CN 201610183358A CN 105989840 A CN105989840 A CN 105989840A
- Authority
- CN
- China
- Prior art keywords
- natural language
- sounding
- equipment
- audio coding
- computer system
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2203/00—Indexing scheme relating to G06F3/00 - G06F3/048
- G06F2203/038—Indexing scheme relating to G06F3/038
- G06F2203/0381—Multimodal input, i.e. interface arrangements enabling the user to issue commands by simultaneous use of input devices of different nature, e.g. voice plus gesture on digitizer
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
Abstract
本发明可以提供一种自然语言语音服务环境中的混合处理的系统及方法,该自然语言语音服务环境包括多个多模式设备。具体地,混合处理通常可以包括多个多模式设备合作地解释和处理包括在一个或多个多模式请求中的一个或多个自然语言发声。例如,虚拟路由器可以接收包括自然语言发声所对应的编码的音频的各种消息,该自然语言发声包含在提供给设备中的一个或多个设备的多模式交互中。然后虚拟路由器可以分析编码的音频以选择自然语言发声的最干净的样本,并与环境中的一个或多个其它设备通信以确定多模式交互的目的。接着虚拟路由器可以根据多模式交互的目的协调解析多模式交互。
Description
本申请是申请日为2010年9月16日、申请号为201080061105.8、发明名称为“自然语言语音服务环境中的混合处理的系统及方法”的中国专利申请的分案申请。
技术领域
本发明涉及包括多个多模式设备的自然语言语音服务环境中的混合处理,其中,自然语言语音服务环境中的混合处理可以包括多个多模式设备合作地解释和处理包括在一个或多个多模式请求中的一个或多个自然语言发声。
背景技术
近年来,随着技术的进步,在许多人的日常生活中,消费性电子设备已变得几乎无处不在。为了满足手机、导航设备、嵌入式设备及其它这样的设备的功能性和移动性的发展而引起的日益增长的需求,许多设备除了核心应用之外,还提供丰富的特性和功能。然而,更强大的功能性也带来了折中,包括通常抑制用户充分利用其电子设备的所有性能的学习曲线(learning curve)。例如,许多现有的电子设备包括可能不是特别方便用户使用的复杂的人机界面,这会妨碍许多技术的大规模市场采用。此外,繁琐的界面还经常会导致难以发现或使用所需的特性(例如,因为菜单复杂或导航起来很冗长繁琐)。因此,许多用户往往不使用或甚至不知道其设备的许多潜在性能。
因此,电子设备的增加的功能性经常倾向于被浪费,正如市场调查表明,许多用户只使用给定设备上的可用特性或应用的仅一小部分。此外,在无线联网和宽带接入越来越普遍的社会中,消费者自然倾向于要求其电子设备具有无缝移动性能。因此,随着消费者对更简单的与电子设备交互的机制的需求加强,妨碍快速和集约化交互的繁琐界面成为重点关注的问题。然而,对使用以直观方式的技术的机制的日益增长的需求仍未得到充分满足。
一种趋于简化电子设备中的人机交互的方法已包括使用语音识别软件,该语音识别软件具有使用户利用不熟悉的、未知的或难以使用的特性的潜力。例如,最近由Navteq公司进行的提供各种应用(例如汽车导航和基于网络的应用)中所使用的数据的一项调查表明,语音识别通常在电子设备的消费者所最期望的特性中占首位。即便如此,现有的语音用户界面在实际工作时,仍要求用户大量学习。
例如,许多现有的语音用户界面只支持根据特定的命令与控制序列或语法所制定的请求。此外,许多现有的语音用户界面因为不准确的语音识别而导致用户受挫或不满。类似地,通过迫使用户提供预先建立的命令或关键字来以系统能理解的方式传递请求,现有的语音用户界面未有效地使用户参与富有成效的、合作的对话中来解析请求并促进会话朝着令人满意的目标进行(例如,当用户可能对特定需要、可用信息、设备性能等不确定时)。因此,现有的语音用户界面往往遭受各种各样的缺点,包括对使用户以合作和会话的方式参与对话有很大的局限性。
另外,许多现有的语音用户界面达不到利用分布在不同领域、设备和应用程序中的信息以解析基于自然语言语音的输入。因此,现有的语音用户界面遭受到局限于一组有限的应用程序(所述语音用户界面针对所述应用程序设计)或局限于存在有所述语音用户界面的设备。尽管技术进步已使用户经常使用几个设备来满足其各种需要,但现有的语音用户界面未使用户充分摆脱设备限制。例如,用户可能对与不同应用程序和设备相关的服务感兴趣,但现有的语音用户界面往往限制用户访问其认为合适的应用程序和设备。此外,在任一给定时间,用户通常只能实际携带有限数量的设备,而在各种情况下,可能会需要用户设备所关联的、与当前正使用的内容或服务不同的内容或服务。
因此,虽然用户往往具有不同需要,其中在各种背景或环境中可能需要与不同设备相关的内容或服务,但现有的语音技术往往不足以提供这样的综合环境:用户可请求几乎与任何设备或网络相关的内容或服务。因此,现有的语音服务环境中对信息可用性和设备交互机制的限制往往妨碍用户以直观的、自然的且有效的方式体验技术。例如,当用户希望使用给定电子设备执行给定功能、但不一定知道如何去执行该功能时,用户通常不能参与与设备的合作的、多模式的交互以仅发出自然语言的词来请求该功能。
此外,使用不具有语音识别性能的电子设备执行相对简单的功能通常是很繁琐的。例如,购买新的手机铃音往往是相对简单的过程,但用户通常必须导航几个菜单和按许多不同的按钮来完成该过程。因此,很显然,如果用户能够使用自然语言来利用隐藏的或难以使用的功能,则与电子设备的交互可以有效得多。现有的系统具有这些问题和其它问题。
发明内容
根据本发明的一个方面,一种自然语言语音服务环境中的混合处理的系统及方法可以解决现有系统的前述问题中的一个或多个问题。特别地,自然语言语音服务环境中的混合处理通常可以包括多个多模式设备合作地解释和处理包括在一个或多个多模式请求中的一个或多个自然语言发声,如本申请中更详细地描述。
根据本发明的一个方面,自然语言语音服务环境中的混合处理通常可以包括多个多模式设备合作地解释和处理包括在一个或多个多模式请求中的一个或多个自然语言发声。例如,虚拟路由器可以接收包括自然语言发声所对应的编码的音频的各种消息,自然语言发声包含在提供给所述设备中的一个或多个设备的多模式交互中。然后虚拟路由器可以分析编码的音频以选择自然语言发声的最干净的样本,并与环境中的一个或多个其它设备通信以确定多模式交互的目的。然后虚拟路由器可以根据多模式交互的目的协调解析多模式交互。
根据本发明的一个方面,一种自然语言语音服务环境中的混合处理的方法可以包括电子设备检测至少一个多模式交互,其中,所述多模式交互至少包括自然语言发声。然后可以将包含与所述多模式交互相关的信息的一个或多个消息传递给与所述电子设备通信的虚拟路由器,其中,所述电子设备通过消息传送接口将所述一个或多个消息传递给所述虚拟路由器。该方法还可以包括所述电子设备通过消息传送接口接收包含与所述多模式交互的目的相关的信息的一个或多个消息。因此,所述电子设备可以根据包含在从所述虚拟路由器所接收的一个或多个消息中的信息解析所述多模式交互。
根据本发明的一个方面,一种自然语言语音服务环境中的混合处理的方法可以包括与虚拟路由器通信的一个或多个电子设备。具体地,所述电子设备可以配置为检测至少一个至少包括自然语言发声的多模式交互,并通过消息传送接口将包含与所述多模式交互相关的信息的一个或多个消息传递给虚拟路由器。然后所述电子设备可以通过所述消息传送接口从所述虚拟路由器接收包含与所述多模式交互的目的相关的信息的一个或多个消息,且所述电子设备根据包含在从所述虚拟路由器所接收的一个或多个消息中的信息解析所述多模式交互。
根据以下附图和详细的描述,本发明的其它目的和优势将变得明显。
附图说明
图1示出根据本发明的一个方面的可用于自然语言语音服务环境中的混合处理的示例性语音支持设备的框图;
图2示出根据本发明的一个方面的自然语言语音服务环境中的混合处理的示例性系统的框图;
图3示出根据本发明的一个方面的初始化合作执行自然语言语音服务环境中的混合处理的各种设备的示例性方法的流程图;
图4到图5示出根据本发明的一个方面的自然语言语音服务环境中的混合处理的示例性方法的流程图。
具体实施方式
根据本发明的一个方面,图1示出可用于自然语言语音服务环境中的混合处理的示例性语音支持(voice-enabled)设备100的框图。从本申请所要提供的进一步描述中将了解到,图1所示的语音支持设备100通常可包括输入设备112或输入设备112的组合,输入设备112可以使用户以多模式方式与语音支持设备100交互。具体地,输入设备112通常可包括至少一个语音输入设备112(例如,麦克风)和至少一个非语音输入设备112(例如,鼠标、触摸屏显示器、滚轮选择器(wheel selector)等)的任一适当组合。因此,输入设备112可以包括具有接收基于语音的输入和基于非语音的输入的机构的电子设备的任一适当组合(例如,连接到车载通信设备、个人导航设备、手机、VoIP(互联网语音传输协议)节点、个人电脑、媒体设备、嵌入式设备、服务器或其它电子设备中的一个或多个设备的麦克风)。
在一个实现中,语音支持设备100可以使用户参与各种多模式会话性交互,该语音支持设备100可以以形式自由且合作的方式处理所述多模式会话交互,以执行各种任务、解析各种查询或者解析多模式交互中所包括的各种自然语言请求。例如,在一个实现中,语音支持设备100可以包括各种自然语言处理部件,所述自然语言处理部件至少包括连接到一个或多个输入设备112的语音点击模块,如在2009年2月20日提交的、名称为“System and Method for ProcessingMulti-Modal Device Interactions in a Natural Language Voice ServicesEnvironment”、序号为12/389,678的共同代决的美国专利申请中的更详细描述,该美国专利申请的全部内容通过引用结合在本申请中。因此,在本申请中将进行更详细地描述,一个或多个输入设备112和语音点击模块可以共同配置为处理用户和语音支持设备100之间的各种多模式交互。
例如,在一个实现中,多模式交互可以包括至少一个自然语言发声,其中,可将自然语言发声转换为电子信号。然后可将该电子信号提供给自动语音识别器(ASR)120,ASR 120也可以被称为语音识别引擎120和/或多通道语音识别引擎120。响应于接收该发声所对应的电子信号,ASR 120可以产生该发声的一个或多个初步解释,并将该初步解释提供给会话语言处理器130。此外,在一个实现中,多模式交互可以包括与一个或多个输入设备112的一个或多个非语音交互(例如,按钮推动、多点触控手势、焦点或关注的焦点选择等)。因此,语音点击模块可以从非语音交互中提取上下文,并将上下文提供给会话语言处理器130(即通过图1中所示的虚线)以在产生发声的解释中使用。因此,如以下更详细描述的,会话语言处理器130可以分析发声和任何伴随的非语音交互,以确定与语音支持设备100的多模式交互的目的。
在一个实现中,如上所述,语音支持设备100可以包括各种可支持形式自由的发声和/或其它形式的非语音设备交互的自然语言处理部件,这可将用户从与制定命令、查询或其它请求的方式有关的限制中解放出来。因此,用户可使用任一说话方式来将发声提供给语音输入设备112,且还可将其它非语音交互提供给非语音输入设备112,以通过语音支持设备100请求可用的任何内容或服务。例如,在一个实现中,响应于语音输入设备112接收到发声,可使用在2008年7月8日被授权为第7,398,209号美国专利的、名称为“Systems and Methods forResponding to Natural Language Speech Utterance”的序号为10/452,147的美国专利申请和在2003年6月15日提交的、名称为“Mobile Systems and Methods forResponding to Natural Language Speech Utterance”的序号为10/618,633的共同代决的美国专利申请中所描述的技术来处理该发声,上述美国专利申请的全部内容通过引用结合在本申请中。此外,用户可以与非语音输入设备112中的一个或多个进行交互,以提供按钮推动、多点触控手势、焦点或关注的焦点选择或其它非语音设备交互,这可以提供与自然语言发声和/或所请求的内容或服务相关的进一步上下文信息或其它信息。
在一个实现中,语音支持设备100可以连接到一个或多个附加系统,该一个或多个附加系统可以配置为与语音支持设备100合作以解释或处理包括自然语言发声和/或非语音设备交互的组合的多模式交互。例如,如下文结合图2的更详细的描述,所述一个或多个附加系统可以包括一个或多个具有与语音支持设备100相似的自然语言处理性能的多模式语音支持设备、一个或多个具有数据检索和/或任务执行性能的非语音设备以及协调语音支持设备100和所述附加系统之间的交互的虚拟路由器。因此,语音支持设备100可以包括到包括多个多模式设备的综合自然语言语音服务环境的接口,其中,用户可以通过任一多模式设备来请求可用内容或服务。
例如,在一个实现中,会话语言处理器130可以包括星座模型132b,星座模型132b提供与语音服务环境中可用的内容、服务、应用程序、目的确定性能和其它特性相关的知识,如在2008年5月27日提交的、名称为“System andMethod for an Integrated,Multi-Modal,Multi-Device Natural Language VoiceServices Environment”、序号为12/127,343的共同代决的美国专利申请中所述的,该美国专利申请的全部内容通过引用结合在本申请中。因此,语音支持设备100可以利用各种系统和多模式设备所共享的与自然语言处理性能、上下文、在前交互、领域知识、短期知识、长期知识和认知模型相关的知识,为解析语音支持设备100所接收的多模式交互提供合作环境。
在一个实现中,输入设备112及其连接的语音点击模块可以配置为持续监控语音支持设备100所接收的一个或多个多模式交互。特别地,输入设备112和语音点击模块可以持续监控一个或多个自然语言发声和/或一个或多个可区别的非语音设备交互,所述一个或多个自然语言发声和/或一个或多个可区别的非语音设备交互可以共同提供用以检索内容、执行任务、调用服务或命令、或处理任何其它合适请求的相关上下文。因此,响应于检测到一个或多个多模式交互,输入设备112和/或语音点击模块可以发信号来向该语音支持设备100通知已接收到发声和/或非语音交互。例如,在一个实现中,非语音交互可以提供用以增强识别、解释和理解所伴随的发声的上下文,此外,该发声可以提供用以提高所伴随的非语音交互的解释的进一步的上下文。因此,发声和非语音交互可以共同提供相关上下文,各种自然语言处理部件可以使用该相关上下文来确定包括发声和非语音交互的多模式交互的目的。
在一个实现中,如上所述,可以在ASR 120上发起处理包括在多模式交互中的发声,其中,ASR 120可以产生该发声的一个或多个初步解释。在一个实现中,为了产生该发声的初步解释,ASR 120可以配置为利用一种或多种动态识别语法和/或声学模型识别来自该发声的一个或多个音节、词、语段或其它声学特性。例如,在一个实现中,ASR 120可以根据语音听写技术来使用动态识别语法和/或声学模型识别来自发声的一连串音位,如在2009年12月15日被授权为第7,634,409号美国专利的、名称为“Dynamic Speech Sharpening”、序号为11/513,269的美国专利申请所述的,该美国专利申请的全部内容通过引用结合在本申请中。此外,动态识别语法和/或声学模型可以包括不重读的中元音(例如,“弱读元音”),所述不重读的中元音可以减小用于识别发声的一连串音位的搜索空间。
此外,在一个实现中,ASR 120可以配置为多通道语音识别引擎120,如在2009年12月29日被授权为第7,640,160号美国专利的、名称为“Systems andMethods for Responding to Natural Language Speech Utterance”、序号为11/197,504的美国专利申请所述的,该美国专利申请的全部内容通过引用结合在本申请中。多通道语音识别引擎120可以配置为首先调用初级语音识别引擎以产生发声的第一转译,且还可选地随后调用一个或多个次级语音识别引擎以产生发声的一个或多个第二转译。在一个实现中,可利用一大列表的听写语法产生第一转译,而可利用虚拟听写语法或其它动态识别语法产生第二转译,该虚拟听写语法具有词汇表外的字、源自会话历史的减少的词汇表的诱导词。例如,在一个实现中,如果第一转译的可信度不能达到或超过一阈值,则可以调用次级语音识别引擎以增强初级语音识别引擎的解释。然而,显然,多通道语音识别引擎120可利用技术的任一适当组合来解释该发声,该任一适当组合形成用于该发声的源自多个转译通道的初步解释(例如,可以不管第一转译的可信度如何,都调用次级语音识别引擎,或者初级语音识别引擎和/或次级语音识别引擎可以使用相同的或针对特定的解释上下文优化的识别语法,等等)。
因此,在一个实现中,可以针对不同的语言、上下文、领域、内存限制和/或其它合适的标准,优化ASR 120中所使用的动态识别语法。例如,在一个实现中,语音支持设备100可以包括一个或多个为特定上下文或领域提供内容或服务的应用程序134,例如导航应用程序134。因此,响应于ASR 120将导航确定为发声的最可能的上下文,可以针对各种物理的、时间的、方向的或其它地理的特性,优化动态识别语法(例如,如在2007年12月11日提交的、名称为“System and Method for Providing a Natural Language Voice User Interface in anIntegrated Voice Navigation Services Environment”、序号为11/954,064的共同代决的美国专利申请所述,该美国专利申请的全部内容通过引用结合在本申请中)。在另一示例中,包含词“交通(traffic)”的发声可能有不同的解释,这取决于用户是否意指导航上下文(即路面交通)、音乐上下文(即二十世纪六十年代的摇滚乐队)或电影上下文(即Steven Soderbergh的电影)。因此,在ASR 120中所使用的识别语法可以动态地适用于优化任何给定发声的准确识别(例如,响应于错误地解释包含特定词或语句的发声,可将错误解释从识别语法中去除,以防止重复发生该错误解释)。
在一个实现中,响应于ASR 120使用一种或多种上述技术产生包括在多模式交互中的发声的初步解释,ASR 120可以将初步解释提供给会话语言处理器130。会话语言处理器130通常可包括各种自然语言处理部件,所述各种自然语言处理部件可以配置为对人与人的会话或交互建模。因此,会话语言处理器130可调用自然语言处理部件中的一个或多个来进一步分析发声的初步解释和任何伴随的非语音交互,以确定语音支持设备100所接收的多模式交互的目的。
在一个实现中,会话语言处理器120可以调用目的确定引擎130a,该目的确定引擎配置为确定语音支持设备100所接收的多模式交互的目的。在一个实现中,目的确定引擎130a可调用为确定目的而提供长期和短期语义知识的知识增强的语音识别引擎,如在2005年8月29日提交的、名称为“Mobile Systemsand Methods of Supporting Natural Language Human-Machine Interactions”、序号为11/212,693的共同代决的美国专利申请所述,其全部内容通过引用结合在本申请中。例如,在一个实现中,语义知识可以基于:源自一个或多个与用户的在前交互的个性化认知模型,源自一个或多个与各种不同用户的在前交互的一般认知模型,和/或源自与用户、语音支持设备100相关的环境和/或语音服务环境(例如,环境噪声特性、位置敏感信息等)的环境认知模型。
此外,目的确定引擎132a可调用上下文跟踪引擎132d,以确定多模式交互的上下文。例如,可将源自多模式交互中的自然语言发声和/或非语音交互的任何上下文推送到与上下文跟踪引擎132d相关的上下文堆栈,其中,该上下文堆栈可以包括各种条目,可根据一个或多个从认知模型所确定的上下文和当前多模式交互的上下文加权或排列所述各种条目。因此,上下文跟踪引擎132d可以确定上下文堆栈中的一个或多个与当前多模式交互所相关的信息匹配的条目,以确定当前多模式交互的最可能的上下文。然后上下文跟踪引擎132d可将该最可能的上下文提供给目的确定引擎132a,该目的确定引擎可以基于最可能的上下文确定多模式交互的目的。
此外,根据最可能的上下文,目的确定引擎132a可以参考星座模型132b,以确定是否调用语音服务环境中的各种系统或多模式设备中的任何一种。例如,如上所述,星座模型132b可以提供通过所述各种系统和多模式设备可用的目的确定性能、领域知识、语义知识、认知模型和其它信息。因此,目的确定引擎132a可以参考星座模型132b,以确定其它系统和/或多模式设备中的一个或多个是否应当参与确定多模式交互的目的。例如,响应于星座模型132b指示其它系统和/或多模式设备中的一个或多个具有针对最可能的上下文的优化的自然语言处理性能,目的确定引擎132a可将与多模式交互相关的信息转发给这样的系统和/或多模式设备,这样的系统和/或多模式设备然后可确定多模式交互的目的并将目的确定返回到语音支持设备100。
在一个实现中,会话语言处理器130可配置为使用户参与一个或多个合作性会话中以解析目的或处理多模式交互,如在2006年10月16日提交的、名称为“System and Method for a Cooperative Conversational Voice User Interface”、序号为11/580,926的共同代决的美国专利申请所述,该美国专利申请的全部内容通过引用结合在本申请中。特别地,会话语言处理器130通常可确定多模式交互的会话目标,其中,该会话目标可以从分析发声、非语音交互、最可能的上下文和/或确定的目的而确定。因此,针对多模式交互所确定的的会话目标通常可控制会话语言处理器130和用户之间的合作性会话。例如,会话语言处理器130通常可使用户参与一个或多个查询会话、教导会话和/或探索性会话中以解析或处理多模式交互。
具体地,响应于确定出会话目标涉及检索离散信息或执行特定功能,则会话语言处理器130可以使用户参与查询回话。因此,在合作性查询会话中,用户可以将会话导向实现特定会话目标,而会话语言处理器130可以发起一个或多个查询、任务、命令或其它请求以实现该目标,从而支持会话中的用户。响应于多模式交互的目的的歧义性或不确定性,会话语言处理器130可以使用户参与教导会话以解决歧义性或不确定性(例如,噪声或用词错误干扰解释发声,无法对多个可能的上下文消除歧义,等等)。因此,在合作性教导会话中,会话语言处理器130可以将会话导向弄清多模式交互的目的(例如,产生通过输出设备114所提供的反馈),而用户可以调节会话并提供附加的多模式交互以弄清目的。响应于确定具有适当可信度的多模式交互的目的,其中该目的指示歧义的或不确定的目标,则会话语言处理器130可使用户参与探索性会话以解析该目标。在合作性的探索性会话中,会话语言处理器130和用户可以共享领导者和支持者的角色,其中,在会话过程中可以改进或完善歧义的或不确定的目标。
因此,会话语言处理器130通常可参与到一个或多个合作性会话中以为语音支持设备100所接收的多模式交互确定目的和解析特定目标。然后,会话语言处理器130可以发起一个或多个促成针对多模式交互所确定的目的和目标的查询、任务、命令或其它请求。例如,在一个实现中,会话语言处理器130可以调用:一个或多个代理132c,所述一个或多个代理具有处理特定领域或应用程序134中的请求的性能;语音搜索引擎132f,所述语音搜索引擎具有检索多模式交互中请求的信息的性能(例如,从一个或多个数据存储库136、网络或连接到语音支持设备100的其它信息源);或者一个或多个其他系统或多模式设备,所述其他系统或多模式设备具有用于促进多模式交互的目的和目标(例如,如从星座模型132b所确定的)的合适的处理性能。
此外,在一个实现中,会话语言处理器130可以调用涉及发起以处理多模式交互的查询、任务、命令或其它请求的广告应用程序134,其中,广告应用程序134可以配置为选择一个或多个可能与多模式交互的目的和/或目标相关的广告,如在2007年2月6日提交的、名称为“System and Method for Selecting andPresenting Advertisements Based on Natural Language Processing of Voice-BasedInput”、序号为11/671,526的共同代决的美国专利申请所述,该美国专利申请的全部内容通过引用结合在本申请中。
在一个实现中,响应于接收来自针对多模式交互所处理的查询、任务、命令或其它请求的任一合适组合的结果,会话语言处理器130可将该结果格式化以通过输出设备114显示给用户。例如,该结果可被格式化为自然语言发声,该自然语言发声可转换成电信号并通过连接至输出设备114的扬声器提供给用户,或者该结果可以被可视地呈现在连接至输出设备114的显示器上,或者该结果可以采用任何其它合适的方式(例如,该结果可以指示是否成功地执行了特定任务或命令,或者该结果可以包括响应于一个或多个查询而检索到的信息,或者如果该结果是歧义的或不完整的,则其可包括制定随后的多模式交互的请求,等等)。
此外,在一个实现中,会话语言处理器130可以包括错误识别引擎132e,该错误识别引擎配置为确定会话语言处理器130是否错误地确定了多模式交互的目的。在一个实现中,响应于在时间上接近于在前的多模式交互提供的一个或多个随后的多模式交互,错误识别引擎132e可以确定会话语言处理器130错误地确定了目的,如在2009年11月17日被授权为第7,620,549号美国专利的、名称为“System and Method of Supporting Adaptive Misrecognition inConversational Speech”、序号为11/200,164的美国专利申请所述的,该美国专利的全部内容通过引用结合在本申请中。例如,错误识别引擎132e可以监控包括停用词的一个或多个随后的多模式交互、推翻当前请求、或者指示未识别或误识别的事件。然后错误识别引擎132e可以为与ASR 120和/或会话语言处理器130相关的各种部件确定一个或多个调整参数,以改进随后的解释。
因此,如上文更详细的描述,语音支持设备100通常可以包括可用于自然语言语音服务环境中的混合处理的各种自然语言处理部件和性能。具体地,语音支持设备100可以配置为确定包括自然语言发声和/或非语音交互的任一适当组合的各种多模式交互的目的,并根据所确定的目的处理一个或多个查询、任务、命令或其它请求。此外,如上所述和如下更详细的描述,一个或多个其它系统和/或多模式设备可以参与为多模式交互确定目的和处理查询、任务、命令或其它请求,以提供混合处理方法,其中,语音支持设备100与各种其它系统和多模式设备可以各执行部分处理以确定目的,以及以合作的方式处理多模式交互。例如,在一个实现中,合作处理的多模式交互可以包括致力于内容(例如,到特定的人物或实体)的请求,其中,可以使用在2009年11月10日提交的、名称为“System and Method for Providing a Natural Language ContentDedication Service”、序号为61/259,820的美国临时专利申请中所描述的技术处理多模式交互,该美国临时专利申请的全部内容通过引用结合在本申请中。
根据本发明的一个方面,图2示出自然语言语音服务环境中的混合处理的示例性系统的框图。具体地,图2中所示的系统通常可以包括类似于上文结合图1描述的语音支持设备的语音支持客户端设备210。例如,语音支持客户端设备210可以包括分别用于接收自然语言多模式交互和提供自然语言多模式交互的响应的输入设备和输出设备215a的任一适当组合。此外,语音支持客户端设备210可以包括自动语音识别器(ASR)220a,ASR 220a配置为产生输入设备215a所接收的自然语言发声的一个或多个初步解释且还配置为将所述初步解释提供给会话语言处理器230a。
在一个实现中,语音支持客户端设备210上的会话语言处理器230a可以包括一个或多个自然语言处理部件,可调用所述一个或多个自然语言处理部件以确定语音支持客户端设备210所接收的多模式交互的目的。然后会话语言处理器230a可以发起一个或多个查询、任务、命令或其它请求以解析所确定的目的。例如,会话语言处理器230a可以调用一个或多个应用程序234a以处理特定领域中的请求,查询一个或多个数据存储库236a以检索多模式交互中请求的信息,或参与一个或多个与语音支持客户端设备210的用户的合作性会话以解析所确定的目的。此外,如上文结合图1所述,语音支持客户端设备210还可与一个或多个具有用于发起查询、任务、命令或其它请求以解析多模式交互的目的的适当的处理性能的其它系统或多模式设备合作。
具体地,为了在混合处理环境中与其它系统或多模式设备合作,语音支持客户端设备210可以使用消息传送接口250a来与虚拟路由器260进行通信,其中,消息传送接口250a通常可包括轻客户端(或瘦客户端),轻客户端为语音支持客户端设备210提供向虚拟路由器260发送输入和从虚拟路由器260接收输出的机制。此外,虚拟路由器260还可以包括消息传送接口250b,消息传送接口250b提供用以与一个或多个附加的语音支持设备270a-n、一个或多个非语音设备280a-n和语音支持服务器240通信的机制。另外,尽管图2将消息传送接口250a和消息传送接口250b示为与其所通信地连接的设备分开的部件,但是显然,这样的图示仅为了便于描述,因为可将消息传送接口250a-b设置为实施在图2中所示的各种设备上的板上部件,以促进混合处理环境中的各种设备之间的通信。
例如,在一个实现中,实施在语音支持客户端设备210上的消息传送接口250a可以将来自语音支持客户端设备210的输入在一个或多个XML消息内发送给虚拟路由器260,其中,所述输入可以包括自然语言发声所对应的编码的音频、自然语言发声的初步解释、多点触控手势、焦点或关注的焦点选择和/或其它多模式交互所对应的数据。在一个实现中,然后虚拟路由器260可以使用会话语言处理器230c进一步处理该输入,会话语言处理器230c具有语音识别、目的确定、适应性错误识别和/或其它自然语言处理的性能。此外,会话语言处理器230c可以包括与通过混合处理环境中的各种设备可用的内容、服务、应用程序、自然语言处理性能和其它特性相关的知识。
因此,在一个实现中,虚拟路由器260还可以通过消息传送接口250b来与语音支持设备270、非语音设备280和/或语音支持服务器240通信,以协调处理从语音支持客户端设备210所接收的输入。例如,根据与混合处理环境中的各种设备的特性和性能相关的知识,虚拟路由器260可以确定设备中具有用于解析从语音支持客户端设备210所接收的输入的目的的合适的特性和/或性能的一个或多个设备。然后虚拟路由器260可通过相应的消息传送接口250b来将该输入的一个或多个要素转发给所确定的设备,其中,可以调用所确定的设备以对从虚拟路由器260所转发的输入的要素进行任何适当的处理。在一个实现中,然后所确定的设备可通过相应的消息传送接口250b将任何处理结果返回给虚拟路由器260,其中,虚拟路由器260可以校对处理结果并通过消息传送接口250a将结果返回给语音支持客户端设备210。
因此,虚拟路由器260可通过消息传送接口250a-b与混合处理环境中的任何可用设备通信,以协调多模式交互或从语音客户端设备210所接收的其它自然语言输入的合作性混合处理。例如,在一个实现中,合作性混合处理可用于提高嵌入式处理架构中的性能,在该嵌入式处理架构中,语音支持客户端设备210包括有限数量的资源(例如,语音支持客户端设备210可以为具有有限数量的内存储器或用于自然语言处理的其他专用资源的移动设备)。因此,当语音支持客户端设备210具有嵌入式处理架构时,语音支持客户端设备210的一个或多个部件可以配置为优化板上自然语言处理的效率,以减少或消除瓶颈、长响应时间或性能降级。
例如,在一个实现中,优化板上自然语言处理的效率可以包括将ASR 220a配置成使用具有词汇表外的字、源自会话历史的减少的词汇表的诱导词的虚拟听写语法或其它动态识别语法(例如,为特定语言、上下文、领域、内存限制和/或其它合适的标准而优化的语法)。在另一示例中,板上应用程序234a和/或数据存储库236a可以与为语音支持客户端设备210提供特定特性和性能的嵌入式应用程序组相关联。例如,可以将语音支持客户端设备210嵌入在汽车车载通信系统、个人导航设备、全球定位系统、手机或用户经常请求基于位置的服务的另一设备中。因此,在这种情况下,可以优化嵌入式应用程序组中的板上应用程序234a和数据存储库236a,以提供可在板上有效处理的某些基于位置的服务(例如,目的地输入、导航、地图控制、音乐搜索、免提拨号等)。
此外,尽管可针对嵌入式架构中的效率优化语音支持客户端设备210的部件,但用户可以请求混合处理环境中可用的任何合适的内容、服务、应用程序和/或其它特性,且在混合处理环境中的其它设备可以共同提供自然语言处理性能,以为语音支持客户端设备210补充嵌入式自然语言处理性能。例如,语音支持客户端设备210可使用嵌入式自然语言处理性能,对特定的多模式交互进行初步处理(例如,板上ASR 220a可以进行高级虚拟听写以部分地转译多模式交互中的发声,板上会话语言处理器230a可以确定多模式交互的初步目的等),其中,可将初步处理的结果提供给虚拟路由器260以进一步处理。
在一个实现中,响应于确定出板上性能不能适当地解释交互(例如,如果部分转译的可信度不满足特定阈值),或者响应于确定出应在板外处理交互(例如,如果初步解释指示交互涉及要求在语音支持服务器240上进行大型计算的本地搜索请求),语音支持客户端设备210还可以将多模式交互所对应的输入传送给虚拟路由器260。因此,虚拟路由器260可以捕获从语音支持客户端设备210所接收的输入,并协调提供自然语言处理性能的语音支持设备270和语音支持服务器240以及提供检索数据或执行任务的性能的非语音设备280之间的进一步处理。此外,响应于虚拟路由器260调用语音支持设备270中的一个或多个,可以优化提供给语音支持设备270的输入,以适应从被调用的语音支持设备270所请求的处理(例如,为了避免过度征用处理资源,特定的语音支持设备270可被提供部分转译或初步解释,且针对给定上下文或领域解析目的)。
可替选地,响应于虚拟路由器260调用语音支持服务器240,提供给语音支持设备270的输入还可以包括自然语言发声所对应的编码的音频和任何其它与多模式交互相关的数据。具体地,如图2所示,除了语音支持服务器240可包括消除了语音支持客户端设备210所经受的限制的大量处理资源之外,语音支持服务器240可以具有类似于语音支持客户端设备210的自然语言处理架构。因此,当语音支持服务器240在多模式交互的混合处理中协作时,可以将自然语言发声所对应的编码的音频和任何其它与多模式交互相关的数据提供给语音支持服务器240,以最大化语音支持服务器240正确确定多模式交互的目的的可能性(例如,ASR 220b可以进行多通道语音识别,以产生自然语言发声的准确转译,会话语言处理器230b可以在任意数量的不同上下文或领域中所进行的目的确定之间进行仲裁,等等)。因此,总之,在图2所示的环境中所进行的混合处理技术通常可以包括各种不同的设备,这些设备可以包括或不包括自然语言性能、合作地确定特定多模式交互的目的以及采取行动以解析该目的。
尽管已在嵌入式处理架构的背景下具体地描述了上述的合作性混合处理技术,但这种技术不一定局限于嵌入式处理架构。具体地,同样的技术可以应用在具有各种设备的任何适当的语音服务环境中,所述各种设备可以协作以发起查询、任务、命令或其它请求以解析多模式交互的目的。此外,在一个实现中,语音支持客户端设备210可以包括适当数量的存储器或可致力于自然语言处理的其它资源(例如,语音支持客户端设备210可以为台式电脑或可处理自然语言而基本不降低性能的其它设备)。在这种情况下,语音支持客户端设备210的部件中的一个或多个部件可以配置为以可以在嵌入式架构中导致瓶颈、长响应时间或性能降级的方式优化板上自然语言处理。例如,在一个实现中,优化板上自然语言处理可以包括将ASR 220a配置成:除了和/或替代在嵌入式处理架构中所使用的虚拟听写语法,使用一大列表的听写语法。
尽管如此,如下文参照图3~图5的更详细的描述,合作性混合处理技术可以是基本相似的,不管语音支持客户端设备210是具有嵌入式架构还是具有非嵌入式架构。具体地,不管语音支持客户端设备210的架构如何,合作性混合处理都可以包括语音支持客户端设备210可选地执行自然语言多模式交互的初步处理以及通过消息传送接口250a将多模式交互所对应的输入传送给虚拟路由器260以进一步处理。可替选地(或另外),合作性混合处理可以包括虚拟路由器260通过消息传送接口250b协调混合环境中的各种设备间的输入的进一步处理以及随后通过消息传送接口250a将任何处理结果返回给语音支持客户端设备210。
根据本发明的各个方面,图3示出用于初始化合作执行自然语言语音服务环境中的混合处理的各种设备的示例性方法的流程图。具体地,如上所述,混合处理环境通常可以包括可合作处理自然语言多模式交互的各种不同设备间的通信。例如,在一个实现中,在混合处理环境中的各种设备可以包括虚拟路由器,该虚拟路由器具有用于与一个或多个语音支持设备、一个或多个非语音设备和/或语音支持服务器通信的一个或多个消息传送接口。因此,在一个实现中,图3所示的方法可以用于初始化混合处理环境中的通信,以开启随后的在混合处理环境中的任一特定设备上所接收的一个或多个自然语言多模式交互的合作处理。
在一个实现中,在混合处理环境中的各种设备可以配置为连续地监听或监控相应的输入设备,以确定自然语言多模式交互是否发生。因此,图3所示的方法可以用于校准、同步或初始化连续监听自然语言多模式交互的各种设备。例如,如上文参照图2所述,混合处理环境中的虚拟路由器、语音支持设备、非语音设备、语音支持服务器和/或其它设备可以配置为提供各种不同的性能或服务,其中,图3所示的初始化方法可以用于确保混合处理环境获得适当的信号,以处理任一特定的自然语言多模式交互并适当调用设备中的一个或多个设备来合作处理自然语言多模式交互。此外,可以调用图3所示的且本申请所述的方法,以注册混合处理环境中的各种设备、注册添加到混合处理环境的新设备、公布领域、服务、目的确定性能和/或其它在注册的设备上所支持的特性、同步注册的设备的本地计时和/或初始化混合处理环境中的设备的任何其它适当的方面。
在一个实现中,初始化混合处理环境中的各种设备可以包括操作310,其中,可以为混合处理环境中的每个设备建立设备监听器。在操作310中所建立的设备监听器通常可以包括可在各种设备上执行的指令、固件或其它例程的任一适当组合,以确定性能、特性、所支持的领域或其它与设备相关的信息。例如,在一个实现中,在操作310中所建立的设备监听器可以配置为使用针对辅助计算机设备所设计的通用即插即用协议与相应的设备通信,但很显然,可合适地替代与各种设备通信的任一适当机制。
响应于为混合处理环境中所注册的每个设备建立设备监听器(或响应于为混合处理环境中所新注册的任一设备建立设备监听器),在操作320中,可以同步设备监听器。具体地,每个注册设备可以具有为输入的自然语言多模式交互指示本地计时的内部时钟或其它计时机制,其中,操作320可以用于根据相应设备的内部时钟或计时机制来同步操作310中所建立的设备监听器。因此,在一个实现中,在操作320中的使设备监听器同步可以包括每个设备监听器公布与相应设备的内部时钟或本地计时相关的信息。例如,设备监听器可以向虚拟路由器公布与内部时钟或本地计时相关的信息,从而虚拟路由器可以随后协调在混合处理环境中的设备中的一个或多个设备上所接收的自然语言多模式交互的合作混合处理。然而,显然,可将与混合处理环境中的各种设备的内部时钟或本地计时相关的信息公布给其它语音支持设备、其它非语音设备、语音支持服务器和/或任何其它可参与提供给混合处理环境的自然语言多模式交互的合作处理的适当设备。
在一个实现中,响应于为在混合处理环境中所注册的各种设备建立和同步设备监听器,在操作330中,设备监听器可以连续地监听或监控相应地所注册设备上的相应设备,以检测与一个或多个自然语言多模式交互相关的信息。例如,设备监听器可以配置为:响应于检测输入的自然语言发声、与输入的自然语言发声相关的焦点或关注的焦点选择和/或与输入的自然语言多模式交互相关的另一交互或交互序列,来检测自然语言多模式交互的发生。此外,操作330还可以包括适当的设备监听器捕获自然语言发声和/或与自然语言发声相关的非语音设备交互。
在一个实现中,接着在操作340中,可以分析捕获的自然语言发声和相关的非语音设备交互,以管理混合处理环境中的随后的合作处理。在一个实现中,例如,操作340可以确定是一个设备监听器还是多个设备监听器捕获与在操作330中所检测的自然语言多模式交互相关的信息。具体地,如上所述,混合处理环境通常可以包括合作处理自然语言多模式交互的各种不同设备,从而可将与自然语言多模式交互相关的信息提供给混合处理环境中的一个或多个设备。因此,操作340可以确定是一个设备监听器还是多个设备监听器捕获与自然语言多模式交互相关的信息,以便确定混合处理环境是否需要使信号在捕获与多模式交互相关的信息的各种设备监听器之间同步。
例如,与混合处理环境进行交互的用户可以浏览呈现在非语音显示设备上呈现的网页,并提供请求更多关于购买网页上所显示的产品的信息的自然语言多模式交互。然后用户可以使用鼠标、键盘或其它非语音输入设备来在该网页上选择包含该产品名称的文本,并将自然语言发声提供给麦克风或其它语音支持设备,例如“这在亚马逊网站上有售吗?”在本示例中,在操作330中,与非语音显示设备相关的设备监听器可以检测关于该产品名称的文本选择,且在操作330中,与语音支持设备相关的设备监听器可以进一步检测询问产品可购性的自然语言发声。此外,在一个实现中,用户可能在多个语音支持设备的适用范围内,这可导致多个设备监听器捕获自然语言发声所对应的不同信号(例如,交互可以发生在语音支持手机、语音支持车载设备和/或其它语音支持设备的范围内,这取决于混合处理环境中的各种设备的布置和配置)。
因此,如本申请更详细的描述,响应于操作340确定多个设备监听器捕获与自然语言多模式交互相关的信息,可以发起使与多个设备监听器所接收的多模式交互相关的不同信号同步的一系列操作。另一方面,响应于操作340确定仅一个设备监听器捕获与自然语言多模式交互相关的信息,可在操作390中处理自然语言多模式交互,而不执行使不同信号同步的一系列操作(即,该一个设备监听器提供与多模式交互相关的全部输入信息,使得可在操作390中发起交互的混合处理,而不使不同的输入信号同步)。然而,在一个实现中,响应于一个设备监听器捕获自然语言发声和一个或多个非语音交互,也可发起该系列同步操作,以将与自然语言多模式交互相关的不同信号对齐,如本申请更详细的描述。
如上所述,接收到与在操作330中所检测的自然语言多模式交互相关的输入的设备监听器可以具有内部时钟或其它本地计时机制。因此,响应于在操作340中确定一个或多个设备监听器捕获与自然语言多模式交互相关的不同信号,可在操作350中发起针对不同信号的该系列同步操作。具体地,操作350可以包括一个或多个设备监听器根据与相应设备监听器相关的内部时钟或其它本地计时机制确定相应信号的本地计时信息,其中,接着可以同步针对相应信号所确定的本地计时信息。
例如,在一个实现中,可在操作360中发起使相应信号的本地计时信息同步。具体地,操作360通常可以包括向接收到与多模式交互相关的输入的每个设备监听器通知针对每个相应信号所确定的本地计时信息。例如,在一个实现中,每个设备监听器可以将相应信号的本地计时信息提供给虚拟路由器,然后虚拟路由器可以将所有信号的本地计时信息提供给每个设备监听器。因此,在一个实现中,操作360可以引起每个设备监听器接收到一通知,该通知包括与在操作330中所检测的自然语言多模式交互相关的每个不同信号的本地计时信息。可替选地(或另外),虚拟路由器可以从每个设备监听器收集每个不同信号的本地计时信息,并进一步同步不同信号的本地计时信息,以启动自然语言多模式交互的混合处理。
在一个实现中,任一具体的自然语言多模式交互可以至少包括自然语言发声,且还可以包括一个或多个与自然语言发声相关的附加设备交互。如上所述,通常可以在附加设备交互之前、同时或之后接收发声。因此,可以在操作370中同步不同信号的本地计时信息,以启动自然语言多模式交互的混合处理。具体地,操作370可以包括将自然语言发声所对应的一个或多个信号的本地计时信息和/或任何与自然语言发声相关的附加设备交互所对应的一个或多个信号的本地计时信息对齐。此外,操作370还可以包括将自然语言发声信号和附加设备交互所对应的信号的本地计时信息对齐。
因此,在匹配发声信号和非语音设备交互信号时,可以向参与自然语言多模式交互的混合处理的任何设备提供彼此对齐的语音成分和/或非语音成分。例如,在一个实现中,可以在虚拟路由器上执行操作370,然后虚拟路由器可将对齐的计时信息提供给在混合处理中可以被调用的任何其它设备。可替选地(或另外),参与混合处理的其它设备中的一个或多个可以本地地对齐计时信息(例如,响应于虚拟路由器调用混合处理中的语音支持服务器,可以利用与语音支持服务器相关的资源对齐计时信息并保留虚拟路由器上的通信带宽)。
此外,在一个实现中,在操作380中,虚拟路由器和/或混合处理环境中的其它设备可以分析自然语言发声所对应的信号,以选择最干净的样本来进一步处理。具体地,如上所述,虚拟路由器可以包括用以从一个或多个语音支持设备接收自然语言发声所对应的编码的音频样本的消息传送接口。例如,虚拟路由器所接收的音频样本可以包括以MPEG-1音频层3(MP3)格式或另一有损格式所编码的自然语言发声,以保留混合处理环境中的通信带宽。然而,显然,可替选地(或另外),响应于混合处理环境具有足够的通信带宽用于处理可提供更好的自然语言发声样本的无损音频,可以使用无损音频压缩编码(FLAC)格式或另一无损格式编码音频样本。
不管是以有损格式还是以无损格式编码音频样本,在操作380中,可以选择自然语言发声所对应的提供最干净的样本的信号。例如,一个语音支持设备可以在噪音环境中或与干扰产生干净的音频样本的条件相关,而另一语音支持设备可以包括麦克风阵列或配置为采用使编码的语音的保真度最大化的技术。因此,响应于在操作330中接收自然语言发声所对应的多个信号,在操作380中可以选择最干净的信号,然后在操作390中可以发起自然语言发声的混合处理。
因此,在图3中所示出的且在本申请中所描述的同步和初始化技术可以确保混合处理环境同步自然语言多模式交互所对应的每个信号且产生用以在操作390中进一步处理的输入,以最可能形成正确的目的确定。此外,在使信号同步和选择最干净的音频样本以在操作390中进一步处理时,在图3中所示出的且在本申请中所描述的技术可以确保混合处理环境中没有设备对自然语言多模式交互采取行动,直到识别出在操作390中所要使用的适当信号。因此,在操作390中可以发起自然语言多模式交互的混合处理,如本申请中更详细地描述的。
根据本发明的一个方面,图4示出在自然语言语音服务环境中的一个或多个客户端设备上执行混合处理的示例性方法的流程图。具体地,如下文将参照图5进行更详细的描述的,一个或多个客户端设备可以通过消息传送接口来与虚拟路由器合作执行混合处理,该消息传送接口通信地连接客户端设备和虚拟路由器。例如,在一个实现中,消息传送接口通常可以包括轻客户端(或瘦客户端),轻客户端为客户端设备提供用以向虚拟路由器发送与自然语言多模式交互相关的输入的机制,且轻客户端还为客户端设备提供用以从虚拟路由器接收与自然语言多模式交互相关的输出的机制。
例如,在一个实现中,响应于在操作410中客户端设备中的一个或多个接收自然语言多模式交互,可以发起客户端设备上的混合处理。具体地,自然语言多模式交互通常可以包括在连接至接收了自然语言多模式交互的客户端设备的麦克风或其它语音支持输入设备上所接收的自然语言发声,且还可以包括一个或多个与自然语言发声相关的其它附加输入形式(例如,文本选择、按钮按下、多点触控手势等)。因此,在操作410中所接收的自然语言多模式交互可以包括提供给客户端设备的一个或多个查询、命令或其它请求,其中,然后可以在操作420中发起自然语言多模式交互的混合处理。
如上文更详细地描述的,自然语言语音服务环境通常可以包括一个或多个语音支持客户端设备、一个或多个非语音设备、语音支持服务器和虚拟路由器,该虚拟路由器布置成与语音支持客户端设备、非语音设备和语音支持服务器中的每一个通信。在一个实现中,因此,虚拟路由器可以协调语音支持客户端设备、非语音设备和语音支持服务器之间的针对自然语言多模式交互的混合处理。因此,本申请所描述的混合处理技术通常可以指虚拟路由器以涉及在多个阶段中解析自然语言多模式交互的目的的方式,协调自然语言多模式交互的合作处理。
具体地,如上文参照图3的描述,可以初始化合作执行混合处理的各种设备,以启动自然语言多模式交互的合作处理。因此,在一个实现中,在操作420中,响应于初始化各种设备,接收与自然语言多模式交互相关的输入的每个客户端设备可以进行相应输入的初步处理。例如,在一个实现中,在操作420中,接收包括在多模式交互中的自然语言发声的客户端设备可以执行初始处理,该初始处理包括编码发声所对应的音频样本、部分或完全转译发声、确定发声的初步目的、或针对发声执行任何其它合适的初步处理。此外,也可以在接收与发声相关的附加输入形式中的一个或多个的客户端设备上执行操作420中的初始处理。例如,在操作420中为附加输入形式所执行的初始处理可以包括识别选择的文本、选择的焦点或关注的焦点,或者产生任何其它可用于进一步解释发声的合适的数据。在一个实现中,接着操作430可以包括确定混合处理环境是否已被配置为自动地将与自然语言多模式交互相关的输入路由至虚拟路由器。
例如,在一个实现中,响应于在操作410中多个客户端设备接收包括在多模式交互中的自然语言发声,操作430可以确定已配置发生自动路由。在本示例中,操作420中所执行的初始处理可以包括多个客户端设备编码发声所对应的相应的音频样本,其中,然后在操作460中,可将包括编码的音频样本的消息发送给虚拟路由器。然后虚拟路由器可以选择一个提供最干净的信号的编码的音频样本并协调随后的针对自然语言多模式交互的混合处理,如下文参照图5的更详细的描述。在另一示例中,响应于引起确定多模式交互涉及可能最适合于在语音支持服务器上处理的请求的初始处理(例如,该请求可以涉及基于位置的搜索查询或者另一命令或任务,该另一命令或任务要求语音支持服务器上所管理的资源、内容、应用程序、领域或存在于与接收请求的客户端设备不同的一个或多个设备上的其它信息等),操作430可以确定已配置发生自动路由。然而,显然,混合处理环境可以酌情配置为响应于其它条件和/或不管任何附加条件是否存在而自动路由。
在一个实现中,响应于虚拟路由器协调自然语言多模式交互的混合处理,在操作470中,虚拟路由器可将混合处理的结果提供给客户端设备。例如,在操作470中提供给客户端设备的结果可以包括自然语言多模式交互的最终目的确定、在交互中所请求的信息、响应于执行交互中所请求的命令或任务而产生的数据和/或使客户端设备在操作480中完成自然语言请求的处理的其它结果。例如,在一个实现中,操作480可以包括:客户端设备根据从虚拟路由器返回的最终目的确定来执行查询、命令、任务或其它请求,呈现从虚拟路由器返回的所请求的信息,确认已执行所请求的命令或任务,和/或执行任何附加处理以解析自然语言请求。
回顾操作430,响应于确定出未满足触发自动路由的条件或者未配置自动路由器,则在操作440中客户端设备可以进一步处理自然语言多模式交互。在一个实现中,在操作440中的进一步处理可以包括客户端设备使用本地自然语言处理性能试图确定自然语言多模式交互的目的。例如,客户端设备可将包括在多模式交互中的任何非语音输入形式合并在包括在多模式交互中的发声的转译中。接着客户端设备上的会话语言处理器可以利用与上下文、领域、共享知识、标准值或其它信息相关的本地信息确定多模式交互的目的。然后客户端设备可以产生发声的一个或多个解释,以确定多模式交互的目的(例如,识别会话类型、包含在交互中的一个或多个请求等)。
在一个实现中,操作440还可以包括确定在客户端设备上所产生的目的确定的可信度(例如,可以响应于客户端设备是否包括多通道语音识别引擎、发声是否包含任何有歧义的词或语句、目的是否随上下文而不同等,得出可信度)。在一个实现中,接着操作450可以根据在操作440中所确定的可信度确定是否调用板外处理。例如,操作450通常可以包括确定在操作440中所确定的目的是否满足特定阈值,该特定阈值指示对确定的目的采取行动的可接受的可信度。因此,响应于目的确定的可信度满足阈值,操作450可以确定不调用板外处理。特别地,可信度满足阈值可以指示客户端设备具有足够的信息对确定的目的采取行动,从而在操作480中,客户端设备可以处理一个或多个查询、命令、任务或其它请求以解析多模式交互。
可替选地,响应于目的确定的可信度不满足阈值,操作450可以调用板外处理,板外处理可以包括在操作460中发送一个或多个消息给虚拟路由器。该一个或多个消息可以使虚拟路由器以上述类似的方式调用多模式交互的附加混合处理,且本申请将参照图5进行更详细地描述。
根据本发明的一个方面,图5示出在自然语言语音服务环境中的虚拟路由器上执行混合处理的示例性方法的流程图。具体地,虚拟路由器可以协调在一个或多个客户端设备上所接收的自然语言多模式交互的混合处理。在一个实现中,在操作510中,虚拟路由器可以接收与在语音服务环境中的客户端设备中的一个或多个客户端设备上所接收的自然语言多模式交互相关的一个或多个消息。例如,虚拟路由器可以包括将虚拟路由器通信地连接到客户端设备和语音支持服务器的消息传送接口,其中,该消息传送接口通常可以包括轻客户端(或瘦客户端),轻客户端为虚拟路由器提供用以从一个或多个客户端设备和/或语音支持服务器接收输入、且还向一个或多个客户端设备和/或语音支持服务器发送输出的机制。在操作510中所接收的消息通常可以包括多模式交互的任何合适的处理结果,从而虚拟路由器可以以包括可发生在虚拟路由器、客户端设备中的一个或多个客户端设备、语音支持服务器或其任一合适组合上的多个处理阶段的方式协调混合处理。
在一个实现中,虚拟路由器可以分析在操作510中所接收的消息以确定是否调用点对点模式的混合处理。例如,消息中的一个或多个消息可以包括初步目的确定,虚拟路由器可以使用该初步目的确定来确定是否调用客户端设备中的一个或多个客户端设备、语音支持服务器或其各种组合,以便执行多模式交互的多个处理阶段中的一个或多个处理阶段。在另一示例中,消息中的一个或多个消息可以包括编码的音频样本,虚拟路由器将该编码的音频样本转发给混合处理环境中的各种设备中的一个或多个设备。因此,在一个实现中,虚拟路由器可以分析在操作510中所接收的消息以确定是否调用语音支持服务器来处理多模式交互(例如,消息可以包括初步目的确定,该初步目的确定指示多模式交互包括需要存在于服务器上的资源的基于位置的请求)。
响应于虚拟路由器确定调用语音支持服务器,在操作530中,虚拟路由器可以将消息转发给服务器。具体地,转发给服务器的消息通常可以包括自然语言发声所对应的编码的音频和涉及其它与发声有关的输入形式的任何附加信息。例如,如上文参照图2的更详细地描述,语音支持服务器可以包括可适当地确定多模式交互的目的的各种自然语言处理部件,从而发送给语音支持服务器的消息可以包括编码的音频,以便允许语音支持服务器独立于客户端设备上的可能不准确或不完全的任何初步处理来确定目的。响应于语音支持服务器处理从虚拟路由器所接收的消息,在操作570中,接着可将处理结果返回给虚拟路由器。例如,很明显,结果可以包括:自然语言多模式交互的目的确定,响应于确定的目的所执行的任何查询、命令、任务或其它请求的结果,或者任何其它合适的结果。
可替选地,响应于操作520中的虚拟路由器确定出调用点对点模式,虚拟路由器可以协调一个或多个客户端设备、语音支持服务器或其任一适当组合之间的混合处理。例如,在一个实现中,在操作540中,虚拟路由器可以确定自然语言多模式交互的上下文,且在操作550中,根据确定的上下文选择一个或多个对等设备。例如,客户端设备中的一个或多个可以配置为提供按照确定的上下文的内容或服务,从而在操作560中,虚拟路由器可以将一个或多个消息发送给这种设备,以便请求这种内容和/或服务。在另一示例中,多模式交互可以包括涉及不同设备上所支持的多个上下文的复合请求,从而在操作560中,虚拟路由器可以将消息转发给每个这种设备,以便请求按照不同的上下文的适当内容和/或服务。
在又一示例中,交互可以包括将要在语音支持服务器上处理的请求,然而该请求可要求存在于客户端设备中的一个或多个客户端设备上的内容和/或服务(例如,涉及客户端设备中的一个或多个客户端设备上的地址簿中的条目的基于位置的查询)。因此,在操作560中,虚拟路由器通常可以将各种消息转发给所选的对等设备,以管理本申请中所描述的混合处理技术中的多个阶段。例如,虚拟路由器可将消息发送给一个或多个具有按照特定上下文的目的确定性能的语音支持客户端设备、一个或多个使用内容、服务和/或处理多模式交互所需的其它资源的非语音客户端设备、或其任一适当组合。因此,虚拟路由器可在操作560中发送消息给客户端设备和/或语音支持服务器,且在操作570中以任一适当方式(例如,并行地、顺序地、迭代地等)从客户端设备和/或语音支持服务器接收响应消息。然后在操作580中,虚拟路由器可以校对在响应消息中所接收的结果,且将结果返回给客户端设备中的一个或多个,用以结果的任何最终处理和/或呈现。
可以以硬件、固件、软件或其各种组合的方式进行本发明的实现。也可将本发明实现为存储在机器可读介质上的可通过一个或多个处理器读取和执行的指令。机器可读介质可以包括用以存储或发送机器(例如,计算设备)可读形式的信息的各种机制。例如,机器可读存储介质可以包括只读存储器、随机存取存储器、磁盘存储介质、光学存储介质、闪存设备或其它存储介质,机器可读发送介质可以包括各种形式的传播信号,例如载波、红外信号、数字信号或其它发送介质。此外,可以在以上公开内容中就本发明的具体示例方面和实现方式以及执行某些动作这些方面来描述固件、软件、程序或指令。然而,显然,这些描述仅是为了方便,且这些动作实际上由执行所述固件、软件、程序或指令的计算设备、处理器、控制器或其他装置产生。
因此,本发明的方面和实现方式可以在本文中描述为包括具体的特征、结构或性质,但将明显的是,每一方面或实现方式可以或者可以不一定包括具体的特征、结构或性质。此外,当具体的特征、结构或者性质已结合一给定的方面或实现方式予以描述时,应当理解,无论是否明确描述,这样的特征、结构或性质也可以包括在其他的方面或实现方式中。因此,可以对以上描述进行各种改变或修改,而不脱离本发明的精神或范围,因此,本说明书和附图应当仅看作示例性的,本发明的范围仅由所附权利要求确定。
Claims (23)
1.一种用于自然语言语音服务环境中的混合处理的方法,所述方法是在计算机系统中实现的,所述计算机系统包括编程有计算机程序指令的一个或多个物理处理器,所述计算机程序指令在被所述一个或多个物理处理器执行时将所述计算机系统编程为执行所述方法,所述方法包括:
所述计算机系统接收多个音频编码,其中,所述多个音频编码至少包括第一音频编码和第二音频编码,所述第一音频编码对应于第一电子设备所捕获的用户的自然语言发声,所述第二音频编码对应于第二电子设备所捕获的所述用户的所述自然语言发声;
所述计算机系统确定所述第一音频编码的音频特性的第一值;
所述计算机系统确定所述第二音频编码的音频特性的第二值;
所述计算机系统基于所述第一值和所述第二值选择所述第一音频编码或所述第二音频编码;以及
所述计算机系统基于所述第一音频编码和所述第二音频编码中所选择的一者来获得所述自然语言发声的目的。
2.如权利要求1所述的方法,其中,获得所述自然语言发声的所述目的包括:
所述计算机系统提供所述第一音频编码和所述第二音频编码中所选择的一者作为语音识别引擎的输入;
响应于接收作为所述语音识别引擎的输出的、从所述自然语言发声识别的词语或短语,所述计算机系统提供所述词语或短语作为会话语言处理器的输入;以及
所述计算机系统基于作为所述会话语言处理器的输入的所述词语或短语的提供来解释所述自然语言发声,以确定所述自然语言发声的所述目的。
3.如权利要求1所述的方法,其中,获得所述自然语言发声的所述目的包括:
所述计算机系统向至少第一设备提供所述第一音频编码和所述第二音频编码中所选择的一者;以及
所述计算机系统从所述第一设备接收所述自然语言发声的所述目的的第一指示。
4.如权利要求3所述的方法,其中,向至少所述第一设备提供所述第一音频编码和所述第二音频编码中所选择的一者包括:
所述计算机系统确定所述第一设备具有自然语言处理能力;以及
所述计算机系统基于所述第一设备的所述自然语言处理能力来识别所述第一设备。
5.如权利要求4所述的方法,所述方法还包括:
所述计算机系统确定所述自然语言发声的语境,其中,所述第一设备还基于所述自然语言发声的所述语境来选择。
6.如权利要求1所述的方法,所述方法还包括:
所述计算机系统确定与第一目的确定相关联的第一可信度;以及
所述计算机系统确定所述第一可信度是否满足与目的预测精度有关的可信度阈值,其中,响应于确定出所述第一可信度不满足所述可信度阈值,向第二设备提供所述第一音频编码和所述第二音频编码中所选择的一者。
7.如权利要求6所述的方法,所述方法还包括:
所述计算机系统从所述第二设备接收基于所述第一音频编码和所述第二音频编码中所选择的一者对所述发声的第二目的确定;
所述计算机系统确定与所述第二目的确定相关联的第二可信度;以及
所述计算机系统基于所述第一可信度和所述第二可信度将一个或多个消息传送至一个或多个客户端设备,所述一个或多个消息包含与所述发声的所述第一目的确定或所述发声的所述第二目的确定有关的信息。
8.如权利要求1所述的方法,所述方法还包括:
所述计算机系统将一个或多个消息传送至一个或多个客户端设备,所述一个或多个消息包含与所述发声的第一目的确定有关的信息。
9.如权利要求8所述的方法,所述方法还包括:
所述计算机系统确定所述自然语言发声的语境,其中,所述一个或多个消息还包括所述自然语言发声的所述语境。
10.如权利要求1所述的方法,所述方法还包括:
所述计算机系统从多个电子设备接收所述发声的一个或多个初步目的确定,其中,基于所述一个或多个初步目的确定将所述第一音频编码和所述第二音频编码中所选择的一者提供给第一设备。
11.如权利要求8所述的方法,所述方法还包括:
所述计算机系统从多个电子设备接收所述发声的一个或多个初步目的确定,其中,还基于所述一个或多个初步目的确定将所述一个或多个消息传送至所述一个或多个客户端设备。
12.一种用于自然语言语音服务环境中的混合处理的系统,所述自然语言语音服务环境包括一个或多个电子设备,所述系统包括:
编程有计算机程序指令的一个或多个物理处理器,所述计算机程序指令在被所述一个或多个物理处理器执行时将所述一个或多个物理处理器编程为:
接收多个音频编码,其中,所述多个音频编码至少包括第一音频编码和第二音频编码,所述第一音频编码对应于第一电子设备所捕获的用户的自然语言发声,所述第二音频编码对应于第二电子设备所捕获的所述用户的所述自然语言发声;
确定所述第一音频编码的音频特性的第一值;
确定所述第二音频编码的音频特性的第二值;
基于所述第一值和所述第二值选择所述第一音频编码或所述第二音频编码;以及
基于所述第一音频编码和所述第二音频编码中所选择的一者来获得所述自然语言发声的目的。
13.如权利要求12所述的系统,其中,获得所述自然语言发声的所述目的还被编程为:
提供所述第一音频编码和所述第二音频编码中所选择的一者作为语音识别引擎的输入;
响应于接收作为所述语音识别引擎的输出的、从所述自然语言发声识别的词语或短语,提供所述词语或短语作为会话语言处理器的输入;以及
基于作为所述会话语言处理器的输入的所述词语或短语的提供来解释所述自然语言发声,以确定所述自然语言发声的所述目的。
14.如权利要求12所述的系统,其中,获得所述自然语言发声的所述目的还被编程为:
向至少第一设备提供所述第一音频编码和所述第二音频编码中所选择的一者;以及
从所述第一设备接收所述自然语言发声的所述目的的第一指示。
15.如权利要求14所述的系统,其中,向至少所述第一设备提供所述第一音频编码和所述第二音频编码中所选择的一者还被编程为:
确定所述第一设备具有自然语言处理能力;以及
基于所述第一设备的所述自然语言处理能力来识别所述第一设备。
16.如权利要求15所述的系统,其中,所述一个或多个物理处理器还被编程为:
确定所述自然语言发声的语境,其中,所述第一设备还基于所述自然语言发声的所述语境来选择。
17.如权利要求12所述的系统,其中,所述一个或多个物理处理器还被编程为:
确定与第一目的确定相关联的第一可信度;以及
确定所述第一可信度是否满足与目的预测精度有关的可信度阈值,其中,响应于确定出所述第一可信度不满足所述可信度阈值,向第二设备提供所述第一音频编码和所述第二音频编码中所选择的一者。
18.如权利要求17所述的系统,其中,所述一个或多个物理处理器还被编程为:
从所述第二设备接收基于所述第一音频编码和所述第二音频编码中所选择的一者对所述发声的第二目的确定;
确定与所述第二目的确定相关联的第二可信度;以及
基于所述第一可信度和所述第二可信度将一个或多个消息传送至一个或多个客户端设备,所述一个或多个消息包含与所述发声的所述第一目的确定或所述发声的所述第二目的确定有关的信息。
19.如权利要求12所述的系统,其中,所述一个或多个物理处理器还被编程为:
将一个或多个消息传送至一个或多个客户端设备,所述一个或多个消息包含与所述发声的第一目的确定有关的信息。
20.如权利要求19所述的系统,其中,所述一个或多个物理处理器还被编程为:
确定所述自然语言发声的语境,其中,所述一个或多个消息还包括所述自然语言发声的所述语境。
21.如权利要求12所述的系统,其中,所述一个或多个物理处理器还被编程为:
从所述多个电子设备接收所述发声的一个或多个初步目的确定,其中,基于所述一个或多个初步目的确定将所述第一音频编码和所述第二音频编码中所选择的一者提供给第一设备。
22.如权利要求19所述的系统,其中,所述一个或多个物理处理器还被编程为:
从所述多个电子设备接收所述发声的一个或多个初步目的确定,其中,还基于所述一个或多个初步目的确定将所述一个或多个消息传送至所述一个或多个客户端设备。
23.一种用于自然语言语音服务环境中的混合处理的方法,所述方法是在计算机系统中实现的,所述计算机系统包括编程有计算机程序指令的一个或多个物理处理器,所述计算机程序指令在被所述一个或多个物理处理器执行时将所述计算机系统编程为执行所述方法,所述方法包括:
所述计算机系统接收自然语言发声;
所述计算机系统执行语音识别以确定所述自然语言发声的一个或多个词语;
所述计算机系统基于所述一个或多个词语来确定对所述自然语言发声的目的的预测;
所述计算机系统将所述自然语言发声的音频编码和对所述自然语言发声的所述目的的所述预测发送至虚拟路由器;
所述计算机系统从所述虚拟路由器接收所述自然语言发声的最终目的确定;以及
所述计算机系统基于所述最终目的确定提供对于所述自然语言发声的响应。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US25982709P | 2009-11-10 | 2009-11-10 | |
US61/259,827 | 2009-11-10 | ||
CN201080061105.8A CN102792294B (zh) | 2009-11-10 | 2010-09-16 | 自然语言语音服务环境中的混合处理的系统及方法 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201080061105.8A Division CN102792294B (zh) | 2009-11-10 | 2010-09-16 | 自然语言语音服务环境中的混合处理的系统及方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105989840A true CN105989840A (zh) | 2016-10-05 |
Family
ID=43974838
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610183358.1A Pending CN105989840A (zh) | 2009-11-10 | 2010-09-16 | 自然语言语音服务环境中的混合处理的系统及方法 |
CN201080061105.8A Active CN102792294B (zh) | 2009-11-10 | 2010-09-16 | 自然语言语音服务环境中的混合处理的系统及方法 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201080061105.8A Active CN102792294B (zh) | 2009-11-10 | 2010-09-16 | 自然语言语音服务环境中的混合处理的系统及方法 |
Country Status (6)
Country | Link |
---|---|
US (2) | US9171541B2 (zh) |
EP (2) | EP2499582A4 (zh) |
JP (1) | JP5663031B2 (zh) |
KR (3) | KR20120120163A (zh) |
CN (2) | CN105989840A (zh) |
WO (1) | WO2011059562A1 (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108597520A (zh) * | 2018-04-23 | 2018-09-28 | 四川斐讯信息技术有限公司 | 一种智能插排和智能插排的控制方法 |
CN111368549A (zh) * | 2018-12-25 | 2020-07-03 | 深圳市优必选科技有限公司 | 一种支持多种服务的自然语言处理方法、装置及系统 |
CN111539217A (zh) * | 2017-05-19 | 2020-08-14 | 北京蓦然认知科技有限公司 | 一种用于自然语言内容标题消歧的方法、设备和系统 |
Families Citing this family (308)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU6630800A (en) | 1999-08-13 | 2001-03-13 | Pixo, Inc. | Methods and apparatuses for display and traversing of links in page character array |
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
ITFI20010199A1 (it) | 2001-10-22 | 2003-04-22 | Riccardo Vieri | Sistema e metodo per trasformare in voce comunicazioni testuali ed inviarle con una connessione internet a qualsiasi apparato telefonico |
US7398209B2 (en) | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7693720B2 (en) | 2002-07-15 | 2010-04-06 | Voicebox Technologies, Inc. | Mobile systems and methods for responding to natural language speech utterance |
US7669134B1 (en) | 2003-05-02 | 2010-02-23 | Apple Inc. | Method and apparatus for displaying information during an instant messaging session |
US20060271520A1 (en) * | 2005-05-27 | 2006-11-30 | Ragan Gene Z | Content-based implicit search query |
US7640160B2 (en) * | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7620549B2 (en) | 2005-08-10 | 2009-11-17 | Voicebox Technologies, Inc. | System and method of supporting adaptive misrecognition in conversational speech |
US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
WO2007027989A2 (en) | 2005-08-31 | 2007-03-08 | Voicebox Technologies, Inc. | Dynamic speech sharpening |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US7633076B2 (en) | 2005-09-30 | 2009-12-15 | Apple Inc. | Automated response to and sensing of user activity in portable devices |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8073681B2 (en) | 2006-10-16 | 2011-12-06 | Voicebox Technologies, Inc. | System and method for a cooperative conversational voice user interface |
US7818176B2 (en) | 2007-02-06 | 2010-10-19 | Voicebox Technologies, Inc. | System and method for selecting and presenting advertisements based on natural language processing of voice-based input |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
ITFI20070177A1 (it) | 2007-07-26 | 2009-01-27 | Riccardo Vieri | Sistema per la creazione e impostazione di una campagna pubblicitaria derivante dall'inserimento di messaggi pubblicitari all'interno di uno scambio di messaggi e metodo per il suo funzionamento. |
US9053089B2 (en) | 2007-10-02 | 2015-06-09 | Apple Inc. | Part-of-speech tagging using latent analogy |
US8364694B2 (en) | 2007-10-26 | 2013-01-29 | Apple Inc. | Search assistant for digital media assets |
US8620662B2 (en) * | 2007-11-20 | 2013-12-31 | Apple Inc. | Context-aware unit selection |
US8140335B2 (en) | 2007-12-11 | 2012-03-20 | Voicebox Technologies, Inc. | System and method for providing a natural language voice user interface in an integrated voice navigation services environment |
US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
US8327272B2 (en) | 2008-01-06 | 2012-12-04 | Apple Inc. | Portable multifunction device, method, and graphical user interface for viewing and managing electronic calendars |
US8065143B2 (en) | 2008-02-22 | 2011-11-22 | Apple Inc. | Providing text input using speech data and non-speech data |
US8289283B2 (en) | 2008-03-04 | 2012-10-16 | Apple Inc. | Language input interface on a device |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US10496753B2 (en) | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US9305548B2 (en) | 2008-05-27 | 2016-04-05 | Voicebox Technologies Corporation | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
US8589161B2 (en) | 2008-05-27 | 2013-11-19 | Voicebox Technologies, Inc. | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
US8464150B2 (en) | 2008-06-07 | 2013-06-11 | Apple Inc. | Automatic language identification for dynamic text processing |
US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
US8768702B2 (en) | 2008-09-05 | 2014-07-01 | Apple Inc. | Multi-tiered voice feedback in an electronic device |
US8898568B2 (en) | 2008-09-09 | 2014-11-25 | Apple Inc. | Audio user interface |
US8712776B2 (en) | 2008-09-29 | 2014-04-29 | Apple Inc. | Systems and methods for selective text to speech synthesis |
US8583418B2 (en) * | 2008-09-29 | 2013-11-12 | Apple Inc. | Systems and methods of detecting language and natural language strings for text to speech synthesis |
US8355919B2 (en) | 2008-09-29 | 2013-01-15 | Apple Inc. | Systems and methods for text normalization for text to speech synthesis |
US8352268B2 (en) | 2008-09-29 | 2013-01-08 | Apple Inc. | Systems and methods for selective rate of speech and speech preferences for text to speech synthesis |
US8352272B2 (en) | 2008-09-29 | 2013-01-08 | Apple Inc. | Systems and methods for text to speech synthesis |
US8396714B2 (en) | 2008-09-29 | 2013-03-12 | Apple Inc. | Systems and methods for concatenation of words in text to speech synthesis |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
US9959870B2 (en) | 2008-12-11 | 2018-05-01 | Apple Inc. | Speech recognition involving a mobile device |
US8862252B2 (en) | 2009-01-30 | 2014-10-14 | Apple Inc. | Audio user interface for displayless electronic device |
US8326637B2 (en) | 2009-02-20 | 2012-12-04 | Voicebox Technologies, Inc. | System and method for processing multi-modal device interactions in a natural language voice services environment |
US8380507B2 (en) | 2009-03-09 | 2013-02-19 | Apple Inc. | Systems and methods for determining the language to use for speech generated by a text to speech engine |
US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
US10255566B2 (en) | 2011-06-03 | 2019-04-09 | Apple Inc. | Generating and processing task items that represent tasks to perform |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US10540976B2 (en) | 2009-06-05 | 2020-01-21 | Apple Inc. | Contextual voice commands |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
JP5471106B2 (ja) * | 2009-07-16 | 2014-04-16 | 独立行政法人情報通信研究機構 | 音声翻訳システム、辞書サーバ装置、およびプログラム |
US8909683B1 (en) | 2009-07-17 | 2014-12-09 | Open Invention Network, Llc | Method and system for communicating with internet resources to identify and supply content for webpage construction |
US9502025B2 (en) | 2009-11-10 | 2016-11-22 | Voicebox Technologies Corporation | System and method for providing a natural language content dedication service |
US20110110534A1 (en) * | 2009-11-12 | 2011-05-12 | Apple Inc. | Adjustable voice output based on device status |
US8682649B2 (en) | 2009-11-12 | 2014-03-25 | Apple Inc. | Sentiment prediction from textual data |
US8914401B2 (en) * | 2009-12-30 | 2014-12-16 | At&T Intellectual Property I, L.P. | System and method for an N-best list interface |
US8600743B2 (en) | 2010-01-06 | 2013-12-03 | Apple Inc. | Noise profile determination for voice-related feature |
US8381107B2 (en) | 2010-01-13 | 2013-02-19 | Apple Inc. | Adaptive audio feedback system and method |
US8311838B2 (en) | 2010-01-13 | 2012-11-13 | Apple Inc. | Devices and methods for identifying a prompt corresponding to a voice input in a sequence of prompts |
US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
US10705794B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US10553209B2 (en) | 2010-01-18 | 2020-02-04 | Apple Inc. | Systems and methods for hands-free notification summaries |
US10679605B2 (en) | 2010-01-18 | 2020-06-09 | Apple Inc. | Hands-free list-reading by intelligent automated assistant |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US9645996B1 (en) * | 2010-03-25 | 2017-05-09 | Open Invention Network Llc | Method and device for automatically generating a tag from a conversation in a social networking website |
US8639516B2 (en) | 2010-06-04 | 2014-01-28 | Apple Inc. | User-specific noise suppression for voice quality improvements |
US8713021B2 (en) | 2010-07-07 | 2014-04-29 | Apple Inc. | Unsupervised document clustering using latent semantic density analysis |
US9104670B2 (en) | 2010-07-21 | 2015-08-11 | Apple Inc. | Customized search or acquisition of digital media assets |
US8719006B2 (en) | 2010-08-27 | 2014-05-06 | Apple Inc. | Combined statistical and rule-based part-of-speech tagging for text-to-speech synthesis |
US8719014B2 (en) | 2010-09-27 | 2014-05-06 | Apple Inc. | Electronic device with text error correction based on voice recognition data |
US10762293B2 (en) | 2010-12-22 | 2020-09-01 | Apple Inc. | Using parts-of-speech tagging and named entity recognition for spelling correction |
US10515147B2 (en) | 2010-12-22 | 2019-12-24 | Apple Inc. | Using statistical language models for contextual lookup |
US20130332170A1 (en) * | 2010-12-30 | 2013-12-12 | Gal Melamed | Method and system for processing content |
KR101828273B1 (ko) * | 2011-01-04 | 2018-02-14 | 삼성전자주식회사 | 결합기반의 음성명령 인식 장치 및 그 방법 |
US8781836B2 (en) | 2011-02-22 | 2014-07-15 | Apple Inc. | Hearing assistance system for providing consistent human speech |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US9171546B1 (en) * | 2011-03-29 | 2015-10-27 | Google Inc. | Performing functions based on commands in context of telephonic communication |
US20120310642A1 (en) | 2011-06-03 | 2012-12-06 | Apple Inc. | Automatically creating a mapping between text data and audio data |
US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
US8812294B2 (en) | 2011-06-21 | 2014-08-19 | Apple Inc. | Translating phrases from one language into another using an order-based set of declarative rules |
US8706472B2 (en) | 2011-08-11 | 2014-04-22 | Apple Inc. | Method for disambiguating multiple readings in language conversion |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
US8762156B2 (en) | 2011-09-28 | 2014-06-24 | Apple Inc. | Speech recognition repair using contextual information |
US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
US9483461B2 (en) | 2012-03-06 | 2016-11-01 | Apple Inc. | Handling speech synthesis of content for multiple languages |
US20130238332A1 (en) * | 2012-03-06 | 2013-09-12 | Apple Inc. | Automatic input signal recognition using location based language modeling |
EP2839391A4 (en) | 2012-04-20 | 2016-01-27 | Maluuba Inc | CONVERSATION AGENT |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US8775442B2 (en) | 2012-05-15 | 2014-07-08 | Apple Inc. | Semantic search using a single-source semantic model |
US10417037B2 (en) | 2012-05-15 | 2019-09-17 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US20130325459A1 (en) * | 2012-05-31 | 2013-12-05 | Royce A. Levien | Speech recognition adaptation systems based on adaptation data |
US10431235B2 (en) | 2012-05-31 | 2019-10-01 | Elwha Llc | Methods and systems for speech adaptation data |
US9899026B2 (en) | 2012-05-31 | 2018-02-20 | Elwha Llc | Speech recognition adaptation systems based on adaptation data |
US9899040B2 (en) | 2012-05-31 | 2018-02-20 | Elwha, Llc | Methods and systems for managing adaptation data |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US10019994B2 (en) | 2012-06-08 | 2018-07-10 | Apple Inc. | Systems and methods for recognizing textual identifiers within a plurality of words |
US9373338B1 (en) * | 2012-06-25 | 2016-06-21 | Amazon Technologies, Inc. | Acoustic echo cancellation processing based on feedback from speech recognizer |
US9495129B2 (en) | 2012-06-29 | 2016-11-15 | Apple Inc. | Device, method, and user interface for voice-activated navigation and browsing of a document |
US10026394B1 (en) | 2012-08-31 | 2018-07-17 | Amazon Technologies, Inc. | Managing dialogs on a speech recognition platform |
US9576574B2 (en) | 2012-09-10 | 2017-02-21 | Apple Inc. | Context-sensitive handling of interruptions by intelligent digital assistant |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
US8935167B2 (en) | 2012-09-25 | 2015-01-13 | Apple Inc. | Exemplar-based latent perceptual modeling for automatic speech recognition |
US9355086B2 (en) * | 2012-10-09 | 2016-05-31 | Microsoft Technology Licensing, Llc | User interface elements for content selection and extended content selection |
EP2954514B1 (en) | 2013-02-07 | 2021-03-31 | Apple Inc. | Voice trigger for a digital assistant |
US10652394B2 (en) | 2013-03-14 | 2020-05-12 | Apple Inc. | System and method for processing voicemail |
US9977779B2 (en) | 2013-03-14 | 2018-05-22 | Apple Inc. | Automatic supplementation of word correction dictionaries |
US10572476B2 (en) | 2013-03-14 | 2020-02-25 | Apple Inc. | Refining a search based on schedule items |
US9368114B2 (en) | 2013-03-14 | 2016-06-14 | Apple Inc. | Context-sensitive handling of interruptions |
US10642574B2 (en) | 2013-03-14 | 2020-05-05 | Apple Inc. | Device, method, and graphical user interface for outputting captions |
US9733821B2 (en) | 2013-03-14 | 2017-08-15 | Apple Inc. | Voice control to diagnose inadvertent activation of accessibility features |
WO2014144579A1 (en) | 2013-03-15 | 2014-09-18 | Apple Inc. | System and method for updating an adaptive speech recognition model |
US10078487B2 (en) | 2013-03-15 | 2018-09-18 | Apple Inc. | Context-sensitive handling of interruptions |
AU2014233517B2 (en) | 2013-03-15 | 2017-05-25 | Apple Inc. | Training an at least partial voice command system |
US10748529B1 (en) | 2013-03-15 | 2020-08-18 | Apple Inc. | Voice activated device for use with a voice-based digital assistant |
US11151899B2 (en) | 2013-03-15 | 2021-10-19 | Apple Inc. | User training by intelligent digital assistant |
US9875494B2 (en) * | 2013-04-16 | 2018-01-23 | Sri International | Using intents to analyze and personalize a user's dialog experience with a virtual personal assistant |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
WO2014200728A1 (en) | 2013-06-09 | 2014-12-18 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
AU2014278595B2 (en) | 2013-06-13 | 2017-04-06 | Apple Inc. | System and method for emergency calls initiated by voice command |
US9997160B2 (en) * | 2013-07-01 | 2018-06-12 | Toyota Motor Engineering & Manufacturing North America, Inc. | Systems and methods for dynamic download of embedded voice components |
KR101749009B1 (ko) | 2013-08-06 | 2017-06-19 | 애플 인크. | 원격 디바이스로부터의 활동에 기초한 스마트 응답의 자동 활성화 |
CN105793923A (zh) * | 2013-09-20 | 2016-07-20 | 亚马逊技术股份有限公司 | 本地和远程语音处理 |
US10296160B2 (en) | 2013-12-06 | 2019-05-21 | Apple Inc. | Method for extracting salient dialog usage from live data |
US20150317973A1 (en) * | 2014-04-30 | 2015-11-05 | GM Global Technology Operations LLC | Systems and methods for coordinating speech recognition |
US9620105B2 (en) | 2014-05-15 | 2017-04-11 | Apple Inc. | Analyzing audio input for efficient speech and music recognition |
US10592095B2 (en) | 2014-05-23 | 2020-03-17 | Apple Inc. | Instantaneous speaking of content on touch devices |
US9502031B2 (en) | 2014-05-27 | 2016-11-22 | Apple Inc. | Method for supporting dynamic grammars in WFST-based ASR |
US10289433B2 (en) | 2014-05-30 | 2019-05-14 | Apple Inc. | Domain specific language for encoding assistant dialog |
US9966065B2 (en) | 2014-05-30 | 2018-05-08 | Apple Inc. | Multi-command single utterance input method |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9785630B2 (en) | 2014-05-30 | 2017-10-10 | Apple Inc. | Text prediction using combined word N-gram and unigram language models |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9734193B2 (en) | 2014-05-30 | 2017-08-15 | Apple Inc. | Determining domain salience ranking from ambiguous words in natural speech |
US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
US9760559B2 (en) | 2014-05-30 | 2017-09-12 | Apple Inc. | Predictive text input |
US10078631B2 (en) | 2014-05-30 | 2018-09-18 | Apple Inc. | Entropy-guided text prediction using combined word and character n-gram language models |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US10659851B2 (en) | 2014-06-30 | 2020-05-19 | Apple Inc. | Real-time digital assistant knowledge updates |
US9548066B2 (en) * | 2014-08-11 | 2017-01-17 | Amazon Technologies, Inc. | Voice application architecture |
US10446141B2 (en) | 2014-08-28 | 2019-10-15 | Apple Inc. | Automatic speech recognition based on user feedback |
US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
US9626703B2 (en) * | 2014-09-16 | 2017-04-18 | Voicebox Technologies Corporation | Voice commerce |
US9898459B2 (en) | 2014-09-16 | 2018-02-20 | Voicebox Technologies Corporation | Integration of domain information into state transitions of a finite state transducer for natural language processing |
US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
EP3207467A4 (en) | 2014-10-15 | 2018-05-23 | VoiceBox Technologies Corporation | System and method for providing follow-up responses to prior natural language inputs of a user |
CN107077843A (zh) * | 2014-10-30 | 2017-08-18 | 三菱电机株式会社 | 对话控制装置和对话控制方法 |
US10431214B2 (en) | 2014-11-26 | 2019-10-01 | Voicebox Technologies Corporation | System and method of determining a domain and/or an action related to a natural language input |
US10614799B2 (en) | 2014-11-26 | 2020-04-07 | Voicebox Technologies Corporation | System and method of providing intent predictions for an utterance prior to a system detection of an end of the utterance |
US10552013B2 (en) | 2014-12-02 | 2020-02-04 | Apple Inc. | Data detection |
US9711141B2 (en) | 2014-12-09 | 2017-07-18 | Apple Inc. | Disambiguating heteronyms in speech synthesis |
US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
US10152299B2 (en) | 2015-03-06 | 2018-12-11 | Apple Inc. | Reducing response latency of intelligent automated assistants |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
US9460713B1 (en) | 2015-03-30 | 2016-10-04 | Google Inc. | Language model biasing modulation |
US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
US10200824B2 (en) | 2015-05-27 | 2019-02-05 | Apple Inc. | Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device |
US9870196B2 (en) * | 2015-05-27 | 2018-01-16 | Google Llc | Selective aborting of online processing of voice inputs in a voice-enabled electronic device |
US9966073B2 (en) * | 2015-05-27 | 2018-05-08 | Google Llc | Context-sensitive dynamic update of voice to text model in a voice-enabled electronic device |
US10083697B2 (en) * | 2015-05-27 | 2018-09-25 | Google Llc | Local persisting of data for selectively offline capable voice action in a voice-enabled electronic device |
US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US20160378747A1 (en) | 2015-06-29 | 2016-12-29 | Apple Inc. | Virtual assistant for media playback |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10740384B2 (en) | 2015-09-08 | 2020-08-11 | Apple Inc. | Intelligent automated assistant for media search and playback |
US10331312B2 (en) | 2015-09-08 | 2019-06-25 | Apple Inc. | Intelligent automated assistant in a media environment |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10956666B2 (en) | 2015-11-09 | 2021-03-23 | Apple Inc. | Unconventional virtual assistant interactions |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
EP3384475B1 (en) | 2015-12-06 | 2021-12-22 | Cerence Operating Company | System and method of conversational adjustment based on user's cognitive state |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US9779735B2 (en) * | 2016-02-24 | 2017-10-03 | Google Inc. | Methods and systems for detecting and processing speech signals |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US11227589B2 (en) | 2016-06-06 | 2022-01-18 | Apple Inc. | Intelligent list reading |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179309B1 (en) | 2016-06-09 | 2018-04-23 | Apple Inc | Intelligent automated assistant in a home environment |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
US10331784B2 (en) | 2016-07-29 | 2019-06-25 | Voicebox Technologies Corporation | System and method of disambiguating natural language processing requests |
US10474753B2 (en) | 2016-09-07 | 2019-11-12 | Apple Inc. | Language identification using recurrent neural networks |
US10503767B2 (en) * | 2016-09-13 | 2019-12-10 | Microsoft Technology Licensing, Llc | Computerized natural language query intent dispatching |
US10540513B2 (en) | 2016-09-13 | 2020-01-21 | Microsoft Technology Licensing, Llc | Natural language processor extension transmission data protection |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
US10559309B2 (en) * | 2016-12-22 | 2020-02-11 | Google Llc | Collaborative voice controlled devices |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US10229680B1 (en) * | 2016-12-29 | 2019-03-12 | Amazon Technologies, Inc. | Contextual entity resolution |
US11204787B2 (en) | 2017-01-09 | 2021-12-21 | Apple Inc. | Application integration with a digital assistant |
US20180270343A1 (en) * | 2017-03-20 | 2018-09-20 | Motorola Mobility Llc | Enabling event-driven voice trigger phrase on an electronic device |
DK201770383A1 (en) | 2017-05-09 | 2018-12-14 | Apple Inc. | USER INTERFACE FOR CORRECTING RECOGNITION ERRORS |
US10417266B2 (en) | 2017-05-09 | 2019-09-17 | Apple Inc. | Context-aware ranking of intelligent response suggestions |
US10395654B2 (en) | 2017-05-11 | 2019-08-27 | Apple Inc. | Text normalization based on a data-driven learning network |
DK180048B1 (en) | 2017-05-11 | 2020-02-04 | Apple Inc. | MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION |
DK201770439A1 (en) * | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
US10726832B2 (en) | 2017-05-11 | 2020-07-28 | Apple Inc. | Maintaining privacy of personal information |
DK201770428A1 (en) | 2017-05-12 | 2019-02-18 | Apple Inc. | LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT |
US11301477B2 (en) | 2017-05-12 | 2022-04-12 | Apple Inc. | Feedback analysis of a digital assistant |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
US20180336892A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Detecting a trigger of a digital assistant |
US10403278B2 (en) | 2017-05-16 | 2019-09-03 | Apple Inc. | Methods and systems for phonetic matching in digital assistant services |
US10303715B2 (en) | 2017-05-16 | 2019-05-28 | Apple Inc. | Intelligent automated assistant for media exploration |
US10311144B2 (en) | 2017-05-16 | 2019-06-04 | Apple Inc. | Emoji word sense disambiguation |
US10657328B2 (en) | 2017-06-02 | 2020-05-19 | Apple Inc. | Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling |
US10607606B2 (en) * | 2017-06-19 | 2020-03-31 | Lenovo (Singapore) Pte. Ltd. | Systems and methods for execution of digital assistant |
KR102399913B1 (ko) * | 2017-07-24 | 2022-05-18 | 엘지전자 주식회사 | 공기조화기 |
US10445429B2 (en) | 2017-09-21 | 2019-10-15 | Apple Inc. | Natural language understanding using vocabularies with compressed serialized tries |
US10755051B2 (en) | 2017-09-29 | 2020-08-25 | Apple Inc. | Rule-based natural language processing |
US10453454B2 (en) * | 2017-10-26 | 2019-10-22 | Hitachi, Ltd. | Dialog system with self-learning natural language understanding |
US10636424B2 (en) | 2017-11-30 | 2020-04-28 | Apple Inc. | Multi-turn canned dialog |
US10713007B2 (en) | 2017-12-12 | 2020-07-14 | Amazon Technologies, Inc. | Architecture for a hub configured to control a second device while a connection to a remote system is unavailable |
US20190197462A1 (en) * | 2017-12-21 | 2019-06-27 | United States Postal Service | Intelligent collection box |
US11348056B2 (en) | 2017-12-21 | 2022-05-31 | United States Postal Service | Digital stamps |
US10733982B2 (en) | 2018-01-08 | 2020-08-04 | Apple Inc. | Multi-directional dialog |
US10733375B2 (en) | 2018-01-31 | 2020-08-04 | Apple Inc. | Knowledge-based framework for improving natural language understanding |
JP2019144790A (ja) * | 2018-02-20 | 2019-08-29 | 富士ゼロックス株式会社 | 情報処理装置及びプログラム |
US10789959B2 (en) | 2018-03-02 | 2020-09-29 | Apple Inc. | Training speaker recognition models for digital assistants |
US10592604B2 (en) | 2018-03-12 | 2020-03-17 | Apple Inc. | Inverse text normalization for automatic speech recognition |
US11461779B1 (en) * | 2018-03-23 | 2022-10-04 | Amazon Technologies, Inc. | Multi-speechlet response |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
US10909331B2 (en) | 2018-03-30 | 2021-02-02 | Apple Inc. | Implicit identification of translation payload with neural machine translation |
US10928918B2 (en) | 2018-05-07 | 2021-02-23 | Apple Inc. | Raise to speak |
US11145294B2 (en) | 2018-05-07 | 2021-10-12 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US10984780B2 (en) | 2018-05-21 | 2021-04-20 | Apple Inc. | Global semantic word embeddings using bi-directional recurrent neural networks |
DK201870355A1 (en) | 2018-06-01 | 2019-12-16 | Apple Inc. | VIRTUAL ASSISTANT OPERATION IN MULTI-DEVICE ENVIRONMENTS |
US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
DK179822B1 (da) | 2018-06-01 | 2019-07-12 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
US11386266B2 (en) | 2018-06-01 | 2022-07-12 | Apple Inc. | Text correction |
US10496705B1 (en) | 2018-06-03 | 2019-12-03 | Apple Inc. | Accelerated task performance |
US11010561B2 (en) | 2018-09-27 | 2021-05-18 | Apple Inc. | Sentiment prediction from textual data |
US10839159B2 (en) | 2018-09-28 | 2020-11-17 | Apple Inc. | Named entity normalization in a spoken dialog system |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
US11170166B2 (en) | 2018-09-28 | 2021-11-09 | Apple Inc. | Neural typographical error modeling via generative adversarial networks |
US11475898B2 (en) | 2018-10-26 | 2022-10-18 | Apple Inc. | Low-latency multi-speaker speech recognition |
US11638059B2 (en) | 2019-01-04 | 2023-04-25 | Apple Inc. | Content playback on multiple devices |
KR102204740B1 (ko) * | 2019-02-28 | 2021-01-19 | 네이버 주식회사 | 대화 시스템에서의 의도 불분명 질의를 처리하는 방법 및 시스템 |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
US11423908B2 (en) | 2019-05-06 | 2022-08-23 | Apple Inc. | Interpreting spoken requests |
US11475884B2 (en) | 2019-05-06 | 2022-10-18 | Apple Inc. | Reducing digital assistant latency when a language is incorrectly determined |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
DK201970511A1 (en) | 2019-05-31 | 2021-02-15 | Apple Inc | Voice identification in digital assistant systems |
US11496600B2 (en) | 2019-05-31 | 2022-11-08 | Apple Inc. | Remote execution of machine-learned models |
DK180129B1 (en) | 2019-05-31 | 2020-06-02 | Apple Inc. | USER ACTIVITY SHORTCUT SUGGESTIONS |
US11289073B2 (en) | 2019-05-31 | 2022-03-29 | Apple Inc. | Device text to speech |
US11468890B2 (en) | 2019-06-01 | 2022-10-11 | Apple Inc. | Methods and user interfaces for voice-based control of electronic devices |
US11360641B2 (en) | 2019-06-01 | 2022-06-14 | Apple Inc. | Increasing the relevance of new available information |
US11348601B1 (en) * | 2019-06-06 | 2022-05-31 | Amazon Technologies, Inc. | Natural language understanding using voice characteristics |
US11398238B2 (en) * | 2019-06-07 | 2022-07-26 | Lg Electronics Inc. | Speech recognition method in edge computing device |
CN114207711A (zh) | 2019-08-13 | 2022-03-18 | 三星电子株式会社 | 用于识别用户的语音的系统和方法 |
WO2021029643A1 (en) | 2019-08-13 | 2021-02-18 | Samsung Electronics Co., Ltd. | System and method for modifying speech recognition result |
CN114223029A (zh) | 2019-08-13 | 2022-03-22 | 三星电子株式会社 | 支持装置进行语音识别的服务器及服务器的操作方法 |
US11531451B2 (en) * | 2019-09-04 | 2022-12-20 | Brain Technologies, Inc. | Real-time morphing interface for display on a computer screen |
US11488406B2 (en) | 2019-09-25 | 2022-11-01 | Apple Inc. | Text detection using global geometry estimators |
CN110659361B (zh) * | 2019-10-11 | 2023-01-17 | 卢卡(北京)智能科技有限公司 | 一种对话方法、装置、设备及介质 |
US11568153B2 (en) | 2020-03-05 | 2023-01-31 | Bank Of America Corporation | Narrative evaluator |
US11061543B1 (en) | 2020-05-11 | 2021-07-13 | Apple Inc. | Providing relevant data items based on context |
US11043220B1 (en) | 2020-05-11 | 2021-06-22 | Apple Inc. | Digital assistant hardware abstraction |
US11810578B2 (en) | 2020-05-11 | 2023-11-07 | Apple Inc. | Device arbitration for digital assistant-based intercom systems |
US11755276B2 (en) | 2020-05-12 | 2023-09-12 | Apple Inc. | Reducing description length based on confidence |
CN111968631B (zh) * | 2020-06-29 | 2023-10-10 | 百度在线网络技术(北京)有限公司 | 智能设备的交互方法、装置、设备及存储介质 |
US11490204B2 (en) | 2020-07-20 | 2022-11-01 | Apple Inc. | Multi-device audio adjustment coordination |
US11438683B2 (en) | 2020-07-21 | 2022-09-06 | Apple Inc. | User identification using headphones |
US11829720B2 (en) | 2020-09-01 | 2023-11-28 | Apple Inc. | Analysis and validation of language models |
US11496373B2 (en) * | 2021-01-26 | 2022-11-08 | Juniper Networks, Inc. | Enhanced conversation interface for network management |
US11763814B2 (en) | 2021-06-21 | 2023-09-19 | Logitech Europe S.A. | Hybrid voice command processing |
US11657805B2 (en) * | 2021-06-24 | 2023-05-23 | Amazon Technologies, Inc. | Dynamic context-based routing of speech processing |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020143532A1 (en) * | 2001-04-02 | 2002-10-03 | Mclean James Gordon | Method and system for collaborative speech recognition for small-area network |
US20040102977A1 (en) * | 2002-11-22 | 2004-05-27 | Metzler Benjamin T. | Methods and apparatus for controlling an electronic device |
US20040166832A1 (en) * | 2001-10-03 | 2004-08-26 | Accenture Global Services Gmbh | Directory assistance with multi-modal messaging |
US20060041926A1 (en) * | 2004-04-30 | 2006-02-23 | Vulcan Inc. | Voice control of multimedia content |
US20070055525A1 (en) * | 2005-08-31 | 2007-03-08 | Kennewick Robert A | Dynamic speech sharpening |
CN101297355A (zh) * | 2005-08-05 | 2008-10-29 | 沃伊斯博克斯科技公司 | 响应自然语言语音口头表达的系统和方法 |
Family Cites Families (542)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4016A (en) * | 1845-04-26 | Improvement in the method of attaching a movable to the stationary keel of a vessel | ||
US4430669A (en) * | 1981-05-29 | 1984-02-07 | Payview Limited | Transmitting and receiving apparatus for permitting the transmission and reception of multi-tier subscription programs |
US4829423A (en) | 1983-01-28 | 1989-05-09 | Texas Instruments Incorporated | Menu-based natural language understanding system |
US5208748A (en) | 1985-11-18 | 1993-05-04 | Action Technologies, Inc. | Method and apparatus for structuring and managing human communications by explicitly defining the types of communications permitted between participants |
US4910784A (en) * | 1987-07-30 | 1990-03-20 | Texas Instruments Incorporated | Low cost speech recognition system and method |
CA1268228A (en) | 1987-09-14 | 1990-04-24 | Gary Lennartz | Voice interactive security system |
US5027406A (en) | 1988-12-06 | 1991-06-25 | Dragon Systems, Inc. | Method for interactive speech recognition and training |
SE466029B (sv) * | 1989-03-06 | 1991-12-02 | Ibm Svenska Ab | Anordning och foerfarande foer analys av naturligt spraak i ett datorbaserat informationsbehandlingssystem |
JPH03129469A (ja) | 1989-10-14 | 1991-06-03 | Canon Inc | 自然言語処理装置 |
JP3266246B2 (ja) | 1990-06-15 | 2002-03-18 | インターナシヨナル・ビジネス・マシーンズ・コーポレーシヨン | 自然言語解析装置及び方法並びに自然言語解析用知識ベース構築方法 |
US5164904A (en) | 1990-07-26 | 1992-11-17 | Farradyne Systems, Inc. | In-vehicle traffic congestion information system |
US5722084A (en) * | 1990-09-28 | 1998-02-24 | At&T Corp. | Cellular/PCS handset NAM download capability using a wide-area paging system |
WO1992010037A1 (en) | 1990-11-27 | 1992-06-11 | Jacobs Gordon M | Digital data converter |
US5274560A (en) | 1990-12-03 | 1993-12-28 | Audio Navigation Systems, Inc. | Sensor free vehicle navigation system utilizing a voice input/output interface for routing a driver from his source point to his destination point |
DE69232407T2 (de) | 1991-11-18 | 2002-09-12 | Toshiba Kawasaki Kk | Sprach-Dialog-System zur Erleichterung von Rechner-Mensch-Wechselwirkung |
WO1994020952A1 (en) | 1993-03-12 | 1994-09-15 | Sri International | Method and apparatus for voice-interactive language instruction |
US5608635A (en) * | 1992-04-14 | 1997-03-04 | Zexel Corporation | Navigation system for a vehicle with route recalculation between multiple locations |
CA2102077C (en) | 1992-12-21 | 1997-09-16 | Steven Lloyd Greenspan | Call billing and measurement methods for redirected calls |
US5465289A (en) | 1993-03-05 | 1995-11-07 | E-Systems, Inc. | Cellular based traffic sensor system |
US5471318A (en) | 1993-04-22 | 1995-11-28 | At&T Corp. | Multimedia communications network |
US5377350A (en) | 1993-04-30 | 1994-12-27 | International Business Machines Corporation | System for cooperative communication between local object managers to provide verification for the performance of remote calls by object messages |
US5537436A (en) | 1993-06-14 | 1996-07-16 | At&T Corp. | Simultaneous analog and digital communication applications |
US5983161A (en) * | 1993-08-11 | 1999-11-09 | Lemelson; Jerome H. | GPS vehicle collision avoidance warning and control system and method |
EP0645757B1 (en) * | 1993-09-23 | 2000-04-05 | Xerox Corporation | Semantic co-occurrence filtering for speech recognition and signal transcription applications |
US5475733A (en) | 1993-11-04 | 1995-12-12 | At&T Corp. | Language accommodated message relaying for hearing impaired callers |
CA2118278C (en) | 1993-12-21 | 1999-09-07 | J. David Garland | Multimedia system |
US5748841A (en) | 1994-02-25 | 1998-05-05 | Morin; Philippe | Supervised contextual language acquisition system |
US5533108A (en) | 1994-03-18 | 1996-07-02 | At&T Corp. | Method and system for routing phone calls based on voice and data transport capability |
US5488652A (en) * | 1994-04-14 | 1996-01-30 | Northern Telecom Limited | Method and apparatus for training speech recognition algorithms for directory assistance applications |
US5652570A (en) | 1994-05-19 | 1997-07-29 | Lepkofker; Robert | Individual location system |
US5752052A (en) | 1994-06-24 | 1998-05-12 | Microsoft Corporation | Method and system for bootstrapping statistical processing into a rule-based natural language parser |
JP2674521B2 (ja) | 1994-09-21 | 1997-11-12 | 日本電気株式会社 | 移動体誘導装置 |
US5539744A (en) | 1994-10-17 | 1996-07-23 | At&T Corp. | Hand-off management for cellular telephony |
US5696965A (en) | 1994-11-03 | 1997-12-09 | Intel Corporation | Electronic information appraisal agent |
JP2855409B2 (ja) | 1994-11-17 | 1999-02-10 | 日本アイ・ビー・エム株式会社 | 自然言語処理方法及びシステム |
US6571279B1 (en) | 1997-12-05 | 2003-05-27 | Pinpoint Incorporated | Location enhanced information delivery system |
US5499289A (en) * | 1994-12-06 | 1996-03-12 | At&T Corp. | Systems, methods and articles of manufacture for performing distributed telecommunications |
US5748974A (en) | 1994-12-13 | 1998-05-05 | International Business Machines Corporation | Multimodal natural language interface for cross-application tasks |
US5774859A (en) | 1995-01-03 | 1998-06-30 | Scientific-Atlanta, Inc. | Information system having a speech interface |
US5794050A (en) | 1995-01-04 | 1998-08-11 | Intelligent Text Processing, Inc. | Natural language understanding system |
US5892900A (en) | 1996-08-30 | 1999-04-06 | Intertrust Technologies Corp. | Systems and methods for secure transaction management and electronic rights protection |
US5918222A (en) | 1995-03-17 | 1999-06-29 | Kabushiki Kaisha Toshiba | Information disclosing apparatus and multi-modal information input/output system |
US6965864B1 (en) | 1995-04-10 | 2005-11-15 | Texas Instruments Incorporated | Voice activated hypermedia systems using grammatical metadata |
WO1996037881A2 (en) | 1995-05-26 | 1996-11-28 | Applied Language Technologies | Method and apparatus for dynamic adaptation of a large vocabulary speech recognition system and for use of constraints from a database in a large vocabulary speech recognition system |
JP3716870B2 (ja) | 1995-05-31 | 2005-11-16 | ソニー株式会社 | 音声認識装置および音声認識方法 |
US5708422A (en) * | 1995-05-31 | 1998-01-13 | At&T | Transaction authorization and alert system |
US20070061735A1 (en) | 1995-06-06 | 2007-03-15 | Hoffberg Steven M | Ergonomic man-machine interface incorporating adaptive pattern recognition based control system |
US5721938A (en) * | 1995-06-07 | 1998-02-24 | Stuckey; Barbara K. | Method and device for parsing and analyzing natural language sentences and text |
US5617407A (en) | 1995-06-21 | 1997-04-01 | Bareis; Monica M. | Optical disk having speech recognition templates for information access |
US5794196A (en) | 1995-06-30 | 1998-08-11 | Kurzweil Applied Intelligence, Inc. | Speech recognition system distinguishing dictation from commands by arbitration between continuous speech and isolated word modules |
US6292767B1 (en) | 1995-07-18 | 2001-09-18 | Nuance Communications | Method and system for building and running natural language understanding systems |
US5963940A (en) | 1995-08-16 | 1999-10-05 | Syracuse University | Natural language information retrieval system and method |
US5911120A (en) | 1995-09-08 | 1999-06-08 | At&T Wireless Services | Wireless communication system having mobile stations establish a communication link through the base station without using a landline or regional cellular network and without a call in progress |
US5855000A (en) | 1995-09-08 | 1998-12-29 | Carnegie Mellon University | Method and apparatus for correcting and repairing machine-transcribed input using independent or cross-modal secondary input |
US5675629A (en) | 1995-09-08 | 1997-10-07 | At&T | Cordless cellular system base station |
US6192110B1 (en) * | 1995-09-15 | 2001-02-20 | At&T Corp. | Method and apparatus for generating sematically consistent inputs to a dialog manager |
US5774841A (en) | 1995-09-20 | 1998-06-30 | The United States Of America As Represented By The Adminstrator Of The National Aeronautics And Space Administration | Real-time reconfigurable adaptive speech recognition command and control apparatus and method |
US5799276A (en) | 1995-11-07 | 1998-08-25 | Accent Incorporated | Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals |
US5960447A (en) | 1995-11-13 | 1999-09-28 | Holt; Douglas | Word tagging and editing system for speech recognition |
EP0809900B1 (en) | 1995-12-15 | 2004-03-24 | Koninklijke Philips Electronics N.V. | An adaptive noise cancelling arrangement, a noise reduction system and a transceiver |
US6567778B1 (en) | 1995-12-21 | 2003-05-20 | Nuance Communications | Natural language speech recognition using slot semantic confidence scores related to their word recognition confidence scores |
US5802510A (en) | 1995-12-29 | 1998-09-01 | At&T Corp | Universal directory service |
US5633922A (en) | 1995-12-29 | 1997-05-27 | At&T | Process and apparatus for restarting call routing in a telephone network |
US5832221A (en) | 1995-12-29 | 1998-11-03 | At&T Corp | Universal message storage system |
US5742763A (en) | 1995-12-29 | 1998-04-21 | At&T Corp. | Universal message delivery system for handles identifying network presences |
US5987404A (en) | 1996-01-29 | 1999-11-16 | International Business Machines Corporation | Statistical natural language understanding using hidden clumpings |
US6314420B1 (en) | 1996-04-04 | 2001-11-06 | Lycos, Inc. | Collaborative/adaptive search engine |
US5848396A (en) | 1996-04-26 | 1998-12-08 | Freedom Of Information, Inc. | Method and apparatus for determining behavioral profile of a computer user |
US5878386A (en) * | 1996-06-28 | 1999-03-02 | Microsoft Corporation | Natural language parser with dictionary-based part-of-speech probabilities |
US5953393A (en) | 1996-07-15 | 1999-09-14 | At&T Corp. | Personal telephone agent |
US6009382A (en) | 1996-08-19 | 1999-12-28 | International Business Machines Corporation | Word storage table for natural language determination |
US5867817A (en) * | 1996-08-19 | 1999-02-02 | Virtual Vision, Inc. | Speech recognition manager |
US6385646B1 (en) | 1996-08-23 | 2002-05-07 | At&T Corp. | Method and system for establishing voice communications in an internet environment |
US6470315B1 (en) | 1996-09-11 | 2002-10-22 | Texas Instruments Incorporated | Enrollment and modeling method and apparatus for robust speaker dependent speech models |
US5878385A (en) * | 1996-09-16 | 1999-03-02 | Ergo Linguistic Technologies | Method and apparatus for universal parsing of language |
US6085186A (en) | 1996-09-20 | 2000-07-04 | Netbot, Inc. | Method and system using information written in a wrapper description language to execute query on a network |
US6961700B2 (en) | 1996-09-24 | 2005-11-01 | Allvoice Computing Plc | Method and apparatus for processing the output of a speech recognition engine |
JP3212618B2 (ja) * | 1996-09-26 | 2001-09-25 | 三菱電機株式会社 | 対話処理装置 |
US5892813A (en) | 1996-09-30 | 1999-04-06 | Matsushita Electric Industrial Co., Ltd. | Multimodal voice dialing digital key telephone with dialog manager |
US5995928A (en) | 1996-10-02 | 1999-11-30 | Speechworks International, Inc. | Method and apparatus for continuous spelling speech recognition with early identification |
US5902347A (en) | 1996-11-19 | 1999-05-11 | American Navigation Systems, Inc. | Hand-held GPS-mapping device |
US5839107A (en) | 1996-11-29 | 1998-11-17 | Northern Telecom Limited | Method and apparatus for automatically generating a speech recognition vocabulary from a white pages listing |
US6154526A (en) | 1996-12-04 | 2000-11-28 | Intellivoice Communications, Inc. | Data acquisition and error correcting speech recognition system |
US5960399A (en) | 1996-12-24 | 1999-09-28 | Gte Internetworking Incorporated | Client/server speech processor/recognizer |
US6456974B1 (en) | 1997-01-06 | 2002-09-24 | Texas Instruments Incorporated | System and method for adding speech recognition capabilities to java |
US6009355A (en) * | 1997-01-28 | 1999-12-28 | American Calcar Inc. | Multimedia information and control system for automobiles |
US6122613A (en) * | 1997-01-30 | 2000-09-19 | Dragon Systems, Inc. | Speech recognition using multiple recognizers (selectively) applied to the same input sample |
JPH10254486A (ja) | 1997-03-13 | 1998-09-25 | Canon Inc | 音声認識装置および方法 |
GB2323693B (en) | 1997-03-27 | 2001-09-26 | Forum Technology Ltd | Speech to text conversion |
US6167377A (en) | 1997-03-28 | 2000-12-26 | Dragon Systems, Inc. | Speech recognition language models |
FR2761837B1 (fr) | 1997-04-08 | 1999-06-11 | Sophie Sommelet | Dispositif d'aide a la navigation ayant une architecture distribuee basee sur internet |
US6014559A (en) * | 1997-04-10 | 2000-01-11 | At&T Wireless Services, Inc. | Method and system for delivering a voice mail notification to a private base station using cellular phone network |
US6078886A (en) | 1997-04-14 | 2000-06-20 | At&T Corporation | System and method for providing remote automatic speech recognition services via a packet network |
US6058187A (en) | 1997-04-17 | 2000-05-02 | At&T Corp. | Secure telecommunications data transmission |
US5895464A (en) | 1997-04-30 | 1999-04-20 | Eastman Kodak Company | Computer program product and a method for using natural language for the description, search and retrieval of multi-media objects |
US6173266B1 (en) * | 1997-05-06 | 2001-01-09 | Speechworks International, Inc. | System and method for developing interactive speech applications |
US6128369A (en) | 1997-05-14 | 2000-10-03 | A.T.&T. Corp. | Employing customer premises equipment in communications network maintenance |
US5960397A (en) | 1997-05-27 | 1999-09-28 | At&T Corp | System and method of recognizing an acoustic environment to adapt a set of based recognition models to the current acoustic environment for subsequent speech recognition |
US5995119A (en) | 1997-06-06 | 1999-11-30 | At&T Corp. | Method for generating photo-realistic animated characters |
US6199043B1 (en) * | 1997-06-24 | 2001-03-06 | International Business Machines Corporation | Conversation management in speech recognition interfaces |
FI972723A0 (fi) | 1997-06-24 | 1997-06-24 | Nokia Mobile Phones Ltd | Mobila kommunikationsanordningar |
US6101241A (en) | 1997-07-16 | 2000-08-08 | At&T Corp. | Telephone-based speech recognition for data collection |
US5926784A (en) | 1997-07-17 | 1999-07-20 | Microsoft Corporation | Method and system for natural language parsing using podding |
US5933822A (en) | 1997-07-22 | 1999-08-03 | Microsoft Corporation | Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision |
US6275231B1 (en) | 1997-08-01 | 2001-08-14 | American Calcar Inc. | Centralized control and management system for automobiles |
US6044347A (en) * | 1997-08-05 | 2000-03-28 | Lucent Technologies Inc. | Methods and apparatus object-oriented rule-based dialogue management |
US6144667A (en) | 1997-08-07 | 2000-11-07 | At&T Corp. | Network-based method and apparatus for initiating and completing a telephone call via the internet |
US6192338B1 (en) * | 1997-08-12 | 2001-02-20 | At&T Corp. | Natural language knowledge servers as network resources |
US6360234B2 (en) | 1997-08-14 | 2002-03-19 | Virage, Inc. | Video cataloger system with synchronized encoders |
US5895466A (en) | 1997-08-19 | 1999-04-20 | At&T Corp | Automated natural language understanding customer service system |
US6707421B1 (en) | 1997-08-19 | 2004-03-16 | Siemens Vdo Automotive Corporation | Driver information system |
US6081774A (en) | 1997-08-22 | 2000-06-27 | Novell, Inc. | Natural language information retrieval system and method |
US6018708A (en) * | 1997-08-26 | 2000-01-25 | Nortel Networks Corporation | Method and apparatus for performing speech recognition utilizing a supplementary lexicon of frequently used orthographies |
US6076059A (en) * | 1997-08-29 | 2000-06-13 | Digital Equipment Corporation | Method for aligning text with audio signals |
US6049602A (en) | 1997-09-18 | 2000-04-11 | At&T Corp | Virtual call center |
US6650747B1 (en) | 1997-09-18 | 2003-11-18 | At&T Corp. | Control of merchant application by system monitor in virtual contact center |
DE19742054A1 (de) | 1997-09-24 | 1999-04-01 | Philips Patentverwaltung | Eingabesystem wenigstens für Orts- und/oder Straßennamen |
US6134235A (en) | 1997-10-08 | 2000-10-17 | At&T Corp. | Pots/packet bridge |
US5897613A (en) | 1997-10-08 | 1999-04-27 | Lucent Technologies Inc. | Efficient transmission of voice silence intervals |
US6272455B1 (en) | 1997-10-22 | 2001-08-07 | Lucent Technologies, Inc. | Method and apparatus for understanding natural language |
JPH11126090A (ja) | 1997-10-23 | 1999-05-11 | Pioneer Electron Corp | 音声認識方法及び音声認識装置並びに音声認識装置を動作させるためのプログラムが記録された記録媒体 |
US6021384A (en) * | 1997-10-29 | 2000-02-01 | At&T Corp. | Automatic generation of superwords |
US6498797B1 (en) | 1997-11-14 | 2002-12-24 | At&T Corp. | Method and apparatus for communication services on a network |
US6188982B1 (en) * | 1997-12-01 | 2001-02-13 | Industrial Technology Research Institute | On-line background noise adaptation of parallel model combination HMM with discriminative learning using weighted HMM for noisy speech recognition |
US5970412A (en) | 1997-12-02 | 1999-10-19 | Maxemchuk; Nicholas Frank | Overload control in a packet-switching cellular environment |
US6614773B1 (en) | 1997-12-02 | 2003-09-02 | At&T Corp. | Packet transmissions over cellular radio |
US6219346B1 (en) | 1997-12-02 | 2001-04-17 | At&T Corp. | Packet switching architecture in cellular radio |
US6195634B1 (en) * | 1997-12-24 | 2001-02-27 | Nortel Networks Corporation | Selection of decoys for non-vocabulary utterances rejection |
US6301560B1 (en) | 1998-01-05 | 2001-10-09 | Microsoft Corporation | Discrete speech recognition system with ballooning active grammar |
US6278377B1 (en) | 1999-08-25 | 2001-08-21 | Donnelly Corporation | Indicator for vehicle accessory |
US5950167A (en) | 1998-01-26 | 1999-09-07 | Lucent Technologies Inc. | Screen-less remote voice or tone-controlled computer program operations via telephone set |
US6226612B1 (en) | 1998-01-30 | 2001-05-01 | Motorola, Inc. | Method of evaluating an utterance in a speech recognition system |
US6385596B1 (en) | 1998-02-06 | 2002-05-07 | Liquid Audio, Inc. | Secure online music distribution system |
US6160883A (en) | 1998-03-04 | 2000-12-12 | At&T Corporation | Telecommunications network system and method |
WO1999046763A1 (en) | 1998-03-09 | 1999-09-16 | Lernout & Hauspie Speech Products N.V. | Apparatus and method for simultaneous multimode dictation |
US6119087A (en) | 1998-03-13 | 2000-09-12 | Nuance Communications | System architecture for and method of voice processing |
US6233559B1 (en) | 1998-04-01 | 2001-05-15 | Motorola, Inc. | Speech control of multiple applications using applets |
US6420975B1 (en) | 1999-08-25 | 2002-07-16 | Donnelly Corporation | Interior rearview mirror sound processing system |
US6173279B1 (en) * | 1998-04-09 | 2001-01-09 | At&T Corp. | Method of using a natural language interface to retrieve information from one or more data resources |
US6144938A (en) | 1998-05-01 | 2000-11-07 | Sun Microsystems, Inc. | Voice user interface with personality |
US6574597B1 (en) | 1998-05-08 | 2003-06-03 | At&T Corp. | Fully expanded context-dependent networks for speech recognition |
US6236968B1 (en) | 1998-05-14 | 2001-05-22 | International Business Machines Corporation | Sleep prevention dialog based car system |
US20070094223A1 (en) | 1998-05-28 | 2007-04-26 | Lawrence Au | Method and system for using contextual meaning in voice to text conversion |
CN1311881A (zh) * | 1998-06-04 | 2001-09-05 | 松下电器产业株式会社 | 语言变换规则产生装置、语言变换装置及程序记录媒体 |
US6219643B1 (en) | 1998-06-26 | 2001-04-17 | Nuance Communications, Inc. | Method of analyzing dialogs in a natural language speech recognition system |
US6553372B1 (en) | 1998-07-13 | 2003-04-22 | Microsoft Corporation | Natural language information retrieval system |
US6175858B1 (en) | 1998-07-13 | 2001-01-16 | At&T Corp. | Intelligent network messaging agent and method |
US6393428B1 (en) | 1998-07-13 | 2002-05-21 | Microsoft Corporation | Natural language information retrieval system |
US6269336B1 (en) | 1998-07-24 | 2001-07-31 | Motorola, Inc. | Voice browser for interactive services and methods thereof |
AU5581599A (en) | 1998-08-24 | 2000-03-14 | Virtual Research Associates, Inc. | Natural language sentence parser |
US6208964B1 (en) * | 1998-08-31 | 2001-03-27 | Nortel Networks Limited | Method and apparatus for providing unsupervised adaptation of transcriptions |
US6499013B1 (en) | 1998-09-09 | 2002-12-24 | One Voice Technologies, Inc. | Interactive user interface using speech recognition and natural language processing |
US6434524B1 (en) | 1998-09-09 | 2002-08-13 | One Voice Technologies, Inc. | Object interactive user interface using speech recognition and natural language processing |
US6049607A (en) | 1998-09-18 | 2000-04-11 | Lamar Signal Processing | Interference canceling method and apparatus |
US6405170B1 (en) | 1998-09-22 | 2002-06-11 | Speechworks International, Inc. | Method and system of reviewing the behavior of an interactive speech recognition application |
US6606598B1 (en) | 1998-09-22 | 2003-08-12 | Speechworks International, Inc. | Statistical computing and reporting for interactive speech applications |
US7003463B1 (en) * | 1998-10-02 | 2006-02-21 | International Business Machines Corporation | System and method for providing network coordinated conversational services |
EP1133734A4 (en) | 1998-10-02 | 2005-12-14 | Ibm | INTERACTIVE NAVIGATOR AND INTERACTIVE SYSTEMS |
EP1119845A1 (en) | 1998-10-05 | 2001-08-01 | Lernout & Hauspie Speech Products N.V. | Speech controlled computer user interface |
WO2000022549A1 (en) | 1998-10-09 | 2000-04-20 | Koninklijke Philips Electronics N.V. | Automatic inquiry method and system |
US6928614B1 (en) | 1998-10-13 | 2005-08-09 | Visteon Global Technologies, Inc. | Mobile office with speech recognition |
US6185535B1 (en) * | 1998-10-16 | 2001-02-06 | Telefonaktiebolaget Lm Ericsson (Publ) | Voice control of a user interface to service applications |
AU1707600A (en) | 1998-10-21 | 2000-05-08 | American Calcar, Inc. | Positional camera and gps data interchange device |
US6453292B2 (en) | 1998-10-28 | 2002-09-17 | International Business Machines Corporation | Command boundary identifier for conversational natural language |
US6028514A (en) * | 1998-10-30 | 2000-02-22 | Lemelson Jerome H. | Personal emergency, safety warning system and method |
US6477200B1 (en) | 1998-11-09 | 2002-11-05 | Broadcom Corporation | Multi-pair gigabit ethernet transceiver |
US8121891B2 (en) * | 1998-11-12 | 2012-02-21 | Accenture Global Services Gmbh | Personalized product report |
US6208972B1 (en) * | 1998-12-23 | 2001-03-27 | Richard Grant | Method for integrating computer processes with an interface controlled by voice actuated grammars |
US6195651B1 (en) * | 1998-11-19 | 2001-02-27 | Andersen Consulting Properties Bv | System, method and article of manufacture for a tuned user application experience |
US6246981B1 (en) | 1998-11-25 | 2001-06-12 | International Business Machines Corporation | Natural language task-oriented dialog manager and method |
US7881936B2 (en) * | 1998-12-04 | 2011-02-01 | Tegic Communications, Inc. | Multimodal disambiguation of speech recognition |
US6430285B1 (en) | 1998-12-15 | 2002-08-06 | At&T Corp. | Method and apparatus for an automated caller interaction system |
US6721001B1 (en) | 1998-12-16 | 2004-04-13 | International Business Machines Corporation | Digital camera with voice recognition annotation |
US6233556B1 (en) | 1998-12-16 | 2001-05-15 | Nuance Communications | Voice processing and verification system |
US6754485B1 (en) | 1998-12-23 | 2004-06-22 | American Calcar Inc. | Technique for effectively providing maintenance and information to vehicles |
US6570555B1 (en) | 1998-12-30 | 2003-05-27 | Fuji Xerox Co., Ltd. | Method and apparatus for embodied conversational characters with multimodal input/output in an interface device |
US6742021B1 (en) | 1999-01-05 | 2004-05-25 | Sri International, Inc. | Navigating network-based electronic information using spoken input with multimodal error feedback |
US6523061B1 (en) * | 1999-01-05 | 2003-02-18 | Sri International, Inc. | System, method, and article of manufacture for agent-based navigation in a speech-based data navigation system |
US7036128B1 (en) | 1999-01-05 | 2006-04-25 | Sri International Offices | Using a community of distributed electronic agents to support a highly mobile, ambient computing environment |
US6757718B1 (en) | 1999-01-05 | 2004-06-29 | Sri International | Mobile navigation of network-based electronic information using spoken input |
US6851115B1 (en) * | 1999-01-05 | 2005-02-01 | Sri International | Software-based architecture for communication and cooperation among distributed electronic agents |
JP3822990B2 (ja) | 1999-01-07 | 2006-09-20 | 株式会社日立製作所 | 翻訳装置、記録媒体 |
US6429813B2 (en) | 1999-01-14 | 2002-08-06 | Navigation Technologies Corp. | Method and system for providing end-user preferences with a navigation system |
US6567797B1 (en) | 1999-01-26 | 2003-05-20 | Xerox Corporation | System and method for providing recommendations based on multi-modal user clusters |
WO2000045375A1 (en) | 1999-01-27 | 2000-08-03 | Kent Ridge Digital Labs | Method and apparatus for voice annotation and retrieval of multimedia data |
US6556970B1 (en) | 1999-01-28 | 2003-04-29 | Denso Corporation | Apparatus for determining appropriate series of words carrying information to be recognized |
US6278968B1 (en) | 1999-01-29 | 2001-08-21 | Sony Corporation | Method and apparatus for adaptive speech recognition hypothesis construction and selection in a spoken language translation system |
US6873837B1 (en) | 1999-02-03 | 2005-03-29 | Matsushita Electric Industrial Co., Ltd. | Emergency reporting system and terminal apparatus therein |
US6430531B1 (en) | 1999-02-04 | 2002-08-06 | Soliloquy, Inc. | Bilateral speech system |
US6643620B1 (en) | 1999-03-15 | 2003-11-04 | Matsushita Electric Industrial Co., Ltd. | Voice activated controller for recording and retrieving audio/video programs |
JP4176228B2 (ja) | 1999-03-15 | 2008-11-05 | 株式会社東芝 | 自然言語対話装置及び自然言語対話方法 |
US6631346B1 (en) | 1999-04-07 | 2003-10-07 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for natural language parsing using multiple passes and tags |
US6233561B1 (en) | 1999-04-12 | 2001-05-15 | Matsushita Electric Industrial Co., Ltd. | Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue |
US6408272B1 (en) | 1999-04-12 | 2002-06-18 | General Magic, Inc. | Distributed voice user interface |
US6570964B1 (en) | 1999-04-16 | 2003-05-27 | Nuance Communications | Technique for recognizing telephone numbers and other spoken information embedded in voice messages stored in a voice messaging system |
US6314402B1 (en) | 1999-04-23 | 2001-11-06 | Nuance Communications | Method and apparatus for creating modifiable and combinable speech objects for acquiring information from a speaker in an interactive voice response system |
US6434523B1 (en) | 1999-04-23 | 2002-08-13 | Nuance Communications | Creating and editing grammars for speech recognition graphically |
US6356869B1 (en) | 1999-04-30 | 2002-03-12 | Nortel Networks Limited | Method and apparatus for discourse management |
US6804638B2 (en) | 1999-04-30 | 2004-10-12 | Recent Memory Incorporated | Device and method for selective recall and preservation of events prior to decision to record the events |
US6505155B1 (en) | 1999-05-06 | 2003-01-07 | International Business Machines Corporation | Method and system for automatically adjusting prompt feedback based on predicted recognition accuracy |
US6308151B1 (en) | 1999-05-14 | 2001-10-23 | International Business Machines Corp. | Method and system using a speech recognition system to dictate a body of text in response to an available body of text |
US6604075B1 (en) | 1999-05-20 | 2003-08-05 | Lucent Technologies Inc. | Web-based voice dialog interface |
US6584439B1 (en) | 1999-05-21 | 2003-06-24 | Winbond Electronics Corporation | Method and apparatus for controlling voice controlled devices |
GB9911971D0 (en) | 1999-05-21 | 1999-07-21 | Canon Kk | A system, a server for a system and a machine for use in a system |
US7787907B2 (en) | 1999-05-26 | 2010-08-31 | Johnson Controls Technology Company | System and method for using speech recognition with a vehicle control system |
US20020032564A1 (en) | 2000-04-19 | 2002-03-14 | Farzad Ehsani | Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface |
US20020107694A1 (en) | 1999-06-07 | 2002-08-08 | Traptec Corporation | Voice-recognition safety system for aircraft and method of using the same |
US6374214B1 (en) | 1999-06-24 | 2002-04-16 | International Business Machines Corp. | Method and apparatus for excluding text phrases during re-dictation in a speech recognition system |
ATE320650T1 (de) | 1999-06-30 | 2006-04-15 | Ibm | Verfahren zur erweiterung des wortschatzes eines spracherkennungssystems |
US6321196B1 (en) | 1999-07-02 | 2001-11-20 | International Business Machines Corporation | Phonetic spelling for speech recognition |
US7069220B2 (en) | 1999-08-13 | 2006-06-27 | International Business Machines Corporation | Method for determining and maintaining dialog focus in a conversational speech system |
US6377913B1 (en) | 1999-08-13 | 2002-04-23 | International Business Machines Corporation | Method and system for multi-client access to a dialog system |
US6513006B2 (en) * | 1999-08-26 | 2003-01-28 | Matsushita Electronic Industrial Co., Ltd. | Automatic control of household activity using speech recognition and natural language |
US6415257B1 (en) | 1999-08-26 | 2002-07-02 | Matsushita Electric Industrial Co., Ltd. | System for identifying and adapting a TV-user profile by means of speech technology |
US6901366B1 (en) | 1999-08-26 | 2005-05-31 | Matsushita Electric Industrial Co., Ltd. | System and method for assessing TV-related information over the internet |
EP1083545A3 (en) | 1999-09-09 | 2001-09-26 | Xanavi Informatics Corporation | Voice recognition of proper names in a navigation apparatus |
US6658388B1 (en) | 1999-09-10 | 2003-12-02 | International Business Machines Corporation | Personality generator for conversational systems |
US6850603B1 (en) * | 1999-09-13 | 2005-02-01 | Microstrategy, Incorporated | System and method for the creation and automatic deployment of personalized dynamic and interactive voice services |
US7340040B1 (en) | 1999-09-13 | 2008-03-04 | Microstrategy, Incorporated | System and method for real-time, personalized, dynamic, interactive voice services for corporate-analysis related information |
US6631351B1 (en) | 1999-09-14 | 2003-10-07 | Aidentity Matrix | Smart toys |
US6601026B2 (en) | 1999-09-17 | 2003-07-29 | Discern Communications, Inc. | Information retrieval by natural language querying |
US20020049535A1 (en) | 1999-09-20 | 2002-04-25 | Ralf Rigo | Wireless interactive voice-actuated mobile telematics system |
US6587858B1 (en) | 1999-09-30 | 2003-07-01 | Steven Paul Strazza | Systems and methods for the control of dynamic data and request criteria in a data repository |
US6963759B1 (en) | 1999-10-05 | 2005-11-08 | Fastmobile, Inc. | Speech recognition technique based on local interrupt detection |
US6937977B2 (en) | 1999-10-05 | 2005-08-30 | Fastmobile, Inc. | Method and apparatus for processing an input speech signal during presentation of an output audio signal |
US6868385B1 (en) | 1999-10-05 | 2005-03-15 | Yomobile, Inc. | Method and apparatus for the provision of information signals based upon speech recognition |
US6442522B1 (en) | 1999-10-12 | 2002-08-27 | International Business Machines Corporation | Bi-directional natural language system for interfacing with multiple back-end applications |
US6721697B1 (en) | 1999-10-18 | 2004-04-13 | Sony Corporation | Method and system for reducing lexical ambiguity |
AU8030300A (en) | 1999-10-19 | 2001-04-30 | Sony Electronics Inc. | Natural language interface control system |
US6581103B1 (en) | 1999-10-22 | 2003-06-17 | Dedicated Radio, Llc | Method for internet radio broadcasting including listener requests of audio and/or video files with input dedications |
US6594367B1 (en) | 1999-10-25 | 2003-07-15 | Andrea Electronics Corporation | Super directional beamforming design and implementation |
AU1039301A (en) | 1999-10-29 | 2001-05-08 | British Telecommunications Public Limited Company | Method and apparatus for processing queries |
US6622119B1 (en) | 1999-10-30 | 2003-09-16 | International Business Machines Corporation | Adaptive command predictor and method for a natural language dialog system |
US6526139B1 (en) * | 1999-11-03 | 2003-02-25 | Tellabs Operations, Inc. | Consolidated noise injection in a voice processing system |
US6681206B1 (en) * | 1999-11-05 | 2004-01-20 | At&T Corporation | Method for generating morphemes |
US8482535B2 (en) | 1999-11-08 | 2013-07-09 | Apple Inc. | Programmable tactile touch screen displays and man-machine interfaces for improved vehicle instrumentation and telematics |
US6615172B1 (en) | 1999-11-12 | 2003-09-02 | Phoenix Solutions, Inc. | Intelligent query engine for processing voice based queries |
US9076448B2 (en) | 1999-11-12 | 2015-07-07 | Nuance Communications, Inc. | Distributed real time speech recognition system |
US7392185B2 (en) * | 1999-11-12 | 2008-06-24 | Phoenix Solutions, Inc. | Speech based learning/training system using semantic decoding |
US6633846B1 (en) | 1999-11-12 | 2003-10-14 | Phoenix Solutions, Inc. | Distributed realtime speech recognition system |
US6418210B1 (en) | 1999-11-29 | 2002-07-09 | At&T Corp | Method and apparatus for providing information between a calling network and a called network |
US6751612B1 (en) | 1999-11-29 | 2004-06-15 | Xerox Corporation | User query generate search results that rank set of servers where ranking is based on comparing content on each server with user query, frequency at which content on each server is altered using web crawler in a search engine |
US6288319B1 (en) | 1999-12-02 | 2001-09-11 | Gary Catona | Electronic greeting card with a custom audio mix |
GB9928420D0 (en) | 1999-12-02 | 2000-01-26 | Ibm | Interactive voice response system |
US6591239B1 (en) | 1999-12-09 | 2003-07-08 | Steris Inc. | Voice controlled surgical suite |
US6598018B1 (en) | 1999-12-15 | 2003-07-22 | Matsushita Electric Industrial Co., Ltd. | Method for natural dialog interface to car devices |
US6976229B1 (en) | 1999-12-16 | 2005-12-13 | Ricoh Co., Ltd. | Method and apparatus for storytelling with digital photographs |
US6832230B1 (en) | 1999-12-22 | 2004-12-14 | Nokia Corporation | Apparatus and associated method for downloading an application with a variable lifetime to a mobile terminal |
US6920421B2 (en) | 1999-12-28 | 2005-07-19 | Sony Corporation | Model adaptive apparatus for performing adaptation of a model used in pattern recognition considering recentness of a received pattern data |
US6678680B1 (en) * | 2000-01-06 | 2004-01-13 | Mark Woo | Music search engine |
US6701294B1 (en) | 2000-01-19 | 2004-03-02 | Lucent Technologies, Inc. | User interface for translating natural language inquiries into database queries and data presentations |
US6829603B1 (en) | 2000-02-02 | 2004-12-07 | International Business Machines Corp. | System, method and program product for interactive natural dialog |
US6560590B1 (en) | 2000-02-14 | 2003-05-06 | Kana Software, Inc. | Method and apparatus for multiple tiered matching of natural language queries to positions in a text corpus |
US6434529B1 (en) | 2000-02-16 | 2002-08-13 | Sun Microsystems, Inc. | System and method for referencing object instances and invoking methods on those object instances from within a speech recognition grammar |
EP3367268A1 (en) | 2000-02-22 | 2018-08-29 | Nokia Technologies Oy | Spatially coding and displaying information |
US7110951B1 (en) | 2000-03-03 | 2006-09-19 | Dorothy Lemelson, legal representative | System and method for enhancing speech intelligibility for the hearing impaired |
US6466654B1 (en) | 2000-03-06 | 2002-10-15 | Avaya Technology Corp. | Personal virtual assistant with semantic tagging |
US7974875B1 (en) | 2000-03-21 | 2011-07-05 | Aol Inc. | System and method for using voice over a telephone to access, process, and carry out transactions over the internet |
US6510417B1 (en) * | 2000-03-21 | 2003-01-21 | America Online, Inc. | System and method for voice access to internet-based information |
EP1279165B1 (en) | 2000-03-24 | 2011-01-05 | Eliza Corporation | Speech recognition |
US6868380B2 (en) | 2000-03-24 | 2005-03-15 | Eliza Corporation | Speech recognition system and method for generating phonotic estimates |
WO2001075676A2 (en) | 2000-04-02 | 2001-10-11 | Tangis Corporation | Soliciting information based on a computer user's context |
US6980092B2 (en) | 2000-04-06 | 2005-12-27 | Gentex Corporation | Vehicle rearview mirror assembly incorporating a communication system |
CA2408584A1 (en) | 2000-04-06 | 2001-10-18 | One Voice Technologies, Inc. | Natural language and dialogue generation processing |
US7177798B2 (en) * | 2000-04-07 | 2007-02-13 | Rensselaer Polytechnic Institute | Natural language interface using constrained intermediate dictionary of results |
US7734287B2 (en) | 2000-04-10 | 2010-06-08 | I/O Controls Corporation | System for providing remote access to diagnostic information over a wide area network |
US6726636B2 (en) | 2000-04-12 | 2004-04-27 | Loran Technologies, Inc. | Breathalyzer with voice recognition |
US6578022B1 (en) | 2000-04-18 | 2003-06-10 | Icplanet Corporation | Interactive intelligent searching with executable suggestions |
US6556973B1 (en) | 2000-04-19 | 2003-04-29 | Voxi Ab | Conversion between data representation formats |
US7502672B1 (en) | 2000-04-24 | 2009-03-10 | Usa Technologies, Inc. | Wireless vehicle diagnostics with service and part determination capabilities |
US6560576B1 (en) | 2000-04-25 | 2003-05-06 | Nuance Communications | Method and apparatus for providing active help to a user of a voice-enabled application |
WO2001082031A2 (en) * | 2000-04-26 | 2001-11-01 | Portable Internet Inc. | Portable internet services |
US6912498B2 (en) | 2000-05-02 | 2005-06-28 | Scansoft, Inc. | Error correction in speech recognition by correcting text around selected area |
JP3542026B2 (ja) | 2000-05-02 | 2004-07-14 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識システム、音声認識方法およびコンピュータ可読な記録媒体 |
WO2001089183A1 (en) | 2000-05-16 | 2001-11-22 | John Taschereau | Method and system for providing geographically targeted information and advertising |
DE60129955D1 (de) * | 2000-05-26 | 2007-09-27 | Koninkl Philips Electronics Nv | Verfahren und gerät zur akustischen echounterdrückung mit adaptiver strahlbildung |
US6487495B1 (en) | 2000-06-02 | 2002-11-26 | Navigation Technologies Corporation | Navigation applications using related location-referenced keywords |
US7082469B2 (en) * | 2000-06-09 | 2006-07-25 | Gold Mustache Publishing, Inc. | Method and system for electronic song dedication |
WO2001097558A2 (en) * | 2000-06-13 | 2001-12-20 | Gn Resound Corporation | Fixed polar-pattern-based adaptive directionality systems |
CN1300677C (zh) * | 2000-06-22 | 2007-02-14 | 微软公司 | 分布式计算服务平台 |
JP3567864B2 (ja) | 2000-07-21 | 2004-09-22 | 株式会社デンソー | 音声認識装置及び記録媒体 |
US7143039B1 (en) | 2000-08-11 | 2006-11-28 | Tellme Networks, Inc. | Providing menu and other services for an information processing system using a telephone or other audio interface |
US6636790B1 (en) | 2000-07-25 | 2003-10-21 | Reynolds And Reynolds Holdings, Inc. | Wireless diagnostic system and method for monitoring vehicles |
DE10036851A1 (de) | 2000-07-28 | 2002-02-28 | Mannesmann Vdo Ag | Verfahren zur Auswahl einer Ortsbezeichnung in einem Navigationssystem durch Spracheingabe |
US6968311B2 (en) | 2000-07-28 | 2005-11-22 | Siemens Vdo Automotive Corporation | User interface for telematics systems |
US7092928B1 (en) | 2000-07-31 | 2006-08-15 | Quantum Leap Research, Inc. | Intelligent portal engine |
US7027975B1 (en) | 2000-08-08 | 2006-04-11 | Object Services And Consulting, Inc. | Guided natural language interface system and method |
US7653748B2 (en) | 2000-08-10 | 2010-01-26 | Simplexity, Llc | Systems, methods and computer program products for integrating advertising within web content |
US6574624B1 (en) | 2000-08-18 | 2003-06-03 | International Business Machines Corporation | Automatic topic identification and switch for natural language search of textual document collections |
AU2001283579A1 (en) | 2000-08-21 | 2002-03-04 | Yahoo, Inc. | Method and system of interpreting and presenting web content using a voice browser |
US8200485B1 (en) | 2000-08-29 | 2012-06-12 | A9.Com, Inc. | Voice interface and methods for improving recognition accuracy of voice search queries |
US7062488B1 (en) | 2000-08-30 | 2006-06-13 | Richard Reisman | Task/domain segmentation in applying feedback to command control |
CN1226717C (zh) * | 2000-08-30 | 2005-11-09 | 国际商业机器公司 | 自动新词提取方法和系统 |
US6813341B1 (en) | 2000-08-31 | 2004-11-02 | Ivoice, Inc. | Voice activated/voice responsive item locator |
EP1184841A1 (de) | 2000-08-31 | 2002-03-06 | Siemens Aktiengesellschaft | Sprachgesteuerte Anordnung und Verfahren zur Spracheingabe und -erkennung |
JP2004508636A (ja) * | 2000-09-07 | 2004-03-18 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | 情報提供システム及びその制御方法 |
US20040205671A1 (en) | 2000-09-13 | 2004-10-14 | Tatsuya Sukehiro | Natural-language processing system |
US6982635B2 (en) | 2000-09-21 | 2006-01-03 | American Calcar Inc. | Technique for assisting a vehicle user to make a turn |
US7085708B2 (en) | 2000-09-23 | 2006-08-01 | Ravenflow, Inc. | Computer system with natural language to machine language translator |
US6362748B1 (en) | 2000-09-27 | 2002-03-26 | Lite Vision Corporation | System for communicating among vehicles and a communication system control center |
US6704576B1 (en) | 2000-09-27 | 2004-03-09 | At&T Corp. | Method and system for communicating multimedia content in a unicast, multicast, simulcast or broadcast environment |
US7216080B2 (en) | 2000-09-29 | 2007-05-08 | Mindfabric Holdings Llc | Natural-language voice-activated personal assistant |
JP2003044708A (ja) | 2000-10-02 | 2003-02-14 | Omron Corp | 情報仲介システムとそれに用いられる情報仲介方法 |
US7219058B1 (en) | 2000-10-13 | 2007-05-15 | At&T Corp. | System and method for processing speech recognition results |
US6922670B2 (en) | 2000-10-24 | 2005-07-26 | Sanyo Electric Co., Ltd. | User support apparatus and system using agents |
US6721706B1 (en) | 2000-10-30 | 2004-04-13 | Koninklijke Philips Electronics N.V. | Environment-responsive user interface/entertainment device that simulates personal interaction |
US6795808B1 (en) | 2000-10-30 | 2004-09-21 | Koninklijke Philips Electronics N.V. | User interface/entertainment device that simulates personal interaction and charges external database with relevant data |
US6934756B2 (en) * | 2000-11-01 | 2005-08-23 | International Business Machines Corporation | Conversational networking via transport, coding and control conversational protocols |
GB0027178D0 (en) * | 2000-11-07 | 2000-12-27 | Canon Kk | Speech processing system |
US6941266B1 (en) | 2000-11-15 | 2005-09-06 | At&T Corp. | Method and system for predicting problematic dialog situations in a task classification system |
US6735592B1 (en) | 2000-11-16 | 2004-05-11 | Discern Communications | System, method, and computer program product for a network-based content exchange system |
US7013308B1 (en) | 2000-11-28 | 2006-03-14 | Semscript Ltd. | Knowledge storage and retrieval system and method |
US20020065568A1 (en) | 2000-11-30 | 2002-05-30 | Silfvast Robert Denton | Plug-in modules for digital signal processor functionalities |
US20020067839A1 (en) | 2000-12-04 | 2002-06-06 | Heinrich Timothy K. | The wireless voice activated and recogintion car system |
US6973429B2 (en) | 2000-12-04 | 2005-12-06 | A9.Com, Inc. | Grammar generation for voice-based searches |
US7016847B1 (en) | 2000-12-08 | 2006-03-21 | Ben Franklin Patent Holdings L.L.C. | Open architecture for a voice user interface |
US6456711B1 (en) | 2000-12-12 | 2002-09-24 | At&T Corp. | Method for placing a call intended for an enhanced network user on hold while the enhanced network user is unavailable to take the call using a distributed feature architecture |
US7170993B2 (en) * | 2000-12-19 | 2007-01-30 | Lucent Technologies Inc. | Methods and apparatus for automated monitoring and action taking based on decision support mechanism |
US20020082911A1 (en) | 2000-12-22 | 2002-06-27 | Dunn Charles L. | Online revenue sharing |
US6973427B2 (en) | 2000-12-26 | 2005-12-06 | Microsoft Corporation | Method for adding phonetic descriptions to a speech recognition lexicon |
US20020087312A1 (en) | 2000-12-29 | 2002-07-04 | Lee Victor Wai Leung | Computer-implemented conversation buffering method and system |
US20020087326A1 (en) | 2000-12-29 | 2002-07-04 | Lee Victor Wai Leung | Computer-implemented web page summarization method and system |
DE10101282A1 (de) | 2001-01-12 | 2002-07-18 | Siemens Ag | Notrufmeldung mittels mobiler Telekommunikationsgeräte |
US7085723B2 (en) | 2001-01-12 | 2006-08-01 | International Business Machines Corporation | System and method for determining utterance context in a multi-context speech application |
US6751591B1 (en) | 2001-01-22 | 2004-06-15 | At&T Corp. | Method and system for predicting understanding errors in a task classification system |
US7069207B2 (en) | 2001-01-26 | 2006-06-27 | Microsoft Corporation | Linguistically intelligent text compression |
US7487110B2 (en) * | 2001-01-30 | 2009-02-03 | International Business Machines Corporation | Automotive information communication exchange system, method, and program product |
US6964023B2 (en) | 2001-02-05 | 2005-11-08 | International Business Machines Corporation | System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input |
US20020107873A1 (en) | 2001-02-07 | 2002-08-08 | Bandag Licensing Corporation | System and method for data collection, reporting, and analysis of fleet vehicle information |
US7206418B2 (en) | 2001-02-12 | 2007-04-17 | Fortemedia, Inc. | Noise suppression for a wireless communication device |
EP1231788A1 (en) | 2001-02-12 | 2002-08-14 | Koninklijke Philips Electronics N.V. | Arrangement for distributing content, profiling center, receiving device and method |
US6549629B2 (en) | 2001-02-21 | 2003-04-15 | Digisonix Llc | DVE system with normalized selection |
GB2372864B (en) * | 2001-02-28 | 2005-09-07 | Vox Generation Ltd | Spoken language interface |
US6754627B2 (en) | 2001-03-01 | 2004-06-22 | International Business Machines Corporation | Detecting speech recognition errors in an embedded speech recognition system |
US7024364B2 (en) | 2001-03-09 | 2006-04-04 | Bevocal, Inc. | System, method and computer program product for looking up business addresses and directions based on a voice dial-up session |
US20020173961A1 (en) * | 2001-03-09 | 2002-11-21 | Guerra Lisa M. | System, method and computer program product for dynamic, robust and fault tolerant audio output in a speech recognition framework |
US20020133402A1 (en) | 2001-03-13 | 2002-09-19 | Scott Faber | Apparatus and method for recruiting, communicating with, and paying participants of interactive advertising |
US7574362B2 (en) | 2001-03-14 | 2009-08-11 | At&T Intellectual Property Ii, L.P. | Method for automated sentence planning in a task classification system |
WO2002073449A1 (en) | 2001-03-14 | 2002-09-19 | At & T Corp. | Automated sentence planning in a task classification system |
US7729918B2 (en) | 2001-03-14 | 2010-06-01 | At&T Intellectual Property Ii, Lp | Trainable sentence planning system |
US7380250B2 (en) * | 2001-03-16 | 2008-05-27 | Microsoft Corporation | Method and system for interacting with devices having different capabilities |
US6801897B2 (en) | 2001-03-28 | 2004-10-05 | International Business Machines Corporation | Method of providing concise forms of natural commands |
US7406421B2 (en) | 2001-10-26 | 2008-07-29 | Intellisist Inc. | Systems and methods for reviewing informational content in a vehicle |
US8175886B2 (en) | 2001-03-29 | 2012-05-08 | Intellisist, Inc. | Determination of signal-processing approach based on signal destination characteristics |
US7472075B2 (en) | 2001-03-29 | 2008-12-30 | Intellisist, Inc. | System and method to associate broadcast radio content with a transaction via an internet server |
US6487494B2 (en) | 2001-03-29 | 2002-11-26 | Wingcast, Llc | System and method for reducing the amount of repetitive data sent by a server to a client for vehicle navigation |
FR2822994B1 (fr) | 2001-03-30 | 2004-05-21 | Bouygues Telecom Sa | Assistance au conducteur d'un vehicule automobile |
JP2002358095A (ja) | 2001-03-30 | 2002-12-13 | Sony Corp | 音声処理装置および音声処理方法、並びにプログラムおよび記録媒体 |
AU2002251205A1 (en) * | 2001-03-30 | 2002-10-15 | British Telecommunications Public Limited Company | Multi-modal interface |
US6996531B2 (en) * | 2001-03-30 | 2006-02-07 | Comverse Ltd. | Automated database assistance using a telephone for a speech based or text based multimedia communication mode |
US6856990B2 (en) * | 2001-04-09 | 2005-02-15 | Intel Corporation | Network dedication system |
JP2002318132A (ja) | 2001-04-23 | 2002-10-31 | Hitachi Ltd | 音声対話型ナビゲーションシステムおよび移動端末装置および音声対話サーバ |
US7437295B2 (en) | 2001-04-27 | 2008-10-14 | Accenture Llp | Natural language processing for a location-based services system |
US7970648B2 (en) | 2001-04-27 | 2011-06-28 | Accenture Global Services Limited | Advertising campaign and business listing management for a location-based services system |
US6950821B2 (en) | 2001-05-04 | 2005-09-27 | Sun Microsystems, Inc. | System and method for resolving distributed network search queries to information providers |
US6804684B2 (en) | 2001-05-07 | 2004-10-12 | Eastman Kodak Company | Method for associating semantic information with multiple images in an image database environment |
US20020173333A1 (en) | 2001-05-18 | 2002-11-21 | Buchholz Dale R. | Method and apparatus for processing barge-in requests |
US6944594B2 (en) | 2001-05-30 | 2005-09-13 | Bellsouth Intellectual Property Corporation | Multi-context conversational environment system and method |
JP2003005897A (ja) * | 2001-06-20 | 2003-01-08 | Alpine Electronics Inc | 情報入力方法および装置 |
US6801604B2 (en) * | 2001-06-25 | 2004-10-05 | International Business Machines Corporation | Universal IP-based and scalable architectures across conversational applications using web services for speech and audio processing resources |
US20020198714A1 (en) | 2001-06-26 | 2002-12-26 | Guojun Zhou | Statistical spoken dialog system |
US20100029261A1 (en) * | 2001-06-27 | 2010-02-04 | John Mikkelsen | Virtual wireless data cable method, apparatus and system |
US20050234727A1 (en) * | 2001-07-03 | 2005-10-20 | Leo Chiu | Method and apparatus for adapting a voice extensible markup language-enabled voice system for natural speech recognition and system response |
US6983307B2 (en) | 2001-07-11 | 2006-01-03 | Kirusa, Inc. | Synchronization among plural browsers |
US7123727B2 (en) * | 2001-07-18 | 2006-10-17 | Agere Systems Inc. | Adaptive close-talking differential microphone array |
US7283951B2 (en) | 2001-08-14 | 2007-10-16 | Insightful Corporation | Method and system for enhanced data searching |
US6757544B2 (en) | 2001-08-15 | 2004-06-29 | Motorola, Inc. | System and method for determining a location relevant to a communication device and/or its associated user |
US7920682B2 (en) | 2001-08-21 | 2011-04-05 | Byrne William J | Dynamic interactive voice interface |
US7305381B1 (en) | 2001-09-14 | 2007-12-04 | Ricoh Co., Ltd | Asynchronous unconscious retrieval in a network of information appliances |
US6959276B2 (en) | 2001-09-27 | 2005-10-25 | Microsoft Corporation | Including the category of environmental noise when processing speech signals |
US6721633B2 (en) | 2001-09-28 | 2004-04-13 | Robert Bosch Gmbh | Method and device for interfacing a driver information system using a voice portal server |
US7289606B2 (en) | 2001-10-01 | 2007-10-30 | Sandeep Sibal | Mode-swapping in multi-modal telephonic applications |
JP3997459B2 (ja) | 2001-10-02 | 2007-10-24 | 株式会社日立製作所 | 音声入力システムおよび音声ポータルサーバおよび音声入力端末 |
US7254384B2 (en) | 2001-10-03 | 2007-08-07 | Accenture Global Services Gmbh | Multi-modal messaging |
US20030069734A1 (en) | 2001-10-05 | 2003-04-10 | Everhart Charles Allen | Technique for active voice recognition grammar adaptation for dynamic multimedia application |
JP4065936B2 (ja) | 2001-10-09 | 2008-03-26 | 独立行政法人情報通信研究機構 | 機械学習法を用いた言語解析処理システムおよび機械学習法を用いた言語省略解析処理システム |
US6501834B1 (en) | 2001-11-21 | 2002-12-31 | At&T Corp. | Message sender status monitor |
US20030101054A1 (en) | 2001-11-27 | 2003-05-29 | Ncc, Llc | Integrated system and method for electronic speech recognition and transcription |
US7174300B2 (en) * | 2001-12-11 | 2007-02-06 | Lockheed Martin Corporation | Dialog processing method and apparatus for uninhabited air vehicles |
US7165028B2 (en) * | 2001-12-12 | 2007-01-16 | Texas Instruments Incorporated | Method of speech recognition resistant to convolutive distortion and additive distortion |
GB2383247A (en) | 2001-12-13 | 2003-06-18 | Hewlett Packard Co | Multi-modal picture allowing verbal interaction between a user and the picture |
US7231343B1 (en) | 2001-12-20 | 2007-06-12 | Ianywhere Solutions, Inc. | Synonyms mechanism for natural language systems |
US20030120493A1 (en) | 2001-12-21 | 2003-06-26 | Gupta Sunil K. | Method and system for updating and customizing recognition vocabulary |
EP1324274A3 (en) | 2001-12-28 | 2005-11-02 | Matsushita Electric Industrial Co., Ltd. | Vehicle information recording system |
US7203644B2 (en) | 2001-12-31 | 2007-04-10 | Intel Corporation | Automating tuning of speech recognition systems |
US7493259B2 (en) * | 2002-01-04 | 2009-02-17 | Siebel Systems, Inc. | Method for accessing data via voice |
US7493559B1 (en) * | 2002-01-09 | 2009-02-17 | Ricoh Co., Ltd. | System and method for direct multi-modal annotation of objects |
US7117200B2 (en) | 2002-01-11 | 2006-10-03 | International Business Machines Corporation | Synthesizing information-bearing content from multiple channels |
US7111248B2 (en) | 2002-01-15 | 2006-09-19 | Openwave Systems Inc. | Alphanumeric information input method |
US7536297B2 (en) | 2002-01-22 | 2009-05-19 | International Business Machines Corporation | System and method for hybrid text mining for finding abbreviations and their definitions |
US7054817B2 (en) | 2002-01-25 | 2006-05-30 | Canon Europa N.V. | User interface for speech model generation and testing |
US20030144846A1 (en) | 2002-01-31 | 2003-07-31 | Denenberg Lawrence A. | Method and system for modifying the behavior of an application based upon the application's grammar |
US7130390B2 (en) * | 2002-02-01 | 2006-10-31 | Microsoft Corporation | Audio messaging system and method |
US7177814B2 (en) | 2002-02-07 | 2007-02-13 | Sap Aktiengesellschaft | Dynamic grammar for voice-enabled applications |
US7058890B2 (en) | 2002-02-13 | 2006-06-06 | Siebel Systems, Inc. | Method and system for enabling connectivity to a data system |
US8249880B2 (en) | 2002-02-14 | 2012-08-21 | Intellisist, Inc. | Real-time display of system instructions |
US7587317B2 (en) | 2002-02-15 | 2009-09-08 | Microsoft Corporation | Word training interface |
JP3974419B2 (ja) | 2002-02-18 | 2007-09-12 | 株式会社日立製作所 | 音声入力を用いた情報取得方法及び情報取得システム |
AU2003217769A1 (en) | 2002-02-27 | 2003-09-09 | Mary Beth Sater | System and method that facilitates customizing media |
US6704396B2 (en) * | 2002-02-27 | 2004-03-09 | Sbc Technology Resources, Inc. | Multi-modal communications method |
US7016849B2 (en) | 2002-03-25 | 2006-03-21 | Sri International | Method and apparatus for providing speech-driven routing between spoken language applications |
US7136875B2 (en) | 2002-09-24 | 2006-11-14 | Google, Inc. | Serving advertisements based on content |
US7072834B2 (en) | 2002-04-05 | 2006-07-04 | Intel Corporation | Adapting to adverse acoustic environment in speech processing using playback training data |
US7197460B1 (en) | 2002-04-23 | 2007-03-27 | At&T Corp. | System for handling frequently asked questions in a natural language dialog service |
US6877001B2 (en) | 2002-04-25 | 2005-04-05 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for retrieving documents with spoken queries |
US7167568B2 (en) | 2002-05-02 | 2007-01-23 | Microsoft Corporation | Microphone array signal enhancement |
US20030212558A1 (en) * | 2002-05-07 | 2003-11-13 | Matula Valentine C. | Method and apparatus for distributed interactive voice processing |
US6915126B2 (en) | 2002-05-08 | 2005-07-05 | General Motors Corporation | Method of activating a wireless communication system in a mobile vehicle |
US20030212550A1 (en) | 2002-05-10 | 2003-11-13 | Ubale Anil W. | Method, apparatus, and system for improving speech quality of voice-over-packets (VOP) systems |
US20030212562A1 (en) | 2002-05-13 | 2003-11-13 | General Motors Corporation | Manual barge-in for server-based in-vehicle voice recognition systems |
JP2003329477A (ja) | 2002-05-15 | 2003-11-19 | Pioneer Electronic Corp | ナビゲーション装置及び対話型情報提供プログラム |
US7107210B2 (en) | 2002-05-20 | 2006-09-12 | Microsoft Corporation | Method of noise reduction based on dynamic aspects of speech |
US7127400B2 (en) | 2002-05-22 | 2006-10-24 | Bellsouth Intellectual Property Corporation | Methods and systems for personal interactive voice response |
US7546382B2 (en) * | 2002-05-28 | 2009-06-09 | International Business Machines Corporation | Methods and systems for authoring of mixed-initiative multi-modal interactions and related browsing mechanisms |
US20040140989A1 (en) * | 2002-05-28 | 2004-07-22 | John Papageorge | Content subscription and delivery service |
US7398209B2 (en) | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7512906B1 (en) * | 2002-06-04 | 2009-03-31 | Rockwell Automation Technologies, Inc. | System and methodology providing adaptive interface in an industrial controller environment |
US7143037B1 (en) | 2002-06-12 | 2006-11-28 | Cisco Technology, Inc. | Spelling words using an arbitrary phonetic alphabet |
US7548491B2 (en) | 2002-06-13 | 2009-06-16 | General Motors Corporation | Personalized key system for a mobile vehicle |
US7502737B2 (en) | 2002-06-24 | 2009-03-10 | Intel Corporation | Multi-pass recognition of spoken dialogue |
US20050021470A1 (en) * | 2002-06-25 | 2005-01-27 | Bose Corporation | Intelligent music track selection |
US7177816B2 (en) | 2002-07-05 | 2007-02-13 | At&T Corp. | System and method of handling problematic input during context-sensitive help for multi-modal dialog systems |
US7177815B2 (en) * | 2002-07-05 | 2007-02-13 | At&T Corp. | System and method of context-sensitive help for multi-modal dialog systems |
US20040010358A1 (en) * | 2002-07-12 | 2004-01-15 | General Motors Corporation | Vehicle personalization through web portal |
US7693720B2 (en) | 2002-07-15 | 2010-04-06 | Voicebox Technologies, Inc. | Mobile systems and methods for responding to natural language speech utterance |
EP1391830A1 (fr) | 2002-07-19 | 2004-02-25 | Albert Inc. S.A. | Système d'extraction d'informations dans un texte en langage naturel |
EP1394692A1 (en) * | 2002-08-05 | 2004-03-03 | Alcatel | Method, terminal, browser application, and mark-up language for multimodal interaction between a user and a terminal |
US7236923B1 (en) | 2002-08-07 | 2007-06-26 | Itt Manufacturing Enterprises, Inc. | Acronym extraction system and method of identifying acronyms and extracting corresponding expansions from text |
US6741931B1 (en) | 2002-09-05 | 2004-05-25 | Daimlerchrysler Corporation | Vehicle navigation system with off-board server |
WO2004025463A1 (ja) * | 2002-09-09 | 2004-03-25 | Catena Corporation | 要件定義方法、ソフトウェアの開発方法、及び、要件単語の変更方法並びに新規規定方法 |
US7184957B2 (en) * | 2002-09-25 | 2007-02-27 | Toyota Infotechnology Center Co., Ltd. | Multiple pass speech recognition method and system |
US7328155B2 (en) * | 2002-09-25 | 2008-02-05 | Toyota Infotechnology Center Co., Ltd. | Method and system for speech recognition using grammar weighted based upon location information |
US20030115062A1 (en) | 2002-10-29 | 2003-06-19 | Walker Marilyn A. | Method for automated sentence planning |
US8793127B2 (en) | 2002-10-31 | 2014-07-29 | Promptu Systems Corporation | Method and apparatus for automatically determining speaker characteristics for speech-directed advertising or other enhancement of speech-controlled devices or services |
US6739556B1 (en) | 2002-11-20 | 2004-05-25 | Raytheon Company | Method and apparatus for providing an aircraft emergency safety control system |
WO2004053836A1 (en) | 2002-12-10 | 2004-06-24 | Kirusa, Inc. | Techniques for disambiguating speech input using multimodal interfaces |
US6834265B2 (en) * | 2002-12-13 | 2004-12-21 | Motorola, Inc. | Method and apparatus for selective speech recognition |
US7890324B2 (en) * | 2002-12-19 | 2011-02-15 | At&T Intellectual Property Ii, L.P. | Context-sensitive interface widgets for multi-modal dialog systems |
US20040158555A1 (en) | 2003-02-11 | 2004-08-12 | Terradigtal Systems Llc. | Method for managing a collection of media objects |
DE10306022B3 (de) | 2003-02-13 | 2004-02-19 | Siemens Ag | Dreistufige Einzelworterkennung |
GB2398913B (en) * | 2003-02-27 | 2005-08-17 | Motorola Inc | Noise estimation in speech recognition |
JP4103639B2 (ja) | 2003-03-14 | 2008-06-18 | セイコーエプソン株式会社 | 音響モデル作成方法および音響モデル作成装置ならびに音声認識装置 |
US7146319B2 (en) | 2003-03-31 | 2006-12-05 | Novauris Technologies Ltd. | Phonetically based speech recognition system and method |
US20050021826A1 (en) * | 2003-04-21 | 2005-01-27 | Sunil Kumar | Gateway controller for a multimodal system that provides inter-communication among different data and voice servers through various mobile devices, and interface for that controller |
US7421393B1 (en) | 2004-03-01 | 2008-09-02 | At&T Corp. | System for developing a dialog manager using modular spoken-dialog components |
US20050015256A1 (en) * | 2003-05-29 | 2005-01-20 | Kargman James B. | Method and apparatus for ordering food items, and in particular, pizza |
JP2005003926A (ja) * | 2003-06-11 | 2005-01-06 | Sony Corp | 情報処理装置および方法、並びにプログラム |
JP2005010691A (ja) | 2003-06-20 | 2005-01-13 | P To Pa:Kk | 音声認識装置、音声認識方法、会話制御装置、会話制御方法及びこれらのためのプログラム |
KR100577387B1 (ko) * | 2003-08-06 | 2006-05-10 | 삼성전자주식회사 | 음성 대화 시스템에서의 음성 인식 오류 처리 방법 및 장치 |
US20050043940A1 (en) * | 2003-08-20 | 2005-02-24 | Marvin Elder | Preparing a data source for a natural language query |
US7428497B2 (en) | 2003-10-06 | 2008-09-23 | Utbk, Inc. | Methods and apparatuses for pay-per-call advertising in mobile/wireless applications |
US20070162296A1 (en) | 2003-10-06 | 2007-07-12 | Utbk, Inc. | Methods and apparatuses for audio advertisements |
GB0325497D0 (en) * | 2003-10-31 | 2003-12-03 | Vox Generation Ltd | Automated speech application creation deployment and management |
US7454608B2 (en) * | 2003-10-31 | 2008-11-18 | International Business Machines Corporation | Resource configuration in multi-modal distributed computing systems |
US7697673B2 (en) | 2003-11-17 | 2010-04-13 | Apptera Inc. | System for advertisement selection, placement and delivery within a multiple-tenant voice interaction service system |
JP2005157494A (ja) | 2003-11-20 | 2005-06-16 | Aruze Corp | 会話制御装置及び会話制御方法 |
JP4558308B2 (ja) | 2003-12-03 | 2010-10-06 | ニュアンス コミュニケーションズ,インコーポレイテッド | 音声認識システム、データ処理装置、そのデータ処理方法及びプログラム |
US20050137877A1 (en) | 2003-12-17 | 2005-06-23 | General Motors Corporation | Method and system for enabling a device function of a vehicle |
US7027586B2 (en) * | 2003-12-18 | 2006-04-11 | Sbc Knowledge Ventures, L.P. | Intelligently routing customer communications |
US20050137850A1 (en) | 2003-12-23 | 2005-06-23 | Intel Corporation | Method for automation of programmable interfaces |
US7386443B1 (en) | 2004-01-09 | 2008-06-10 | At&T Corp. | System and method for mobile automatic speech recognition |
JP3924583B2 (ja) | 2004-02-03 | 2007-06-06 | 松下電器産業株式会社 | ユーザ適応型装置およびその制御方法 |
US7542903B2 (en) | 2004-02-18 | 2009-06-02 | Fuji Xerox Co., Ltd. | Systems and methods for determining predictive models of discourse functions |
US20050216254A1 (en) | 2004-03-24 | 2005-09-29 | Gupta Anurag K | System-resource-based multi-modal input fusion |
US20050246174A1 (en) | 2004-04-28 | 2005-11-03 | Degolia Richard C | Method and system for presenting dynamic commercial content to clients interacting with a voice extensible markup language system |
US20050283752A1 (en) * | 2004-05-17 | 2005-12-22 | Renate Fruchter | DiVAS-a cross-media system for ubiquitous gesture-discourse-sketch knowledge capture and reuse |
US20060206310A1 (en) | 2004-06-29 | 2006-09-14 | Damaka, Inc. | System and method for natural language processing in a peer-to-peer hybrid communications network |
DE102004037858A1 (de) | 2004-08-04 | 2006-03-16 | Harman Becker Automotive Systems Gmbh | Navigationssystem mit sprachgesteuerter Angabe von Sonderzielen |
US7480618B2 (en) | 2004-09-02 | 2009-01-20 | Microsoft Corporation | Eliminating interference of noisy modality in a multimodal application |
US7587482B2 (en) * | 2004-09-08 | 2009-09-08 | Yahoo! Inc. | Multimodal interface for mobile messaging |
FR2875919A1 (fr) | 2004-09-27 | 2006-03-31 | France Telecom | Agent rationnel dialoguant, systeme de dialogue intelligent l'utilisant, procede de pilotage d'un dialogue intelligent, et programme pour sa mise en oeuvre |
US20060074660A1 (en) | 2004-09-29 | 2006-04-06 | France Telecom | Method and apparatus for enhancing speech recognition accuracy by using geographic data to filter a set of words |
US7925506B2 (en) | 2004-10-05 | 2011-04-12 | Inago Corporation | Speech recognition accuracy via concept to keyword mapping |
US7376645B2 (en) | 2004-11-29 | 2008-05-20 | The Intellection Group, Inc. | Multimodal natural language query system and architecture for processing voice and proximity-based queries |
US20070214182A1 (en) | 2005-01-15 | 2007-09-13 | Outland Research, Llc | Establishment-based media and messaging service |
US7873654B2 (en) * | 2005-01-24 | 2011-01-18 | The Intellection Group, Inc. | Multimodal natural language query system for processing and analyzing voice and proximity-based queries |
US7437297B2 (en) | 2005-01-27 | 2008-10-14 | International Business Machines Corporation | Systems and methods for predicting consequences of misinterpretation of user commands in automated systems |
KR100718147B1 (ko) | 2005-02-01 | 2007-05-14 | 삼성전자주식회사 | 음성인식용 문법망 생성장치 및 방법과 이를 이용한 대화체음성인식장치 및 방법 |
US7831433B1 (en) | 2005-02-03 | 2010-11-09 | Hrl Laboratories, Llc | System and method for using context in navigation dialog |
US7461059B2 (en) | 2005-02-23 | 2008-12-02 | Microsoft Corporation | Dynamically updated search results based upon continuously-evolving search query that is based at least in part upon phrase suggestion, search engine uses previous result sets performing additional search tasks |
US7283829B2 (en) * | 2005-03-25 | 2007-10-16 | Cisco Technology, Inc. | Management of call requests in multi-modal communication environments |
US7668716B2 (en) | 2005-05-05 | 2010-02-23 | Dictaphone Corporation | Incorporation of external knowledge in multimodal dialog systems |
US7813485B2 (en) * | 2005-05-26 | 2010-10-12 | International Business Machines Corporation | System and method for seamlessly integrating an interactive visual menu with an voice menu provided in an interactive voice response system |
US7917365B2 (en) | 2005-06-16 | 2011-03-29 | Nuance Communications, Inc. | Synchronizing visual and speech events in a multimodal application |
US7873523B2 (en) * | 2005-06-30 | 2011-01-18 | Microsoft Corporation | Computer implemented method of analyzing recognition results between a user and an interactive application utilizing inferred values instead of transcribed speech |
JP2007011928A (ja) * | 2005-07-04 | 2007-01-18 | Sony Corp | コンテンツ提供システム、コンテンツ提供装置、コンテンツ配信サーバ、コンテンツ受信端末およびコンテンツ提供方法 |
WO2007008798A2 (en) * | 2005-07-07 | 2007-01-18 | V-Enable, Inc. | System and method for searching for network-based content in a multi-modal system using spoken keywords |
WO2007008248A2 (en) | 2005-07-11 | 2007-01-18 | Voicedemand, Inc. | Voice control of a media player |
US7620549B2 (en) * | 2005-08-10 | 2009-11-17 | Voicebox Technologies, Inc. | System and method of supporting adaptive misrecognition in conversational speech |
US20070043569A1 (en) * | 2005-08-19 | 2007-02-22 | Intervoice Limited Partnership | System and method for inheritance of advertised functionality in a user interactive system |
US8027684B2 (en) * | 2005-08-22 | 2011-09-27 | Infosys Technologies, Ltd. | System for performing a task in a communication network and methods thereof |
US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
US9009046B1 (en) * | 2005-09-27 | 2015-04-14 | At&T Intellectual Property Ii, L.P. | System and method for disambiguating multiple intents in a natural language dialog system |
US7672852B2 (en) | 2005-09-29 | 2010-03-02 | Microsoft Corporation | Localization of prompts |
US8626588B2 (en) | 2005-09-30 | 2014-01-07 | Google Inc. | Advertising with audio content |
US20070078708A1 (en) | 2005-09-30 | 2007-04-05 | Hua Yu | Using speech recognition to determine advertisements relevant to audio content and/or audio content relevant to advertisements |
JP4849662B2 (ja) | 2005-10-21 | 2012-01-11 | 株式会社ユニバーサルエンターテインメント | 会話制御装置 |
US7477909B2 (en) * | 2005-10-31 | 2009-01-13 | Nuance Communications, Inc. | System and method for conducting a search using a wireless mobile device |
US7587308B2 (en) | 2005-11-21 | 2009-09-08 | Hewlett-Packard Development Company, L.P. | Word recognition using ontologies |
US20070135101A1 (en) * | 2005-12-08 | 2007-06-14 | Comverse, Ltd. | Enhanced visual IVR capabilities |
US8325398B2 (en) | 2005-12-22 | 2012-12-04 | Canon Kabushiki Kaisha | Image editing system, image management apparatus, and image editing program |
US20070186165A1 (en) | 2006-02-07 | 2007-08-09 | Pudding Ltd. | Method And Apparatus For Electronically Providing Advertisements |
DE102006006305A1 (de) * | 2006-02-10 | 2007-08-16 | Siemens Ag | Verfahren zur Aufbereitung von Informationen für ein Sprachdialogsystem |
JP4131978B2 (ja) * | 2006-02-24 | 2008-08-13 | 本田技研工業株式会社 | 音声認識機器制御装置 |
US20070203736A1 (en) | 2006-02-28 | 2007-08-30 | Commonwealth Intellectual Property Holdings, Inc. | Interactive 411 Directory Assistance |
EP2011017A4 (en) | 2006-03-30 | 2010-07-07 | Stanford Res Inst Int | METHOD AND APPARATUS FOR ANNOTATING MULTIMEDIA STREAMS |
US7533089B2 (en) | 2006-06-27 | 2009-05-12 | International Business Machines Corporation | Hybrid approach for query recommendation in conversation systems |
CN101489469B (zh) * | 2006-07-10 | 2012-12-12 | 埃森哲环球服务有限公司 | 用于提供反馈的移动个人服务平台 |
US7620551B2 (en) | 2006-07-20 | 2009-11-17 | Mspot, Inc. | Method and apparatus for providing search capability and targeted advertising for audio, image, and video content over the internet |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8145493B2 (en) | 2006-09-11 | 2012-03-27 | Nuance Communications, Inc. | Establishing a preferred mode of interaction between a user and a multimodal application |
US8086463B2 (en) | 2006-09-12 | 2011-12-27 | Nuance Communications, Inc. | Dynamically generating a vocal help prompt in a multimodal application |
WO2008032329A2 (en) | 2006-09-13 | 2008-03-20 | Alon Atsmon | Providing content responsive to multimedia signals |
US7788084B2 (en) | 2006-09-19 | 2010-08-31 | Xerox Corporation | Labeling of work of art titles in text for natural language processing |
US8073681B2 (en) | 2006-10-16 | 2011-12-06 | Voicebox Technologies, Inc. | System and method for a cooperative conversational voice user interface |
WO2008057268A2 (en) | 2006-10-26 | 2008-05-15 | Mobile Content Networks, Inc. | Techniques for determining relevant advertisements in response to queries |
WO2008056251A2 (en) | 2006-11-10 | 2008-05-15 | Audiogate Technologies Ltd. | System and method for providing advertisement based on speech recognition |
JP4833043B2 (ja) | 2006-11-30 | 2011-12-07 | ヤフー株式会社 | ブログ又はクエリ・クリックを利用したアフィニティ装置 |
US7640272B2 (en) | 2006-12-07 | 2009-12-29 | Microsoft Corporation | Using automated content analysis for audio/video content consumption |
TWI342010B (en) | 2006-12-13 | 2011-05-11 | Delta Electronics Inc | Speech recognition method and system with intelligent classification and adjustment |
US7630978B2 (en) * | 2006-12-14 | 2009-12-08 | Yahoo! Inc. | Query rewriting with spell correction suggestions using a generated set of query features |
US20080154604A1 (en) | 2006-12-22 | 2008-06-26 | Nokia Corporation | System and method for providing context-based dynamic speech grammar generation for use in search applications |
US7818176B2 (en) | 2007-02-06 | 2010-10-19 | Voicebox Technologies, Inc. | System and method for selecting and presenting advertisements based on natural language processing of voice-based input |
US8909532B2 (en) | 2007-03-23 | 2014-12-09 | Nuance Communications, Inc. | Supporting multi-lingual user interaction with a multimodal application |
US8060367B2 (en) | 2007-06-26 | 2011-11-15 | Targus Information Corporation | Spatially indexed grammar and methods of use |
US8219399B2 (en) | 2007-07-11 | 2012-07-10 | Garmin Switzerland Gmbh | Automated speech recognition (ASR) tiling |
US20090070109A1 (en) | 2007-09-12 | 2009-03-12 | Microsoft Corporation | Speech-to-Text Transcription for Personal Communication Devices |
DE102007044792B4 (de) * | 2007-09-19 | 2012-12-13 | Siemens Ag | Verfahren, Steuergerät und System zur Steuerung oder Bedienung |
DE102008051757A1 (de) | 2007-11-12 | 2009-05-14 | Volkswagen Ag | Multimodale Benutzerschnittstelle eines Fahrerassistenzsystems zur Eingabe und Präsentation von Informationen |
US8140335B2 (en) | 2007-12-11 | 2012-03-20 | Voicebox Technologies, Inc. | System and method for providing a natural language voice user interface in an integrated voice navigation services environment |
US8077975B2 (en) | 2008-02-26 | 2011-12-13 | Microsoft Corporation | Handwriting symbol recognition accuracy using speech input |
US8255224B2 (en) | 2008-03-07 | 2012-08-28 | Google Inc. | Voice recognition grammar selection based on context |
US20090259646A1 (en) | 2008-04-09 | 2009-10-15 | Yahoo!, Inc. | Method for Calculating Score for Search Query |
US20090276700A1 (en) | 2008-04-30 | 2009-11-05 | Nokia Corporation | Method, apparatus, and computer program product for determining user status indicators |
US8589161B2 (en) | 2008-05-27 | 2013-11-19 | Voicebox Technologies, Inc. | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
US8171021B2 (en) * | 2008-06-23 | 2012-05-01 | Google Inc. | Query identification and association |
US8909810B2 (en) * | 2008-08-05 | 2014-12-09 | Isabella Products, Inc. | Systems and methods for multimedia content sharing |
US8224652B2 (en) * | 2008-09-26 | 2012-07-17 | Microsoft Corporation | Speech and text driven HMM-based body animation synthesis |
US20100094707A1 (en) | 2008-10-10 | 2010-04-15 | Carl Johan Freer | Method and platform for voice and location-based services for mobile advertising |
US8326637B2 (en) | 2009-02-20 | 2012-12-04 | Voicebox Technologies, Inc. | System and method for processing multi-modal device interactions in a natural language voice services environment |
US9502025B2 (en) | 2009-11-10 | 2016-11-22 | Voicebox Technologies Corporation | System and method for providing a natural language content dedication service |
US8762156B2 (en) | 2011-09-28 | 2014-06-24 | Apple Inc. | Speech recognition repair using contextual information |
-
2010
- 2010-02-09 US US12/703,032 patent/US9171541B2/en active Active
- 2010-09-16 WO PCT/US2010/049077 patent/WO2011059562A1/en active Application Filing
- 2010-09-16 CN CN201610183358.1A patent/CN105989840A/zh active Pending
- 2010-09-16 KR KR1020127014872A patent/KR20120120163A/ko active Application Filing
- 2010-09-16 JP JP2012538814A patent/JP5663031B2/ja not_active Expired - Fee Related
- 2010-09-16 KR KR1020177021619A patent/KR101798828B1/ko active IP Right Grant
- 2010-09-16 EP EP10830368.6A patent/EP2499582A4/en not_active Ceased
- 2010-09-16 CN CN201080061105.8A patent/CN102792294B/zh active Active
- 2010-09-16 KR KR1020177032087A patent/KR101912058B1/ko active IP Right Grant
- 2010-09-16 EP EP20169496.5A patent/EP3809406A1/en not_active Withdrawn
-
2015
- 2015-10-26 US US14/922,921 patent/US20160049152A1/en not_active Abandoned
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020143532A1 (en) * | 2001-04-02 | 2002-10-03 | Mclean James Gordon | Method and system for collaborative speech recognition for small-area network |
US20040166832A1 (en) * | 2001-10-03 | 2004-08-26 | Accenture Global Services Gmbh | Directory assistance with multi-modal messaging |
US20040102977A1 (en) * | 2002-11-22 | 2004-05-27 | Metzler Benjamin T. | Methods and apparatus for controlling an electronic device |
US20060041926A1 (en) * | 2004-04-30 | 2006-02-23 | Vulcan Inc. | Voice control of multimedia content |
CN101297355A (zh) * | 2005-08-05 | 2008-10-29 | 沃伊斯博克斯科技公司 | 响应自然语言语音口头表达的系统和方法 |
US20070055525A1 (en) * | 2005-08-31 | 2007-03-08 | Kennewick Robert A | Dynamic speech sharpening |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111539217A (zh) * | 2017-05-19 | 2020-08-14 | 北京蓦然认知科技有限公司 | 一种用于自然语言内容标题消歧的方法、设备和系统 |
CN111539217B (zh) * | 2017-05-19 | 2024-01-12 | 无锡小飞玛信息技术有限公司 | 一种用于自然语言内容标题消歧的方法、设备和系统 |
CN108597520A (zh) * | 2018-04-23 | 2018-09-28 | 四川斐讯信息技术有限公司 | 一种智能插排和智能插排的控制方法 |
CN111368549A (zh) * | 2018-12-25 | 2020-07-03 | 深圳市优必选科技有限公司 | 一种支持多种服务的自然语言处理方法、装置及系统 |
Also Published As
Publication number | Publication date |
---|---|
CN102792294B (zh) | 2016-04-27 |
US20110112827A1 (en) | 2011-05-12 |
KR20170091797A (ko) | 2017-08-09 |
KR20170126023A (ko) | 2017-11-15 |
US9171541B2 (en) | 2015-10-27 |
EP2499582A4 (en) | 2013-12-25 |
WO2011059562A1 (en) | 2011-05-19 |
US20160049152A1 (en) | 2016-02-18 |
KR101912058B1 (ko) | 2018-10-25 |
EP2499582A1 (en) | 2012-09-19 |
KR101798828B1 (ko) | 2017-11-16 |
JP5663031B2 (ja) | 2015-02-04 |
EP3809406A1 (en) | 2021-04-21 |
JP2013510341A (ja) | 2013-03-21 |
KR20120120163A (ko) | 2012-11-01 |
CN102792294A (zh) | 2012-11-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102792294B (zh) | 自然语言语音服务环境中的混合处理的系统及方法 | |
US20220221959A1 (en) | Annotations in software applications for invoking dialog system functions | |
US10553216B2 (en) | System and method for an integrated, multi-modal, multi-device natural language voice services environment | |
AU2015210460B2 (en) | Speech recognition repair using contextual information | |
US9502025B2 (en) | System and method for providing a natural language content dedication service | |
JP6588637B2 (ja) | 個別化されたエンティティ発音の学習 | |
US20190027147A1 (en) | Automatic integration of image capture and recognition in a voice-based query to understand intent | |
JP5598998B2 (ja) | 音声翻訳システム、第一端末装置、音声認識サーバ装置、翻訳サーバ装置、および音声合成サーバ装置 | |
EP2283431B1 (en) | System and method for an integrated, multi-device natural language voice services system | |
RU2349969C2 (ru) | Синхронное понимание семантических объектов, реализованное с помощью тэгов речевого приложения | |
JP2015122084A (ja) | 自然言語音声サービス環境においてマルチモーダル機器対話を処理するシステム及び方法 | |
CN117809641A (zh) | 一种终端设备及基于查询文本改写的语音交互方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20161005 |