CN102150128B - 音频用户接口 - Google Patents
音频用户接口 Download PDFInfo
- Publication number
- CN102150128B CN102150128B CN200980135356.3A CN200980135356A CN102150128B CN 102150128 B CN102150128 B CN 102150128B CN 200980135356 A CN200980135356 A CN 200980135356A CN 102150128 B CN102150128 B CN 102150128B
- Authority
- CN
- China
- Prior art keywords
- audio
- audio prompt
- user
- media player
- user interface
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01C—MEASURING DISTANCES, LEVELS OR BEARINGS; SURVEYING; NAVIGATION; GYROSCOPIC INSTRUMENTS; PHOTOGRAMMETRY OR VIDEOGRAMMETRY
- G01C21/00—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00
- G01C21/26—Navigation; Navigational instruments not provided for in groups G01C1/00 - G01C19/00 specially adapted for navigation in a road network
- G01C21/34—Route searching; Route guidance
- G01C21/36—Input/output arrangements for on-board computers
- G01C21/3626—Details of the output of route guidance instructions
- G01C21/3629—Guidance using speech or audio output, e.g. text-to-speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M1/00—Substation equipment, e.g. for use by subscribers
- H04M1/72—Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
- H04M1/724—User interfaces specially adapted for cordless or mobile telephones
- H04M1/72475—User interfaces specially adapted for cordless or mobile telephones specially adapted for disabled users
- H04M1/72481—User interfaces specially adapted for cordless or mobile telephones specially adapted for disabled users for visually impaired users
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2201/00—Electronic components, circuits, software, systems or apparatus used in telephone systems
- H04M2201/39—Electronic components, circuits, software, systems or apparatus used in telephone systems using speech synthesis
Abstract
公开了音频用户接口,该接口提供音频提示,这些音频提示帮助用户与电子设备的用户接口进行交互。音频提示可以提供音频指示,这些音频指示允许用户将其视觉注意力集中在其他任务,例如驾驶汽车、进行锻炼或穿越街道,同时仍然使用户能够与用户接口进行交互。智能途径可以提供对应来自各种不同源的不同类型音频提示的访问。不同类型的音频提示可以根据特定类型音频提示的可用性来呈现。例如,音频提示可以包括从专门语音服务器获得的预先记录的语音音频,例如名人语音或卡通角色。在预先记录的或合成的音频数据缺乏可用性时,可以提供非语音音频提示。
Description
相关申请的交叉引用
本申请涉及2004年11月4日提交的题为“Audio User Interface ForComputing Devices”的美国专利申请No.10/981,993以及2003年7月18日提交的题为“Voice Menu System”的美国专利申请No.10/623,339,这些申请的全部内容通过引用方式结合于此。
技术领域
本发明大体上涉及音频用户接口,尤其涉及用于给计算设备提供音频用户接口的技术。
背景技术
电子设备(例如便携式媒体播放器、蜂窝电话、个人数字助理(PDA)等)在目前的市场上很流行,对它们的使用提供支持的外围电子设备(例如扩展坞(docking station)等)也是这样。随着个人电子装置的竞争日益白热化,消费者在这些设备的功能和使用方面要求越来越高。
用户们收听、观看或以其他方式在各种环境下接收和消费内容。例如,常常在驾车、乘坐公共交通工具、锻炼、远足、做家务等的同时收听音乐。另外,除了对储存在媒体播放器上的内容进行回放之外,用户们现在还更加经常地使用媒体播放器来接收电台、电视、卫星广播、全球定位以及其他基于广播的位置服务,以进行导航和消遣。
传统上,媒体播放器或便携式媒体播放器可以具有为其用户播放媒体的能力,这些媒体例如音频(例如歌曲)或视频(例如电影)。在播放音频时,如果媒体播放器包含显示器,则显示器可以呈现歌曲名称、艺术家和与该歌曲有关的其他信息。在播放视频的情形下,显示器可以用来呈现视频。
为了实现便携性,许多手持式设备可以使用用户接口,这些用户接口向用户呈现各种显示屏幕以进行交互,交互以视觉方式为主。用户们能够与这些用户接口进行交互,以操纵滚动轮和/或一组按钮来对显示屏幕进行导航,从而访问这些手持式设备的功能。但是,这些用户接口有时由于各种原因而难以使用。一种原因是:这些显示屏幕可能常常有小的尺寸和形状因素,因而难以看清。另一个原因是:用户可能具有不良的阅读视力或者由于其他原因而视觉较弱。即使能够察觉这些显示屏幕,在用户不能将视觉焦点从重要的活动转向该用户接口的情况下,用户也会难以对用户接口进行导航。这些活动例如包括:驾驶汽车、锻炼以及穿越街道。
因此,需要改善的方法和装置来解决上述问题中的一些。另外,还需要改善的方法和装置来减小上述缺点中的一些。
发明内容
在各种实施例中,通过包含音频用户接口,能够增强用户与电子设备(例如媒体播放器或便携式媒体设备)间交互的体验,所述用户音频接口提供了对用于该音频用户接口的合适音频对话是否可用进行判断的智能途径。例如,根据该电子设备是否具有到通信网络(例如互联网)的宽带连接,可以作出判断以请求从语音服务器向该电子设备流送第一类型或类别(例如高质量语音记录)的音频文件,以由该音频用户接口进行输出。在另一种示例中,可以作出判断以仅仅使用第二类型或类别(例如低质量语音记录)的音频文件,这些音频文件是该电子设备可访问的媒体储存设备上可用的。在再一种示例中,在预先记录的语音音频数据欠缺可用性的情况下,可以作出判断以使用一种或多种语音合成技术或文本至语音技术来创建第三类别的音频数据,用于该音频用户接口的音频提示。
在一些实施例中,电子设备(例如媒体播放器或便携式媒体设备)的用户可以确定对于该音频用户接口所要呈现(例如播放)的音频提示的质量。用户可以提供一个或多个用户偏好,这些用户偏好表示是否应当使用预先记录的音频数据,是否应当使用用一项或多项合成技术而合成的音频提示,或者是否应当对于该音频用户接口使用传统的蜂鸣或其他非语音音频数据。因此,带有或不带有显示器的电子设备(例如媒体播放器或便携式媒体设备)都能由音频用户接口进行增强,以便于根据服务是否可用或根据其他选择判据来进行用户交互。
在一种实施例中,输入可以被接收,该输入表示用户与用户接口的交互,该用户接口与电子设备(例如媒体播放器或便携式媒体设备)相关联。用户可以通过对按钮(例如播放/暂停按钮)进行按压或对图形用户接口的菜单条目进行选择/突出来与媒体播放器进行交互。该电子设备可以识别音频提示,该音频提示与把用户与该用户接口的交互听觉化(audibilizing)相关联。该电子设备可以判断多个音频数据类别中与该音频提示相对应的那个类别是否对于媒体播放器而言可用。例如,电子设备可以判断内部储存装置上是否储存了预先记录的名人语音音频文件,语音合成模块或文本至语音引擎是否能够合成数字,或者语音服务器是否能够针对该音频用户接口而向电子设备流送语音数据。
然后,第一类别的音频数据的一部分可以在电子设备处输出或以其他方式呈现。在一些实施例中,响应于从第一源输出该部分音频数据,媒体文件的回放可以被暂停或中止。响应于从第一源输出该部分音频数据,媒体文件的回放音量可以被减小或被静音。
参考这份文献的其他部分以及附图,能够对由本申请中所公开的这些发明提供的性质、优点和改善有进一步的了解。
附图说明
为了更好地说明和阐述这份文献中呈现的任何发明的实施例和/或示例,将参考一副或多幅附图。用来描述附图的附加细节或示例不应认为是对所公开的发明中任一项的范围、当前描述的实施例和/或示例中任一项、或当前被认为是这份文献中呈现的任何发明的最佳模式的限制。
图1是媒体播放器的框图,该媒体播放器可以包含本发明的实施例;
图2是根据本发明的一种实施例中媒体播放器的框图,该媒体播放器可以提供预先记录的或合成的音频提示;
图3是根据本发明的一种实施例中音频用户接口管理系统的框图,该系统可以提供预先记录的或合成的音频提示;
图4是根据本发明的一种实施例中对音频提示系统进行流送的框图;
图5图示了根据本发明的一种实施例中媒体播放器及其相关联的用户输入控件的示意图;
图6图示了根据本发明的可替换实施例中媒体播放器及其相关联的用户输入控件的示意图;
图7是根据本发明的一种实施例中,用于向电子设备的用户提供音频用户接口的方法的简化流程图;
图8A和图8B是根据本发明的一种实施例中,用于给电子设备提供音频用户接口的方法的流程图;
图9是根据本发明的一种实施例中对用于音频用户接口的音频提示进行流送的方法的流程图;
图10是根据本发明的一种实施例中用于使用一项或多项语音或文本至语音合成技术来在主计算机系统创建音频提示的方法的流程图;
图11是根据本发明的可替换实施例,使用一种或多种语音或文本至语音合成技术来创建音频提示的方法的流程图;
图12是可以包含本发明实施例的电子设备的框图。
具体实施方式
各种实施例可以适用于具有音频回放能力的电子设备,例如媒体设备(例如数字媒体播放器或便携式MP3播放器)或其他便携式多功能设备(例如移动电话或个人数字助理)。例如,便携式设备常常可以储存和播放数字媒体资料(媒体条目),例如音乐(例如歌曲)、视频(例如电影)、音频书、播客(podcast)、会议记录和/或其他多媒体记录。便携式设备(例如便携式媒体播放器或其他便携式多功能设备)还可以是小巧而高度便携的。另外,便携式设备是能够容易的由用户的一只手握持在内的手持式设备,例如手持式媒体播放器或手持式多功能设备。便携式设备还可以是口袋尺寸、微型的或可佩戴的。
在各种实施例中,通过包含音频用户接口,能够增强用户与电子设备(例如媒体播放器或便携式媒体设备)间交互的体验,所述用户音频接口提供了对用于该音频用户接口的合适音频对话是否可用进行判断智能途径。例如,根据该电子设备是否具有到通信网络(例如互联网)的宽带连接,可以作出判断以请求从语音服务器向该电子设备流送高质量语音记录的音频文件,以由该音频用户接口进行输出。在另一种示例中,可以作出判断以仅仅使用低质量语音记录的音频文件,这些音频文件是该电子设备可访问的媒体储存设备上可用的。在再一种示例中,在预先记录的语音音频数据欠缺可用性的情况下,可以作出判断以使用一种或多种语音合成技术或文本至语音技术来创建用于该音频用户接口的音频提示。
在一些实施例中,电子设备(例如媒体播放器或便携式媒体设备)的用户可以判断对于该音频用户接口所要呈现(例如播放)的音频提示的质量。用户可以提供一个或多个用户偏好,这些用户偏好表示是否应当使用预先记录的音频数据,是否应当使用用一项或多项合成技术而合成的音频提示,或者是否应当对于该音频用户接口使用传统的蜂鸣或其他非语音音频数据。因此,带有或不带有显示器的电子设备(例如媒体播放器或便携式媒体设备)都能由音频用户接口进行增强,以便于根据服务是否可用或根据其他选择判据来进行用户交互。
首先将说明一些环境的方面,本申请中的发明的各种示例和/或实施例在这些环境中工作。
图1是媒体播放器100的框图,该媒体播放器可以包含本发明的实施例。大体上,媒体播放器储存内容和/或媒体资料,例如能够在该媒体播放器上播放或显示的音频轨、电影或照片。媒体播放器100的一种示例可以是iPod媒体播放器,它可以从Cupertino,CA的Apple,Inc.买到。媒体播放器100的另一种示例可以是个人计算机,例如膝上型电脑或台式机。
在这种示例中,媒体播放器100包括处理器110、储存设备120、用户接口130和通信接口140。处理器110可以控制与媒体播放器100相关联的各种功能。媒体播放器100可以输出音频内容、视频内容、图像内容等。媒体播放器100还可以输出与内容相关联的元数据或其他信息,例如轨信息和作品集艺术家。
通常,用户可以使用储存设备120将内容装载或储存到媒体播放器100上。储存设备120可以包括只读存储器(ROM)、随机存取存储器(RAM)、非易失性存储器、闪存、软盘、硬盘等。用户可以与媒体播放器100的用户接口130进行交互以对内容进行观看或消费。用户接口130的一些示例可以包括按钮、点击轮、触摸板、显示器、触摸屏、以及其他的输入/输出设备。
媒体播放器100可以包括一个或多个连接器或端口,这些连接器或端口可以用来装载内容、取得内容、与媒体播放器100上运行的应用进行交互、与外部设备进行对接等。在该示例中,媒体播放器100包括通信接口140。通信接口140的一些示例可以包括通用串行总线(USB)接口、IEEE 1394(即FireWire/iLink)接口、通用异步接收器/发送器(UART)、有线的和无线的网络接口、收发器等。可以用通信接口140将媒体播放器100连接到设备、附件、私有的和公共的通信网络(例如互联网)等。
在一种示例中,媒体播放器100可以经过有线的和/或无线的连接器或端口而耦合,以向扬声器150输出音频和/或其他信息。在另一种示例中,媒体播放器100可以经过有线的和/或无线的连接器或端口而耦合,以向耳机160输出音频和/或其他信息。在再一种示例中,媒体播放器100可以经过有线的和/或无线的连接器或端口而耦合,以与附件170或主计算机180进行对接。可以由同一连接器或端口在不同的时候允许使用不同的连接。
媒体播放器100可以物理地插入到扩展坞系统190中。媒体播放器100可以经过有线的和/或无线的连接器或端口而耦合,以与扩展坞系统190进行对接。扩展坞系统190也可以使一个或多个附件设备195能够通过有线或无线方式耦合,以与媒体播放器100进行对接。附件设备170和195的许多不同类型和功能可以向或与媒体播放器100互连。例如,附件可以允许遥控器以无线方式控制媒体播放器100。又例如,汽车可以包括连接器,媒体播放器100可以插入到该连接器中,使得汽车媒体系统能够与媒体播放器100进行交互,从而允许在汽车中播放储存在媒体播放器100上的媒体内容。
在各种实施例中,媒体播放器100可以从计算机系统(例如主计算机160)接收内容或其他媒体资料。计算机系统可以用来使用户能够对储存在计算机系统上和/或储存在媒体播放器100上的媒体资料进行管理。例如,通信接口140可以允许媒体播放器100与主计算机160进行对接。主计算机160可以执行媒体管理应用,以对媒体资料进行管理,例如将歌曲、电影、照片等装载到媒体播放器100上。媒体管理应用还可以创建播放列表、记录或抓取内容、对内容进行安排以进行回放或记录,等等。媒体管理应用的一种示例可以是由Cupertino,California的Apple,Inc.生产的iTunes
在各种实施例中,媒体播放器100可以包括音频用户接口。在用户与媒体播放器100进行交互时(例如在用户按下按钮、触摸了触摸屏、或者选择了图形用户接口上的条目时),音频用户接口的实施例可以呈现或以其他方式输出从音频对话选择的音频提示以进行回放。音频提示可以包括音频指示器,这些音频指示器允许用户将其视觉注意力集中在其他任务(例如驾驶汽车、进行锻炼或穿越街道)上,而仍然使得用户能够与用户接口130进行交互。作为示例,音频提示可以对被下压的硬件按钮的读音名称或描述、虚拟按钮或控件的读音激活或者用户接口的读音版本(例如显示菜单的所选(例如被突出的)菜单条目或所选功能)进行听觉化。音频提示可以包括预先记录的语音数据,也可以通过语音或语音发生技术来产生。
在一个方面,媒体播放器100的实施例可以包括用于给电子设备提供音频用户接口的技术,这些技术有效地改善了用于音频用户接口的音频提示源的可用性。例如,媒体播放器100可以根据音频对话的源是否可用、更高质量的源是否可用等等,来选择性地从不同的音频对话输出音频提示。在一种示例中,在连接到互联网之前,媒体播放器100的用户可以听到低质量的语音音频提示或由媒体播放器100合成的音频提示,在连接到互联网时,更高质量的、预先记录的语音音频提示可以被下载或流送到音频用户接口。这样,在各种实施例中,媒体播放器100可以判断用于音频用户接口的音频提示的源是否可用,并自动地从一个源切换到另一个,以向用户选择性地提供一个最好的可用音频反馈。
图2是根据本发明的一种实施例中媒体播放器200的框图,该媒体播放器可以提供预先记录的或合成的音频提示。在该示例中,媒体播放器200可以以媒体播放器100的形式实施,并可以包括专用于对内容或其他媒体资料(例如音频、视频或图像)进行处理的便携式计算设备。例如,媒体播放器200可以是音乐播放器(例如MP3播放器)、游戏播放器、视频播放器、视频记录器、相机、图像观看器、移动电话(例如蜂窝电话)、个人手持设备等。这些设备通常使用电池来工作并有高度便携性,以使得用户无论旅行到哪里都能听音乐、播放游戏或视频、记录视频或拍摄图片。
在一种实现方式中,媒体播放器200可以包括手持设备,该设备的尺寸可被放置在用户的口袋中或手中。通过手持方式,媒体播放器200可以较小,并容易由其用户操纵和使用。通过口袋尺寸,用户无需直接拿着媒体播放器200,因此该设备能够被带到几乎用户旅行到的任何地方(例如,用户不受携带庞大、笨拙并且常常沉重的设备的限制(如便携式计算机的情形那样))。此外,媒体播放器200爱可以由用户的手操作,从而无需基准表面(例如桌面)。在可替换实施例中,媒体播放器200可以是并非被具体限制来播放媒体文件的计算设备。例如,媒体播放器200也可以是移动电话或个人数字助理。
在这种示例中,媒体播放器200可以包括用户接口控制模块210、音频提示数据库220和文本至语音引擎230。用户接口控制模块210可以包括用于对用户接口进行管理的硬件和/或软件元件,该用户接口允许用户与媒体播放器200进行交互(例如,导航、启动内容回放等)。用户接口例如可以允许媒体播放器200的用户对媒体播放器200上驻留的或以其他方式可访问的内容或其他媒体资料进行浏览、排序、搜索、播放等。用户接口还可以允许媒体播放器200的用户从媒体播放器200下载(添加)或删除(移除)媒体条目。
与媒体播放器200的用户接口进行的交互可以造成用于音频用户接口的音频提示被回放(例如通过耳机或扬声器)。音频提示数据库220可以包括硬件和/或软件元件,用于储存用于音频提示的音频文件和音频数据。在一些实施例中,音频文件可以包括被预先记录并被储存在媒体播放器200上的音频提示。在其他实施例中,音频文件可以包括从一个或多个计算机上流送、并被缓存在音频提示数据库220中以便随后使用的音频文件。在各种实施例中,音频文件可以包括使用一种或多种语音合成技术而由媒体播放器200或由另一设备产生的音频提示。音频提示数据库220可以包括其他内容或媒体资料。
文本至语音转换引擎230可以包括硬件和/或软件元件,用于将数据(例如文本)转换成能够播放的音频数据或音频文件,以产生能够将数据(例如文本串)听觉化(例如用类似人类的语音或以读音形式进行语言表达)的用户接口音频提示。这样的文本至语音(TTS)引擎可以使用各种技术来创建音频数据或音频文件。例如,一些算法使用这样的技术:将单词分解成片段或音节、然后给这些片段或音节指定某个声音。然后,可以通过将各个声音进行组合来对单词进行语言表达。在媒体内容涉及音乐的情况下,这些文本串例如可以对应于歌曲题目、作品集名称、艺术家名称、联系人名称、地址、电话号码和播放列表名称。
在一种操作示例中,媒体播放器200可以根据音频提示对于音频数据库220和TTS引擎230是否可用,来选择性地提供用于音频用户接口的音频提示。例如,当预先记录的音频提示可用或以其他方式储存在音频提示数据库220中时,媒体播放器200可以选择性地输出来自音频提示数据库220的音频提示。媒体播放器200还可以选择性地在各种质量的音频提示之间进行选择,例如呈现较高质量或比特率的音频提示而不是较低质量或比特率的音频提示。在另一种示例中,由于缺少储存在音频提示数据库220中的预先记录的音频提示,或者响应于用户对于特定的模拟语音简档(profile)的偏好,媒体播放器100可以而呈现由TTS引擎230合成的音频提示或语音提示。在各种实施例中,媒体播放器100可以动态地输出来自音频提示数据库220、或TTS引擎230的、或二者的音频提示。
在其他实施例中,电子设备(例如媒体播放器或便携式媒体设备)可以包括由音频用户接口管理系统提供的音频用户接口。音频用户接口管理系统可以包括媒体回放设备,并包括主计算机或服务器计算机系统中的一项或多项以便提供媒体回放设备上的音频用户接口。例如,主计算机系统可以包括个人计算机,媒体回放设备可以包括MP3播放器。在一些实施例中,媒体回放设备可以容许与用户接口进行多模式交互。例如,用户可以通过音频提示和视觉提示与用户接口进行交互。
图3是根据本发明的一种实施例中音频用户接口管理系统300的框图,该系统可以提供预先记录的或合成的音频提示。在这种示例中,管理系统300可以包括媒体播放器310和个人计算机(主计算机)340。媒体播放器310可以以上述媒体播放器100的形式实施,并可以链接或耦合到个人计算机340。
媒体播放器310可以以图1的媒体播放器100的形式实施,并且例如可以包括用电池工作的便携式设备。在一种实施例中,媒体播放器310包括MP3播放器。通常,媒体播放器310可以将内容或其他媒体资料储存到多个数据储存设备(例如,盘驱动器)之一。媒体播放器310可以在媒体文件中储存内容或其他媒体资料。
媒体播放器310可以包括用户接口控制模块320和音频提示数据库330。用户接口控制模块320可以包括用于管理用户接口的硬件和/或软件元件,该用户接口允许用户与媒体播放器310进行交互(例如导航、启动内容回放等)。与媒体播放器310的用户接口进行的交互可以造成用于音频用户接口的音频提示被回放(例如通过耳机或扬声器)。音频提示数据库330可以包括硬件和/或软件元件,用于储存用于音频提示的音频文件和音频数据。
个人计算机340可以包括媒体管理器350、音频提示数据库360以及文本至语音(TTS)引擎370。个人计算机340可以对于媒体播放器310用作主计算机系统。个人计算机340也可以是相对于媒体播放器310(作为客户机)作为服务器的任何类型计算机。
媒体管理器350可以包括硬件和/或软件元件,使得个人计算机350的用户能够直接对个人计算机340上储存的内容或其他媒体资料进行管理。媒体管理器350还可以被配置成以直接或间接方式管理媒体播放器310上储存的内容或其他媒体资料。在一种示例中,媒体播放器310和个人计算机340可以由外围设备电缆进行耦合。通常,外围设备电缆可以把媒体播放器310上和个人计算机340上提供的数据端口耦合在一起。在一些实施例中,这些数据端口可以是FIREWIRE端口,外围设备电缆可以是FIREWIRE电缆。在另一示例中,这些数据端口可以是通用串行总线(USB)端口,外围设备电缆可以是USB电缆。更一般而言,外部设备电缆可以用作数据链路。媒体条目可以通过外部设备电缆而在媒体播放器310与个人计算机340之间传输,反之亦可。
在各种实施例中,媒体管理器350还可以包括用户接口,该用户接口允许用户对个人计算机340上驻留的内容或其他媒体资料进行浏览、排序、搜索、播放、制作播放列表、烧录光盘(CD)等。该用户接口还可以允许个人计算机340的用户从个人计算机340下载(添加)或删除(移除)媒体条目。在一种实施例中,媒体管理器350及其相关联的用户接口是由Cupertino,California的Apple,Inc.的iTunesTM提供的。
个人计算机340的音频提示数据库360可以包括硬件和/或软件元件,用于储存与媒体播放器310或个人计算机340相关联的音频用户接口的音频提示的音频文件或音频数据。音频提示数据库330可以包括用于音频对话的音频提示,这些音频对话是从互联网下载、从CD抓取、由用户记录或由TTS引擎370生成的。TTS引擎370可以包括硬件和/或软件元件,用于将信息或数据转换成使该信息听觉化的、能够以音频提示的形式播放的音频文件或语音数据。
在一种示例中,个人计算机340与媒体播放器310之间可以发生同步操作,以将音频提示上载到媒体播放器310的音频提示数据库330中,或者用音频提示数据库360中储存的或由TTS引擎370产生的音频提示对音频提示数据库330中储存的音频提示进行更新。在一种示例中,当来自各个数据库的内容之间的比较表明个人计算机340上驻留有媒体播放器330上未驻留的特定音频提示时,则该特定音频提示可以被传送(下载)到媒体播放器310,例如使用无线链路或者通过外围设备电缆进行。因此,个人计算机340与媒体播放器310之间的同步操作可以确保媒体播放器310包含有适于呈现可用音频用户接口的音频数据或音频文件。
要下载到媒体播放器310上的音频文件的数据可以取决于针对音频用户接口的用户设定。例如,用户可能希望下载音频提示数据库360中储存的音频文件或其他音频数据,以与媒体播放器310上的音频用户接口的全部或部分的选项或特征相关联。
图4是根据本发明的一种实施例中对音频提示系统400进行流送的框图。在这种示例中,媒体播放器410链接到通信网络420。媒体播放器410可以以图2的媒体播放器200的形式或图3的媒体播放器310的形式实施。语音服务器430也链接到通信网络420,并能够与媒体播放器410通信。
在各种实施例中,媒体播放器410可以对经过通信网络420至语音服务器430的连接的存在情况进行判断。在一种操作示例中,媒体播放器410可以选择从语音服务器430接收音频提示以由媒体播放器410的音频用户接口呈现。媒体播放器410可以生成对于音频提示的一个或多个请求,语音服务器430在接收请求时可以向媒体播放器410流送相应的音频提示以输出给用户。
语音服务器430可以包括音频提示数据库440和TTS引擎450。语音服务器430的音频提示数据库440可以包括硬件和/或软件元件,用于储存与媒体播放器410相关联的音频用户接口的音频提示的音频数据或音频文件。音频提示数据库330可以包括用于音频对话的音频提示,这些音频对话是由一个或多个内容生产商预先记录的、由内容发行商提供的、或由TTS引擎450产生的。TTS引擎370可以包括硬件和/或软件元件,用于将信息或数据转换成对该信息进行听觉化的、能够以音频提示的形式播放的音频文件或语音数据。
因此,媒体播放器410可以选择性地在用于音频用户接口的音频提示的源之间进行选择,以向用户提供音频语音反馈。媒体播放器410可以从语音服务器430接收音频提示(例如预先记录的或合成的),直到失去连接。此时,媒体播放器410可以自动地选择来自其他源(例如内部音频提示数据库,或语音合成模块)的音频提示。
图5图示了根据本发明的一种实施例中媒体播放器500及其相关联的用户输入控件的示意图。媒体播放器500可以包括用于播放媒体文件(例如歌曲文件)的任意计算设备。媒体播放器500可以包含存储器和播放模块,该存储器储存媒体数据库,该播放模块用于对媒体数据库中储存的内容或其他媒体资料进行呈现或播放。一组嵌套菜单505可以呈现用户接口的至少一部分,该用户接口允许用户对所需的歌曲文件进行导航、选择并从而收听。使用这组嵌套菜单505可以通过不同的途径到达某个媒体文件。用户接口还可以允许用户对由媒体播放器500提供的所需功能进行导航和选择。
图5还图示了媒体播放器500的用户接口控件510。根据一种实施例,用户接口控件510包括“菜单”按钮515、“下一个”按钮520、“播放/暂停”按钮525和“前一个”按钮530。用户接口控件510可以包括滚动轮,该滚动轮以能够旋转的旋转轮装置、或理解旋转用户手势的触摸板装置的形式实现。用户可以对用户接口控件510进行按压、摩擦或以其他方式进行交互来对嵌套菜单505进行导航。
图6图示了根据本发明的可替换实施例中媒体播放器600及其相关联的用户输入控件的示意图。媒体播放器600可以包括“前一个”按钮610、“播放/暂停”按钮620和“下一个”按钮630。LED 640和650可以用来向用户传递信息,例如表明电能状态或媒体回放状态。在这种示例中,媒体播放器600可以不包括被配置成图形用户接口(例如图5的嵌套菜单505)的显示器。因此,以可听方式传递与媒体播放器600的操作有关信息的用户接口可以极大地增强用户体验。
图7是根据本发明的一种实施例中,用于向电子设备的用户提供音频用户接口的方法的简化流程图。图7所示方法700的处理可以由软件(例如指令或代码模块)在由逻辑机(例如计算机系统或信息处理设备)的中央处理单元(CPU或处理器)执行时进行,由电子设备的硬件组件或专用集成电路来进行,或者由软件和硬件元件相结合来进行。图7开始于步骤710。
在步骤720,信息被接收,该信息表示用户与用户接口的交互。该信息可以包括信号、消息、中断、输入等。该信息可以指明用户按压或压下了按钮、点击了点击轮、触摸了触摸屏、比划了手势、突出或选择了图形用户接口上的元素等。该信息可以代表用户的单一动作,或者代表多个动作的组合。
在步骤730,与用户的该交互相对应的音频提示被识别。音频提示可以包括对音频数据进行识别的信息,该音频数据以发声、听觉化或其他方式向用户提供对于所登记的交互的反馈。在步骤740,对于该音频提示判断音频数据的类型或类别。在各种实施例中,音频提示可以由音频数据的不同类型或类别来代表。音频数据的类型或类别例如可以包括:不同听觉质量的音频数据、语音与非语音、比特率、压缩、编码、源、递送机构等。例如,由语音合成模块生成的合成音频数据可以用来针对数字、日期等提供音频提示。在另一种示例中,经过了压缩的预先记录音频数据可以用来针对按钮交互(例如播放、暂停、下一个、回退、快进、倒转等)提供音频提示。在再一种示例中,CD质量的预先记录音频数据可以用来针对数字、日期、按钮下压、菜单选择以及给定的音频用户接口中可能包含的任何其他的用户交互来提供整套音频提示。
在步骤750,判断对于该音频提示判断出的音频数据的类型或类别是否可用。例如,可以作出选择,把预先记录的音频对话(例如,一组预先记录的音频文件)用于音频用户接口的音频提示。电子设备可以检查其内部储存装置,以判断用于该音频提示的音频文件是否存在。或者,电子设备可以请求来自主计算机或流语音服务器的、用于该音频提示的音频文件。在另一种示例中,如果预先记录的音频提示没有在本地储存在电子设备处,则可以作出选择,把预先记录的音频数据用于某些音频提示,而把合成音频数据用于其他音频提示。
在步骤760,判断出的类型或类别的音频数据的一部分从可用的源输出。因此,各种实施例可以给音频用户接口的音频提示提供不同类型或类别音频数据的动态选择。另外,作为音频用户接口的一部分,一些实施例还可以提供机构,用于将所选择或所识别的类型或类别的音频数据放置到电子设备供使用。图7结束于步骤770。
图8A和图8B是根据本发明的一种实施例中,用于给电子设备提供音频用户接口的方法800的流程图。方法800大体上包含智能判决途径,该途径判断对于音频用户接口的合适音频对话是否可用,并获得最佳的可用音频对话以输出给用户。图8A开始于步骤805。
在步骤810,接收表示按钮下压的输入。例如,用户可以与图5的媒体播放器500的用户接口控件510进行对接。媒体播放器500可以生成一个或多个模拟或数字信号,这些模拟或数字信号代表按钮下压、触摸、压力、手势、运动等。
在步骤815,判断是否对于该按钮下压呈现音频提示。在一些实施例中,控件选择伴随有向用户输出用于确认选择的音频提示的指示。例如,可以使用户能够听到“播放”,以对于真的压下了播放/暂停按钮525提供反馈。这些实施例可以包含对于用户接口控制进行选择的反复的用户动作。例如,用户可能要对于用户接口控件进行多次“点击”以进行选择。第一次“点击”可以使媒体播放器500确定将所选的用户接口控件听觉化。例如,当用户按压播放按钮时,“播放”可以被听觉化。这个第一音频提示可以提供关于哪个按钮受到了下压的音频指导,这在不将视觉注意力导向手持设备的时候对于用户很有帮助。
然后,随后的“点击”可以使媒体播放器500执行与该用户接口控件相对应的动作。例如,第二次按压播放按钮可以使媒体文件被播放。另一方面,音频提示可能已经告知用户:将要作出意图之外的选择。因此,用户可以尝试选择不同的用户接口控件。例如,用户可以在此后尝试按压“下一个”按钮520,而不是继续第二次按压播放按钮525。
如果在步骤815确定对于按钮按压呈现音频提示,则处理沿着智能判决途径进行,该途径判断对于该音频提示是否有合适的对话可用,并确定如何将这个合适的音频对话放置到电子设备上。该智能判决途径例如可以包括:发现或识别音频数据的类型或类别以及该音频数据是否可用。
在步骤820,对于高质量源是否可用作出判断。相对于低质量源,高质量源可以包括数字音频文件或下述音频数据:该音频数据是以高于预先确定或认可的频率所采样的,处于给定比特率的,大小超过了预定阈值或限制,等等。可以根据是否存在至下述通信网络的无线的或有线的连接来作出该判断:通过该通信网络可访问高质量源。在一种实现方式中,可以根据选择判据或用户偏好来作出该判断。例如,在一种操作模式中,用户可能希望对于用户所选择的每个动作和菜单条目听到音频提示。在另一种模式中,用户可能不激活用于那些控件选择(例如“播放”按钮)的音频提示,而仅仅对于被突出的菜单条目听取音频提示。在另一种模式中,可以仅对于顶层的菜单条目输出音频提示。
如果判断为高质量源可用,则在步骤825,从与该按钮按压相对应的高质量源取得音频提示。高质量源的一个示例可以包括无损的或CD质量的预先记录的音频数据或音频文件。预先记录的音频数据或音频文件可以包括下列项的记录:专业制作的名人语音、卡通角色、或来自电视节目、故事片的摘录。
或者,如果判断为高质量源不可用,则在步骤830,判断低质量源是否可用。如果判断为低质量源可用,则在步骤835,从与该按钮按压相对应的低质量源取得音频提示。低质量源的一种示例可以包括使用一种或多种压缩或编码技术(例如MP3、WMA、OGG等)压缩的、预先记录的音频数据或音频文件。这些预先记录的音频数据或音频文件可以包括人声的普通记录,或者所储存的、用一种或多种语音或文本合成技术生成的音频文件或音频数据。
现在参考图8B,如果判断为低质量源不可用,则在步骤840,判断文本至语音(TTS)或语音合成是否可用。如果判断为一个或多个合成源可用,则在步骤845,用语音合成或TTS合成来合成或生成音频提示。
如果对于该音频用户接口没有音频提示的源可被确定或选择,则在步骤850,可以与按钮按压对应地输出一个或多个蜂鸣或其他的一般声音。优选地,在步骤855,与该按钮按压对应的音频提示被输出,该音频提示是选择性地在步骤825从高质量源获得、在步骤835从低质量源获得、或者在步骤845中合成的。在一些实施例中,可以根据所选的音频接口模式来播放音频提示。当媒体播放器或便携式媒体设备并非正在播放音频文件时,只有与用户接口相对应的音频文件可以被播放并由用户可听到。
在各种实施例中,当媒体文件正在被回放时,音频接口模式可以被设定成以不同的方式将该媒体文件与音频提示回放相混合。根据一种设定,在要播放音频提示时,用于回放媒体文件的音量可以被动态地减小。例如,在音频提示的回放过程中,歌曲或电影剪辑的回放音量可以被降低。根据另一种设定,在音频提示的回放过程中,媒体文件的回放被暂停,然后在播放音频提示之后重新开始。如果用户在某个期限内作出了多个用户控件选择,则媒体文件的回放可以被短时间暂停,使得媒体文件的回放不必多次被暂停和重新开始。这可以避免反复中断歌曲回放。例如,如果用户在5秒内作出了至少三个用户控件选择,则媒体文件的回放可以被暂停五秒钟。用户控件选择的时间长度和数目可以根据用户的偏好而改变。一些音频接口模式可以指定通过左侧、右侧或双侧扬声器或耳机通道来播放音频提示。
因此,对于下述情况进行判断:合适的音频对话(例如电子设备上或者与该设备相连的主计算机/服务器计算机上的)是否可用,以及是否能够获得最佳的可用音频对话以输出给用户。图8B结束于步骤860。
图9是根据本发明的一种实施例中对用于音频用户接口的音频提示进行流送的方法900的流程图。方法900大体上包括根据至语音服务器的连接来向媒体回放设备流送音频提示。图9开始于步骤910。
在步骤920,媒体回放设备(例如媒体播放器100)检测到宽带连接。例如,媒体回放设备可以成功地与无线接入点相关联。在另一种示例中,媒体回放设备可以认可至互联网的有线连接。
在步骤930,媒体回放设备确定使用语音服务器来获得用于音频用户接口的语音对话。例如,由媒体回放设备执行的软件程序可以启动和完成与语音服务器托管的一个或多个应用的握手。在另一种示例中,媒体回放设备可以周期性地对语音服务器进行轮询以判断连接的可用性。
在步骤940,媒体回放设备生成对于音频提示的请求。该请求可以包括标识了音频提示的信息、标识了与所请求的音频提示相对应的用户交互的信息等。该请求可以包括下述一项或多项:头部、标志、字段、校验、哈希等。在一种实施例中,该请求可以包括超文本传输协议(HTTP)数据或实时传送协议(RTP)数据。
在步骤950,语音服务器将音频提示向媒体回放设备流送。在步骤960,媒体回放设备输出所流送的音频提示。语音服务器可以使用一个或多个流传输协议(例如实时的或比实时更快的),使得媒体回放设备在进行回放之前对音频提示的一部分进行缓冲。
在各种实施例中,可以按照每个条目或每个定购付费的方式,来使语音服务器可访问。语音服务器可以支持对未压缩的和经压缩的(例如,无损的或有损的)音频数据进行流送。语音服务器还可以支持对与内容或其他媒体资料相关联的信息进行传送,用户可以根据所述内容或其他媒体资料来进行交互(例如导航),这些内容或其他媒体资料例如题目信息、作品集信息、艺术家信息、流派信息、元数据等。图9结束于步骤970。
图10是根据本发明的一种实施例中用于使用一项或多项语音或文本至语音合成技术来在主计算机系统创建音频提示的方法1000的流程图。方法1000大体上包括合成用于音频用户接口的音频提示以及向媒体回放设备传送所合成的音频提示。图10开始于步骤1010。
在步骤1020,媒体回放设备(例如,图1的媒体播放器100)检测到至主计算机的连接。例如,媒体回放设备可以对于该媒体回放设备是否用外围设备电缆耦合到主计算机进行检测。在另一种示例中,媒体回放设备可以对于主计算机的接近度进行检测并建立无线连接,例如使用WiFi或蓝牙模块。
在步骤1030,媒体回放设备确定使用主计算机来获得用于音频用户接口的语音对话。例如,当媒体回放设备的内部储存装置没有足够的空间来在内容或其他媒体资料之外再储存音频提示时,媒体回放设备可以确定使用主计算机。在另一种示例中,当媒体回放设备不包含TTS引擎时,媒体回放设备可以确定使用主计算机。
在步骤1040,主计算机合成音频提示。主计算机可以使用一项或多项语音合成或文本至语音合成技术来生成音频提示。例如,主计算机可以确定与媒体回放设备相关联的简档。该简档可以包括某电子设备所特有的、通过按钮按压、菜单选择或其他用户交互而登记的事件的文字描述。主计算机可以通过生成并记录合成语音阅读,来使该简档的文字描述听觉化。主计算机可以给每个文字描述生成一个音频提示。主计算机也可以生成一个音频提示,该音频提示包含用于每个文字描述的音频数据,以及下述信息:该信息表示这一个音频提示内对于给定的文字描述的音频数据。
在步骤1050,主计算机向媒体回放设备传送该音频提示。在一种实现方式中,主计算机为音频用户接口生成音频对话的多个音频提示。然后,主计算机向媒体回放设备传送整个音频对话,例如在对设备上的内容或其他媒体资料进行管理的时候。在另一种示例中,主计算机可以大体上实时地生成并向媒体回放设备传送音频提示。在步骤1060,媒体回放设备输出音频提示。图10结束于步骤1060。
图11是根据本发明的可替换实施例,使用一种或多种语音或文本至语音合成技术来创建音频提示的方法1100的流程图。方法1100大体上包括创建或合成音频数据,该音频数据代表事件的文字描述。图11开始于步骤1110。
在步骤1120,事件被识别。事件可以包括对于电子设备可能进行的任何用户接口。事件可以由用户的按钮按压、点击、滚动、触摸、选择、突出等来代表。在步骤1130,确定所识别的事件的文本描述。文本描述可以包括对事件、设备、用户、内容的一部分等进行描述的字、句等。文字描述可以由用户、开发者或其他的第三方来生成。
在步骤1140,根据事件的文字描述,语音音频被合成或以其他方式生成。在一种示例中,计算机系统可以取得针对文本至语音转换处理的配置设定。该配置设定可以对语音合成或文本至语音转换处理的各个方面进行控制。例如,该配置设定可以确定要被转换成音频文件的某些文本串、TTS转换的质量、对这些文本串进行语言表达的语音性别、将音频提示听觉化的速度(例如,随着用户越来越熟悉这些音频提示,讲话速度可以被提高),以及对于不同的子任务的定制语音(例如,控件和功能可以用一种语音来听觉化,而数据(例如歌曲和联系人名称)可以用其他的语音来听觉化)。此外,通过在用户进行导航时播放音频提示的仅一部分,配置设定还可以处理用户接口控件的熟练操纵。例如,在以字典方式浏览联系人名称时,仅表现字母(a、b、c...),直到用户到达以所需字母开头的联系人姓名。例如,在Jones的情况下是j。因此应当理解,TTS配置设定可以具有与设备、配置或用户期望所对应的各种设定。
各种声音合成器规则和引擎可以被用来生成音频文件。用于将词转换成音频文件的处理的一种大致示例可以按如下方式工作。用于对词“browse”进行转换的处理开始于将这个词分解成代表双连音(diphone)单元的片段或音节,例如“b”、“r”、“ow”、“s”。然后,各种技术对应每个成分生成音频提示,这些音频提示然后可以被组合以形成可理解的词或短语。音频文件通常被赋予于所创建的音频文件的类型相对应的扩展名。例如,用于“browse”的音频文件可以由browse.aiff文件名来标识,其中.aiff扩展名标识音频文件。
在步骤1150,语音音频提示被输出。语音音频提示可以响应于用户与具有音频用户接口的媒体回放设备的交互而被输出。在一种实施例中,音频用户接口可以包括指示,该指示指向对应的音频提示或音频文件。例如,可以用对照表来保持指向音频提示的相关指示的轨迹。图11结束于步骤1160。
图12是可以包含本发明实施例的计算机系统1200的简化框图。图12仅仅是包含本发明的实施例的举例说明,而不应限制权利要求所述的发明范围。本领域普通技术人员会想到各种变更、修改和替换形式。
在一种实施例中,计算机系统1200包括(一个或多个)处理器1210、随机存取存储器(RAM)1220、盘驱动器1230、(一个或多个)输入设备1240、(一个或多个)输出设备1250、显示器1260、(一个或多个)通信接口1270、以及将上述组件互连的系统总线1280。也可以有其他组件(例如文件系统、储存盘、只读存储器(ROM)、缓存存储器、编解码器等)。
RAM 1220和盘驱动器1230是有形介质的示例,这些有形介质被配置来储存数据(例如音频、图像和电影文件)、操作系统代码、本发明的实施例,包括可执行计算机代码、人类可读的代码等。有形介质的其他类型包括软盘、可移动硬盘、光储存介质(例如CD-ROM、DVD和条码)、半导体存储器(例如闪存)、只读存储器(ROM)、电池支持的易失性存储器、联网储存设备等。
在各种实施例中,输入设备1240通常以下述方式实施:计算机鼠标、轨迹球、跟踪板、游戏杆、无线遥控器、画图板、语音命令系统、眼睛跟踪(eye tracking)系统、多点触摸接口、滚动轮、点击轮、触摸屏、FM/TV调谐器、音频/视频输入装置等。输入设备1240可以允许用户通过命令(例如对按钮进行点击等)来选择对象、图表、文本等。在各种实施例中,输出设备1250通常以下述方式实施:显示器、打印机、力反馈机构、音频输出装置、视频分量输出等。显示器1260可以包括CRT显示器、LCD显示器、等离子显示器等。
通信接口1270的实施例可以包括计算机接口,例如包括以太网卡、调制解调器(电话、微型、电缆、ISDN)、(异步)数字订户环路(DSL)单元、FireWire接口、USB接口等。例如,这些计算机接口可以耦合到计算机网络1290、FireWire总线等。在其他实施例中,这些计算机接口可以在实体上集成在计算机系统1200的系统板或主板上,并可以是软件程序等。
在各种实施例中,计算机系统1200还可以包括允许通过网络进行通信的软件,例如HTTP、TCP/IP、RTP/RTSP协议等。在本发明的可替换实施例中,也可以使用其他通信软件和传输协议,例如IPX、UDP等。
在各种实施例中,计算机系统1200还可以包括操作系统,例如Microsoft WindowsLinuxMac OS X实时操作系统(RTOS)、开源的和有产权的OS等。
图12是能够实施本发明的媒体播放器和/或计算机系统的代表。本领域普通技术人员容易看到,许多其他硬件和软件配置适用于本发明。例如,媒体播放器可以是桌面的、便携的、机架安装的或平板的配置。另外,媒体播放器还可以是一系列联网的计算机。此外,媒体播放器可以是移动设备、嵌入式设备、个人数字助理、智能电话等。在其他实施例中,上文所述的那些技术可以在芯片上或辅助处理板上实现。
本发明可以以硬件、或软件、或二者的组合形式,由控制逻辑的形式实现。控制逻辑可以以多个指令的形式储存在信息储存介质中,这些指令适于指引信息处理设备执行本发明实施例中公开的一组步骤。根据本申请中的公开内容和教导,本领域普通技术人员会想到实现本发明的其他方式和/或方法。
本申请中所述的实施例是本发明的一种或多种示例的举例说明。由于参考插图对本发明的这些实施例进行了描述,本领域技术人员可以了解所描述的这些方法和/或具体结构的各种变更或修改。依赖于本发明的这些教导的、以及这些教导使本领域进步所用的所有变更、修改或变化形式应当认为落在本发明的范围内。因此,这些说明和附图不应以限制性的意义来理解,因为应当明白,本发明决不仅限于所举例说明的这些实施例。
上述说明是举例而非限制性的。在阅览该公开内容时,本领域技术人员会想到本发明的许多变化形式。因此,本发明的范围应当参照权利要求及其完整范围或等同含义来确定,而不是上文的说明书。
Claims (11)
1.一种通过媒体播放器向用户提供音频提示的方法,所述方法包括:
接收输入,所述输入表示所述用户与和所述媒体播放器相关联的用户接口的交互;
判断是否要输出音频提示,所述音频提示使所述交互听觉化;以及
在要输出音频提示的情况下:
判断从媒体播放器经过通信网络至语音服务器的连接是否存在;
如果所述连接存在,则从所述语音服务器接收与所述交互相关联的预备音频提示;
如果所述连接不存在,则生成新的音频提示;以及
输出所述预备音频提示或所述新的音频提示的至少一部分。
2.根据权利要求1所述的方法,其中,生成新的音频提示的步骤包括:使用文本至语音技术,由所述媒体播放器合成所述新的音频提示。
3.根据权利要求1所述的方法,其中,从所述语音服务器接收的预备音频提示包括高质量语音记录。
4.根据权利要求1所述的方法,其中,所述预备音频提示的质量高于所述新的音频提示的质量。
5.根据权利要求1所述的方法,其中,接收所述预备音频提示的步骤包括:
从所述语音服务器接收流输入,所述流输入包含所述预备音频提示。
6.一种便携式媒体回放设备,包括:
媒体回放单元;
用户接口;和
处理器,所述处理器被配置成:
接收输入,所述输入表示用户与所述用户接口的交互;
判断是否要输出音频提示,所述音频提示使所述交互听觉化;并且
在要输出音频提示的情况下,所述处理器还被配置成:
判断从该便携式媒体回放设备经过通信网络至语音服务器的连接是否存在;
如果所述连接存在,则从所述语音服务器接收与所述交互相关联的预备音频提示;
如果所述连接不存在,则生成新的音频提示;以及
输出所述预备音频提示或所述新的音频提示的至少一部分。
7.根据权利要求6所述的便携式媒体回放设备,其中,从所述语音服务器接收的预备音频提示包括高质量语音记录。
8.根据权利要求6所述的便携式媒体回放设备,其中,所述预备音频提示的质量高于所述新的音频提示的质量。
9.根据权利要求6所述的便携式媒体回放设备,其中,接收所述预备音频提示的步骤包括:
从所述语音服务器接收流输入,所述流输入包含所述预备音频提示。
10.根据权利要求6所述的便携式媒体回放设备,其中,所述新的音频提示是利用文本至语音合成技术由所述处理器生成的。
11.一种通过媒体播放器向用户提供音频提示的设备,所述设备包括:
接收输入的部件,所述输入表示所述用户与和所述媒体播放器相关联的用户接口的交互;
判断是否要输出音频提示的部件,所述音频提示使所述交互听觉化;以及
在要输出音频提示的情况下执行包括以下的操作的部件:
判断从媒体播放器经过通信网络至语音服务器的连接是否存在;
如果所述连接存在,则从所述语音服务器接收与所述交互相关联的预备音频提示;
如果所述连接不存在,则生成新的音频提示;以及
输出所述预备音频提示或所述新的音频提示的至少一部分。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/207,314 | 2008-09-09 | ||
US12/207,314 US8898568B2 (en) | 2008-09-09 | 2008-09-09 | Audio user interface |
PCT/US2009/051954 WO2010030440A1 (en) | 2008-09-09 | 2009-07-28 | Audio user interface |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102150128A CN102150128A (zh) | 2011-08-10 |
CN102150128B true CN102150128B (zh) | 2015-02-25 |
Family
ID=41172235
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200980135356.3A Expired - Fee Related CN102150128B (zh) | 2008-09-09 | 2009-07-28 | 音频用户接口 |
Country Status (8)
Country | Link |
---|---|
US (1) | US8898568B2 (zh) |
EP (1) | EP2324416B1 (zh) |
JP (1) | JP5667978B2 (zh) |
KR (1) | KR20110038735A (zh) |
CN (1) | CN102150128B (zh) |
DE (1) | DE112009002183T5 (zh) |
HK (1) | HK1160957A1 (zh) |
WO (1) | WO2010030440A1 (zh) |
Families Citing this family (275)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
USD822716S1 (en) | 2016-05-13 | 2018-07-10 | Google Llc | Voice interface device |
US8645137B2 (en) | 2000-03-16 | 2014-02-04 | Apple Inc. | Fast, language-independent method for user authentication by voice |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
US10848590B2 (en) | 2005-10-26 | 2020-11-24 | Cortica Ltd | System and method for determining a contextual insight and providing recommendations based thereon |
US9953032B2 (en) | 2005-10-26 | 2018-04-24 | Cortica, Ltd. | System and method for characterization of multimedia content signals using cores of a natural liquid architecture system |
US10635640B2 (en) | 2005-10-26 | 2020-04-28 | Cortica, Ltd. | System and method for enriching a concept database |
US10949773B2 (en) | 2005-10-26 | 2021-03-16 | Cortica, Ltd. | System and methods thereof for recommending tags for multimedia content elements based on context |
US10607355B2 (en) | 2005-10-26 | 2020-03-31 | Cortica, Ltd. | Method and system for determining the dimensions of an object shown in a multimedia content item |
US8818916B2 (en) | 2005-10-26 | 2014-08-26 | Cortica, Ltd. | System and method for linking multimedia data elements to web pages |
US9235557B2 (en) | 2005-10-26 | 2016-01-12 | Cortica, Ltd. | System and method thereof for dynamically associating a link to an information resource with a multimedia content displayed in a web-page |
US9256668B2 (en) | 2005-10-26 | 2016-02-09 | Cortica, Ltd. | System and method of detecting common patterns within unstructured data elements retrieved from big data sources |
US11003706B2 (en) | 2005-10-26 | 2021-05-11 | Cortica Ltd | System and methods for determining access permissions on personalized clusters of multimedia content elements |
US8312031B2 (en) | 2005-10-26 | 2012-11-13 | Cortica Ltd. | System and method for generation of complex signatures for multimedia data content |
US9087049B2 (en) * | 2005-10-26 | 2015-07-21 | Cortica, Ltd. | System and method for context translation of natural language |
US10193990B2 (en) | 2005-10-26 | 2019-01-29 | Cortica Ltd. | System and method for creating user profiles based on multimedia content |
US9489431B2 (en) | 2005-10-26 | 2016-11-08 | Cortica, Ltd. | System and method for distributed search-by-content |
US10776585B2 (en) | 2005-10-26 | 2020-09-15 | Cortica, Ltd. | System and method for recognizing characters in multimedia content |
US10698939B2 (en) | 2005-10-26 | 2020-06-30 | Cortica Ltd | System and method for customizing images |
US11620327B2 (en) | 2005-10-26 | 2023-04-04 | Cortica Ltd | System and method for determining a contextual insight and generating an interface with recommendations based thereon |
US10191976B2 (en) | 2005-10-26 | 2019-01-29 | Cortica, Ltd. | System and method of detecting common patterns within unstructured data elements retrieved from big data sources |
US11403336B2 (en) | 2005-10-26 | 2022-08-02 | Cortica Ltd. | System and method for removing contextually identical multimedia content elements |
US10372746B2 (en) | 2005-10-26 | 2019-08-06 | Cortica, Ltd. | System and method for searching applications using multimedia content elements |
US10742340B2 (en) | 2005-10-26 | 2020-08-11 | Cortica Ltd. | System and method for identifying the context of multimedia content elements displayed in a web-page and providing contextual filters respective thereto |
US9639532B2 (en) | 2005-10-26 | 2017-05-02 | Cortica, Ltd. | Context-based analysis of multimedia content items using signatures of multimedia elements and matching concepts |
US9558449B2 (en) | 2005-10-26 | 2017-01-31 | Cortica, Ltd. | System and method for identifying a target area in a multimedia content element |
US9529984B2 (en) | 2005-10-26 | 2016-12-27 | Cortica, Ltd. | System and method for verification of user identification based on multimedia content elements |
US11386139B2 (en) | 2005-10-26 | 2022-07-12 | Cortica Ltd. | System and method for generating analytics for entities depicted in multimedia content |
US10621988B2 (en) | 2005-10-26 | 2020-04-14 | Cortica Ltd | System and method for speech to text translation using cores of a natural liquid architecture system |
US9396435B2 (en) | 2005-10-26 | 2016-07-19 | Cortica, Ltd. | System and method for identification of deviations from periodic behavior patterns in multimedia content |
US8326775B2 (en) | 2005-10-26 | 2012-12-04 | Cortica Ltd. | Signature generation for multimedia deep-content-classification by a large-scale matching system and method thereof |
US10387914B2 (en) | 2005-10-26 | 2019-08-20 | Cortica, Ltd. | Method for identification of multimedia content elements and adding advertising content respective thereof |
US8266185B2 (en) | 2005-10-26 | 2012-09-11 | Cortica Ltd. | System and methods thereof for generation of searchable structures respective of multimedia data content |
US10180942B2 (en) | 2005-10-26 | 2019-01-15 | Cortica Ltd. | System and method for generation of concept structures based on sub-concepts |
US10380267B2 (en) | 2005-10-26 | 2019-08-13 | Cortica, Ltd. | System and method for tagging multimedia content elements |
US9031999B2 (en) | 2005-10-26 | 2015-05-12 | Cortica, Ltd. | System and methods for generation of a concept based database |
US10691642B2 (en) | 2005-10-26 | 2020-06-23 | Cortica Ltd | System and method for enriching a concept database with homogenous concepts |
US11032017B2 (en) | 2005-10-26 | 2021-06-08 | Cortica, Ltd. | System and method for identifying the context of multimedia content elements |
US9477658B2 (en) | 2005-10-26 | 2016-10-25 | Cortica, Ltd. | Systems and method for speech to speech translation using cores of a natural liquid architecture system |
US9466068B2 (en) | 2005-10-26 | 2016-10-11 | Cortica, Ltd. | System and method for determining a pupillary response to a multimedia data element |
US9191626B2 (en) | 2005-10-26 | 2015-11-17 | Cortica, Ltd. | System and methods thereof for visual analysis of an image on a web-page and matching an advertisement thereto |
US10380623B2 (en) | 2005-10-26 | 2019-08-13 | Cortica, Ltd. | System and method for generating an advertisement effectiveness performance score |
US9372940B2 (en) | 2005-10-26 | 2016-06-21 | Cortica, Ltd. | Apparatus and method for determining user attention using a deep-content-classification (DCC) system |
US10360253B2 (en) | 2005-10-26 | 2019-07-23 | Cortica, Ltd. | Systems and methods for generation of searchable structures respective of multimedia data content |
US10614626B2 (en) | 2005-10-26 | 2020-04-07 | Cortica Ltd. | System and method for providing augmented reality challenges |
US9767143B2 (en) | 2005-10-26 | 2017-09-19 | Cortica, Ltd. | System and method for caching of concept structures |
US11019161B2 (en) | 2005-10-26 | 2021-05-25 | Cortica, Ltd. | System and method for profiling users interest based on multimedia content analysis |
US11604847B2 (en) | 2005-10-26 | 2023-03-14 | Cortica Ltd. | System and method for overlaying content on a multimedia content element based on user interest |
US10380164B2 (en) | 2005-10-26 | 2019-08-13 | Cortica, Ltd. | System and method for using on-image gestures and multimedia content elements as search queries |
US10535192B2 (en) | 2005-10-26 | 2020-01-14 | Cortica Ltd. | System and method for generating a customized augmented reality environment to a user |
US9330189B2 (en) | 2005-10-26 | 2016-05-03 | Cortica, Ltd. | System and method for capturing a multimedia content item by a mobile device and matching sequentially relevant content to the multimedia content item |
US9286623B2 (en) | 2005-10-26 | 2016-03-15 | Cortica, Ltd. | Method for determining an area within a multimedia content element over which an advertisement can be displayed |
US11216498B2 (en) | 2005-10-26 | 2022-01-04 | Cortica, Ltd. | System and method for generating signatures to three-dimensional multimedia data elements |
US11361014B2 (en) | 2005-10-26 | 2022-06-14 | Cortica Ltd. | System and method for completing a user profile |
US9218606B2 (en) | 2005-10-26 | 2015-12-22 | Cortica, Ltd. | System and method for brand monitoring and trend analysis based on deep-content-classification |
US9646005B2 (en) | 2005-10-26 | 2017-05-09 | Cortica, Ltd. | System and method for creating a database of multimedia content elements assigned to users |
US9384196B2 (en) | 2005-10-26 | 2016-07-05 | Cortica, Ltd. | Signature generation for multimedia deep-content-classification by a large-scale matching system and method thereof |
US10585934B2 (en) | 2005-10-26 | 2020-03-10 | Cortica Ltd. | Method and system for populating a concept database with respect to user identifiers |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US10733326B2 (en) | 2006-10-26 | 2020-08-04 | Cortica Ltd. | System and method for identification of inappropriate multimedia content |
US8977255B2 (en) | 2007-04-03 | 2015-03-10 | Apple Inc. | Method and system for operating a multi-function portable electronic device using voice-activation |
US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
US9330720B2 (en) | 2008-01-03 | 2016-05-03 | Apple Inc. | Methods and apparatus for altering audio output signals |
US8996376B2 (en) | 2008-04-05 | 2015-03-31 | Apple Inc. | Intelligent text-to-speech conversion |
US10496753B2 (en) * | 2010-01-18 | 2019-12-03 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US20100030549A1 (en) | 2008-07-31 | 2010-02-04 | Lee Michael M | Mobile device having human language translation capability with positional feedback |
US8463053B1 (en) | 2008-08-08 | 2013-06-11 | The Research Foundation Of State University Of New York | Enhanced max margin learning on multimodal data mining in a multimedia database |
US8768702B2 (en) * | 2008-09-05 | 2014-07-01 | Apple Inc. | Multi-tiered voice feedback in an electronic device |
US8676904B2 (en) | 2008-10-02 | 2014-03-18 | Apple Inc. | Electronic devices with voice command and contextual data processing capabilities |
JP4623390B2 (ja) | 2008-10-03 | 2011-02-02 | ソニー株式会社 | 再生装置、再生方法及び再生プログラム |
US10255566B2 (en) | 2011-06-03 | 2019-04-09 | Apple Inc. | Generating and processing task items that represent tasks to perform |
US10241752B2 (en) | 2011-09-30 | 2019-03-26 | Apple Inc. | Interface for a virtual digital assistant |
US10241644B2 (en) | 2011-06-03 | 2019-03-26 | Apple Inc. | Actionable reminder entries |
US9431006B2 (en) | 2009-07-02 | 2016-08-30 | Apple Inc. | Methods and apparatuses for automatic speech recognition |
US10679605B2 (en) | 2010-01-18 | 2020-06-09 | Apple Inc. | Hands-free list-reading by intelligent automated assistant |
US10553209B2 (en) | 2010-01-18 | 2020-02-04 | Apple Inc. | Systems and methods for hands-free notification summaries |
US10705794B2 (en) | 2010-01-18 | 2020-07-07 | Apple Inc. | Automatically adapting user interfaces for hands-free interaction |
US10276170B2 (en) | 2010-01-18 | 2019-04-30 | Apple Inc. | Intelligent automated assistant |
US8682667B2 (en) | 2010-02-25 | 2014-03-25 | Apple Inc. | User profiling for selecting user specific voice input processing information |
US8600447B2 (en) * | 2010-03-30 | 2013-12-03 | Flextronics Ap, Llc | Menu icons with descriptive audio |
US9634855B2 (en) | 2010-05-13 | 2017-04-25 | Alexander Poltorak | Electronic personal interactive device that determines topics of interest using a conversational agent |
US8645141B2 (en) * | 2010-09-14 | 2014-02-04 | Sony Corporation | Method and system for text to speech conversion |
US9472181B2 (en) * | 2011-02-03 | 2016-10-18 | Panasonic Intellectual Property Management Co., Ltd. | Text-to-speech device, speech output device, speech output system, text-to-speech methods, and speech output method |
US9262612B2 (en) | 2011-03-21 | 2016-02-16 | Apple Inc. | Device access using voice authentication |
US8855797B2 (en) | 2011-03-23 | 2014-10-07 | Audible, Inc. | Managing playback of synchronized content |
US9760920B2 (en) | 2011-03-23 | 2017-09-12 | Audible, Inc. | Synchronizing digital content |
US9703781B2 (en) | 2011-03-23 | 2017-07-11 | Audible, Inc. | Managing related digital content |
EP2689346B1 (en) * | 2011-03-23 | 2019-07-10 | Audible, Inc. | Managing playback of synchronized content |
US8862255B2 (en) | 2011-03-23 | 2014-10-14 | Audible, Inc. | Managing playback of synchronized content |
US9706247B2 (en) | 2011-03-23 | 2017-07-11 | Audible, Inc. | Synchronized digital content samples |
US9734153B2 (en) | 2011-03-23 | 2017-08-15 | Audible, Inc. | Managing related digital content |
US9697871B2 (en) | 2011-03-23 | 2017-07-04 | Audible, Inc. | Synchronizing recorded audio content and companion content |
US8948892B2 (en) | 2011-03-23 | 2015-02-03 | Audible, Inc. | Managing playback of synchronized content |
CN102221922A (zh) * | 2011-03-25 | 2011-10-19 | 苏州瀚瑞微电子有限公司 | 一种支持语音提示的触控系统及其实现方法 |
US10057736B2 (en) | 2011-06-03 | 2018-08-21 | Apple Inc. | Active transport based notifications |
DE102011079034A1 (de) | 2011-07-12 | 2013-01-17 | Siemens Aktiengesellschaft | Ansteuerung eines technischen Systems |
US8994660B2 (en) | 2011-08-29 | 2015-03-31 | Apple Inc. | Text correction processing |
US20130131849A1 (en) * | 2011-11-21 | 2013-05-23 | Shadi Mere | System for adapting music and sound to digital text, for electronic devices |
TWI574254B (zh) * | 2012-01-20 | 2017-03-11 | 華碩電腦股份有限公司 | 用於電子系統的語音合成方法及裝置 |
US9557903B2 (en) * | 2012-02-13 | 2017-01-31 | Lg Electronics Inc. | Method for providing user interface on terminal |
US10134385B2 (en) | 2012-03-02 | 2018-11-20 | Apple Inc. | Systems and methods for name pronunciation |
US9037956B2 (en) | 2012-03-29 | 2015-05-19 | Audible, Inc. | Content customization |
US8849676B2 (en) | 2012-03-29 | 2014-09-30 | Audible, Inc. | Content customization |
US9075760B2 (en) | 2012-05-07 | 2015-07-07 | Audible, Inc. | Narration settings distribution for content customization |
US9280610B2 (en) | 2012-05-14 | 2016-03-08 | Apple Inc. | Crowd sourcing information to fulfill user requests |
US10417037B2 (en) | 2012-05-15 | 2019-09-17 | Apple Inc. | Systems and methods for integrating third party services with a digital assistant |
US9317500B2 (en) | 2012-05-30 | 2016-04-19 | Audible, Inc. | Synchronizing translated digital content |
US9721563B2 (en) | 2012-06-08 | 2017-08-01 | Apple Inc. | Name recognition system |
US9141257B1 (en) | 2012-06-18 | 2015-09-22 | Audible, Inc. | Selecting and conveying supplemental content |
US9824695B2 (en) * | 2012-06-18 | 2017-11-21 | International Business Machines Corporation | Enhancing comprehension in voice communications |
US8972265B1 (en) | 2012-06-18 | 2015-03-03 | Audible, Inc. | Multiple voices in audio content |
US9536439B1 (en) | 2012-06-27 | 2017-01-03 | Audible, Inc. | Conveying questions with content |
US9679608B2 (en) | 2012-06-28 | 2017-06-13 | Audible, Inc. | Pacing content |
US10109278B2 (en) | 2012-08-02 | 2018-10-23 | Audible, Inc. | Aligning body matter across content formats |
US9547647B2 (en) | 2012-09-19 | 2017-01-17 | Apple Inc. | Voice-based media searching |
US9367196B1 (en) | 2012-09-26 | 2016-06-14 | Audible, Inc. | Conveying branched content |
US9632647B1 (en) | 2012-10-09 | 2017-04-25 | Audible, Inc. | Selecting presentation positions in dynamic content |
US9223830B1 (en) | 2012-10-26 | 2015-12-29 | Audible, Inc. | Content presentation analysis |
CN103839548B (zh) * | 2012-11-26 | 2018-06-01 | 腾讯科技(北京)有限公司 | 一种语音交互方法、装置、系统和移动终端 |
US9265458B2 (en) | 2012-12-04 | 2016-02-23 | Sync-Think, Inc. | Application of smooth pursuit cognitive testing paradigms to clinical drug development |
US9280906B2 (en) | 2013-02-04 | 2016-03-08 | Audible. Inc. | Prompting a user for input during a synchronous presentation of audio content and textual content |
US9472113B1 (en) | 2013-02-05 | 2016-10-18 | Audible, Inc. | Synchronizing playback of digital content with physical content |
KR20230137475A (ko) | 2013-02-07 | 2023-10-04 | 애플 인크. | 디지털 어시스턴트를 위한 음성 트리거 |
US9380976B2 (en) | 2013-03-11 | 2016-07-05 | Sync-Think, Inc. | Optical neuroinformatics |
US10652394B2 (en) | 2013-03-14 | 2020-05-12 | Apple Inc. | System and method for processing voicemail |
US10748529B1 (en) | 2013-03-15 | 2020-08-18 | Apple Inc. | Voice activated device for use with a voice-based digital assistant |
US9317486B1 (en) | 2013-06-07 | 2016-04-19 | Audible, Inc. | Synchronizing playback of digital content with captured physical content |
US9582608B2 (en) | 2013-06-07 | 2017-02-28 | Apple Inc. | Unified ranking with entropy-weighted information for phrase-based semantic auto-completion |
WO2014197334A2 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for user-specified pronunciation of words for speech synthesis and recognition |
WO2014197336A1 (en) | 2013-06-07 | 2014-12-11 | Apple Inc. | System and method for detecting errors in interactions with a voice-based digital assistant |
WO2014197335A1 (en) | 2013-06-08 | 2014-12-11 | Apple Inc. | Interpreting and acting upon commands that involve sharing information with remote devices |
US10176167B2 (en) | 2013-06-09 | 2019-01-08 | Apple Inc. | System and method for inferring user intent from speech inputs |
EP3937002A1 (en) | 2013-06-09 | 2022-01-12 | Apple Inc. | Device, method, and graphical user interface for enabling conversation persistence across two or more instances of a digital assistant |
US9565497B2 (en) | 2013-08-01 | 2017-02-07 | Caavo Inc. | Enhancing audio using a mobile device |
US9489360B2 (en) | 2013-09-05 | 2016-11-08 | Audible, Inc. | Identifying extra material in companion content |
US10296160B2 (en) | 2013-12-06 | 2019-05-21 | Apple Inc. | Method for extracting salient dialog usage from live data |
US9715875B2 (en) | 2014-05-30 | 2017-07-25 | Apple Inc. | Reducing the need for manual start/end-pointing and trigger phrases |
US9430463B2 (en) | 2014-05-30 | 2016-08-30 | Apple Inc. | Exemplar-based natural language processing |
US9633004B2 (en) | 2014-05-30 | 2017-04-25 | Apple Inc. | Better resolution when referencing to concepts |
AU2015266863B2 (en) | 2014-05-30 | 2018-03-15 | Apple Inc. | Multi-command single utterance input method |
US9842101B2 (en) | 2014-05-30 | 2017-12-12 | Apple Inc. | Predictive conversion of language input |
US10170123B2 (en) | 2014-05-30 | 2019-01-01 | Apple Inc. | Intelligent assistant for home automation |
US9632748B2 (en) * | 2014-06-24 | 2017-04-25 | Google Inc. | Device designation for audio input monitoring |
US9338493B2 (en) | 2014-06-30 | 2016-05-10 | Apple Inc. | Intelligent automated assistant for TV user interactions |
US9558736B2 (en) * | 2014-07-02 | 2017-01-31 | Bose Corporation | Voice prompt generation combining native and remotely-generated speech data |
US9818400B2 (en) | 2014-09-11 | 2017-11-14 | Apple Inc. | Method and apparatus for discovering trending terms in speech requests |
US10789041B2 (en) | 2014-09-12 | 2020-09-29 | Apple Inc. | Dynamic thresholds for always listening speech trigger |
US9668121B2 (en) | 2014-09-30 | 2017-05-30 | Apple Inc. | Social reminders |
US10074360B2 (en) | 2014-09-30 | 2018-09-11 | Apple Inc. | Providing an indication of the suitability of speech recognition |
US9886432B2 (en) | 2014-09-30 | 2018-02-06 | Apple Inc. | Parsimonious handling of word inflection via categorical stem + suffix N-gram language models |
US10127911B2 (en) | 2014-09-30 | 2018-11-13 | Apple Inc. | Speaker identification and unsupervised speaker adaptation techniques |
US20160092159A1 (en) * | 2014-09-30 | 2016-03-31 | Google Inc. | Conversational music agent |
US9646609B2 (en) | 2014-09-30 | 2017-05-09 | Apple Inc. | Caching apparatus for serving phonetic pronunciations |
US9865280B2 (en) | 2015-03-06 | 2018-01-09 | Apple Inc. | Structured dictation using intelligent automated assistants |
US10152299B2 (en) | 2015-03-06 | 2018-12-11 | Apple Inc. | Reducing response latency of intelligent automated assistants |
US9886953B2 (en) | 2015-03-08 | 2018-02-06 | Apple Inc. | Virtual assistant activation |
US9721566B2 (en) | 2015-03-08 | 2017-08-01 | Apple Inc. | Competing devices responding to voice triggers |
US10567477B2 (en) | 2015-03-08 | 2020-02-18 | Apple Inc. | Virtual assistant continuity |
US9899019B2 (en) | 2015-03-18 | 2018-02-20 | Apple Inc. | Systems and methods for structured stem and suffix language models |
US9842105B2 (en) | 2015-04-16 | 2017-12-12 | Apple Inc. | Parsimonious continuous-space phrase representations for natural language processing |
US10460227B2 (en) | 2015-05-15 | 2019-10-29 | Apple Inc. | Virtual assistant in a communication session |
US10083688B2 (en) | 2015-05-27 | 2018-09-25 | Apple Inc. | Device voice control for selecting a displayed affordance |
US10200824B2 (en) | 2015-05-27 | 2019-02-05 | Apple Inc. | Systems and methods for proactively identifying and surfacing relevant content on a touch-sensitive device |
US10127220B2 (en) | 2015-06-04 | 2018-11-13 | Apple Inc. | Language identification from short strings |
US9578173B2 (en) | 2015-06-05 | 2017-02-21 | Apple Inc. | Virtual assistant aided communication with 3rd party service in a communication session |
US10101822B2 (en) | 2015-06-05 | 2018-10-16 | Apple Inc. | Language input correction |
US10186254B2 (en) | 2015-06-07 | 2019-01-22 | Apple Inc. | Context-based endpoint detection |
US10255907B2 (en) | 2015-06-07 | 2019-04-09 | Apple Inc. | Automatic accent detection using acoustic models |
US11025565B2 (en) | 2015-06-07 | 2021-06-01 | Apple Inc. | Personalized prediction of responses for instant messaging |
US20160378747A1 (en) | 2015-06-29 | 2016-12-29 | Apple Inc. | Virtual assistant for media playback |
US10747498B2 (en) | 2015-09-08 | 2020-08-18 | Apple Inc. | Zero latency digital assistant |
US10331312B2 (en) | 2015-09-08 | 2019-06-25 | Apple Inc. | Intelligent automated assistant in a media environment |
US10671428B2 (en) | 2015-09-08 | 2020-06-02 | Apple Inc. | Distributed personal assistant |
US10740384B2 (en) | 2015-09-08 | 2020-08-11 | Apple Inc. | Intelligent automated assistant for media search and playback |
PL3382693T3 (pl) * | 2015-09-22 | 2021-05-31 | Vorwerk & Co. Interholding Gmbh | Sposób wytwarzania komunikatu głosowego |
US9697820B2 (en) | 2015-09-24 | 2017-07-04 | Apple Inc. | Unit-selection text-to-speech synthesis using concatenation-sensitive neural networks |
US11010550B2 (en) | 2015-09-29 | 2021-05-18 | Apple Inc. | Unified language modeling framework for word prediction, auto-completion and auto-correction |
US10366158B2 (en) | 2015-09-29 | 2019-07-30 | Apple Inc. | Efficient word encoding for recurrent neural network language models |
US11587559B2 (en) | 2015-09-30 | 2023-02-21 | Apple Inc. | Intelligent device identification |
US10691473B2 (en) | 2015-11-06 | 2020-06-23 | Apple Inc. | Intelligent automated assistant in a messaging environment |
US10956666B2 (en) | 2015-11-09 | 2021-03-23 | Apple Inc. | Unconventional virtual assistant interactions |
US10049668B2 (en) | 2015-12-02 | 2018-08-14 | Apple Inc. | Applying neural network language models to weighted finite state transducers for automatic speech recognition |
US10223066B2 (en) | 2015-12-23 | 2019-03-05 | Apple Inc. | Proactive assistance based on dialog communication between devices |
US9898250B1 (en) * | 2016-02-12 | 2018-02-20 | Amazon Technologies, Inc. | Controlling distributed audio outputs to enable voice output |
US9858927B2 (en) * | 2016-02-12 | 2018-01-02 | Amazon Technologies, Inc | Processing spoken commands to control distributed audio outputs |
US10446143B2 (en) | 2016-03-14 | 2019-10-15 | Apple Inc. | Identification of voice inputs providing credentials |
JP2019523918A (ja) | 2016-05-10 | 2019-08-29 | グーグル エルエルシー | デバイス上の音声アシスタントの実装 |
KR102114003B1 (ko) | 2016-05-13 | 2020-05-25 | 구글 엘엘씨 | 음성 사용자 인터페이스들의 시각적 어포던스를 위한 led 설계 언어 |
US10175941B2 (en) * | 2016-05-24 | 2019-01-08 | Oracle International Corporation | Audio feedback for continuous scrolled content |
US9934775B2 (en) | 2016-05-26 | 2018-04-03 | Apple Inc. | Unit-selection text-to-speech synthesis based on predicted concatenation parameters |
US9972304B2 (en) | 2016-06-03 | 2018-05-15 | Apple Inc. | Privacy preserving distributed evaluation framework for embedded personalized systems |
US10249300B2 (en) | 2016-06-06 | 2019-04-02 | Apple Inc. | Intelligent list reading |
US11227589B2 (en) | 2016-06-06 | 2022-01-18 | Apple Inc. | Intelligent list reading |
US10049663B2 (en) | 2016-06-08 | 2018-08-14 | Apple, Inc. | Intelligent automated assistant for media exploration |
DK179588B1 (en) | 2016-06-09 | 2019-02-22 | Apple Inc. | INTELLIGENT AUTOMATED ASSISTANT IN A HOME ENVIRONMENT |
US10509862B2 (en) | 2016-06-10 | 2019-12-17 | Apple Inc. | Dynamic phrase expansion of language input |
US10067938B2 (en) | 2016-06-10 | 2018-09-04 | Apple Inc. | Multilingual word prediction |
US10490187B2 (en) | 2016-06-10 | 2019-11-26 | Apple Inc. | Digital assistant providing automated status report |
US10192552B2 (en) | 2016-06-10 | 2019-01-29 | Apple Inc. | Digital assistant providing whispered speech |
US10586535B2 (en) | 2016-06-10 | 2020-03-10 | Apple Inc. | Intelligent digital assistant in a multi-tasking environment |
DK179343B1 (en) | 2016-06-11 | 2018-05-14 | Apple Inc | Intelligent task discovery |
DK201670540A1 (en) | 2016-06-11 | 2018-01-08 | Apple Inc | Application integration with a digital assistant |
DK179049B1 (en) | 2016-06-11 | 2017-09-18 | Apple Inc | Data driven natural language event detection and classification |
DK179415B1 (en) | 2016-06-11 | 2018-06-14 | Apple Inc | Intelligent device arbitration and control |
US10474753B2 (en) | 2016-09-07 | 2019-11-12 | Apple Inc. | Language identification using recurrent neural networks |
US10043516B2 (en) | 2016-09-23 | 2018-08-07 | Apple Inc. | Intelligent automated assistant |
US11281993B2 (en) | 2016-12-05 | 2022-03-22 | Apple Inc. | Model and ensemble compression for metric learning |
US10593346B2 (en) | 2016-12-22 | 2020-03-17 | Apple Inc. | Rank-reduced token representation for automatic speech recognition |
US11204787B2 (en) | 2017-01-09 | 2021-12-21 | Apple Inc. | Application integration with a digital assistant |
DK201770383A1 (en) | 2017-05-09 | 2018-12-14 | Apple Inc. | USER INTERFACE FOR CORRECTING RECOGNITION ERRORS |
US10417266B2 (en) | 2017-05-09 | 2019-09-17 | Apple Inc. | Context-aware ranking of intelligent response suggestions |
DK201770439A1 (en) | 2017-05-11 | 2018-12-13 | Apple Inc. | Offline personal assistant |
DK180048B1 (en) | 2017-05-11 | 2020-02-04 | Apple Inc. | MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION |
US10395654B2 (en) | 2017-05-11 | 2019-08-27 | Apple Inc. | Text normalization based on a data-driven learning network |
US10726832B2 (en) | 2017-05-11 | 2020-07-28 | Apple Inc. | Maintaining privacy of personal information |
DK201770429A1 (en) * | 2017-05-12 | 2018-12-14 | Apple Inc. | LOW-LATENCY INTELLIGENT AUTOMATED ASSISTANT |
US11301477B2 (en) | 2017-05-12 | 2022-04-12 | Apple Inc. | Feedback analysis of a digital assistant |
DK179496B1 (en) | 2017-05-12 | 2019-01-15 | Apple Inc. | USER-SPECIFIC Acoustic Models |
DK179745B1 (en) | 2017-05-12 | 2019-05-01 | Apple Inc. | SYNCHRONIZATION AND TASK DELEGATION OF A DIGITAL ASSISTANT |
DK201770432A1 (en) | 2017-05-15 | 2018-12-21 | Apple Inc. | Hierarchical belief states for digital assistants |
DK201770431A1 (en) | 2017-05-15 | 2018-12-20 | Apple Inc. | Optimizing dialogue policy decisions for digital assistants using implicit feedback |
US10303715B2 (en) | 2017-05-16 | 2019-05-28 | Apple Inc. | Intelligent automated assistant for media exploration |
US20180336892A1 (en) | 2017-05-16 | 2018-11-22 | Apple Inc. | Detecting a trigger of a digital assistant |
DK179560B1 (en) | 2017-05-16 | 2019-02-18 | Apple Inc. | FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES |
US10403278B2 (en) | 2017-05-16 | 2019-09-03 | Apple Inc. | Methods and systems for phonetic matching in digital assistant services |
US10311144B2 (en) | 2017-05-16 | 2019-06-04 | Apple Inc. | Emoji word sense disambiguation |
US10657328B2 (en) | 2017-06-02 | 2020-05-19 | Apple Inc. | Multi-task recurrent neural network architecture for efficient morphology handling in neural language modeling |
US10242557B2 (en) | 2017-06-20 | 2019-03-26 | Erik Ward | User-responsive medical trauma treatment device |
CN107564532A (zh) | 2017-07-05 | 2018-01-09 | 百度在线网络技术(北京)有限公司 | 电子设备的唤醒方法、装置、设备及计算机可读存储介质 |
US10445429B2 (en) | 2017-09-21 | 2019-10-15 | Apple Inc. | Natural language understanding using vocabularies with compressed serialized tries |
US10733987B1 (en) * | 2017-09-26 | 2020-08-04 | Amazon Technologies, Inc. | System and methods for providing unplayed content |
US10755051B2 (en) | 2017-09-29 | 2020-08-25 | Apple Inc. | Rule-based natural language processing |
US10636424B2 (en) | 2017-11-30 | 2020-04-28 | Apple Inc. | Multi-turn canned dialog |
US10733982B2 (en) | 2018-01-08 | 2020-08-04 | Apple Inc. | Multi-directional dialog |
US10733375B2 (en) | 2018-01-31 | 2020-08-04 | Apple Inc. | Knowledge-based framework for improving natural language understanding |
US10789959B2 (en) | 2018-03-02 | 2020-09-29 | Apple Inc. | Training speaker recognition models for digital assistants |
US10592604B2 (en) | 2018-03-12 | 2020-03-17 | Apple Inc. | Inverse text normalization for automatic speech recognition |
US10818288B2 (en) | 2018-03-26 | 2020-10-27 | Apple Inc. | Natural assistant interaction |
US10909331B2 (en) | 2018-03-30 | 2021-02-02 | Apple Inc. | Implicit identification of translation payload with neural machine translation |
US10928918B2 (en) | 2018-05-07 | 2021-02-23 | Apple Inc. | Raise to speak |
US10908873B2 (en) | 2018-05-07 | 2021-02-02 | Spotify Ab | Command confirmation for a media playback device |
US11145294B2 (en) | 2018-05-07 | 2021-10-12 | Apple Inc. | Intelligent automated assistant for delivering content from user experiences |
US10984780B2 (en) | 2018-05-21 | 2021-04-20 | Apple Inc. | Global semantic word embeddings using bi-directional recurrent neural networks |
DK180639B1 (en) | 2018-06-01 | 2021-11-04 | Apple Inc | DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT |
DK179822B1 (da) | 2018-06-01 | 2019-07-12 | Apple Inc. | Voice interaction at a primary device to access call functionality of a companion device |
US10892996B2 (en) | 2018-06-01 | 2021-01-12 | Apple Inc. | Variable latency device coordination |
US11386266B2 (en) | 2018-06-01 | 2022-07-12 | Apple Inc. | Text correction |
DK201870355A1 (en) | 2018-06-01 | 2019-12-16 | Apple Inc. | VIRTUAL ASSISTANT OPERATION IN MULTI-DEVICE ENVIRONMENTS |
US10504518B1 (en) | 2018-06-03 | 2019-12-10 | Apple Inc. | Accelerated task performance |
CN108877767A (zh) * | 2018-06-12 | 2018-11-23 | 浙江吉利控股集团有限公司 | 一种智能语音提示系统及方法 |
EP3598295A1 (en) | 2018-07-18 | 2020-01-22 | Spotify AB | Human-machine interfaces for utterance-based playlist selection |
CN109151565B (zh) * | 2018-09-04 | 2019-12-20 | 北京达佳互联信息技术有限公司 | 播放语音的方法、装置、电子设备及存储介质 |
US11010561B2 (en) | 2018-09-27 | 2021-05-18 | Apple Inc. | Sentiment prediction from textual data |
US10839159B2 (en) | 2018-09-28 | 2020-11-17 | Apple Inc. | Named entity normalization in a spoken dialog system |
US11462215B2 (en) | 2018-09-28 | 2022-10-04 | Apple Inc. | Multi-modal inputs for voice commands |
US11170166B2 (en) | 2018-09-28 | 2021-11-09 | Apple Inc. | Neural typographical error modeling via generative adversarial networks |
US11475898B2 (en) | 2018-10-26 | 2022-10-18 | Apple Inc. | Low-latency multi-speaker speech recognition |
US11638059B2 (en) | 2019-01-04 | 2023-04-25 | Apple Inc. | Content playback on multiple devices |
US11348573B2 (en) | 2019-03-18 | 2022-05-31 | Apple Inc. | Multimodality in digital assistant systems |
US11265308B2 (en) * | 2019-03-29 | 2022-03-01 | Vmware, Inc. | Workflow service back end integration |
US11475884B2 (en) | 2019-05-06 | 2022-10-18 | Apple Inc. | Reducing digital assistant latency when a language is incorrectly determined |
US11307752B2 (en) | 2019-05-06 | 2022-04-19 | Apple Inc. | User configurable task triggers |
US11423908B2 (en) | 2019-05-06 | 2022-08-23 | Apple Inc. | Interpreting spoken requests |
DK201970509A1 (en) | 2019-05-06 | 2021-01-15 | Apple Inc | Spoken notifications |
US11140099B2 (en) | 2019-05-21 | 2021-10-05 | Apple Inc. | Providing message response suggestions |
DK201970511A1 (en) | 2019-05-31 | 2021-02-15 | Apple Inc | Voice identification in digital assistant systems |
US11496600B2 (en) | 2019-05-31 | 2022-11-08 | Apple Inc. | Remote execution of machine-learned models |
US11289073B2 (en) | 2019-05-31 | 2022-03-29 | Apple Inc. | Device text to speech |
DK180129B1 (en) | 2019-05-31 | 2020-06-02 | Apple Inc. | USER ACTIVITY SHORTCUT SUGGESTIONS |
US11227599B2 (en) | 2019-06-01 | 2022-01-18 | Apple Inc. | Methods and user interfaces for voice-based control of electronic devices |
US11360641B2 (en) | 2019-06-01 | 2022-06-14 | Apple Inc. | Increasing the relevance of new available information |
WO2021056255A1 (en) | 2019-09-25 | 2021-04-01 | Apple Inc. | Text detection using global geometry estimators |
US11038934B1 (en) | 2020-05-11 | 2021-06-15 | Apple Inc. | Digital assistant hardware abstraction |
US11061543B1 (en) | 2020-05-11 | 2021-07-13 | Apple Inc. | Providing relevant data items based on context |
US11490204B2 (en) | 2020-07-20 | 2022-11-01 | Apple Inc. | Multi-device audio adjustment coordination |
US11438683B2 (en) | 2020-07-21 | 2022-09-06 | Apple Inc. | User identification using headphones |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020169605A1 (en) * | 2001-03-09 | 2002-11-14 | Damiba Bertrand A. | System, method and computer program product for self-verifying file content in a speech recognition framework |
CN101051823A (zh) * | 2005-12-07 | 2007-10-10 | 苹果电脑有限公司 | 提供对音频音量参数的自动控制以保护听觉的便携式音频设备 |
Family Cites Families (673)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6009A (en) * | 1849-01-09 | Improvement in machines for gathering pea-vines | ||
US3828132A (en) | 1970-10-30 | 1974-08-06 | Bell Telephone Labor Inc | Speech synthesis by concatenation of formant encoded words |
US3704345A (en) | 1971-03-19 | 1972-11-28 | Bell Telephone Labor Inc | Conversion of printed text into synthetic speech |
US3979557A (en) | 1974-07-03 | 1976-09-07 | International Telephone And Telegraph Corporation | Speech processor system for pitch period extraction using prediction filters |
BG24190A1 (en) | 1976-09-08 | 1978-01-10 | Antonov | Method of synthesis of speech and device for effecting same |
JPS597120B2 (ja) | 1978-11-24 | 1984-02-16 | 日本電気株式会社 | 音声分析装置 |
US4310721A (en) * | 1980-01-23 | 1982-01-12 | The United States Of America As Represented By The Secretary Of The Army | Half duplex integral vocoder modem system |
US4348553A (en) | 1980-07-02 | 1982-09-07 | International Business Machines Corporation | Parallel pattern verifier with dynamic time warping |
DE3382796T2 (de) | 1982-06-11 | 1996-03-28 | Mitsubishi Electric Corp | Vorrichtung zur Zwischenbildkodierung. |
US4688195A (en) | 1983-01-28 | 1987-08-18 | Texas Instruments Incorporated | Natural-language interface generating system |
JPS603056A (ja) | 1983-06-21 | 1985-01-09 | Toshiba Corp | 情報整理装置 |
DE3335358A1 (de) | 1983-09-29 | 1985-04-11 | Siemens AG, 1000 Berlin und 8000 München | Verfahren zur bestimmung von sprachspektren fuer die automatische spracherkennung und sprachcodierung |
US5164900A (en) | 1983-11-14 | 1992-11-17 | Colman Bernath | Method and device for phonetically encoding Chinese textual data for data processing entry |
JPS60116072A (ja) | 1983-11-29 | 1985-06-22 | N K B:Kk | 情報提供システム |
US4726065A (en) * | 1984-01-26 | 1988-02-16 | Horst Froessl | Image manipulation by speech signals |
US4955047A (en) | 1984-03-26 | 1990-09-04 | Dytel Corporation | Automated attendant with direct inward system access |
US4811243A (en) | 1984-04-06 | 1989-03-07 | Racine Marsh V | Computer aided coordinate digitizing system |
US4692941A (en) | 1984-04-10 | 1987-09-08 | First Byte | Real-time text-to-speech conversion system |
US4783807A (en) | 1984-08-27 | 1988-11-08 | John Marley | System and method for sound recognition with feature selection synchronized to voice pitch |
US4718094A (en) * | 1984-11-19 | 1988-01-05 | International Business Machines Corp. | Speech recognition system |
US5165007A (en) | 1985-02-01 | 1992-11-17 | International Business Machines Corporation | Feneme-based Markov models for words |
US4944013A (en) | 1985-04-03 | 1990-07-24 | British Telecommunications Public Limited Company | Multi-pulse speech coder |
US4833712A (en) | 1985-05-29 | 1989-05-23 | International Business Machines Corporation | Automatic generation of simple Markov model stunted baseforms for words in a vocabulary |
US4819271A (en) | 1985-05-29 | 1989-04-04 | International Business Machines Corporation | Constructing Markov model word baseforms from multiple utterances by concatenating model sequences for word segments |
EP0218859A3 (en) | 1985-10-11 | 1989-09-06 | International Business Machines Corporation | Signal processor communication interface |
US4776016A (en) | 1985-11-21 | 1988-10-04 | Position Orientation Systems, Inc. | Voice control system |
JPH0833744B2 (ja) | 1986-01-09 | 1996-03-29 | 株式会社東芝 | 音声合成装置 |
US4724542A (en) | 1986-01-22 | 1988-02-09 | International Business Machines Corporation | Automatic reference adaptation during dynamic signature verification |
US5128752A (en) | 1986-03-10 | 1992-07-07 | Kohorn H Von | System and method for generating and redeeming tokens |
US5759101A (en) | 1986-03-10 | 1998-06-02 | Response Reward Systems L.C. | Central and remote evaluation of responses of participatory broadcast audience with automatic crediting and couponing |
US5032989A (en) | 1986-03-19 | 1991-07-16 | Realpro, Ltd. | Real estate search and location system and method |
EP0241170B1 (en) | 1986-03-28 | 1992-05-27 | AT&T Corp. | Adaptive speech feature signal generation arrangement |
US4903305A (en) * | 1986-05-12 | 1990-02-20 | Dragon Systems, Inc. | Method for representing word models for use in speech recognition |
EP0262938B1 (en) | 1986-10-03 | 1993-12-15 | BRITISH TELECOMMUNICATIONS public limited company | Language translation system |
USRE34562E (en) | 1986-10-16 | 1994-03-15 | Mitsubishi Denki Kabushiki Kaisha | Amplitude-adaptive vector quantization system |
US4829576A (en) | 1986-10-21 | 1989-05-09 | Dragon Systems, Inc. | Voice recognition system |
US4852168A (en) | 1986-11-18 | 1989-07-25 | Sprague Richard P | Compression of stored waveforms for artificial speech |
US4727354A (en) * | 1987-01-07 | 1988-02-23 | Unisys Corporation | System for selecting best fit vector code in vector quantization encoding |
US4827520A (en) | 1987-01-16 | 1989-05-02 | Prince Corporation | Voice actuated control system for use in a vehicle |
JPH0619965B2 (ja) | 1987-02-13 | 1994-03-16 | 日本電子株式会社 | 走査型電子顕微鏡等における試料交換装置 |
US4965763A (en) | 1987-03-03 | 1990-10-23 | International Business Machines Corporation | Computer method for automatic extraction of commonly specified information from business correspondence |
US5644727A (en) | 1987-04-15 | 1997-07-01 | Proprietary Financial Products, Inc. | System for the operation and management of one or more financial accounts through the use of a digital communication and computation system for exchange, investment and borrowing |
CA1295064C (en) | 1987-05-29 | 1992-01-28 | Kuniyoshi Marui | Voice recognition system used in telephone apparatus |
DE3723078A1 (de) | 1987-07-11 | 1989-01-19 | Philips Patentverwaltung | Verfahren zur erkennung von zusammenhaengend gesprochenen woertern |
CA1288516C (en) | 1987-07-31 | 1991-09-03 | Leendert M. Bijnagte | Apparatus and method for communicating textual and image information between a host computer and a remote display terminal |
US4974191A (en) | 1987-07-31 | 1990-11-27 | Syntellect Software Inc. | Adaptive natural language computer interface system |
US5022081A (en) | 1987-10-01 | 1991-06-04 | Sharp Kabushiki Kaisha | Information recognition system |
US4852173A (en) | 1987-10-29 | 1989-07-25 | International Business Machines Corporation | Design and construction of a binary-tree system for language modelling |
EP0314908B1 (en) | 1987-10-30 | 1992-12-02 | International Business Machines Corporation | Automatic determination of labels and markov word models in a speech recognition system |
US5072452A (en) | 1987-10-30 | 1991-12-10 | International Business Machines Corporation | Automatic determination of labels and Markov word models in a speech recognition system |
US4914586A (en) | 1987-11-06 | 1990-04-03 | Xerox Corporation | Garbage collector for hypermedia systems |
US4992972A (en) * | 1987-11-18 | 1991-02-12 | International Business Machines Corporation | Flexible context searchable on-line information system with help files and modules for on-line computer system documentation |
US5220657A (en) | 1987-12-02 | 1993-06-15 | Xerox Corporation | Updating local copy of shared data in a collaborative system |
US4984177A (en) | 1988-02-05 | 1991-01-08 | Advanced Products And Technologies, Inc. | Voice language translator |
CA1333420C (en) | 1988-02-29 | 1994-12-06 | Tokumichi Murakami | Vector quantizer |
US4914590A (en) | 1988-05-18 | 1990-04-03 | Emhart Industries, Inc. | Natural language understanding system |
FR2636163B1 (fr) | 1988-09-02 | 1991-07-05 | Hamon Christian | Procede et dispositif de synthese de la parole par addition-recouvrement de formes d'onde |
US4839853A (en) | 1988-09-15 | 1989-06-13 | Bell Communications Research, Inc. | Computer information retrieval using latent semantic structure |
JPH0293597A (ja) | 1988-09-30 | 1990-04-04 | Nippon I B M Kk | 音声認識装置 |
US4905163A (en) * | 1988-10-03 | 1990-02-27 | Minnesota Mining & Manufacturing Company | Intelligent optical navigator dynamic information presentation and navigation system |
US5282265A (en) * | 1988-10-04 | 1994-01-25 | Canon Kabushiki Kaisha | Knowledge information processing system |
DE3837590A1 (de) | 1988-11-05 | 1990-05-10 | Ant Nachrichtentech | Verfahren zum reduzieren der datenrate von digitalen bilddaten |
EP0372734B1 (en) | 1988-11-23 | 1994-03-09 | Digital Equipment Corporation | Name pronunciation by synthesizer |
US5027406A (en) | 1988-12-06 | 1991-06-25 | Dragon Systems, Inc. | Method for interactive speech recognition and training |
US5127055A (en) | 1988-12-30 | 1992-06-30 | Kurzweil Applied Intelligence, Inc. | Speech recognition apparatus & method having dynamic reference pattern adaptation |
US5293448A (en) | 1989-10-02 | 1994-03-08 | Nippon Telegraph And Telephone Corporation | Speech analysis-synthesis method and apparatus therefor |
US5047614A (en) | 1989-01-23 | 1991-09-10 | Bianco James S | Method and apparatus for computer-aided shopping |
SE466029B (sv) * | 1989-03-06 | 1991-12-02 | Ibm Svenska Ab | Anordning och foerfarande foer analys av naturligt spraak i ett datorbaserat informationsbehandlingssystem |
JPH0782544B2 (ja) | 1989-03-24 | 1995-09-06 | インターナショナル・ビジネス・マシーンズ・コーポレーション | マルチテンプレートを用いるdpマツチング方法及び装置 |
US4977598A (en) | 1989-04-13 | 1990-12-11 | Texas Instruments Incorporated | Efficient pruning algorithm for hidden markov model speech recognition |
US5197005A (en) | 1989-05-01 | 1993-03-23 | Intelligent Business Systems | Database retrieval system having a natural language interface |
US5010574A (en) | 1989-06-13 | 1991-04-23 | At&T Bell Laboratories | Vector quantizer search arrangement |
JP2940005B2 (ja) | 1989-07-20 | 1999-08-25 | 日本電気株式会社 | 音声符号化装置 |
US5091945A (en) * | 1989-09-28 | 1992-02-25 | At&T Bell Laboratories | Source dependent channel coding with error protection |
CA2027705C (en) | 1989-10-17 | 1994-02-15 | Masami Akamine | Speech coding system utilizing a recursive computation technique for improvement in processing speed |
US5020112A (en) | 1989-10-31 | 1991-05-28 | At&T Bell Laboratories | Image recognition method using two-dimensional stochastic grammars |
US5220639A (en) | 1989-12-01 | 1993-06-15 | National Science Council | Mandarin speech input method for Chinese computers and a mandarin speech recognition machine |
US5021971A (en) | 1989-12-07 | 1991-06-04 | Unisys Corporation | Reflective binary encoder for vector quantization |
US5179652A (en) * | 1989-12-13 | 1993-01-12 | Anthony I. Rozmanith | Method and apparatus for storing, transmitting and retrieving graphical and tabular data |
AT394262B (de) | 1989-12-15 | 1992-02-25 | Vaillant Gmbh | Einrichtung fuer die einstellung und ueberwachung einer heizungsanlage |
DE69133296T2 (de) | 1990-02-22 | 2004-01-29 | Nec Corp | Sprachcodierer |
US5301109A (en) | 1990-06-11 | 1994-04-05 | Bell Communications Research, Inc. | Computerized cross-language document retrieval using latent semantic indexing |
JP3266246B2 (ja) | 1990-06-15 | 2002-03-18 | インターナシヨナル・ビジネス・マシーンズ・コーポレーシヨン | 自然言語解析装置及び方法並びに自然言語解析用知識ベース構築方法 |
US5202952A (en) | 1990-06-22 | 1993-04-13 | Dragon Systems, Inc. | Large-vocabulary continuous speech prefiltering and processing system |
GB9017600D0 (en) | 1990-08-10 | 1990-09-26 | British Aerospace | An assembly and method for binary tree-searched vector quanisation data compression processing |
US5309359A (en) | 1990-08-16 | 1994-05-03 | Boris Katz | Method and apparatus for generating and utlizing annotations to facilitate computer text retrieval |
US5404295A (en) | 1990-08-16 | 1995-04-04 | Katz; Boris | Method and apparatus for utilizing annotations to facilitate computer retrieval of database material |
US5297170A (en) | 1990-08-21 | 1994-03-22 | Codex Corporation | Lattice and trellis-coded quantization |
US5400434A (en) | 1990-09-04 | 1995-03-21 | Matsushita Electric Industrial Co., Ltd. | Voice source for synthetic speech system |
US5216747A (en) | 1990-09-20 | 1993-06-01 | Digital Voice Systems, Inc. | Voiced/unvoiced estimation of an acoustic signal |
US5128672A (en) | 1990-10-30 | 1992-07-07 | Apple Computer, Inc. | Dynamic predictive keyboard |
US5317507A (en) | 1990-11-07 | 1994-05-31 | Gallant Stephen I | Method for document retrieval and for word sense disambiguation using neural networks |
US5325298A (en) | 1990-11-07 | 1994-06-28 | Hnc, Inc. | Methods for generating or revising context vectors for a plurality of word stems |
US5247579A (en) * | 1990-12-05 | 1993-09-21 | Digital Voice Systems, Inc. | Methods for speech transmission |
US5345536A (en) | 1990-12-21 | 1994-09-06 | Matsushita Electric Industrial Co., Ltd. | Method of speech recognition |
US5127053A (en) | 1990-12-24 | 1992-06-30 | General Electric Company | Low-complexity method for improving the performance of autocorrelation-based pitch detectors |
US5133011A (en) | 1990-12-26 | 1992-07-21 | International Business Machines Corporation | Method and apparatus for linear vocal control of cursor position |
US5268990A (en) | 1991-01-31 | 1993-12-07 | Sri International | Method for recognizing speech using linguistically-motivated hidden Markov models |
GB9105367D0 (en) | 1991-03-13 | 1991-04-24 | Univ Strathclyde | Computerised information-retrieval database systems |
US5303406A (en) | 1991-04-29 | 1994-04-12 | Motorola, Inc. | Noise squelch circuit with adaptive noise shaping |
US5475587A (en) | 1991-06-28 | 1995-12-12 | Digital Equipment Corporation | Method and apparatus for efficient morphological text analysis using a high-level language for compact specification of inflectional paradigms |
US5293452A (en) | 1991-07-01 | 1994-03-08 | Texas Instruments Incorporated | Voice log-in using spoken name input |
US5687077A (en) | 1991-07-31 | 1997-11-11 | Universal Dynamics Limited | Method and apparatus for adaptive control |
US5199077A (en) | 1991-09-19 | 1993-03-30 | Xerox Corporation | Wordspotting for voice editing and indexing |
JP2662120B2 (ja) | 1991-10-01 | 1997-10-08 | インターナショナル・ビジネス・マシーンズ・コーポレイション | 音声認識装置および音声認識用処理ユニット |
JPH05108065A (ja) | 1991-10-15 | 1993-04-30 | Kawai Musical Instr Mfg Co Ltd | 自動演奏装置 |
US5222146A (en) | 1991-10-23 | 1993-06-22 | International Business Machines Corporation | Speech recognition apparatus having a speech coder outputting acoustic prototype ranks |
KR940002854B1 (ko) | 1991-11-06 | 1994-04-04 | 한국전기통신공사 | 음성 합성시스팀의 음성단편 코딩 및 그의 피치조절 방법과 그의 유성음 합성장치 |
US5386494A (en) * | 1991-12-06 | 1995-01-31 | Apple Computer, Inc. | Method and apparatus for controlling a speech recognition function using a cursor control device |
US5903454A (en) | 1991-12-23 | 1999-05-11 | Hoffberg; Linda Irene | Human-factored interface corporating adaptive pattern recognition based controller apparatus |
US6081750A (en) | 1991-12-23 | 2000-06-27 | Hoffberg; Steven Mark | Ergonomic man-machine interface incorporating adaptive pattern recognition based control system |
US5502790A (en) | 1991-12-24 | 1996-03-26 | Oki Electric Industry Co., Ltd. | Speech recognition method and system using triphones, diphones, and phonemes |
US5349645A (en) | 1991-12-31 | 1994-09-20 | Matsushita Electric Industrial Co., Ltd. | Word hypothesizer for continuous speech decoding using stressed-vowel centered bidirectional tree searches |
US5267345A (en) | 1992-02-10 | 1993-11-30 | International Business Machines Corporation | Speech recognition apparatus which predicts word classes from context and words from word classes |
EP0559349B1 (en) | 1992-03-02 | 1999-01-07 | AT&T Corp. | Training method and apparatus for speech recognition |
US6055514A (en) | 1992-03-20 | 2000-04-25 | Wren; Stephen Corey | System for marketing foods and services utilizing computerized centraland remote facilities |
US5317647A (en) | 1992-04-07 | 1994-05-31 | Apple Computer, Inc. | Constrained attribute grammars for syntactic pattern recognition |
US5412804A (en) | 1992-04-30 | 1995-05-02 | Oracle Corporation | Extending the semantics of the outer join operator for un-nesting queries to a data base |
US5293584A (en) | 1992-05-21 | 1994-03-08 | International Business Machines Corporation | Speech recognition system for natural language translation |
US5434777A (en) | 1992-05-27 | 1995-07-18 | Apple Computer, Inc. | Method and apparatus for processing natural language |
US5390281A (en) | 1992-05-27 | 1995-02-14 | Apple Computer, Inc. | Method and apparatus for deducing user intent and providing computer implemented services |
US5734789A (en) * | 1992-06-01 | 1998-03-31 | Hughes Electronics | Voiced, unvoiced or noise modes in a CELP vocoder |
US5333275A (en) | 1992-06-23 | 1994-07-26 | Wheatley Barbara J | System and method for time aligning speech |
US5325297A (en) | 1992-06-25 | 1994-06-28 | System Of Multiple-Colored Images For Internationally Listed Estates, Inc. | Computer implemented method and system for storing and retrieving textual data and compressed image data |
US5999908A (en) | 1992-08-06 | 1999-12-07 | Abelow; Daniel H. | Customer-based product design module |
GB9220404D0 (en) | 1992-08-20 | 1992-11-11 | Nat Security Agency | Method of identifying,retrieving and sorting documents |
US5412806A (en) | 1992-08-20 | 1995-05-02 | Hewlett-Packard Company | Calibration of logical cost formulae for queries in a heterogeneous DBMS using synthetic database |
US5333236A (en) | 1992-09-10 | 1994-07-26 | International Business Machines Corporation | Speech recognizer having a speech coder for an acoustic match based on context-dependent speech-transition acoustic models |
US5384893A (en) * | 1992-09-23 | 1995-01-24 | Emerson & Stern Associates, Inc. | Method and apparatus for speech synthesis based on prosodic analysis |
FR2696036B1 (fr) | 1992-09-24 | 1994-10-14 | France Telecom | Procédé de mesure de ressemblance entre échantillons sonores et dispositif de mise en Óoeuvre de ce procédé. |
JPH0772840B2 (ja) | 1992-09-29 | 1995-08-02 | 日本アイ・ビー・エム株式会社 | 音声モデルの構成方法、音声認識方法、音声認識装置及び音声モデルの訓練方法 |
US5758313A (en) | 1992-10-16 | 1998-05-26 | Mobile Information Systems, Inc. | Method and apparatus for tracking vehicle location |
US5455888A (en) | 1992-12-04 | 1995-10-03 | Northern Telecom Limited | Speech bandwidth extension method and apparatus |
US5533182A (en) | 1992-12-22 | 1996-07-02 | International Business Machines Corporation | Aural position indicating mechanism for viewable objects |
US5412756A (en) | 1992-12-22 | 1995-05-02 | Mitsubishi Denki Kabushiki Kaisha | Artificial intelligence software shell for plant operation simulation |
US5384892A (en) * | 1992-12-31 | 1995-01-24 | Apple Computer, Inc. | Dynamic language model for speech recognition |
US5390279A (en) * | 1992-12-31 | 1995-02-14 | Apple Computer, Inc. | Partitioning speech rules by context for speech recognition |
US5613036A (en) | 1992-12-31 | 1997-03-18 | Apple Computer, Inc. | Dynamic categories for a speech recognition system |
US5734791A (en) | 1992-12-31 | 1998-03-31 | Apple Computer, Inc. | Rapid tree-based method for vector quantization |
US6122616A (en) | 1993-01-21 | 2000-09-19 | Apple Computer, Inc. | Method and apparatus for diphone aliasing |
US5890122A (en) | 1993-02-08 | 1999-03-30 | Microsoft Corporation | Voice-controlled computer simulateously displaying application menu and list of available commands |
US5864844A (en) | 1993-02-18 | 1999-01-26 | Apple Computer, Inc. | System and method for enhancing a user interface with a computer based training tool |
CA2091658A1 (en) | 1993-03-15 | 1994-09-16 | Matthew Lennig | Method and apparatus for automation of directory assistance using speech recognition |
US6055531A (en) | 1993-03-24 | 2000-04-25 | Engate Incorporated | Down-line transcription system having context sensitive searching capability |
US5536902A (en) | 1993-04-14 | 1996-07-16 | Yamaha Corporation | Method of and apparatus for analyzing and synthesizing a sound by extracting and controlling a sound parameter |
US5444823A (en) | 1993-04-16 | 1995-08-22 | Compaq Computer Corporation | Intelligent search engine for associated on-line documentation having questionless case-based knowledge base |
US5574823A (en) | 1993-06-23 | 1996-11-12 | Her Majesty The Queen In Right Of Canada As Represented By The Minister Of Communications | Frequency selective harmonic coding |
US5515475A (en) | 1993-06-24 | 1996-05-07 | Northern Telecom Limited | Speech recognition method using a two-pass search |
JPH0756933A (ja) | 1993-06-24 | 1995-03-03 | Xerox Corp | 文書検索方法 |
JP3685812B2 (ja) | 1993-06-29 | 2005-08-24 | ソニー株式会社 | 音声信号送受信装置 |
US5794207A (en) | 1996-09-04 | 1998-08-11 | Walker Asset Management Limited Partnership | Method and apparatus for a cryptographically assisted commercial network system designed to facilitate buyer-driven conditional purchase offers |
WO1995002221A1 (en) | 1993-07-07 | 1995-01-19 | Inference Corporation | Case-based organizing and querying of a database |
US5495604A (en) | 1993-08-25 | 1996-02-27 | Asymetrix Corporation | Method and apparatus for the modeling and query of database structures using natural language-like constructs |
US5619694A (en) | 1993-08-26 | 1997-04-08 | Nec Corporation | Case database storage/retrieval system |
US5940811A (en) | 1993-08-27 | 1999-08-17 | Affinity Technology Group, Inc. | Closed loop financial transaction method and apparatus |
US5377258A (en) | 1993-08-30 | 1994-12-27 | National Medical Research Council | Method and apparatus for an automated and interactive behavioral guidance system |
US5873056A (en) * | 1993-10-12 | 1999-02-16 | The Syracuse University | Natural language processing system for semantic vector representation which accounts for lexical ambiguity |
US5578808A (en) | 1993-12-22 | 1996-11-26 | Datamark Services, Inc. | Data card that can be used for transactions involving separate card issuers |
EP0736203A1 (en) | 1993-12-23 | 1996-10-09 | Diacom Technologies, Inc. | Method and apparatus for implementing user feedback |
US5621859A (en) | 1994-01-19 | 1997-04-15 | Bbn Corporation | Single tree method for grammar directed, very large vocabulary speech recognizer |
US5584024A (en) | 1994-03-24 | 1996-12-10 | Software Ag | Interactive database query system and method for prohibiting the selection of semantically incorrect query parameters |
US5642519A (en) | 1994-04-29 | 1997-06-24 | Sun Microsystems, Inc. | Speech interpreter with a unified grammer compiler |
EP0684607B1 (en) | 1994-05-25 | 2001-03-14 | Victor Company Of Japan, Limited | Variable transfer rate data reproduction apparatus |
US5493677A (en) | 1994-06-08 | 1996-02-20 | Systems Research & Applications Corporation | Generation, archiving, and retrieval of digital images with evoked suggestion-set captions and natural language interface |
US5675819A (en) | 1994-06-16 | 1997-10-07 | Xerox Corporation | Document information retrieval using global word co-occurrence patterns |
JPH0869470A (ja) | 1994-06-21 | 1996-03-12 | Canon Inc | 自然言語処理装置及びその方法 |
US5948040A (en) | 1994-06-24 | 1999-09-07 | Delorme Publishing Co. | Travel reservation information and planning system |
US5682539A (en) | 1994-09-29 | 1997-10-28 | Conrad; Donovan | Anticipated meaning natural language interface |
US5715468A (en) | 1994-09-30 | 1998-02-03 | Budzinski; Robert Lucius | Memory system for storing and retrieving experience and knowledge with natural language |
GB2293667B (en) | 1994-09-30 | 1998-05-27 | Intermation Limited | Database management system |
US5661787A (en) | 1994-10-27 | 1997-08-26 | Pocock; Michael H. | System for on-demand remote access to a self-generating audio recording, storage, indexing and transaction system |
US5845255A (en) | 1994-10-28 | 1998-12-01 | Advanced Health Med-E-Systems Corporation | Prescription management system |
US5577241A (en) | 1994-12-07 | 1996-11-19 | Excite, Inc. | Information retrieval system and method with implementation extensible query architecture |
US5748974A (en) | 1994-12-13 | 1998-05-05 | International Business Machines Corporation | Multimodal natural language interface for cross-application tasks |
US5794050A (en) | 1995-01-04 | 1998-08-11 | Intelligent Text Processing, Inc. | Natural language understanding system |
EP1643340B1 (en) | 1995-02-13 | 2013-08-14 | Intertrust Technologies Corp. | Secure transaction management |
US5701400A (en) | 1995-03-08 | 1997-12-23 | Amado; Carlos Armando | Method and apparatus for applying if-then-else rules to data sets in a relational data base and generating from the results of application of said rules a database of diagnostics linked to said data sets to aid executive analysis of financial data |
US5749081A (en) | 1995-04-06 | 1998-05-05 | Firefly Network, Inc. | System and method for recommending items to a user |
US5642464A (en) | 1995-05-03 | 1997-06-24 | Northern Telecom Limited | Methods and apparatus for noise conditioning in digital speech compression systems using linear predictive coding |
US5664055A (en) | 1995-06-07 | 1997-09-02 | Lucent Technologies Inc. | CS-ACELP speech compression system with adaptive pitch prediction filter gain based on a measure of periodicity |
US5710886A (en) | 1995-06-16 | 1998-01-20 | Sellectsoft, L.C. | Electric couponing method and apparatus |
JP3284832B2 (ja) | 1995-06-22 | 2002-05-20 | セイコーエプソン株式会社 | 音声認識対話処理方法および音声認識対話装置 |
US6038533A (en) | 1995-07-07 | 2000-03-14 | Lucent Technologies Inc. | System and method for selecting training text |
US5999895A (en) | 1995-07-24 | 1999-12-07 | Forest; Donald K. | Sound operated menu method and apparatus |
US6026388A (en) | 1995-08-16 | 2000-02-15 | Textwise, Llc | User interface and other enhancements for natural language information retrieval system and method |
JP3697748B2 (ja) | 1995-08-21 | 2005-09-21 | セイコーエプソン株式会社 | 端末、音声認識装置 |
US5712957A (en) * | 1995-09-08 | 1998-01-27 | Carnegie Mellon University | Locating and correcting erroneously recognized portions of utterances by rescoring based on two n-best lists |
US5790978A (en) | 1995-09-15 | 1998-08-04 | Lucent Technologies, Inc. | System and method for determining pitch contours |
US6173261B1 (en) * | 1998-09-30 | 2001-01-09 | At&T Corp | Grammar fragment acquisition using syntactic and semantic clustering |
US5737734A (en) | 1995-09-15 | 1998-04-07 | Infonautics Corporation | Query word relevance adjustment in a search of an information retrieval system |
US5884323A (en) | 1995-10-13 | 1999-03-16 | 3Com Corporation | Extendible method and apparatus for synchronizing files on two different computer systems |
US5799276A (en) | 1995-11-07 | 1998-08-25 | Accent Incorporated | Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals |
US5794237A (en) | 1995-11-13 | 1998-08-11 | International Business Machines Corporation | System and method for improving problem source identification in computer systems employing relevance feedback and statistical source ranking |
US5802526A (en) | 1995-11-15 | 1998-09-01 | Microsoft Corporation | System and method for graphically displaying and navigating through an interactive voice response menu |
US5706442A (en) | 1995-12-20 | 1998-01-06 | Block Financial Corporation | System for on-line financial services using distributed objects |
WO1997026612A1 (en) | 1996-01-17 | 1997-07-24 | Personal Agents, Inc. | Intelligent agents for electronic commerce |
US6119101A (en) | 1996-01-17 | 2000-09-12 | Personal Agents, Inc. | Intelligent agents for electronic commerce |
US6125356A (en) | 1996-01-18 | 2000-09-26 | Rosefaire Development, Ltd. | Portable sales presentation system with selective scripted seller prompts |
US5987404A (en) | 1996-01-29 | 1999-11-16 | International Business Machines Corporation | Statistical natural language understanding using hidden clumpings |
US5729694A (en) | 1996-02-06 | 1998-03-17 | The Regents Of The University Of California | Speech coding, reconstruction and recognition using acoustics and electromagnetic waves |
US6076088A (en) | 1996-02-09 | 2000-06-13 | Paik; Woojin | Information extraction system and method using concept relation concept (CRC) triples |
US5835893A (en) | 1996-02-15 | 1998-11-10 | Atr Interpreting Telecommunications Research Labs | Class-based word clustering for speech recognition using a three-level balanced hierarchical similarity |
US5901287A (en) | 1996-04-01 | 1999-05-04 | The Sabre Group Inc. | Information aggregation and synthesization system |
US5867799A (en) * | 1996-04-04 | 1999-02-02 | Lang; Andrew K. | Information system and method for filtering a massive flow of information entities to meet user information classification needs |
US5963924A (en) | 1996-04-26 | 1999-10-05 | Verifone, Inc. | System, method and article of manufacture for the use of payment instrument holders and payment instruments in network electronic commerce |
US5987140A (en) | 1996-04-26 | 1999-11-16 | Verifone, Inc. | System, method and article of manufacture for secure network electronic payment and credit collection |
US5913193A (en) | 1996-04-30 | 1999-06-15 | Microsoft Corporation | Method and system of runtime acoustic unit selection for speech synthesis |
US5857184A (en) | 1996-05-03 | 1999-01-05 | Walden Media, Inc. | Language and method for creating, organizing, and retrieving data from a database |
US5828999A (en) | 1996-05-06 | 1998-10-27 | Apple Computer, Inc. | Method and system for deriving a large-span semantic language model for large-vocabulary recognition systems |
FR2748342B1 (fr) * | 1996-05-06 | 1998-07-17 | France Telecom | Procede et dispositif de filtrage par egalisation d'un signal de parole, mettant en oeuvre un modele statistique de ce signal |
US5826261A (en) | 1996-05-10 | 1998-10-20 | Spencer; Graham | System and method for querying multiple, distributed databases by selective sharing of local relative significance information for terms related to the query |
US6366883B1 (en) | 1996-05-15 | 2002-04-02 | Atr Interpreting Telecommunications | Concatenation of speech segments by use of a speech synthesizer |
US5727950A (en) | 1996-05-22 | 1998-03-17 | Netsage Corporation | Agent based instruction system and method |
US5966533A (en) * | 1996-06-11 | 1999-10-12 | Excite, Inc. | Method and system for dynamically synthesizing a computer program by differentially resolving atoms based on user context data |
US5915249A (en) | 1996-06-14 | 1999-06-22 | Excite, Inc. | System and method for accelerated query evaluation of very large full-text databases |
US5987132A (en) | 1996-06-17 | 1999-11-16 | Verifone, Inc. | System, method and article of manufacture for conditionally accepting a payment method utilizing an extensible, flexible architecture |
US5912952A (en) | 1996-06-27 | 1999-06-15 | At&T Corp | Voice response unit with a visual menu interface |
US5825881A (en) | 1996-06-28 | 1998-10-20 | Allsoft Distributing Inc. | Public network merchandising system |
US6070147A (en) | 1996-07-02 | 2000-05-30 | Tecmark Services, Inc. | Customer identification and marketing analysis systems |
WO1998003927A2 (en) | 1996-07-22 | 1998-01-29 | Cyva Research Corp | Personal information security and exchange tool |
US5862223A (en) | 1996-07-24 | 1999-01-19 | Walker Asset Management Limited Partnership | Method and apparatus for a cryptographically-assisted commercial network system designed to facilitate and support expert-based commerce |
US5950123A (en) | 1996-08-26 | 1999-09-07 | Telefonaktiebolaget L M | Cellular telephone network support of audible information delivery to visually impaired subscribers |
EP0829811A1 (en) | 1996-09-11 | 1998-03-18 | Nippon Telegraph And Telephone Corporation | Method and system for information retrieval |
US6181935B1 (en) | 1996-09-27 | 2001-01-30 | Software.Com, Inc. | Mobility extended telephone application programming interface and method of use |
US5794182A (en) | 1996-09-30 | 1998-08-11 | Apple Computer, Inc. | Linear predictive speech encoding systems with efficient combination pitch coefficients computation |
US5721827A (en) * | 1996-10-02 | 1998-02-24 | James Logan | System for electrically distributing personalized information |
US6199076B1 (en) * | 1996-10-02 | 2001-03-06 | James Logan | Audio program player including a dynamic program selection controller |
US5913203A (en) | 1996-10-03 | 1999-06-15 | Jaesent Inc. | System and method for pseudo cash transactions |
US5930769A (en) | 1996-10-07 | 1999-07-27 | Rose; Andrea | System and method for fashion shopping |
US5836771A (en) | 1996-12-02 | 1998-11-17 | Ho; Chi Fai | Learning method and system based on questioning |
US6665639B2 (en) | 1996-12-06 | 2003-12-16 | Sensory, Inc. | Speech recognition in consumer electronic products |
US6078914A (en) | 1996-12-09 | 2000-06-20 | Open Text Corporation | Natural language meta-search system and method |
US5839106A (en) | 1996-12-17 | 1998-11-17 | Apple Computer, Inc. | Large-vocabulary speech recognition using an integrated syntactic and semantic statistical language model |
US5966126A (en) | 1996-12-23 | 1999-10-12 | Szabo; Andrew J. | Graphic user interface for database system |
US5932869A (en) | 1996-12-27 | 1999-08-03 | Graphic Technology, Inc. | Promotional system with magnetic stripe and visual thermo-reversible print surfaced medium |
JP3579204B2 (ja) | 1997-01-17 | 2004-10-20 | 富士通株式会社 | 文書要約装置およびその方法 |
US5941944A (en) | 1997-03-03 | 1999-08-24 | Microsoft Corporation | Method for providing a substitute for a requested inaccessible object by identifying substantially similar objects using weights corresponding to object features |
US5930801A (en) | 1997-03-07 | 1999-07-27 | Xerox Corporation | Shared-data environment in which each file has independent security properties |
US6076051A (en) | 1997-03-07 | 2000-06-13 | Microsoft Corporation | Information retrieval utilizing semantic representation of text |
JPH10320169A (ja) * | 1997-03-14 | 1998-12-04 | Fujitsu Ltd | 情報電子装置 |
AU6566598A (en) | 1997-03-20 | 1998-10-12 | Schlumberger Technologies, Inc. | System and method of transactional taxation using secure stored data devices |
US5822743A (en) | 1997-04-08 | 1998-10-13 | 1215627 Ontario Inc. | Knowledge-based information retrieval system |
JP3704925B2 (ja) | 1997-04-22 | 2005-10-12 | トヨタ自動車株式会社 | 移動端末装置及びその音声出力プログラムを記録した媒体 |
US5970474A (en) | 1997-04-24 | 1999-10-19 | Sears, Roebuck And Co. | Registry information system for shoppers |
US5895464A (en) | 1997-04-30 | 1999-04-20 | Eastman Kodak Company | Computer program product and a method for using natural language for the description, search and retrieval of multi-media objects |
WO1999001834A1 (en) | 1997-07-02 | 1999-01-14 | Coueignoux, Philippe, J., M. | System and method for the secure discovery, exploitation and publication of information |
US5860063A (en) * | 1997-07-11 | 1999-01-12 | At&T Corp | Automated meaningful phrase clustering |
US5933822A (en) | 1997-07-22 | 1999-08-03 | Microsoft Corporation | Apparatus and methods for an information retrieval system that employs natural language processing of search results to improve overall precision |
US5974146A (en) | 1997-07-30 | 1999-10-26 | Huntington Bancshares Incorporated | Real time bank-centric universal payment system |
US6016476A (en) | 1997-08-11 | 2000-01-18 | International Business Machines Corporation | Portable information and transaction processing system and method utilizing biometric authorization and digital certificate security |
US5895466A (en) | 1997-08-19 | 1999-04-20 | At&T Corp | Automated natural language understanding customer service system |
US6081774A (en) | 1997-08-22 | 2000-06-27 | Novell, Inc. | Natural language information retrieval system and method |
US6404876B1 (en) | 1997-09-25 | 2002-06-11 | Gte Intelligent Network Services Incorporated | System and method for voice activated dialing and routing under open access network control |
US6023684A (en) | 1997-10-01 | 2000-02-08 | Security First Technologies, Inc. | Three tier financial transaction system with cache memory |
US6035336A (en) | 1997-10-17 | 2000-03-07 | International Business Machines Corporation | Audio ticker system and method for presenting push information including pre-recorded audio |
EP0911808B1 (en) | 1997-10-23 | 2002-05-08 | Sony International (Europe) GmbH | Speech interface in a home network environment |
US6108627A (en) | 1997-10-31 | 2000-08-22 | Nortel Networks Corporation | Automatic transcription tool |
US5943670A (en) | 1997-11-21 | 1999-08-24 | International Business Machines Corporation | System and method for categorizing objects in combined categories |
US5960422A (en) | 1997-11-26 | 1999-09-28 | International Business Machines Corporation | System and method for optimized source selection in an information retrieval system |
US6026375A (en) | 1997-12-05 | 2000-02-15 | Nortel Networks Corporation | Method and apparatus for processing orders from customers in a mobile environment |
US6064960A (en) | 1997-12-18 | 2000-05-16 | Apple Computer, Inc. | Method and apparatus for improved duration modeling of phonemes |
US6094649A (en) | 1997-12-22 | 2000-07-25 | Partnet, Inc. | Keyword searches of structured databases |
US20020002039A1 (en) * | 1998-06-12 | 2002-01-03 | Safi Qureshey | Network-enabled audio device |
US20020080163A1 (en) | 1998-02-23 | 2002-06-27 | Morey Dale D. | Information retrieval system |
US6345250B1 (en) * | 1998-02-24 | 2002-02-05 | International Business Machines Corp. | Developing voice response applications from pre-recorded voice and stored text-to-speech prompts |
US6173287B1 (en) | 1998-03-11 | 2001-01-09 | Digital Equipment Corporation | Technique for ranking multimedia annotations of interest |
US6195641B1 (en) * | 1998-03-27 | 2001-02-27 | International Business Machines Corp. | Network universal spoken language vocabulary |
US6026393A (en) | 1998-03-31 | 2000-02-15 | Casebank Technologies Inc. | Configuration knowledge as an aid to case retrieval |
US6233559B1 (en) | 1998-04-01 | 2001-05-15 | Motorola, Inc. | Speech control of multiple applications using applets |
US6173279B1 (en) | 1998-04-09 | 2001-01-09 | At&T Corp. | Method of using a natural language interface to retrieve information from one or more data resources |
US6088731A (en) | 1998-04-24 | 2000-07-11 | Associative Computing, Inc. | Intelligent assistant for use with a local computer and with the internet |
AU3717099A (en) | 1998-04-27 | 1999-11-16 | British Telecommunications Public Limited Company | Database access tool |
US6016471A (en) * | 1998-04-29 | 2000-01-18 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus using decision trees to generate and score multiple pronunciations for a spelled word |
US6029132A (en) * | 1998-04-30 | 2000-02-22 | Matsushita Electric Industrial Co. | Method for letter-to-sound in text-to-speech synthesis |
US6285786B1 (en) | 1998-04-30 | 2001-09-04 | Motorola, Inc. | Text recognizer and method using non-cumulative character scoring in a forward search |
US6144938A (en) | 1998-05-01 | 2000-11-07 | Sun Microsystems, Inc. | Voice user interface with personality |
US6778970B2 (en) | 1998-05-28 | 2004-08-17 | Lawrence Au | Topological methods to organize semantic network data flows for conversational applications |
US20070094224A1 (en) | 1998-05-28 | 2007-04-26 | Lawrence Au | Method and system for determining contextual meaning for network search applications |
US7711672B2 (en) | 1998-05-28 | 2010-05-04 | Lawrence Au | Semantic network methods to disambiguate natural language meaning |
US6563769B1 (en) | 1998-06-11 | 2003-05-13 | Koninklijke Philips Electronics N.V. | Virtual jukebox |
US6144958A (en) | 1998-07-15 | 2000-11-07 | Amazon.Com, Inc. | System and method for correcting spelling errors in search queries |
US6105865A (en) | 1998-07-17 | 2000-08-22 | Hardesty; Laurence Daniel | Financial transaction system with retirement saving benefit |
US6493428B1 (en) | 1998-08-18 | 2002-12-10 | Siemens Information & Communication Networks, Inc | Text-enhanced voice menu system |
US6499013B1 (en) | 1998-09-09 | 2002-12-24 | One Voice Technologies, Inc. | Interactive user interface using speech recognition and natural language processing |
US6434524B1 (en) | 1998-09-09 | 2002-08-13 | One Voice Technologies, Inc. | Object interactive user interface using speech recognition and natural language processing |
US6792082B1 (en) | 1998-09-11 | 2004-09-14 | Comverse Ltd. | Voice mail system with personal assistant provisioning |
US6266637B1 (en) | 1998-09-11 | 2001-07-24 | International Business Machines Corporation | Phrase splicing and variable substitution using a trainable speech synthesizer |
DE19841541B4 (de) | 1998-09-11 | 2007-12-06 | Püllen, Rainer | Teilnehmereinheit für einen Multimediadienst |
US6317831B1 (en) | 1998-09-21 | 2001-11-13 | Openwave Systems Inc. | Method and apparatus for establishing a secure connection over a one-way data path |
US6275824B1 (en) | 1998-10-02 | 2001-08-14 | Ncr Corporation | System and method for managing data privacy in a database management system |
CN1160700C (zh) | 1998-10-02 | 2004-08-04 | 国际商业机器公司 | 提供网络协同会话服务的系统和方法 |
US7003463B1 (en) * | 1998-10-02 | 2006-02-21 | International Business Machines Corporation | System and method for providing network coordinated conversational services |
US6360237B1 (en) | 1998-10-05 | 2002-03-19 | Lernout & Hauspie Speech Products N.V. | Method and system for performing text edits during audio recording playback |
GB9821969D0 (en) | 1998-10-08 | 1998-12-02 | Canon Kk | Apparatus and method for processing natural language |
US6928614B1 (en) | 1998-10-13 | 2005-08-09 | Visteon Global Technologies, Inc. | Mobile office with speech recognition |
US6453292B2 (en) | 1998-10-28 | 2002-09-17 | International Business Machines Corporation | Command boundary identifier for conversational natural language |
US6208971B1 (en) | 1998-10-30 | 2001-03-27 | Apple Computer, Inc. | Method and apparatus for command recognition using data-driven semantic inference |
US6321092B1 (en) | 1998-11-03 | 2001-11-20 | Signal Soft Corporation | Multiple input data management for wireless location-based applications |
US6446076B1 (en) | 1998-11-12 | 2002-09-03 | Accenture Llp. | Voice interactive web-based agent system responsive to a user location for prioritizing and formatting information |
US6606599B2 (en) | 1998-12-23 | 2003-08-12 | Interactive Speech Technologies, Llc | Method for integrating computing processes with an interface controlled by voice actuated grammars |
JP2002530703A (ja) | 1998-11-13 | 2002-09-17 | ルノー・アンド・オスピー・スピーチ・プロダクツ・ナームローゼ・ベンノートシャープ | 音声波形の連結を用いる音声合成 |
US6246981B1 (en) | 1998-11-25 | 2001-06-12 | International Business Machines Corporation | Natural language task-oriented dialog manager and method |
US7082397B2 (en) | 1998-12-01 | 2006-07-25 | Nuance Communications, Inc. | System for and method of creating and browsing a voice web |
US6260024B1 (en) | 1998-12-02 | 2001-07-10 | Gary Shkedy | Method and apparatus for facilitating buyer-driven purchase orders on a commercial network system |
US7881936B2 (en) | 1998-12-04 | 2011-02-01 | Tegic Communications, Inc. | Multimodal disambiguation of speech recognition |
US6317707B1 (en) | 1998-12-07 | 2001-11-13 | At&T Corp. | Automatic clustering of tokens from a corpus for grammar acquisition |
US6308149B1 (en) | 1998-12-16 | 2001-10-23 | Xerox Corporation | Grouping words with equivalent substrings by automatic clustering based on suffix relationships |
US6523172B1 (en) | 1998-12-17 | 2003-02-18 | Evolutionary Technologies International, Inc. | Parser translator system and method |
US6460029B1 (en) | 1998-12-23 | 2002-10-01 | Microsoft Corporation | System for improving search text |
US6851115B1 (en) | 1999-01-05 | 2005-02-01 | Sri International | Software-based architecture for communication and cooperation among distributed electronic agents |
US6523061B1 (en) * | 1999-01-05 | 2003-02-18 | Sri International, Inc. | System, method, and article of manufacture for agent-based navigation in a speech-based data navigation system |
US7036128B1 (en) | 1999-01-05 | 2006-04-25 | Sri International Offices | Using a community of distributed electronic agents to support a highly mobile, ambient computing environment |
US6742021B1 (en) | 1999-01-05 | 2004-05-25 | Sri International, Inc. | Navigating network-based electronic information using spoken input with multimodal error feedback |
US6757718B1 (en) | 1999-01-05 | 2004-06-29 | Sri International | Mobile navigation of network-based electronic information using spoken input |
US6513063B1 (en) * | 1999-01-05 | 2003-01-28 | Sri International | Accessing network-based electronic information through scripted online interfaces using spoken input |
US7152070B1 (en) | 1999-01-08 | 2006-12-19 | The Regents Of The University Of California | System and method for integrating and accessing multiple data sources within a data warehouse architecture |
US6505183B1 (en) | 1999-02-04 | 2003-01-07 | Authoria, Inc. | Human resource knowledge modeling and delivery system |
JP3629384B2 (ja) * | 1999-06-29 | 2005-03-16 | シャープ株式会社 | 情報選択装置及び記録媒体 |
US6983251B1 (en) * | 1999-02-15 | 2006-01-03 | Sharp Kabushiki Kaisha | Information selection apparatus selecting desired information from plurality of audio information by mainly using audio |
US6317718B1 (en) | 1999-02-26 | 2001-11-13 | Accenture Properties (2) B.V. | System, method and article of manufacture for location-based filtering for shopping agent in the physical world |
GB9904662D0 (en) | 1999-03-01 | 1999-04-21 | Canon Kk | Natural language search method and apparatus |
US20020013852A1 (en) * | 2000-03-03 | 2002-01-31 | Craig Janik | System for providing content, management, and interactivity for thin client devices |
US6356905B1 (en) | 1999-03-05 | 2002-03-12 | Accenture Llp | System, method and article of manufacture for mobile communication utilizing an interface support framework |
US6928404B1 (en) | 1999-03-17 | 2005-08-09 | International Business Machines Corporation | System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies |
US6584464B1 (en) | 1999-03-19 | 2003-06-24 | Ask Jeeves, Inc. | Grammar template query system |
WO2000058946A1 (en) | 1999-03-26 | 2000-10-05 | Koninklijke Philips Electronics N.V. | Client-server speech recognition |
US6356854B1 (en) | 1999-04-05 | 2002-03-12 | Delphi Technologies, Inc. | Holographic object position and type sensing system and method |
US6631346B1 (en) | 1999-04-07 | 2003-10-07 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for natural language parsing using multiple passes and tags |
WO2000060435A2 (en) | 1999-04-07 | 2000-10-12 | Rensselaer Polytechnic Institute | System and method for accessing personal information |
US6647260B2 (en) | 1999-04-09 | 2003-11-11 | Openwave Systems Inc. | Method and system facilitating web based provisioning of two-way mobile communications devices |
US6924828B1 (en) | 1999-04-27 | 2005-08-02 | Surfnotes | Method and apparatus for improved information representation |
US6697780B1 (en) * | 1999-04-30 | 2004-02-24 | At&T Corp. | Method and apparatus for rapid acoustic unit selection from a large speech corpus |
US6741264B1 (en) | 1999-05-11 | 2004-05-25 | Gific Corporation | Method of generating an audible indication of data stored in a database |
US20020032564A1 (en) | 2000-04-19 | 2002-03-14 | Farzad Ehsani | Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface |
AU5451800A (en) | 1999-05-28 | 2000-12-18 | Sehda, Inc. | Phrase-based dialogue modeling with particular application to creating recognition grammars for voice-controlled user interfaces |
US6728675B1 (en) | 1999-06-03 | 2004-04-27 | International Business Machines Corporatiion | Data processor controlled display system with audio identifiers for overlapping windows in an interactive graphical user interface |
US6931384B1 (en) | 1999-06-04 | 2005-08-16 | Microsoft Corporation | System and method providing utility-based decision making about clarification dialog given communicative uncertainty |
US6598039B1 (en) | 1999-06-08 | 2003-07-22 | Albert-Inc. S.A. | Natural language interface for searching database |
US8065155B1 (en) | 1999-06-10 | 2011-11-22 | Gazdzinski Robert F | Adaptive advertising apparatus and methods |
US7093693B1 (en) | 1999-06-10 | 2006-08-22 | Gazdzinski Robert F | Elevator access control system and method |
US7711565B1 (en) | 1999-06-10 | 2010-05-04 | Gazdzinski Robert F | “Smart” elevator system and method |
US6615175B1 (en) * | 1999-06-10 | 2003-09-02 | Robert F. Gazdzinski | “Smart” elevator system and method |
US6711585B1 (en) | 1999-06-15 | 2004-03-23 | Kanisa Inc. | System and method for implementing a knowledge management system |
JP3361291B2 (ja) | 1999-07-23 | 2003-01-07 | コナミ株式会社 | 音声合成方法、音声合成装置及び音声合成プログラムを記録したコンピュータ読み取り可能な媒体 |
US6421672B1 (en) | 1999-07-27 | 2002-07-16 | Verizon Services Corp. | Apparatus for and method of disambiguation of directory listing searches utilizing multiple selectable secondary search keys |
JP2001056233A (ja) * | 1999-08-17 | 2001-02-27 | Arex:Kk | 車載用音声情報サービス装置及び該装置を利用する音声情報サービスシステム |
EP1079387A3 (en) | 1999-08-26 | 2003-07-09 | Matsushita Electric Industrial Co., Ltd. | Mechanism for storing information about recorded television broadcasts |
US6601234B1 (en) | 1999-08-31 | 2003-07-29 | Accenture Llp | Attribute dictionary in a business logic services environment |
US6697824B1 (en) | 1999-08-31 | 2004-02-24 | Accenture Llp | Relationship management in an E-commerce application framework |
US6912499B1 (en) | 1999-08-31 | 2005-06-28 | Nortel Networks Limited | Method and apparatus for training a multilingual speech model set |
GB2353927B (en) * | 1999-09-06 | 2004-02-11 | Nokia Mobile Phones Ltd | User interface for text to speech conversion |
US7127403B1 (en) | 1999-09-13 | 2006-10-24 | Microstrategy, Inc. | System and method for personalizing an interactive voice broadcast of a voice service based on particulars of a request |
US6601026B2 (en) | 1999-09-17 | 2003-07-29 | Discern Communications, Inc. | Information retrieval by natural language querying |
US6505175B1 (en) | 1999-10-06 | 2003-01-07 | Goldman, Sachs & Co. | Order centric tracking system |
US6625583B1 (en) | 1999-10-06 | 2003-09-23 | Goldman, Sachs & Co. | Handheld trading system interface |
US7020685B1 (en) | 1999-10-08 | 2006-03-28 | Openwave Systems Inc. | Method and apparatus for providing internet content to SMS-based wireless devices |
US7219123B1 (en) | 1999-10-08 | 2007-05-15 | At Road, Inc. | Portable browser device with adaptive personalization capability |
CA2748396A1 (en) | 1999-10-19 | 2001-04-26 | Sony Electronics Inc. | Natural language interface control system |
US6807574B1 (en) | 1999-10-22 | 2004-10-19 | Tellme Networks, Inc. | Method and apparatus for content personalization over a telephone interface |
JP2001125896A (ja) | 1999-10-26 | 2001-05-11 | Victor Co Of Japan Ltd | 自然言語対話システム |
US7310600B1 (en) | 1999-10-28 | 2007-12-18 | Canon Kabushiki Kaisha | Language recognition using a similarity measure |
US7392185B2 (en) | 1999-11-12 | 2008-06-24 | Phoenix Solutions, Inc. | Speech based learning/training system using semantic decoding |
US6615172B1 (en) | 1999-11-12 | 2003-09-02 | Phoenix Solutions, Inc. | Intelligent query engine for processing voice based queries |
US9076448B2 (en) * | 1999-11-12 | 2015-07-07 | Nuance Communications, Inc. | Distributed real time speech recognition system |
US7050977B1 (en) | 1999-11-12 | 2006-05-23 | Phoenix Solutions, Inc. | Speech-enabled server for internet website and method |
US6665640B1 (en) | 1999-11-12 | 2003-12-16 | Phoenix Solutions, Inc. | Interactive speech based learning/training system formulating search queries based on natural language parsing of recognized user queries |
US7725307B2 (en) | 1999-11-12 | 2010-05-25 | Phoenix Solutions, Inc. | Query engine for processing voice based queries including semantic decoding |
US6633846B1 (en) | 1999-11-12 | 2003-10-14 | Phoenix Solutions, Inc. | Distributed realtime speech recognition system |
US6532446B1 (en) | 1999-11-24 | 2003-03-11 | Openwave Systems Inc. | Server based speech recognition user interface for wireless devices |
US6526382B1 (en) | 1999-12-07 | 2003-02-25 | Comverse, Inc. | Language-oriented user interfaces for voice activated services |
US20010030660A1 (en) | 1999-12-10 | 2001-10-18 | Roustem Zainoulline | Interactive graphical user interface and method for previewing media products |
US6978127B1 (en) | 1999-12-16 | 2005-12-20 | Koninklijke Philips Electronics N.V. | Hand-ear user interface for hand-held device |
US6526395B1 (en) * | 1999-12-31 | 2003-02-25 | Intel Corporation | Application of personality models and interaction with synthetic characters in a computing system |
US6556983B1 (en) | 2000-01-12 | 2003-04-29 | Microsoft Corporation | Methods and apparatus for finding semantic information, such as usage logs, similar to a query using a pattern lattice data space |
US6546388B1 (en) | 2000-01-14 | 2003-04-08 | International Business Machines Corporation | Metadata search results ranking system |
US6701294B1 (en) | 2000-01-19 | 2004-03-02 | Lucent Technologies, Inc. | User interface for translating natural language inquiries into database queries and data presentations |
US6829603B1 (en) | 2000-02-02 | 2004-12-07 | International Business Machines Corp. | System, method and program product for interactive natural dialog |
US6895558B1 (en) | 2000-02-11 | 2005-05-17 | Microsoft Corporation | Multi-access mode electronic personal assistant |
US6640098B1 (en) | 2000-02-14 | 2003-10-28 | Action Engine Corporation | System for obtaining service-related information for local interactive wireless devices |
US6760754B1 (en) | 2000-02-22 | 2004-07-06 | At&T Corp. | System, method and apparatus for communicating via sound messages and personal sound identifiers |
WO2001063382A2 (en) | 2000-02-25 | 2001-08-30 | Synquiry Technologies, Ltd. | Conceptual factoring and unification of graphs representing semantic models |
US6720980B1 (en) | 2000-03-01 | 2004-04-13 | Microsoft Corporation | Method and system for embedding voice notes |
US6519566B1 (en) * | 2000-03-01 | 2003-02-11 | International Business Machines Corporation | Method for hands-free operation of a pointer |
US6449620B1 (en) | 2000-03-02 | 2002-09-10 | Nimble Technology, Inc. | Method and apparatus for generating information pages using semi-structured data stored in a structured manner |
US6895380B2 (en) | 2000-03-02 | 2005-05-17 | Electro Standards Laboratories | Voice actuation with contextual learning for intelligent machine control |
US6466654B1 (en) | 2000-03-06 | 2002-10-15 | Avaya Technology Corp. | Personal virtual assistant with semantic tagging |
US6757362B1 (en) | 2000-03-06 | 2004-06-29 | Avaya Technology Corp. | Personal virtual assistant |
EP1275042A2 (en) | 2000-03-06 | 2003-01-15 | Kanisa Inc. | A system and method for providing an intelligent multi-step dialog with a user |
US6477488B1 (en) | 2000-03-10 | 2002-11-05 | Apple Computer, Inc. | Method for dynamic context scope selection in hybrid n-gram+LSA language modeling |
US6615220B1 (en) | 2000-03-14 | 2003-09-02 | Oracle International Corporation | Method and mechanism for data consolidation |
US6510417B1 (en) | 2000-03-21 | 2003-01-21 | America Online, Inc. | System and method for voice access to internet-based information |
GB2366009B (en) | 2000-03-22 | 2004-07-21 | Canon Kk | Natural language machine interface |
JP3728172B2 (ja) | 2000-03-31 | 2005-12-21 | キヤノン株式会社 | 音声合成方法および装置 |
NL1014847C1 (nl) | 2000-04-05 | 2001-10-08 | Minos B V I O | Gegevensoverdracht. |
US7177798B2 (en) * | 2000-04-07 | 2007-02-13 | Rensselaer Polytechnic Institute | Natural language interface using constrained intermediate dictionary of results |
US6917373B2 (en) | 2000-12-28 | 2005-07-12 | Microsoft Corporation | Context sensitive labels for an electronic device |
US6810379B1 (en) | 2000-04-24 | 2004-10-26 | Sensory, Inc. | Client/server architecture for text-to-speech synthesis |
US7818691B2 (en) | 2000-05-11 | 2010-10-19 | Nes Stewart Irvine | Zeroclick |
KR100867760B1 (ko) | 2000-05-15 | 2008-11-10 | 소니 가부시끼 가이샤 | 재생장치, 재생방법 및 기록매체 |
US6754504B1 (en) | 2000-06-10 | 2004-06-22 | Motorola, Inc. | Method and apparatus for controlling environmental conditions using a personal area network |
US6684187B1 (en) * | 2000-06-30 | 2004-01-27 | At&T Corp. | Method and system for preselection of suitable units for concatenative speech |
US6691111B2 (en) * | 2000-06-30 | 2004-02-10 | Research In Motion Limited | System and method for implementing a natural language user interface |
US6505158B1 (en) * | 2000-07-05 | 2003-01-07 | At&T Corp. | Synthesis-based pre-selection of suitable units for concatenative speech |
JP3949356B2 (ja) | 2000-07-12 | 2007-07-25 | 三菱電機株式会社 | 音声対話システム |
US7139709B2 (en) | 2000-07-20 | 2006-11-21 | Microsoft Corporation | Middleware layer between speech related applications and engines |
JP2002041276A (ja) | 2000-07-24 | 2002-02-08 | Sony Corp | 対話型操作支援システム及び対話型操作支援方法、並びに記憶媒体 |
US20060143007A1 (en) | 2000-07-24 | 2006-06-29 | Koh V E | User interaction with voice information services |
KR20020009276A (ko) * | 2000-07-25 | 2002-02-01 | 구자홍 | 음악재생기능의 이동통신단말기 및 이동통신단말기에의음악파일 제공방법 |
US7092928B1 (en) | 2000-07-31 | 2006-08-15 | Quantum Leap Research, Inc. | Intelligent portal engine |
US6778951B1 (en) | 2000-08-09 | 2004-08-17 | Concerto Software, Inc. | Information retrieval method with natural language interface |
US20020052747A1 (en) * | 2000-08-21 | 2002-05-02 | Sarukkai Ramesh R. | Method and system of interpreting and presenting web content using a voice browser |
US6766320B1 (en) | 2000-08-24 | 2004-07-20 | Microsoft Corporation | Search engine with natural language-based robust parsing for user query and relevance feedback learning |
DE10042944C2 (de) | 2000-08-31 | 2003-03-13 | Siemens Ag | Graphem-Phonem-Konvertierung |
US6556971B1 (en) | 2000-09-01 | 2003-04-29 | Snap-On Technologies, Inc. | Computer-implemented speech recognition system training |
US7058569B2 (en) | 2000-09-15 | 2006-06-06 | Nuance Communications, Inc. | Fast waveform synchronization for concentration and time-scale modification of speech |
US7216080B2 (en) | 2000-09-29 | 2007-05-08 | Mindfabric Holdings Llc | Natural-language voice-activated personal assistant |
US6947728B2 (en) | 2000-10-13 | 2005-09-20 | Matsushita Electric Industrial Co., Ltd. | Mobile phone with music reproduction function, music data reproduction method by mobile phone with music reproduction function, and the program thereof |
US20020046315A1 (en) | 2000-10-13 | 2002-04-18 | Interactive Objects, Inc. | System and method for mapping interface functionality to codec functionality in a portable audio device |
US6832194B1 (en) | 2000-10-26 | 2004-12-14 | Sensory, Incorporated | Audio recognition peripheral system |
US7027974B1 (en) | 2000-10-27 | 2006-04-11 | Science Applications International Corporation | Ontology-based parser for natural language processing |
US7006969B2 (en) | 2000-11-02 | 2006-02-28 | At&T Corp. | System and method of pattern recognition in very high-dimensional space |
EP1346344A1 (en) | 2000-12-18 | 2003-09-24 | Koninklijke Philips Electronics N.V. | Store speech, select vocabulary to recognize word |
CN1537300A (zh) | 2000-12-22 | 2004-10-13 | 通信系统 | |
US6937986B2 (en) | 2000-12-28 | 2005-08-30 | Comverse, Inc. | Automatic dynamic speech recognition vocabulary based on external sources of information |
CA2400366C (en) | 2000-12-29 | 2008-10-07 | General Electric Company | Method and system for identifying repeatedly malfunctioning equipment |
US6731312B2 (en) | 2001-01-08 | 2004-05-04 | Apple Computer, Inc. | Media player interface |
US7257537B2 (en) * | 2001-01-12 | 2007-08-14 | International Business Machines Corporation | Method and apparatus for performing dialog management in a computer conversational interface |
US7149319B2 (en) | 2001-01-23 | 2006-12-12 | Phonak Ag | Telecommunication system, speech recognizer, and terminal, and method for adjusting capacity for vocal commanding |
GB2374772B (en) | 2001-01-29 | 2004-12-29 | Hewlett Packard Co | Audio user interface |
US6964023B2 (en) | 2001-02-05 | 2005-11-08 | International Business Machines Corporation | System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input |
US7290039B1 (en) | 2001-02-27 | 2007-10-30 | Microsoft Corporation | Intent based processing |
US6721728B2 (en) | 2001-03-02 | 2004-04-13 | The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration | System, method and apparatus for discovering phrases in a database |
US7000189B2 (en) * | 2001-03-08 | 2006-02-14 | International Business Mahcines Corporation | Dynamic data generation suitable for talking browser |
AU2002237495A1 (en) | 2001-03-13 | 2002-09-24 | Intelligate Ltd. | Dynamic natural language understanding |
US6448485B1 (en) | 2001-03-16 | 2002-09-10 | Intel Corporation | Method and system for embedding audio titles |
US7058889B2 (en) | 2001-03-23 | 2006-06-06 | Koninklijke Philips Electronics N.V. | Synchronizing text/visual information with audio playback |
US6738743B2 (en) * | 2001-03-28 | 2004-05-18 | Intel Corporation | Unified client-server distributed architectures for spoken dialogue systems |
US6834264B2 (en) | 2001-03-29 | 2004-12-21 | Provox Technologies Corporation | Method and apparatus for voice dictation and document production |
US6996531B2 (en) * | 2001-03-30 | 2006-02-07 | Comverse Ltd. | Automated database assistance using a telephone for a speech based or text based multimedia communication mode |
US6654740B2 (en) | 2001-05-08 | 2003-11-25 | Sunflare Co., Ltd. | Probabilistic information retrieval based on differential latent semantic space |
US7085722B2 (en) | 2001-05-14 | 2006-08-01 | Sony Computer Entertainment America Inc. | System and method for menu-driven voice control of characters in a game environment |
US6944594B2 (en) | 2001-05-30 | 2005-09-13 | Bellsouth Intellectual Property Corporation | Multi-context conversational environment system and method |
US20020194003A1 (en) * | 2001-06-05 | 2002-12-19 | Mozer Todd F. | Client-server security system and method |
US20020198714A1 (en) | 2001-06-26 | 2002-12-26 | Guojun Zhou | Statistical spoken dialog system |
US7139722B2 (en) | 2001-06-27 | 2006-11-21 | Bellsouth Intellectual Property Corporation | Location and time sensitive wireless calendaring |
US7752546B2 (en) * | 2001-06-29 | 2010-07-06 | Thomson Licensing | Method and system for providing an acoustic interface |
US20030020760A1 (en) * | 2001-07-06 | 2003-01-30 | Kazunori Takatsu | Method for setting a function and a setting item by selectively specifying a position in a tree-structured menu |
US6604059B2 (en) | 2001-07-10 | 2003-08-05 | Koninklijke Philips Electronics N.V. | Predictive calendar |
US7987151B2 (en) | 2001-08-10 | 2011-07-26 | General Dynamics Advanced Info Systems, Inc. | Apparatus and method for problem solving using intelligent agents |
US6813491B1 (en) | 2001-08-31 | 2004-11-02 | Openwave Systems Inc. | Method and apparatus for adapting settings of wireless communication devices in accordance with user proximity |
US6892083B2 (en) | 2001-09-05 | 2005-05-10 | Vocera Communications Inc. | Voice-controlled wireless communications system and method |
US7010581B2 (en) | 2001-09-24 | 2006-03-07 | International Business Machines Corporation | Method and system for providing browser functions on a web page for client-specific accessibility |
US7403938B2 (en) | 2001-09-24 | 2008-07-22 | Iac Search & Media, Inc. | Natural language query processing |
US20050196732A1 (en) | 2001-09-26 | 2005-09-08 | Scientific Learning Corporation | Method and apparatus for automated training of language learning skills |
US6985865B1 (en) * | 2001-09-26 | 2006-01-10 | Sprint Spectrum L.P. | Method and system for enhanced response to voice commands in a voice command platform |
US6650735B2 (en) | 2001-09-27 | 2003-11-18 | Microsoft Corporation | Integrated voice access to a variety of personal information services |
US7324947B2 (en) | 2001-10-03 | 2008-01-29 | Promptu Systems Corporation | Global speech user interface |
US7027990B2 (en) | 2001-10-12 | 2006-04-11 | Lester Sussman | System and method for integrating the visual display of text menus for interactive voice response systems |
US7167832B2 (en) | 2001-10-15 | 2007-01-23 | At&T Corp. | Method for dialog management |
US20030167318A1 (en) | 2001-10-22 | 2003-09-04 | Apple Computer, Inc. | Intelligent synchronization of media player with host computer |
GB2387001B (en) | 2001-10-22 | 2005-02-02 | Apple Computer | Intelligent interaction between media player and host computer |
GB2381409B (en) | 2001-10-27 | 2004-04-28 | Hewlett Packard Ltd | Asynchronous access to synchronous voice services |
EP1311102A1 (en) | 2001-11-08 | 2003-05-14 | Hewlett-Packard Company | Streaming audio under voice control |
NO316480B1 (no) | 2001-11-15 | 2004-01-26 | Forinnova As | Fremgangsmåte og system for tekstuell granskning og oppdagelse |
US6996777B2 (en) | 2001-11-29 | 2006-02-07 | Nokia Corporation | Method and apparatus for presenting auditory icons in a mobile terminal |
TW541517B (en) | 2001-12-25 | 2003-07-11 | Univ Nat Cheng Kung | Speech recognition system |
US20030144846A1 (en) * | 2002-01-31 | 2003-07-31 | Denenberg Lawrence A. | Method and system for modifying the behavior of an application based upon the application's grammar |
US20030158737A1 (en) | 2002-02-15 | 2003-08-21 | Csicsatka Tibor George | Method and apparatus for incorporating additional audio information into audio data file identifying information |
US20030167335A1 (en) * | 2002-03-04 | 2003-09-04 | Vigilos, Inc. | System and method for network-based communication |
JP4039086B2 (ja) * | 2002-03-05 | 2008-01-30 | ソニー株式会社 | 情報処理装置および情報処理方法、情報処理システム、記録媒体、並びにプログラム |
US7197460B1 (en) | 2002-04-23 | 2007-03-27 | At&T Corp. | System for handling frequently asked questions in a natural language dialog service |
US6847966B1 (en) * | 2002-04-24 | 2005-01-25 | Engenium Corporation | Method and system for optimally searching a document database using a representative semantic space |
US7546382B2 (en) | 2002-05-28 | 2009-06-09 | International Business Machines Corporation | Methods and systems for authoring of mixed-initiative multi-modal interactions and related browsing mechanisms |
JP4013949B2 (ja) * | 2002-05-31 | 2007-11-28 | オンキヨー株式会社 | ネットワーク型コンテンツ再生システム |
US7398209B2 (en) | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US6999066B2 (en) * | 2002-06-24 | 2006-02-14 | Xerox Corporation | System for audible feedback for touch screen displays |
US7233790B2 (en) | 2002-06-28 | 2007-06-19 | Openwave Systems, Inc. | Device capability based discovery, packaging and provisioning of content for wireless mobile devices |
US7299033B2 (en) | 2002-06-28 | 2007-11-20 | Openwave Systems Inc. | Domain-based management of distribution of digital content from multiple suppliers to multiple wireless services subscribers |
US7693720B2 (en) | 2002-07-15 | 2010-04-06 | Voicebox Technologies, Inc. | Mobile systems and methods for responding to natural language speech utterance |
AU2003274902A1 (en) | 2002-07-25 | 2004-02-16 | Sharp Laboratories Of America, Inc. | Aural user interface |
US7166791B2 (en) * | 2002-07-30 | 2007-01-23 | Apple Computer, Inc. | Graphical user interface and methods of use thereof in a multimedia player |
US7103157B2 (en) * | 2002-09-17 | 2006-09-05 | International Business Machines Corporation | Audio quality when streaming audio to non-streaming telephony devices |
JP2004117905A (ja) * | 2002-09-26 | 2004-04-15 | Fujitsu Ltd | 音声を用いた情報アクセス装置及び方法 |
US20040061717A1 (en) | 2002-09-30 | 2004-04-01 | Menon Rama R. | Mechanism for voice-enabling legacy internet content for use with multi-modal browsers |
US7467087B1 (en) | 2002-10-10 | 2008-12-16 | Gillick Laurence S | Training and using pronunciation guessers in speech recognition |
US7054888B2 (en) | 2002-10-16 | 2006-05-30 | Microsoft Corporation | Optimizing media player memory during rendering |
US20040218451A1 (en) | 2002-11-05 | 2004-11-04 | Said Joe P. | Accessible user interface and navigation system and method |
AU2003293071A1 (en) | 2002-11-22 | 2004-06-18 | Roy Rosser | Autonomous response engine |
US7684985B2 (en) | 2002-12-10 | 2010-03-23 | Richard Dominach | Techniques for disambiguating speech input using multimodal interfaces |
US7386449B2 (en) | 2002-12-11 | 2008-06-10 | Voice Enabling Systems Technology Inc. | Knowledge-based flexible natural speech dialogue system |
GB2396927A (en) | 2002-12-30 | 2004-07-07 | Digital Fidelity Ltd | Media file distribution system |
US7956766B2 (en) | 2003-01-06 | 2011-06-07 | Panasonic Corporation | Apparatus operating system |
US7529671B2 (en) | 2003-03-04 | 2009-05-05 | Microsoft Corporation | Block synchronous decoding |
US6980949B2 (en) | 2003-03-14 | 2005-12-27 | Sonum Technologies, Inc. | Natural language processor |
US7496498B2 (en) * | 2003-03-24 | 2009-02-24 | Microsoft Corporation | Front-end architecture for a multi-lingual text-to-speech system |
EP1465047A1 (en) * | 2003-04-03 | 2004-10-06 | Deutsche Thomson-Brandt Gmbh | Method for presenting menu buttons |
US6728729B1 (en) | 2003-04-25 | 2004-04-27 | Apple Computer, Inc. | Accessing media across networks |
US7421393B1 (en) | 2004-03-01 | 2008-09-02 | At&T Corp. | System for developing a dialog manager using modular spoken-dialog components |
US20050045373A1 (en) | 2003-05-27 | 2005-03-03 | Joseph Born | Portable media device with audio prompt menu |
US7200559B2 (en) | 2003-05-29 | 2007-04-03 | Microsoft Corporation | Semantic object synchronous understanding implemented with speech application language tags |
US7720683B1 (en) | 2003-06-13 | 2010-05-18 | Sensory, Inc. | Method and apparatus of specifying and performing speech recognition operations |
US20060277058A1 (en) | 2003-07-07 | 2006-12-07 | J Maev Jack I | Method and apparatus for providing aftermarket service for a product |
US7757173B2 (en) * | 2003-07-18 | 2010-07-13 | Apple Inc. | Voice menu system |
EP1653361A4 (en) * | 2003-08-08 | 2006-12-13 | Onkyo Kk | NETWORK AV SYSTEM |
US7475010B2 (en) * | 2003-09-03 | 2009-01-06 | Lingospot, Inc. | Adaptive and scalable method for resolving natural language ambiguities |
US7418392B1 (en) | 2003-09-25 | 2008-08-26 | Sensory, Inc. | System and method for controlling the operation of a device by voice commands |
US7155706B2 (en) | 2003-10-24 | 2006-12-26 | Microsoft Corporation | Administrative tool environment |
US20050102625A1 (en) | 2003-11-07 | 2005-05-12 | Lee Yong C. | Audio tag retrieval system and method |
US7584092B2 (en) | 2004-11-15 | 2009-09-01 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
US7412385B2 (en) | 2003-11-12 | 2008-08-12 | Microsoft Corporation | System for identifying paraphrases using machine translation |
US8055713B2 (en) | 2003-11-17 | 2011-11-08 | Hewlett-Packard Development Company, L.P. | Email application with user voice interface |
US7447630B2 (en) | 2003-11-26 | 2008-11-04 | Microsoft Corporation | Method and apparatus for multi-sensory speech enhancement |
JP4533845B2 (ja) | 2003-12-05 | 2010-09-01 | 株式会社ケンウッド | オーディオ機器制御装置、オーディオ機器制御方法及びプログラム |
ES2312851T3 (es) | 2003-12-16 | 2009-03-01 | Loquendo Spa | Procedimiento y sistema texto a voz y el programa informatico asociado. |
US7427024B1 (en) | 2003-12-17 | 2008-09-23 | Gazdzinski Mark J | Chattel management apparatus and methods |
US7552055B2 (en) | 2004-01-10 | 2009-06-23 | Microsoft Corporation | Dialog component re-use in recognition systems |
EP1704558B8 (en) | 2004-01-16 | 2011-09-21 | Nuance Communications, Inc. | Corpus-based speech synthesis based on segment recombination |
US20050165607A1 (en) | 2004-01-22 | 2005-07-28 | At&T Corp. | System and method to disambiguate and clarify user intention in a spoken dialog system |
ATE415684T1 (de) | 2004-01-29 | 2008-12-15 | Harman Becker Automotive Sys | Verfahren und system zur sprachdialogschnittstelle |
KR100462292B1 (ko) | 2004-02-26 | 2004-12-17 | 엔에이치엔(주) | 중요도 정보를 반영한 검색 결과 리스트 제공 방법 및 그시스템 |
US7693715B2 (en) | 2004-03-10 | 2010-04-06 | Microsoft Corporation | Generating large units of graphonemes with mutual information criterion for letter to sound conversion |
US7409337B1 (en) | 2004-03-30 | 2008-08-05 | Microsoft Corporation | Natural language processing interface |
US7496512B2 (en) * | 2004-04-13 | 2009-02-24 | Microsoft Corporation | Refining of segmental boundaries in speech waveforms using contextual-dependent models |
JP2005311864A (ja) * | 2004-04-23 | 2005-11-04 | Toshiba Corp | 家電機器、アダプタ装置および家電機器システム |
US8095364B2 (en) * | 2004-06-02 | 2012-01-10 | Tegic Communications, Inc. | Multimodal disambiguation of speech recognition |
US7720674B2 (en) | 2004-06-29 | 2010-05-18 | Sap Ag | Systems and methods for processing natural language queries |
TWI252049B (en) * | 2004-07-23 | 2006-03-21 | Inventec Corp | Sound control system and method |
US7725318B2 (en) | 2004-07-30 | 2010-05-25 | Nice Systems Inc. | System and method for improving the accuracy of audio searching |
US7853574B2 (en) | 2004-08-26 | 2010-12-14 | International Business Machines Corporation | Method of generating a context-inferenced search query and of sorting a result of the query |
US7716056B2 (en) | 2004-09-27 | 2010-05-11 | Robert Bosch Corporation | Method and system for interactive conversational dialogue for cognitively overloaded device users |
US8107401B2 (en) * | 2004-09-30 | 2012-01-31 | Avaya Inc. | Method and apparatus for providing a virtual assistant to a communication participant |
US7362312B2 (en) | 2004-11-01 | 2008-04-22 | Nokia Corporation | Mobile communication terminal and method |
US7735012B2 (en) * | 2004-11-04 | 2010-06-08 | Apple Inc. | Audio user interface for computing devices |
US7552046B2 (en) | 2004-11-15 | 2009-06-23 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
US7546235B2 (en) | 2004-11-15 | 2009-06-09 | Microsoft Corporation | Unsupervised learning of paraphrase/translation alternations and selective application thereof |
US7702500B2 (en) | 2004-11-24 | 2010-04-20 | Blaedow Karen R | Method and apparatus for determining the meaning of natural language |
CN1609859A (zh) | 2004-11-26 | 2005-04-27 | 孙斌 | 搜索结果聚类的方法 |
US7376645B2 (en) | 2004-11-29 | 2008-05-20 | The Intellection Group, Inc. | Multimodal natural language query system and architecture for processing voice and proximity-based queries |
US20060122834A1 (en) | 2004-12-03 | 2006-06-08 | Bennett Ian M | Emotion detection device & method for use in distributed systems |
US8214214B2 (en) * | 2004-12-03 | 2012-07-03 | Phoenix Solutions, Inc. | Emotion detection device and method for use in distributed systems |
US8024194B2 (en) | 2004-12-08 | 2011-09-20 | Nuance Communications, Inc. | Dynamic switching between local and remote speech rendering |
US7636657B2 (en) | 2004-12-09 | 2009-12-22 | Microsoft Corporation | Method and apparatus for automatic grammar generation from data entries |
US7873654B2 (en) * | 2005-01-24 | 2011-01-18 | The Intellection Group, Inc. | Multimodal natural language query system for processing and analyzing voice and proximity-based queries |
US7508373B2 (en) | 2005-01-28 | 2009-03-24 | Microsoft Corporation | Form factor and input method for language input |
GB0502259D0 (en) | 2005-02-03 | 2005-03-09 | British Telecomm | Document searching tool and method |
US7676026B1 (en) | 2005-03-08 | 2010-03-09 | Baxtech Asia Pte Ltd | Desktop telephony system |
US7925525B2 (en) | 2005-03-25 | 2011-04-12 | Microsoft Corporation | Smart reminders |
WO2006129967A1 (en) | 2005-05-30 | 2006-12-07 | Daumsoft, Inc. | Conversation system and method using conversational agent |
US8041570B2 (en) | 2005-05-31 | 2011-10-18 | Robert Bosch Corporation | Dialogue management using scripts |
US8024195B2 (en) | 2005-06-27 | 2011-09-20 | Sensory, Inc. | Systems and methods of performing speech recognition using historical information |
US7826945B2 (en) | 2005-07-01 | 2010-11-02 | You Zhang | Automobile speech-recognition interface |
US20070058832A1 (en) | 2005-08-05 | 2007-03-15 | Realnetworks, Inc. | Personal media device |
US7640160B2 (en) | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7620549B2 (en) * | 2005-08-10 | 2009-11-17 | Voicebox Technologies, Inc. | System and method of supporting adaptive misrecognition in conversational speech |
US8126716B2 (en) * | 2005-08-19 | 2012-02-28 | Nuance Communications, Inc. | Method and system for collecting audio prompts in a dynamically generated voice application |
US7949529B2 (en) | 2005-08-29 | 2011-05-24 | Voicebox Technologies, Inc. | Mobile systems and methods of supporting natural language human-machine interactions |
US8265939B2 (en) | 2005-08-31 | 2012-09-11 | Nuance Communications, Inc. | Hierarchical methods and apparatus for extracting user intent from spoken utterances |
EP1934971A4 (en) | 2005-08-31 | 2010-10-27 | Voicebox Technologies Inc | DYNAMIC LANGUAGE SCRIPTURE |
US8677377B2 (en) | 2005-09-08 | 2014-03-18 | Apple Inc. | Method and apparatus for building an intelligent automated assistant |
JP4908094B2 (ja) | 2005-09-30 | 2012-04-04 | 株式会社リコー | 情報処理システム、情報処理方法及び情報処理プログラム |
US7930168B2 (en) | 2005-10-04 | 2011-04-19 | Robert Bosch Gmbh | Natural language processing of disfluent sentences |
US8620667B2 (en) | 2005-10-17 | 2013-12-31 | Microsoft Corporation | Flexible speech-activated command and control |
US7707032B2 (en) | 2005-10-20 | 2010-04-27 | National Cheng Kung University | Method and system for matching speech data |
US20070106674A1 (en) | 2005-11-10 | 2007-05-10 | Purusharth Agrawal | Field sales process facilitation systems and methods |
US20070185926A1 (en) | 2005-11-28 | 2007-08-09 | Anand Prahlad | Systems and methods for classifying and transferring information in a storage network |
KR20070057496A (ko) | 2005-12-02 | 2007-06-07 | 삼성전자주식회사 | 액정 표시 장치 |
KR100810500B1 (ko) | 2005-12-08 | 2008-03-07 | 한국전자통신연구원 | 대화형 음성 인터페이스 시스템에서의 사용자 편의성증대 방법 |
DE102005061365A1 (de) | 2005-12-21 | 2007-06-28 | Siemens Ag | Verfahren zur Ansteuerung zumindest einer ersten und zweiten Hintergrundapplikation über ein universelles Sprachdialogsystem |
US7996228B2 (en) | 2005-12-22 | 2011-08-09 | Microsoft Corporation | Voice initiated network operations |
US7599918B2 (en) | 2005-12-29 | 2009-10-06 | Microsoft Corporation | Dynamic search with implicit user intention mining |
JP2007183864A (ja) | 2006-01-10 | 2007-07-19 | Fujitsu Ltd | ファイル検索方法及びそのシステム |
US20070174188A1 (en) | 2006-01-25 | 2007-07-26 | Fish Robert D | Electronic marketplace that facilitates transactions between consolidated buyers and/or sellers |
IL174107A0 (en) * | 2006-02-01 | 2006-08-01 | Grois Dan | Method and system for advertising by means of a search engine over a data network |
KR100764174B1 (ko) | 2006-03-03 | 2007-10-08 | 삼성전자주식회사 | 음성 대화 서비스 장치 및 방법 |
US7752152B2 (en) | 2006-03-17 | 2010-07-06 | Microsoft Corporation | Using predictive user models for language modeling on a personal device with user behavior models based on statistical modeling |
JP4734155B2 (ja) | 2006-03-24 | 2011-07-27 | 株式会社東芝 | 音声認識装置、音声認識方法および音声認識プログラム |
US7707027B2 (en) | 2006-04-13 | 2010-04-27 | Nuance Communications, Inc. | Identification and rejection of meaningless input during natural language classification |
BRPI0711317B8 (pt) * | 2006-05-10 | 2021-06-22 | Koninklijke Philips Nv | método para fornecer informação audível a partir de um desfibrilador, e, desfibrilador externo automático |
EP1858005A1 (en) * | 2006-05-19 | 2007-11-21 | Texthelp Systems Limited | Streaming speech with synchronized highlighting generated by a server |
US8423347B2 (en) | 2006-06-06 | 2013-04-16 | Microsoft Corporation | Natural language personal information management |
US7483894B2 (en) * | 2006-06-07 | 2009-01-27 | Platformation Technologies, Inc | Methods and apparatus for entity search |
US7523108B2 (en) | 2006-06-07 | 2009-04-21 | Platformation, Inc. | Methods and apparatus for searching with awareness of geography and languages |
US20100257160A1 (en) | 2006-06-07 | 2010-10-07 | Yu Cao | Methods & apparatus for searching with awareness of different types of information |
KR100776800B1 (ko) | 2006-06-16 | 2007-11-19 | 한국전자통신연구원 | 지능형 가제트를 이용한 맞춤형 서비스 제공 방법 및시스템 |
US7548895B2 (en) | 2006-06-30 | 2009-06-16 | Microsoft Corporation | Communication-prompted user assistance |
EP2044804A4 (en) * | 2006-07-08 | 2013-12-18 | Personics Holdings Inc | PERSONAL HEARING AID AND METHOD |
US20080042970A1 (en) * | 2006-07-24 | 2008-02-21 | Yih-Shiuan Liang | Associating a region on a surface with a sound or with another region |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
US8073681B2 (en) | 2006-10-16 | 2011-12-06 | Voicebox Technologies, Inc. | System and method for a cooperative conversational voice user interface |
US20080129520A1 (en) | 2006-12-01 | 2008-06-05 | Apple Computer, Inc. | Electronic device with enhanced audio feedback |
WO2008085742A2 (en) | 2007-01-07 | 2008-07-17 | Apple Inc. | Portable multifunction device, method and graphical user interface for interacting with user input elements in displayed content |
KR100883657B1 (ko) | 2007-01-26 | 2009-02-18 | 삼성전자주식회사 | 음성 인식 기반의 음악 검색 방법 및 장치 |
US7818176B2 (en) | 2007-02-06 | 2010-10-19 | Voicebox Technologies, Inc. | System and method for selecting and presenting advertisements based on natural language processing of voice-based input |
US7801728B2 (en) | 2007-02-26 | 2010-09-21 | Nuance Communications, Inc. | Document session replay for multimodal applications |
US7822608B2 (en) | 2007-02-27 | 2010-10-26 | Nuance Communications, Inc. | Disambiguating a speech recognition grammar in a multimodal application |
US20080221899A1 (en) | 2007-03-07 | 2008-09-11 | Cerra Joseph P | Mobile messaging environment speech processing facility |
US7801729B2 (en) | 2007-03-13 | 2010-09-21 | Sensory, Inc. | Using multiple attributes to create a voice search playlist |
US8219406B2 (en) | 2007-03-15 | 2012-07-10 | Microsoft Corporation | Speech-centric multimodal user interface design in mobile technology |
US7809610B2 (en) | 2007-04-09 | 2010-10-05 | Platformation, Inc. | Methods and apparatus for freshness and completeness of information |
US7983915B2 (en) | 2007-04-30 | 2011-07-19 | Sonic Foundry, Inc. | Audio content search engine |
US8055708B2 (en) | 2007-06-01 | 2011-11-08 | Microsoft Corporation | Multimedia spaces |
US8204238B2 (en) | 2007-06-08 | 2012-06-19 | Sensory, Inc | Systems and methods of sonic communication |
US8190627B2 (en) * | 2007-06-28 | 2012-05-29 | Microsoft Corporation | Machine assisted query formulation |
US8019606B2 (en) * | 2007-06-29 | 2011-09-13 | Microsoft Corporation | Identification and selection of a software application via speech |
JP4424382B2 (ja) * | 2007-07-04 | 2010-03-03 | ソニー株式会社 | コンテンツ再生装置およびコンテンツ自動受信方法 |
JP2009036999A (ja) | 2007-08-01 | 2009-02-19 | Infocom Corp | コンピュータによる対話方法、対話システム、コンピュータプログラムおよびコンピュータに読み取り可能な記憶媒体 |
KR101359715B1 (ko) | 2007-08-24 | 2014-02-10 | 삼성전자주식회사 | 모바일 음성 웹 제공 방법 및 장치 |
WO2009029910A2 (en) | 2007-08-31 | 2009-03-05 | Proxpro, Inc. | Situation-aware personal information management for a mobile device |
US20090058823A1 (en) | 2007-09-04 | 2009-03-05 | Apple Inc. | Virtual Keyboards in Multi-Language Environment |
US9734465B2 (en) | 2007-09-14 | 2017-08-15 | Ricoh Co., Ltd | Distributed workflow-enabled system |
KR100920267B1 (ko) | 2007-09-17 | 2009-10-05 | 한국전자통신연구원 | 음성 대화 분석 시스템 및 그 방법 |
US8706476B2 (en) | 2007-09-18 | 2014-04-22 | Ariadne Genomics, Inc. | Natural language processing method by analyzing primitive sentences, logical clauses, clause types and verbal blocks |
US8165886B1 (en) | 2007-10-04 | 2012-04-24 | Great Northern Research LLC | Speech interface system and method for control and interaction with applications on a computing system |
US8036901B2 (en) | 2007-10-05 | 2011-10-11 | Sensory, Incorporated | Systems and methods of performing speech recognition using sensory inputs of human position |
US20090112677A1 (en) | 2007-10-24 | 2009-04-30 | Rhett Randolph L | Method for automatically developing suggested optimal work schedules from unsorted group and individual task lists |
US7840447B2 (en) | 2007-10-30 | 2010-11-23 | Leonard Kleinrock | Pricing and auctioning of bundled items among multiple sellers and buyers |
US7983997B2 (en) | 2007-11-02 | 2011-07-19 | Florida Institute For Human And Machine Cognition, Inc. | Interactive complex task teaching system that allows for natural language input, recognizes a user's intent, and automatically performs tasks in document object model (DOM) nodes |
US8112280B2 (en) | 2007-11-19 | 2012-02-07 | Sensory, Inc. | Systems and methods of performing speech recognition with barge-in for use in a bluetooth system |
US8140335B2 (en) | 2007-12-11 | 2012-03-20 | Voicebox Technologies, Inc. | System and method for providing a natural language voice user interface in an integrated voice navigation services environment |
US10002189B2 (en) | 2007-12-20 | 2018-06-19 | Apple Inc. | Method and apparatus for searching using an active ontology |
US8219407B1 (en) | 2007-12-27 | 2012-07-10 | Great Northern Research, LLC | Method for processing the output of a speech recognizer |
US8099289B2 (en) * | 2008-02-13 | 2012-01-17 | Sensory, Inc. | Voice interface and search for electronic devices including bluetooth headsets and remote systems |
US8958848B2 (en) | 2008-04-08 | 2015-02-17 | Lg Electronics Inc. | Mobile terminal and menu control method thereof |
US8666824B2 (en) | 2008-04-23 | 2014-03-04 | Dell Products L.P. | Digital media content location and purchasing system |
US8285344B2 (en) | 2008-05-21 | 2012-10-09 | DP Technlogies, Inc. | Method and apparatus for adjusting audio for a user environment |
US8589161B2 (en) | 2008-05-27 | 2013-11-19 | Voicebox Technologies, Inc. | System and method for an integrated, multi-modal, multi-device natural language voice services environment |
US8694355B2 (en) | 2008-05-30 | 2014-04-08 | Sri International | Method and apparatus for automated assistance with task management |
US8423288B2 (en) | 2009-11-30 | 2013-04-16 | Apple Inc. | Dynamic alerts for calendar events |
US8166019B1 (en) | 2008-07-21 | 2012-04-24 | Sprint Communications Company L.P. | Providing suggested actions in response to textual communications |
KR101005074B1 (ko) | 2008-09-18 | 2010-12-30 | 주식회사 수현테크 | 합성수지제 파이프 연결 고정구 |
US9200913B2 (en) | 2008-10-07 | 2015-12-01 | Telecommunication Systems, Inc. | User interface for predictive traffic |
US8140328B2 (en) | 2008-12-01 | 2012-03-20 | At&T Intellectual Property I, L.P. | User intention based on N-best list of recognition hypotheses for utterances in a dialog |
US8326637B2 (en) | 2009-02-20 | 2012-12-04 | Voicebox Technologies, Inc. | System and method for processing multi-modal device interactions in a natural language voice services environment |
US8805823B2 (en) | 2009-04-14 | 2014-08-12 | Sri International | Content processing systems and methods |
US8606735B2 (en) | 2009-04-30 | 2013-12-10 | Samsung Electronics Co., Ltd. | Apparatus and method for predicting user's intention based on multimodal information |
KR101581883B1 (ko) | 2009-04-30 | 2016-01-11 | 삼성전자주식회사 | 모션 정보를 이용하는 음성 검출 장치 및 방법 |
US9858925B2 (en) | 2009-06-05 | 2018-01-02 | Apple Inc. | Using context information to facilitate processing of commands in a virtual assistant |
US10540976B2 (en) | 2009-06-05 | 2020-01-21 | Apple Inc. | Contextual voice commands |
US10255566B2 (en) | 2011-06-03 | 2019-04-09 | Apple Inc. | Generating and processing task items that represent tasks to perform |
KR101562792B1 (ko) | 2009-06-10 | 2015-10-23 | 삼성전자주식회사 | 목표 예측 인터페이스 제공 장치 및 그 방법 |
US8527278B2 (en) | 2009-06-29 | 2013-09-03 | Abraham Ben David | Intelligent home automation |
US20110047072A1 (en) | 2009-08-07 | 2011-02-24 | Visa U.S.A. Inc. | Systems and Methods for Propensity Analysis and Validation |
US8768313B2 (en) | 2009-08-17 | 2014-07-01 | Digimarc Corporation | Methods and systems for image or audio recognition processing |
WO2011028842A2 (en) | 2009-09-02 | 2011-03-10 | Sri International | Method and apparatus for exploiting human feedback in an intelligent automated assistant |
US8321527B2 (en) | 2009-09-10 | 2012-11-27 | Tribal Brands | System and method for tracking user location and associated activity and responsively providing mobile device updates |
KR20110036385A (ko) | 2009-10-01 | 2011-04-07 | 삼성전자주식회사 | 사용자 의도 분석 장치 및 방법 |
US20110099507A1 (en) * | 2009-10-28 | 2011-04-28 | Google Inc. | Displaying a collection of interactive elements that trigger actions directed to an item |
US9197736B2 (en) | 2009-12-31 | 2015-11-24 | Digimarc Corporation | Intuitive computing methods and systems |
US20120137367A1 (en) | 2009-11-06 | 2012-05-31 | Cataphora, Inc. | Continuous anomaly detection based on behavior modeling and heterogeneous information analysis |
US9171541B2 (en) | 2009-11-10 | 2015-10-27 | Voicebox Technologies Corporation | System and method for hybrid processing in a natural language voice services environment |
US9502025B2 (en) | 2009-11-10 | 2016-11-22 | Voicebox Technologies Corporation | System and method for providing a natural language content dedication service |
US8712759B2 (en) | 2009-11-13 | 2014-04-29 | Clausal Computing Oy | Specializing disambiguation of a natural language expression |
KR101960835B1 (ko) | 2009-11-24 | 2019-03-21 | 삼성전자주식회사 | 대화 로봇을 이용한 일정 관리 시스템 및 그 방법 |
US8396888B2 (en) * | 2009-12-04 | 2013-03-12 | Google Inc. | Location-based searching using a search area that corresponds to a geographical location of a computing device |
KR101622111B1 (ko) | 2009-12-11 | 2016-05-18 | 삼성전자 주식회사 | 대화 시스템 및 그의 대화 방법 |
US20110161309A1 (en) | 2009-12-29 | 2011-06-30 | Lx1 Technology Limited | Method Of Sorting The Result Set Of A Search Engine |
US8494852B2 (en) * | 2010-01-05 | 2013-07-23 | Google Inc. | Word-level correction of speech input |
US8334842B2 (en) | 2010-01-15 | 2012-12-18 | Microsoft Corporation | Recognizing user intent in motion capture system |
US8626511B2 (en) | 2010-01-22 | 2014-01-07 | Google Inc. | Multi-dimensional disambiguation of voice commands |
US20110218855A1 (en) | 2010-03-03 | 2011-09-08 | Platformation, Inc. | Offering Promotions Based on Query Analysis |
KR101369810B1 (ko) | 2010-04-09 | 2014-03-05 | 이초강 | 로봇을 위한 경험적 상황인식 방법을 실행하는 프로그램을 기록한 컴퓨터 판독가능한 기록 매체. |
US8265928B2 (en) * | 2010-04-14 | 2012-09-11 | Google Inc. | Geotagged environmental audio for enhanced speech recognition accuracy |
US20110279368A1 (en) | 2010-05-12 | 2011-11-17 | Microsoft Corporation | Inferring user intent to engage a motion capture system |
US8694313B2 (en) * | 2010-05-19 | 2014-04-08 | Google Inc. | Disambiguation of contact information using historical data |
US8522283B2 (en) | 2010-05-20 | 2013-08-27 | Google Inc. | Television remote control data transfer |
US8468012B2 (en) * | 2010-05-26 | 2013-06-18 | Google Inc. | Acoustic model adaptation using geographic information |
US20110306426A1 (en) | 2010-06-10 | 2011-12-15 | Microsoft Corporation | Activity Participation Based On User Intent |
US8234111B2 (en) * | 2010-06-14 | 2012-07-31 | Google Inc. | Speech and noise models for speech recognition |
US8411874B2 (en) * | 2010-06-30 | 2013-04-02 | Google Inc. | Removing noise from audio |
US8775156B2 (en) | 2010-08-05 | 2014-07-08 | Google Inc. | Translating languages in response to device motion |
US8359020B2 (en) | 2010-08-06 | 2013-01-22 | Google Inc. | Automatically monitoring for voice input based on context |
US8473289B2 (en) | 2010-08-06 | 2013-06-25 | Google Inc. | Disambiguating input based on context |
CN103688279A (zh) | 2011-04-25 | 2014-03-26 | 韦韦欧股份有限公司 | 用于智能个人时间表助理的系统和方法 |
-
2008
- 2008-09-09 US US12/207,314 patent/US8898568B2/en not_active Expired - Fee Related
-
2009
- 2009-07-28 KR KR1020117005433A patent/KR20110038735A/ko not_active Application Discontinuation
- 2009-07-28 WO PCT/US2009/051954 patent/WO2010030440A1/en active Application Filing
- 2009-07-28 EP EP09790882.6A patent/EP2324416B1/en not_active Not-in-force
- 2009-07-28 CN CN200980135356.3A patent/CN102150128B/zh not_active Expired - Fee Related
- 2009-07-28 DE DE112009002183T patent/DE112009002183T5/de not_active Withdrawn
- 2009-07-28 JP JP2011525045A patent/JP5667978B2/ja not_active Expired - Fee Related
-
2012
- 2012-02-09 HK HK12101271.4A patent/HK1160957A1/zh not_active IP Right Cessation
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20020169605A1 (en) * | 2001-03-09 | 2002-11-14 | Damiba Bertrand A. | System, method and computer program product for self-verifying file content in a speech recognition framework |
CN101051823A (zh) * | 2005-12-07 | 2007-10-10 | 苹果电脑有限公司 | 提供对音频音量参数的自动控制以保护听觉的便携式音频设备 |
Also Published As
Publication number | Publication date |
---|---|
US8898568B2 (en) | 2014-11-25 |
KR20110038735A (ko) | 2011-04-14 |
EP2324416A1 (en) | 2011-05-25 |
CN102150128A (zh) | 2011-08-10 |
EP2324416B1 (en) | 2016-01-13 |
DE112009002183T5 (de) | 2011-12-22 |
JP2012501035A (ja) | 2012-01-12 |
WO2010030440A1 (en) | 2010-03-18 |
US20100064218A1 (en) | 2010-03-11 |
JP5667978B2 (ja) | 2015-02-12 |
HK1160957A1 (zh) | 2012-08-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102150128B (zh) | 音频用户接口 | |
US7779357B2 (en) | Audio user interface for computing devices | |
US8108462B2 (en) | Information processing apparatus, information processing method, information processing program and recording medium for storing the program | |
US9824150B2 (en) | Systems and methods for providing information discovery and retrieval | |
CN103957227B (zh) | 用于从个人计算机传送数字内容到移动手机的方法和设备 | |
CN100385371C (zh) | 再现装置及再现控制方法 | |
US8015261B2 (en) | Information processing apparatus with first and second sending/receiving units | |
US8438485B2 (en) | System, method, and apparatus for generating, customizing, distributing, and presenting an interactive audio publication | |
US20070166683A1 (en) | Dynamic lyrics display for portable media devices | |
JP4621637B2 (ja) | ジョグダイヤルを備えた携帯端末機及びその制御方法 | |
US20070168262A1 (en) | Information processing system, information processing apparatus, information processing method, information processing program and recording medium for storing the program | |
US7870222B2 (en) | Systems and methods for transmitting content being reproduced | |
CN101796516A (zh) | 导航系统和方法 | |
WO2003024012A2 (en) | Dynamic content delivery responsive to user requests | |
US20070188519A1 (en) | Information processing apparatus, information processing method, information processing program and recording medium | |
US8340797B2 (en) | Method and system for generating and processing digital content based on text-to-speech conversion | |
JP2008505536A (ja) | 携帯マルチメディア装置を利用したデータ送信の方法 | |
WO2005031700A1 (ja) | 通信装置、通信方法および通信プログラム | |
CN105373585B (zh) | 歌曲收藏方法和装置 | |
JP4462324B2 (ja) | 情報処理装置および情報処理方法、並びに、プログラム | |
KR20070066022A (ko) | 휴대용 음원 재생기에서의 파일 정보 음성 출력 방법 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1160957 Country of ref document: HK |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1160957 Country of ref document: HK |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20150225 Termination date: 20210728 |
|
CF01 | Termination of patent right due to non-payment of annual fee |