US20080140418A1 - Method and device to distinguish between voice conversation and automated speech recognition - Google Patents
Method and device to distinguish between voice conversation and automated speech recognition Download PDFInfo
- Publication number
- US20080140418A1 US20080140418A1 US11/929,301 US92930107A US2008140418A1 US 20080140418 A1 US20080140418 A1 US 20080140418A1 US 92930107 A US92930107 A US 92930107A US 2008140418 A1 US2008140418 A1 US 2008140418A1
- Authority
- US
- United States
- Prior art keywords
- destination
- phonation
- signal
- address
- speech recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Abandoned
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Definitions
- Voice transmission over a digital wireless network involves capturing sound waves using a microphone and converting them to electrical signals and then binary data.
- the process comprises sampling, digitizing, and other digital signal processes at the receiver unit (e.g., telematics module or cell phone.)
- the present invention provides a method and computer-based device for performing preprocessing on voice transmissions depending upon the intended transmission destination.
- the device includes a receiving component configured to receive a voice signal from a source over a network. Also included are a processing component configured to determine a destination address associated with the received signal, determine a signal processing algorithm from a plurality of signal processing algorithms based on the destination address, and process the voice signal according to the specified algorithm.
- the device further includes a delivery component configured to send the processed signal to the associated address.
- the device also includes memory configured to store addresses with an associated signal processing algorithm, wherein the processing component finds in memory a signal processing algorithm that is associated with the determined destination address.
- the device includes an alert component configured to alert the recipient that the voice signal is from a computer-based system, if the source is a computer-based system.
- the computer-based device includes four additional components used to facilitate the present invention: a first component configured to select an address for a voice transmission; a second component configured to receive a phonation inputted for the voice transmission; a third component configured to process the received phonation according to an algorithm associated with a speech recognition device (if the selected address is associated with a speech recognition device) and send the processed phonation to the selected destination; and a fourth component configured to send the received phonation to the selected destination according to a delivery method associated with human recipients (if the selected address is not associated with a speech recognition device).
- a computer-based device includes four additional components used to facilitate the present invention: a first component configured to process a phonation at a source for reception by a human recipient; a second component configured to send the processed phonation to a destination according to an address associated with the phonation; a third component configured to receive a change signal from the destination; and a fourth component configured to process a next phonation for reception by a speech recognition server according to a received change signal, and send the newly processed phonation to the destination.
- the invention provides a method and device for improving voice transmissions by performing some preprocessing on voice transmissions depending upon the intended destination, and for providing recipients with caller identification information if the transmission is computer generated.
- FIG. 1 is a diagram illustrating an exemplary system for receiving and processing voice transmission signals over a wireless network in accordance with the present invention
- FIG. 2 is a flow chart illustrating operation of the present invention
- FIG. 3 is a flow chart illustrating an alternate aspect of the present invention.
- FIGS. 4-5 are flow charts illustrating alternative aspects of the present invention.
- the present invention provides a method and device to determine the destination of a voice transmission signal and apply an appropriate data signal processing algorithm based on the determined destination.
- the same system also provides a method to distinguish between incoming calls sent from a server or from a human in order to notify the person receiving the call that an unsolicited call is being sent.
- the present invention includes a system 10 that includes a transmitter 12 and a distribution gateway 16 .
- Transmitter 12 includes a cellular or landline telephone, network phone, other communication device or a voice generation computer that generates a voice sound signal for transmission to end units (users 24 or voice recognition servers 26 ) over a network, such as a wireless network or a primarily non-wireless network (e.g., Internet).
- Distribution gateway 16 includes a processor 17 , a receiver 18 , a transmitter 19 , and a database 20 .
- Receiver 18 in distribution gateway 16 is preferably a wireless communication module capable of receiving voice and data via a wireless communication link.
- Transmitter 19 in distribution gateway 16 is preferably a wireless communication module capable of sending voice and data via a wireless communication link.
- Distribution gateway 16 is in communication with one or more user end units 24 and one or more automated speech recognition (ASR) servers 26 , either directly or over a network (not shown).
- ASR automated speech recognition
- Processor 17 compares an address included in the voice transmission signal from transmitter 12 , such as an Internet Protocol (IP) address, a telephone number, or other method of identifying an incoming call, to a lookup table stored in database 20 .
- IP Internet Protocol
- the processor applies one of a number of signal processing algorithms depending upon the results of the comparison. Because there is a fundamental difference between the way humans process auditory input and the way ASR servers process voice input, different algorithms are applied to the voice transmission signal to optimize the benefit for the determined destination. For example, if the destination is an ASR server 26 , the algorithm converts the transmission to digital form (if not already in digital form) and performs other digital signal processing that benefit the process the ASR server will perform.
- IP Internet Protocol
- distribution gateway 16 receives a voice transmission signal.
- the voice transmission signal includes a human voice or voice generated by a computer.
- processor 17 looks up information in database 20 corresponding to a destination address or phone number included in the voice transmission signal.
- the database includes a table of phone numbers or a table of IP addresses of destinations (user end units 24 and servers 26 ) that are associated with each algorithm.
- An IP address is specified by the Internet Protocol and uniquely identifies a computer on the Internet.
- Processor 17 determines which algorithm (identified as the associated algorithm) to use for optimization depending upon the destination IP address of the voice transmission received. Therefore, when a call is placed to one of the numbers associated with an ASR server, the processor chooses the algorithm optimized for an ASR server. Otherwise, the processor chooses the algorithm for voice conversation at an end user unit 24 .
- processor 17 determines whether the associated destination number is an ASR server 26 . If, at block 36 , the processor determines that the associated destination number is an ASR server 26 , then, at block 38 , the processor processes the voice transmission according to the optimization algorithm for an ASR server and sends the processed voice transmission signal to the determined destination. If, at block 36 , the processor determines that the associated destination number is not an ASR server 26 , the processor processes the voice transmission signal for human auditory means according to an optimization algorithm used for producing a result best suited for a human recipient. At block 40 , the processor sends the processed voice transmission signal to the determined destination.
- FIG. 3 is a flowchart illustrating an operation of sending a signal to a human recipient at block 40 of FIG. 2 .
- processor 17 looks up caller identification information related to the origin of the signal (transmitter 12 in database 20 ).
- the processor determines the origin of the voice transmission signal by comparing the sending address included in the voice transmission signal to a table of stored IP addresses or phone numbers in database 20 . Thus, the processor distinguishes between incoming calls from an ASR server or other calls. If, at block 56 , the processor determines that the voice transmission signal originated from an ASR server, the logic proceeds to block 58 , where the user unit is informed that the incoming call is an ASR server generated voice transmission signal (i.e. a solicitation). If, at block 56 , the processor determines the origin is not from the ASR server, then the logic proceeds to block 60 , where caller identification information is presented to the user.
- the functions performed by distribution gateway 16 are performed at a user origination unit or transmitter 12 or at user end unit 24 .
- the user origination unit or transmitter 12 is preferably a mobile device that is implemented in a vehicle.
- the user origination unit can be a device similar to user end unit 24 .
- FIGS. 4 and 5 illustrate flowcharts that present two embodiments for performing some signal processing at the user origination unit or transmitter 12 .
- a processor at the user origination unit or transmitter 12 is defaulted to process an outbound voice signal with an algorithm optimized for delivery to an ASR server.
- the processor at the user origination unit or transmitter 12 receives voice input from the user and an address associated with the voice input. The associated address can be entered by the user or automatically generated by the processor at the user origination unit or transmitter 12 .
- the processor at the user origination unit or transmitter 12 compares the address associated with the inputted voice signal to addresses (phone numbers, IP addresses or other types of addresses) associated with ASR servers that are stored within a database at the user unit.
- the processor determines that the associated address corresponds to a stored address
- the inputted voice is processed according to the default algorithm (automated speech recognition algorithm) at block 76 . Otherwise, the logic proceeds to block 78 , where the processor processes the inputted voice using a voice communication algorithm for human recipients.
- the processor at the user origination unit or transmitter 12 is defaulted to process an outbound voice signal with an algorithm optimized for voice conversation (human recipient) (block 90 ).
- the processed voice signal is sent to the addressee associated with the voice signal.
- the ASR server sends a signal back to the user origination unit instructing the processor of the user origination unit to switch to an algorithm optimized for an ASR server.
- the unit processes the entered voice signals using an algorithm for an ASR server (block 96 ).
- the signal sent by the ASR server is preferably sent in Dual Tone Multiple Frequency also known as Touch Tone, but can also be sent in other formats.
- the voice signal When the voice signal is processed at the user origination unit or transmitter 12 , there may be times when the destination switches from an ASR server to a human recipient or from a human recipient to an ASR server.
- An operator system (human recipient) or ASR server informs the user origination unit when a switch has occurred. Once the user origination unit has been informed of the switch, the unit begins processing according to the new recipient.
Abstract
A method and device for performing some preprocessing on voice transmissions depending upon the intended destination of the transmission. The device includes a receiving component configured to receive a voice signal from a source over a network. The device also includes a processing component configured to determine a destination address associated with the received signal, determine a signal processing algorithm from a plurality of signal processing algorithms based on the determined address, and process the voice signal according to the specified algorithm. The device further includes a delivery component configured to send the processed signal to the associated address.
Description
- The following application is incorporated by reference as if fully set forth herein: U.S. application Ser. No. 09/884,902 filed Jun. 18, 2001
- Voice transmission over a digital wireless network involves capturing sound waves using a microphone and converting them to electrical signals and then binary data. The process comprises sampling, digitizing, and other digital signal processes at the receiver unit (e.g., telematics module or cell phone.)
- There is a fundamental difference between the way humans process auditory input and the way automated speech recognition (ASR) servers process voice input. Thus, different algorithms for signal processing should be used. In current applications, however, a single, compromise process is used, with resultant inefficiencies.
- The present invention provides a method and computer-based device for performing preprocessing on voice transmissions depending upon the intended transmission destination. The device includes a receiving component configured to receive a voice signal from a source over a network. Also included are a processing component configured to determine a destination address associated with the received signal, determine a signal processing algorithm from a plurality of signal processing algorithms based on the destination address, and process the voice signal according to the specified algorithm. The device further includes a delivery component configured to send the processed signal to the associated address.
- In accordance with other aspects of the invention, the device also includes memory configured to store addresses with an associated signal processing algorithm, wherein the processing component finds in memory a signal processing algorithm that is associated with the determined destination address.
- In accordance with yet other aspects of the invention, the device includes an alert component configured to alert the recipient that the voice signal is from a computer-based system, if the source is a computer-based system.
- In accordance with still another aspect of the invention, the computer-based device includes four additional components used to facilitate the present invention: a first component configured to select an address for a voice transmission; a second component configured to receive a phonation inputted for the voice transmission; a third component configured to process the received phonation according to an algorithm associated with a speech recognition device (if the selected address is associated with a speech recognition device) and send the processed phonation to the selected destination; and a fourth component configured to send the received phonation to the selected destination according to a delivery method associated with human recipients (if the selected address is not associated with a speech recognition device).
- In accordance with still further aspects of the invention, a computer-based device includes four additional components used to facilitate the present invention: a first component configured to process a phonation at a source for reception by a human recipient; a second component configured to send the processed phonation to a destination according to an address associated with the phonation; a third component configured to receive a change signal from the destination; and a fourth component configured to process a next phonation for reception by a speech recognition server according to a received change signal, and send the newly processed phonation to the destination.
- As will be readily appreciated from the foregoing summary, the invention provides a method and device for improving voice transmissions by performing some preprocessing on voice transmissions depending upon the intended destination, and for providing recipients with caller identification information if the transmission is computer generated.
- The preferred and alternative embodiments of the present invention are described in detail below with reference to the following drawings.
-
FIG. 1 is a diagram illustrating an exemplary system for receiving and processing voice transmission signals over a wireless network in accordance with the present invention; -
FIG. 2 is a flow chart illustrating operation of the present invention; -
FIG. 3 is a flow chart illustrating an alternate aspect of the present invention; and -
FIGS. 4-5 are flow charts illustrating alternative aspects of the present invention. - The present invention provides a method and device to determine the destination of a voice transmission signal and apply an appropriate data signal processing algorithm based on the determined destination. The same system also provides a method to distinguish between incoming calls sent from a server or from a human in order to notify the person receiving the call that an unsolicited call is being sent. By way of overview and with reference to
FIG. 1 , the present invention includes a system 10 that includes atransmitter 12 and adistribution gateway 16.Transmitter 12 includes a cellular or landline telephone, network phone, other communication device or a voice generation computer that generates a voice sound signal for transmission to end units (users 24 or voice recognition servers 26) over a network, such as a wireless network or a primarily non-wireless network (e.g., Internet).Distribution gateway 16 includes aprocessor 17, areceiver 18, atransmitter 19, and adatabase 20.Receiver 18 indistribution gateway 16 is preferably a wireless communication module capable of receiving voice and data via a wireless communication link.Transmitter 19 indistribution gateway 16 is preferably a wireless communication module capable of sending voice and data via a wireless communication link.Distribution gateway 16 is in communication with one or moreuser end units 24 and one or more automated speech recognition (ASR)servers 26, either directly or over a network (not shown). -
Processor 17 compares an address included in the voice transmission signal fromtransmitter 12, such as an Internet Protocol (IP) address, a telephone number, or other method of identifying an incoming call, to a lookup table stored indatabase 20. The processor applies one of a number of signal processing algorithms depending upon the results of the comparison. Because there is a fundamental difference between the way humans process auditory input and the way ASR servers process voice input, different algorithms are applied to the voice transmission signal to optimize the benefit for the determined destination. For example, if the destination is anASR server 26, the algorithm converts the transmission to digital form (if not already in digital form) and performs other digital signal processing that benefit the process the ASR server will perform. - Referring now to
FIG. 2 , anillustrative routine 30 for operation of the present invention will be described. Atblock 32,distribution gateway 16 receives a voice transmission signal. Typically, the voice transmission signal includes a human voice or voice generated by a computer. At block 34,processor 17 looks up information indatabase 20 corresponding to a destination address or phone number included in the voice transmission signal. The database includes a table of phone numbers or a table of IP addresses of destinations (user end units 24 and servers 26) that are associated with each algorithm. An IP address is specified by the Internet Protocol and uniquely identifies a computer on the Internet.Processor 17 determines which algorithm (identified as the associated algorithm) to use for optimization depending upon the destination IP address of the voice transmission received. Therefore, when a call is placed to one of the numbers associated with an ASR server, the processor chooses the algorithm optimized for an ASR server. Otherwise, the processor chooses the algorithm for voice conversation at anend user unit 24. - At
decision block 36,processor 17 determines whether the associated destination number is anASR server 26. If, atblock 36, the processor determines that the associated destination number is anASR server 26, then, atblock 38, the processor processes the voice transmission according to the optimization algorithm for an ASR server and sends the processed voice transmission signal to the determined destination. If, atblock 36, the processor determines that the associated destination number is not anASR server 26, the processor processes the voice transmission signal for human auditory means according to an optimization algorithm used for producing a result best suited for a human recipient. Atblock 40, the processor sends the processed voice transmission signal to the determined destination. -
FIG. 3 is a flowchart illustrating an operation of sending a signal to a human recipient atblock 40 ofFIG. 2 . In this situation, atblock 54,processor 17 looks up caller identification information related to the origin of the signal (transmitter 12 in database 20). Atdecision block 56, the processor determines the origin of the voice transmission signal by comparing the sending address included in the voice transmission signal to a table of stored IP addresses or phone numbers indatabase 20. Thus, the processor distinguishes between incoming calls from an ASR server or other calls. If, atblock 56, the processor determines that the voice transmission signal originated from an ASR server, the logic proceeds to block 58, where the user unit is informed that the incoming call is an ASR server generated voice transmission signal (i.e. a solicitation). If, atblock 56, the processor determines the origin is not from the ASR server, then the logic proceeds to block 60, where caller identification information is presented to the user. - In an alternate embodiment, the functions performed by
distribution gateway 16 are performed at a user origination unit ortransmitter 12 or atuser end unit 24. The user origination unit ortransmitter 12 is preferably a mobile device that is implemented in a vehicle. The user origination unit can be a device similar touser end unit 24.FIGS. 4 and 5 illustrate flowcharts that present two embodiments for performing some signal processing at the user origination unit ortransmitter 12. - In a first embodiment (
FIG. 4 ) where distribution gateway functions are performed at user origination unit ortransmitter 12, a processor at the user origination unit ortransmitter 12 is defaulted to process an outbound voice signal with an algorithm optimized for delivery to an ASR server. Atblock 72, the processor at the user origination unit ortransmitter 12 receives voice input from the user and an address associated with the voice input. The associated address can be entered by the user or automatically generated by the processor at the user origination unit ortransmitter 12. Atdecision block 74, the processor at the user origination unit ortransmitter 12 compares the address associated with the inputted voice signal to addresses (phone numbers, IP addresses or other types of addresses) associated with ASR servers that are stored within a database at the user unit. When the processor determines that the associated address corresponds to a stored address, the inputted voice is processed according to the default algorithm (automated speech recognition algorithm) atblock 76. Otherwise, the logic proceeds to block 78, where the processor processes the inputted voice using a voice communication algorithm for human recipients. - In a second embodiment (
FIG. 5 ) where server functions are performed at the user origination unit ortransmitter 12, the processor at the user origination unit ortransmitter 12 is defaulted to process an outbound voice signal with an algorithm optimized for voice conversation (human recipient) (block 90). Atblock 92, the processed voice signal is sent to the addressee associated with the voice signal. Atblock 94, if the outbound voice signal goes to an ASR server, the ASR server sends a signal back to the user origination unit instructing the processor of the user origination unit to switch to an algorithm optimized for an ASR server. When the user origination unit receives a signal to switch, the unit processes the entered voice signals using an algorithm for an ASR server (block 96). The signal sent by the ASR server is preferably sent in Dual Tone Multiple Frequency also known as Touch Tone, but can also be sent in other formats. - When the voice signal is processed at the user origination unit or
transmitter 12, there may be times when the destination switches from an ASR server to a human recipient or from a human recipient to an ASR server. An operator system (human recipient) or ASR server informs the user origination unit when a switch has occurred. Once the user origination unit has been informed of the switch, the unit begins processing according to the new recipient. - While the preferred embodiment of the invention has been illustrated and described, as noted above, many changes can be made without departing from the spirit and scope of the invention. Accordingly, the scope of the invention is not limited by the disclosure of the preferred embodiment. Instead, the invention should be determined entirely by reference to the claims that follow.
Claims (15)
1. A method comprising:
receiving a voice signal from a source over a network;
determining a destination associated with the received signal;
determining a signal processing algorithm from a plurality of signal processing algorithms based on the determined address;
processing the voice signal according to the determined algorithm; and
sending the processed signal to the associated address.
2. The method of claim 1 , wherein determining the processing algorithm comprises finding in memory a signal processing algorithm that is associated with the determined destination address.
3. The method of claim 1 , further comprising:
determining the originator of the voice signal, if the determined destination is a human recipient; and
if the determined originator is a computer-based system, alerting the recipient that the voice signal is from a computer-based system.
4. A method comprising:
selecting address for a voice transmission;
receiving at a user input unit phonation inputted for the voice transmission;
if the selected address is associated with a speech recognition device, processing the received phonation according to an algorithm associated with the speech recognition device and sending the processed phonation to the selected destination; and
if the selected address is not associated with a speech recognition device, sending the received phonation to the selected destination according to a delivery method associated with human recipients.
5. The method of claim 4 , further comprising:
switching the destination from a destination associated with a human recipient to a destination associated with a speech recognition device;
sending a switch signal to the user input unit based on the switched destination; and
sending the received phonation to the selected destination according to a delivery method associated with human recipients.
6. The method of claim 4 , further comprising:
switching the destination from a destination associated with a speech recognition device to a destination associated with a human recipient;
sending a switch signal to the user input unit based on the switched destination; and
processing the received phonation according to an algorithm associated with the speech recognition device and sending the processed phonation to the selected destination; and
7. A method comprising:
sending a signal from a source to a destination according to an address associated with a to be generated phonation; and
if the destination is a speech recognition server, sending a change signal from the destination to the source, generating a phonation for reception by a speech recognition server, and sending the newly processed phonation, otherwise generating a phonation at the source for reception by a human recipient.
8. A computer-based device comprising:
a receiving component configured to receive a voice signal from a source over a network;
a processing component configured to determine a destination address associated with the received signal, determine a signal processing algorithm from a plurality of signal processing algorithms based on the determined address, and process the voice signal according to the determined algorithm; and
a delivery component configured to send the processed signal to the associated address.
9. The device of claim 8 , further comprising memory configured to store addresses with an associated signal processing algorithm, wherein the processing component finds in memory a signal processing algorithm that is associated with the determined destination address.
10. The device of claim 8 , further comprising an alert component configured to alert the recipient that the voice signal is from a computer-based system, if the source is a computer-based system.
11. A computer-based device comprising:
a first component configured to select an address for a voice transmission;
a second component configured to receive a phonation inputted for the voice transmission;
a third component configured to process the received phonation accord to an algorithm associated with a speech recognition device, if the selected address is associated with a speech recognition device and send the processed phonation to the selected destination; and
a fourth component configured to send the received phonation to the selected destination according to a delivery method associated with human recipients, if the selected address is not associated with a speech recognition device.
12. A computer-based device comprising:
a first component configured to process a phonation at a source for reception by a human recipient;
a second component configured to send the processed phonation to a destination accord to an address associated with the phonation;
a third component configured to receive a change signal from the destination; and
a fourth component configured to process a next phonation for reception by a speech recognition server according to a received change signal, and send the newly processed phonation to the destination.
13. An apparatus comprising:
means for receiving a voice signal from a source over a network;
means for determining a destination associated with the received signal;
means for determining a signal processing algorithm from a plurality of signal processing algorithms based on the determined address;
means for processing the voice signal according to the determined algorithm; and
means for sending the processed signal to the associated address.
14. An apparatus comprising:
means for selecting an address for a voice transmission;
means for receiving phonation inputted for the voice transmission;
if the selected address is associated with a speech recognition device, means for processing the received phonation according to an algorithm associated with the speech recognition device and means for sending the processed phonation to the selected destination; and
if the selected address is not associated with a speech recognition device, means for sending the received phonation to the selected destination according to a delivery method associated with human recipients.
15. An apparatus comprising:
means for processing a phonation at a source for reception by a human recipient;
means for sending the processed phonation to a destination according to an address associated with the phonation; and
if the destination is a speech recognition server, means for sending a change signal from the destination to the source, means for processing a next phonation for reception by a speech recognition server, and means for sending the newly processed phonation.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/929,301 US20080140418A1 (en) | 2001-03-29 | 2007-10-30 | Method and device to distinguish between voice conversation and automated speech recognition |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US28037701P | 2001-03-29 | 2001-03-29 | |
US27845401P | 2001-04-02 | 2001-04-02 | |
US09/884,902 US7392191B2 (en) | 2001-03-29 | 2001-06-18 | Method and device to distinguish between voice conversation and automated speech recognition |
US11/929,301 US20080140418A1 (en) | 2001-03-29 | 2007-10-30 | Method and device to distinguish between voice conversation and automated speech recognition |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/884,902 Continuation US7392191B2 (en) | 2001-03-29 | 2001-06-18 | Method and device to distinguish between voice conversation and automated speech recognition |
Publications (1)
Publication Number | Publication Date |
---|---|
US20080140418A1 true US20080140418A1 (en) | 2008-06-12 |
Family
ID=27402992
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/884,902 Expired - Lifetime US7392191B2 (en) | 2001-03-29 | 2001-06-18 | Method and device to distinguish between voice conversation and automated speech recognition |
US11/929,301 Abandoned US20080140418A1 (en) | 2001-03-29 | 2007-10-30 | Method and device to distinguish between voice conversation and automated speech recognition |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/884,902 Expired - Lifetime US7392191B2 (en) | 2001-03-29 | 2001-06-18 | Method and device to distinguish between voice conversation and automated speech recognition |
Country Status (5)
Country | Link |
---|---|
US (2) | US7392191B2 (en) |
EP (1) | EP1386309A4 (en) |
CA (1) | CA2378540C (en) |
MX (1) | MXPA02002812A (en) |
WO (1) | WO2002080145A1 (en) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7406421B2 (en) | 2001-10-26 | 2008-07-29 | Intellisist Inc. | Systems and methods for reviewing informational content in a vehicle |
US8175886B2 (en) * | 2001-03-29 | 2012-05-08 | Intellisist, Inc. | Determination of signal-processing approach based on signal destination characteristics |
US20120203557A1 (en) * | 2001-03-29 | 2012-08-09 | Gilad Odinak | Comprehensive multiple feature telematics system |
US20050065779A1 (en) * | 2001-03-29 | 2005-03-24 | Gilad Odinak | Comprehensive multiple feature telematics system |
US7392191B2 (en) * | 2001-03-29 | 2008-06-24 | Intellisist, Inc. | Method and device to distinguish between voice conversation and automated speech recognition |
US7164922B2 (en) * | 2001-10-05 | 2007-01-16 | Intellisist, Inc. | Local phone number lookup and cache |
US7162414B2 (en) * | 2001-12-07 | 2007-01-09 | Intel Corporation | Method and apparatus to perform speech recognition over a data channel |
US8249880B2 (en) * | 2002-02-14 | 2012-08-21 | Intellisist, Inc. | Real-time display of system instructions |
JP2004349802A (en) * | 2003-05-20 | 2004-12-09 | Sharp Corp | Ip telephone device |
JP3625212B1 (en) * | 2003-09-16 | 2005-03-02 | 独立行政法人科学技術振興機構 | Three-dimensional virtual space simulator, three-dimensional virtual space simulation program, and computer-readable recording medium recording the same |
Citations (53)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4644107A (en) * | 1984-10-26 | 1987-02-17 | Ttc | Voice-controlled telephone using visual display |
US4827500A (en) * | 1987-01-30 | 1989-05-02 | American Telephone And Telegraph Company, At&T Bell Laboratories | Automatic speech recognition to select among call destinations |
US4856066A (en) * | 1986-11-06 | 1989-08-08 | Lemelson Jerome H | Speech communication system and method |
US4928302A (en) * | 1987-11-06 | 1990-05-22 | Ricoh Company, Ltd. | Voice actuated dialing apparatus |
US5396542A (en) * | 1993-08-31 | 1995-03-07 | At&T Corp. | Method for use by a telecommunications system in enabling improved attendant services |
US5440627A (en) * | 1994-04-21 | 1995-08-08 | Puri; Rajendra P. | Computer phone dialing system |
US5556229A (en) * | 1995-01-04 | 1996-09-17 | Bishop; Robert J. | Shoreline erosion control system |
US5719921A (en) * | 1996-02-29 | 1998-02-17 | Nynex Science & Technology | Methods and apparatus for activating telephone services in response to speech |
US5764644A (en) * | 1996-01-02 | 1998-06-09 | Lucent Technologies Inc. | Method and apparatus for transport of communication signals over a public network |
US5855003A (en) * | 1996-10-11 | 1998-12-29 | Motorola, Inc. | Method and apparatus for establishing a link in a wireless communication system |
US5937040A (en) * | 1997-04-18 | 1999-08-10 | Siemens Information And Communication Networks, Inc. | Method and apparatus for using a D-channel for displaying user data |
US6122613A (en) * | 1997-01-30 | 2000-09-19 | Dragon Systems, Inc. | Speech recognition using multiple recognizers (selectively) applied to the same input sample |
US6154662A (en) * | 1998-02-27 | 2000-11-28 | Lucent Technologies Inc. | Providing voice dialing service for wireless roamers |
US6161085A (en) * | 1995-11-02 | 2000-12-12 | Nokia Telecommunications Oy | Method and arrangement for adding a new speech encoding method to an existing telecommunication system |
US6175856B1 (en) * | 1996-09-30 | 2001-01-16 | Apple Computer, Inc. | Method and apparatus for dynamic selection of compression processing during teleconference call initiation |
US6195636B1 (en) * | 1999-02-19 | 2001-02-27 | Texas Instruments Incorporated | Speech recognition over packet networks |
US20010037197A1 (en) * | 2000-03-24 | 2001-11-01 | Oleg Boulanov | Remote server object architecture for speech recognition |
US20020029142A1 (en) * | 2000-09-05 | 2002-03-07 | Yeon-Sang Koo | Vocoding method in all internet protocol (IP) network |
US6363349B1 (en) * | 1999-05-28 | 2002-03-26 | Motorola, Inc. | Method and apparatus for performing distributed speech processing in a communication system |
US6370237B1 (en) * | 1998-12-29 | 2002-04-09 | Alcatel Usa Sourcing, Lp | Voice activated dialing with reduced storage requirements |
US6370506B1 (en) * | 1999-10-04 | 2002-04-09 | Ericsson Inc. | Communication devices, methods, and computer program products for transmitting information using voice activated signaling to perform in-call functions |
US6398105B2 (en) * | 1999-01-29 | 2002-06-04 | Intermec Ip Corporation | Automatic data collection device that intelligently switches data based on data type |
US6404876B1 (en) * | 1997-09-25 | 2002-06-11 | Gte Intelligent Network Services Incorporated | System and method for voice activated dialing and routing under open access network control |
US20020085690A1 (en) * | 2000-12-28 | 2002-07-04 | Davidson Jason Alan | Method and system for providing textual content along with voice messages |
US20020103639A1 (en) * | 2001-01-31 | 2002-08-01 | Chienchung Chang | Distributed voice recognition system using acoustic feature vector modification |
US20020101978A1 (en) * | 2001-01-29 | 2002-08-01 | William Lo | System and method for virtual interactive response unit |
US6434528B1 (en) * | 1999-03-31 | 2002-08-13 | International Business Machines Corporation | Computer network telephony |
US6446042B1 (en) * | 1999-11-15 | 2002-09-03 | Sharp Laboratories Of America, Inc. | Method and apparatus for encoding speech in a communications network |
US20020122541A1 (en) * | 2001-03-01 | 2002-09-05 | Michael Metcalf | Voice-activated interactive multimedia information processing system |
US20020123889A1 (en) * | 2000-06-30 | 2002-09-05 | Jurgen Sienel | Telecommunication system, and switch, and server, and method |
US20020126813A1 (en) * | 2001-01-05 | 2002-09-12 | Hadi Partovi | Phone based rewards programs method and apparatus prepared by tellme networks, Inc |
US20020143551A1 (en) * | 2001-03-28 | 2002-10-03 | Sharma Sangita R. | Unified client-server distributed architectures for spoken dialogue systems |
US20020164000A1 (en) * | 1998-12-01 | 2002-11-07 | Michael H. Cohen | System for and method of creating and browsing a voice web |
US20020169611A1 (en) * | 2001-03-09 | 2002-11-14 | Guerra Lisa M. | System, method and computer program product for looking up business addresses and directions based on a voice dial-up session |
US6505159B1 (en) * | 1998-03-03 | 2003-01-07 | Microsoft Corporation | Apparatus and method for providing speech input to a speech recognition system |
US6539354B1 (en) * | 2000-03-24 | 2003-03-25 | Fluent Speech Technologies, Inc. | Methods and devices for producing and using synthetic visual speech based on natural coarticulation |
US6556627B2 (en) * | 1999-12-03 | 2003-04-29 | Sony Corporation | Information processing apparatus, information processing method and recording medium |
US6587871B1 (en) * | 1998-12-22 | 2003-07-01 | Ericsson Inc. | System, method, and apparatus for converting voice mail to text and transmitting as an email or facsimile |
US20030194074A1 (en) * | 2000-12-21 | 2003-10-16 | Jayson Newlin | Methods and systems for communicating and controlling facsimile information |
US6754232B1 (en) * | 2000-01-12 | 2004-06-22 | Cisco Technology, Inc. | Dynamic codec speed selection and bandwidth preallocation in a voice packet network method and apparatus |
US6766291B2 (en) * | 1999-06-18 | 2004-07-20 | Nortel Networks Limited | Method and apparatus for controlling the transition of an audio signal converter between two operative modes based on a certain characteristic of the audio input signal |
US6765995B1 (en) * | 1999-07-09 | 2004-07-20 | Nec Infrontia Corporation | Telephone system and telephone method |
US20050030973A1 (en) * | 2000-08-29 | 2005-02-10 | Takuya Kitamura | Media convert service providing method |
US20050065779A1 (en) * | 2001-03-29 | 2005-03-24 | Gilad Odinak | Comprehensive multiple feature telematics system |
US6917917B1 (en) * | 1999-08-30 | 2005-07-12 | Samsung Electronics Co., Ltd | Apparatus and method for voice recognition and displaying of characters in mobile telecommunication system |
US20050208954A1 (en) * | 2001-03-26 | 2005-09-22 | Microsoft Corporation | User-tagging of cellular telephone locations |
US7047182B2 (en) * | 2000-12-20 | 2006-05-16 | Fuji Xerox Co., Ltd. | Multilingual document retrieval system |
US7203288B1 (en) * | 1997-11-21 | 2007-04-10 | Dictaphone Corporation | Intelligent routing of voice files in voice data management system |
US7215684B1 (en) * | 2000-09-20 | 2007-05-08 | Qualcomm Incorporated | Method and apparatus for reducing transmission overhead in a communication system |
US7392191B2 (en) * | 2001-03-29 | 2008-06-24 | Intellisist, Inc. | Method and device to distinguish between voice conversation and automated speech recognition |
US7444156B2 (en) * | 2001-02-20 | 2008-10-28 | Microsoft Corporation | User-tagging of cellular telephone locations |
US8175886B2 (en) * | 2001-03-29 | 2012-05-08 | Intellisist, Inc. | Determination of signal-processing approach based on signal destination characteristics |
US20120203557A1 (en) * | 2001-03-29 | 2012-08-09 | Gilad Odinak | Comprehensive multiple feature telematics system |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5566229A (en) * | 1992-08-24 | 1996-10-15 | At&T | Voice directed communications system employing shared subscriber identifiers |
US20010033643A1 (en) * | 2000-02-17 | 2001-10-25 | Mulvey Kevin C.W. | Telephone privacy protection system |
-
2001
- 2001-06-18 US US09/884,902 patent/US7392191B2/en not_active Expired - Lifetime
-
2002
- 2002-03-14 MX MXPA02002812A patent/MXPA02002812A/en active IP Right Grant
- 2002-03-22 CA CA2378540A patent/CA2378540C/en not_active Expired - Lifetime
- 2002-03-27 EP EP02725379A patent/EP1386309A4/en not_active Withdrawn
- 2002-03-27 WO PCT/US2002/009485 patent/WO2002080145A1/en not_active Application Discontinuation
-
2007
- 2007-10-30 US US11/929,301 patent/US20080140418A1/en not_active Abandoned
Patent Citations (54)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4644107A (en) * | 1984-10-26 | 1987-02-17 | Ttc | Voice-controlled telephone using visual display |
US4856066A (en) * | 1986-11-06 | 1989-08-08 | Lemelson Jerome H | Speech communication system and method |
US4827500A (en) * | 1987-01-30 | 1989-05-02 | American Telephone And Telegraph Company, At&T Bell Laboratories | Automatic speech recognition to select among call destinations |
US4928302A (en) * | 1987-11-06 | 1990-05-22 | Ricoh Company, Ltd. | Voice actuated dialing apparatus |
US5396542A (en) * | 1993-08-31 | 1995-03-07 | At&T Corp. | Method for use by a telecommunications system in enabling improved attendant services |
US5440627A (en) * | 1994-04-21 | 1995-08-08 | Puri; Rajendra P. | Computer phone dialing system |
US5556229A (en) * | 1995-01-04 | 1996-09-17 | Bishop; Robert J. | Shoreline erosion control system |
US6161085A (en) * | 1995-11-02 | 2000-12-12 | Nokia Telecommunications Oy | Method and arrangement for adding a new speech encoding method to an existing telecommunication system |
US5764644A (en) * | 1996-01-02 | 1998-06-09 | Lucent Technologies Inc. | Method and apparatus for transport of communication signals over a public network |
US5719921A (en) * | 1996-02-29 | 1998-02-17 | Nynex Science & Technology | Methods and apparatus for activating telephone services in response to speech |
US6175856B1 (en) * | 1996-09-30 | 2001-01-16 | Apple Computer, Inc. | Method and apparatus for dynamic selection of compression processing during teleconference call initiation |
US5855003A (en) * | 1996-10-11 | 1998-12-29 | Motorola, Inc. | Method and apparatus for establishing a link in a wireless communication system |
US6122613A (en) * | 1997-01-30 | 2000-09-19 | Dragon Systems, Inc. | Speech recognition using multiple recognizers (selectively) applied to the same input sample |
US5937040A (en) * | 1997-04-18 | 1999-08-10 | Siemens Information And Communication Networks, Inc. | Method and apparatus for using a D-channel for displaying user data |
US6404876B1 (en) * | 1997-09-25 | 2002-06-11 | Gte Intelligent Network Services Incorporated | System and method for voice activated dialing and routing under open access network control |
US7203288B1 (en) * | 1997-11-21 | 2007-04-10 | Dictaphone Corporation | Intelligent routing of voice files in voice data management system |
US6154662A (en) * | 1998-02-27 | 2000-11-28 | Lucent Technologies Inc. | Providing voice dialing service for wireless roamers |
US6505159B1 (en) * | 1998-03-03 | 2003-01-07 | Microsoft Corporation | Apparatus and method for providing speech input to a speech recognition system |
US20020164000A1 (en) * | 1998-12-01 | 2002-11-07 | Michael H. Cohen | System for and method of creating and browsing a voice web |
US6587871B1 (en) * | 1998-12-22 | 2003-07-01 | Ericsson Inc. | System, method, and apparatus for converting voice mail to text and transmitting as an email or facsimile |
US6370237B1 (en) * | 1998-12-29 | 2002-04-09 | Alcatel Usa Sourcing, Lp | Voice activated dialing with reduced storage requirements |
US6398105B2 (en) * | 1999-01-29 | 2002-06-04 | Intermec Ip Corporation | Automatic data collection device that intelligently switches data based on data type |
US6195636B1 (en) * | 1999-02-19 | 2001-02-27 | Texas Instruments Incorporated | Speech recognition over packet networks |
US6434528B1 (en) * | 1999-03-31 | 2002-08-13 | International Business Machines Corporation | Computer network telephony |
US6363349B1 (en) * | 1999-05-28 | 2002-03-26 | Motorola, Inc. | Method and apparatus for performing distributed speech processing in a communication system |
US6766291B2 (en) * | 1999-06-18 | 2004-07-20 | Nortel Networks Limited | Method and apparatus for controlling the transition of an audio signal converter between two operative modes based on a certain characteristic of the audio input signal |
US6765995B1 (en) * | 1999-07-09 | 2004-07-20 | Nec Infrontia Corporation | Telephone system and telephone method |
US6917917B1 (en) * | 1999-08-30 | 2005-07-12 | Samsung Electronics Co., Ltd | Apparatus and method for voice recognition and displaying of characters in mobile telecommunication system |
US6370506B1 (en) * | 1999-10-04 | 2002-04-09 | Ericsson Inc. | Communication devices, methods, and computer program products for transmitting information using voice activated signaling to perform in-call functions |
US6446042B1 (en) * | 1999-11-15 | 2002-09-03 | Sharp Laboratories Of America, Inc. | Method and apparatus for encoding speech in a communications network |
US6556627B2 (en) * | 1999-12-03 | 2003-04-29 | Sony Corporation | Information processing apparatus, information processing method and recording medium |
US6754232B1 (en) * | 2000-01-12 | 2004-06-22 | Cisco Technology, Inc. | Dynamic codec speed selection and bandwidth preallocation in a voice packet network method and apparatus |
US20010037197A1 (en) * | 2000-03-24 | 2001-11-01 | Oleg Boulanov | Remote server object architecture for speech recognition |
US6539354B1 (en) * | 2000-03-24 | 2003-03-25 | Fluent Speech Technologies, Inc. | Methods and devices for producing and using synthetic visual speech based on natural coarticulation |
US20020123889A1 (en) * | 2000-06-30 | 2002-09-05 | Jurgen Sienel | Telecommunication system, and switch, and server, and method |
US20050030973A1 (en) * | 2000-08-29 | 2005-02-10 | Takuya Kitamura | Media convert service providing method |
US6895017B2 (en) * | 2000-08-29 | 2005-05-17 | Fujitsu Limited | Media convert service providing method |
US20020029142A1 (en) * | 2000-09-05 | 2002-03-07 | Yeon-Sang Koo | Vocoding method in all internet protocol (IP) network |
US7215684B1 (en) * | 2000-09-20 | 2007-05-08 | Qualcomm Incorporated | Method and apparatus for reducing transmission overhead in a communication system |
US7047182B2 (en) * | 2000-12-20 | 2006-05-16 | Fuji Xerox Co., Ltd. | Multilingual document retrieval system |
US20030194074A1 (en) * | 2000-12-21 | 2003-10-16 | Jayson Newlin | Methods and systems for communicating and controlling facsimile information |
US20020085690A1 (en) * | 2000-12-28 | 2002-07-04 | Davidson Jason Alan | Method and system for providing textual content along with voice messages |
US20020126813A1 (en) * | 2001-01-05 | 2002-09-12 | Hadi Partovi | Phone based rewards programs method and apparatus prepared by tellme networks, Inc |
US20020101978A1 (en) * | 2001-01-29 | 2002-08-01 | William Lo | System and method for virtual interactive response unit |
US20020103639A1 (en) * | 2001-01-31 | 2002-08-01 | Chienchung Chang | Distributed voice recognition system using acoustic feature vector modification |
US7444156B2 (en) * | 2001-02-20 | 2008-10-28 | Microsoft Corporation | User-tagging of cellular telephone locations |
US20020122541A1 (en) * | 2001-03-01 | 2002-09-05 | Michael Metcalf | Voice-activated interactive multimedia information processing system |
US20020169611A1 (en) * | 2001-03-09 | 2002-11-14 | Guerra Lisa M. | System, method and computer program product for looking up business addresses and directions based on a voice dial-up session |
US20050208954A1 (en) * | 2001-03-26 | 2005-09-22 | Microsoft Corporation | User-tagging of cellular telephone locations |
US20020143551A1 (en) * | 2001-03-28 | 2002-10-03 | Sharma Sangita R. | Unified client-server distributed architectures for spoken dialogue systems |
US20120203557A1 (en) * | 2001-03-29 | 2012-08-09 | Gilad Odinak | Comprehensive multiple feature telematics system |
US7392191B2 (en) * | 2001-03-29 | 2008-06-24 | Intellisist, Inc. | Method and device to distinguish between voice conversation and automated speech recognition |
US8175886B2 (en) * | 2001-03-29 | 2012-05-08 | Intellisist, Inc. | Determination of signal-processing approach based on signal destination characteristics |
US20050065779A1 (en) * | 2001-03-29 | 2005-03-24 | Gilad Odinak | Comprehensive multiple feature telematics system |
Also Published As
Publication number | Publication date |
---|---|
CA2378540A1 (en) | 2002-09-29 |
EP1386309A1 (en) | 2004-02-04 |
US7392191B2 (en) | 2008-06-24 |
CA2378540C (en) | 2011-06-14 |
MXPA02002812A (en) | 2005-10-05 |
WO2002080145A1 (en) | 2002-10-10 |
US20020143552A1 (en) | 2002-10-03 |
EP1386309A4 (en) | 2005-10-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20080140418A1 (en) | Method and device to distinguish between voice conversation and automated speech recognition | |
US8374328B2 (en) | Method and system for adding a caller in a blocked list | |
US20050261034A1 (en) | Communication mechanism for calls in which speaking is not possible | |
US20060094472A1 (en) | Intelligent codec selection to optimize audio transmission in wireless communications | |
CA2192402C (en) | Method and apparatus for transport of communication signals over a public network | |
US6307845B1 (en) | Processing packet-switched voice calls directed to a mobile unit operating in circuit-switched mode | |
JPH10322439A (en) | Electric communication equipment, electric communication system and electric communication method | |
US9042526B2 (en) | Method and apparatus for enabling a calling party to leave a voice message for a called party in response to a command provided by the calling party | |
US7092701B2 (en) | Methods, networks, and computer program products for selectively routing communications based on routing rules | |
US8085906B2 (en) | Method, system and apparatus for providing alternative multimedia ring back tone substitute service by using intelligent network | |
GB2378849A (en) | Method and system of using a single telephone number for data and voice calls | |
EP1710992B1 (en) | Method for call origination using short key in mobile communication terminal and mobile communication terminal implementing the same | |
US20040087301A1 (en) | Communication means | |
US7301937B2 (en) | System for automatically selecting voice data transmission and reception system for IP network, method thereof, and IP terminal | |
US7065189B2 (en) | Voice mail apparatus and method of processing voice mail | |
JP2004040541A (en) | VoIP ADAPTER DEVICE, AND INCOMING CALL INFORMATION METHOD FOR VoIP ADAPTER DEVICE | |
US7890142B2 (en) | Portable telephone sound reproduction by determined use of CODEC via base station | |
CA2705961C (en) | Method and apparatus for enabling a calling party to leave a voice message for a called party | |
KR20000072754A (en) | Method and system for transmitting communication signals through the internet and a public switched telephone network | |
KR20020060382A (en) | Method and system of dual SMS number service | |
US20030123430A1 (en) | Voice over packet network phone | |
US11303778B2 (en) | Fax board incompatibility detection and circumvention | |
JP4154184B2 (en) | Voice terminal and voice communication method | |
CN1656783A (en) | System and method for mobile electronic messaging | |
JP2001345804A (en) | Telephone system, terminal adapter device and telephone set |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCB | Information on status: application discontinuation |
Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION |