Suche Bilder Maps Play YouTube News Gmail Drive Mehr »
Erweiterte Patentsuche | Webprotokoll | Anmelden

Patente

The present invention is a synthetic speech encoding device that produces a synthetic speech signal which closely matches an actual speech signal. The actual speech signal is digitized, and excitation pulses are selected by minimizing the error between the actual and synthetic speech signals. The preferred pattern of excitation pulses needed to produce the synthetic speech signal is obtained by using an excitation pattern containing a multiplicity of weighted pulses at timed positions. The selection of the location and amplitude of each excitation pulse is obtained by minimizing an error criterion between the synthetic speech signal and the actual speech signal. The error criterion function incorporates a perceptual weighting filter which shapes the error spectrum.

ErfinderDaniel Lin, Brian M. McCarthy
Ursprünglich BevollmächtigterInterDigital Technology Corporation
Erster Prüfer: Susan McFadden
Rechtsanwalt: Volpe and Koenig, P.C.
Aktuelle US-Klassifikation704/219

Patent beim USPTO abrufen
In Assignment Database des USPTO suchen

Zitate

Zitiertes PatentEingetragenAusgestelltUrsprünglich Bevollmächtigter Titel
US361763622. Sept. 19692. Nov. 1971PITCH DETECTION APPARATUS
US40586767. Juli 197515. Nov. 1977International Communication SciencesSpeech analysis and synthesis system
US461898223. Sept. 198221. Okt. 1986Gretag AktiengesellschaftDigital speech processing system having reduced encoding bit requirements
US46691202. Juli 198426. Mai 1987NEC CorporationLow bit-rate speech coding with decision of a location of each exciting pulse of a train concurrently with optimum amplitudes of pulses
US473184613. Apr. 198315. März 1988Texas Instruments IncorporatedVoice messaging system with pitch tracking based on adaptively filtered LPC residual signal
US47760155. Dez. 19854. Okt. 1988Hitachi, Ltd.Speech analysis-synthesis apparatus and method
US479792526. Sept. 198610. Jan. 1989Bell Communications Research, Inc.Method for coding speech at low bit rates
US48151348. Sept. 198721. März 1989Texas Instruments IncorporatedVery low rate speech encoder and decoder
US484575318. Dez. 19864. Juli 1989NEC CorporationPitch detecting device
US48688676. Apr. 198719. Sept. 1989Voicecraft Inc.Vector excitation speech or audio coder for transmission or storage
US48903273. Juni 198726. Dez. 1989ITT CorporationMulti-rate digital voice coder apparatus
US498091626. Okt. 198925. Dez. 1990General Electric CompanyMethod for improving speech quality in code excited linear predictive speech coding
US499121326. Mai 19885. Febr. 1991Pacific Communication Sciences, Inc.Speech specific adaptive transform coder
US500175927. Sept. 198919. März 1991NEC CorporationMethod and apparatus for speech coding
US502740515. Dez. 198925. Juni 1991NEC CorporationCommunication system capable of improving a speech quality by a pair of pulse producing units
US512705324. Dez. 199030. Juni 1992General Electric CompanyLow-complexity method for improving the performance of autocorrelation-based pitch detectors
US52356703. Okt. 199010. Aug. 1993InterDigital Patents CorporationMultiple impulse excitation speech encoder and decoder
US526516719. Nov. 199223. Nov. 1993Kabushiki Kaisha ToshibaSpeech coding and decoding apparatus
US530744129. Nov. 198926. Apr. 1994Comsat CorporationWear-toll quality 4.8 kbps speech codec
US53275204. Juni 19925. Juli 1994AT&T Bell LaboratoriesMethod of use of voice message coder/decoder
US556851227. Juli 199422. Okt. 1996Micron Communications, Inc.Communication system having transmitter frequency control
US56757028. März 19967. Okt. 1997Motorola, Inc.Multi-segment vector quantizer for a speech coder suitable for use in a radiotelephone
US599989920. Okt. 19977. Dez. 1999SoftSound LimitedLow bit rate audio coder and decoder operating in a transform domain using vector quantization
US601462226. Sept. 199611. Jan. 2000Rockwell Semiconductor Systems, Inc.Low bit rate speech coder using adaptive open-loop subframe pitch lag estimation and vector quantization
US614828229. Dez. 199714. Nov. 2000Texas Instruments IncorporatedMultimodal code-excited linear prediction (CELP) coder and method using peakiness measure
US624367211. Sept. 19975. Juni 2001Sony CorporationSpeech encoding/decoding method and apparatus using a pitch reliability measure
US624697923. März 200012. Juni 2001Grundig AGMethod for voice signal coding and/or decoding by means of a long term prediction and a multipulse excitation signal
US63452482. Nov. 19995. Febr. 2002Conexant Systems, Inc.Low bit-rate speech coder using adaptive open-loop subframe pitch lag estimation and vector quantization
US65912347. Jan. 20008. Juli 2003Tellabs Operations, Inc.Method and apparatus for adaptively suppressing noise
US66338392. Febr. 200114. Okt. 2003Motorola, Inc.Method and apparatus for speech reconstruction in a distributed speech recognition system
US725453317. Okt. 20037. Aug. 2007Dilithium Networks Pty Ltd.Method and apparatus for a thin CELP voice codec

Ansprüche

1. A speech encoder comprising:

a sampler to generate samples from a speech signal;

a linear predictive coding (LPC) device to produce a first set of linear predication (LP) coefficients based on the samples, and to produce spectral representations from the first set of LP coefficients;

an interpolator to interpolate the spectral representations to generate interpolated spectral representations;

a spectral device to convert the interpolated spectral representations to a second set of LP coefficients;
a pitch analyzer to perform open-loop pitch analysis with the second set of LP coefficients; and
a bit packing device to transmit encoded speech and a codebook index.

2. The speech encoder of claim 1, wherein a residual signal is associated with the pitch analyzer.

3. The speech encoder of claim 2, wherein the codebook index is based on the residual signal.

4. The speech encoder of claim 1, wherein the sampler is samples the speech signal at a sampling rate of 8 kHz.

5. A method for encoding speech, the method comprising:

sampling a speech signal to generate samples;

producing spectral representations from the samples;

interpolating the spectral representations to generate interpolated spectral representations;

performing open-loop pitch analysis based on the interpolated spectral representation; and
transmitting encoded speech and a codebook index.

6. The method of claim 5, wherein a residual signal is associated with the open-loop pitch analysis.

7. The method of claim 6, wherein the codebook index is based on the residual signal.

8. The method of claim 5, wherein a sampling rate of the speech signal is 8 kHz.

9. A method for encoding speech, the method comprising:

sampling a speech signal to generate samples;

producing a first set of linear predication (LP) coefficients based on the samples;

producing spectral representations from the first set of LP coefficients;

interpolating the spectral representations to generate interpolated spectral representations;
converting the interpolated spectral representations to a second set of LP coefficients;
performing open-loop pitch analysis with the second set of LP coefficients; and
transmitting encoded speech and a codebook index.

10. The method of claim 9, wherein a sampling rate of the speech signal is 8 kHz.

11. The method of claim 9, wherein a residual signal is associated with the open-loop pitch analysis.

12. The method of claim 11, wherein the codebook index is based on the residual signal.

13. A speech encoder comprising:

a sampler to generate samples from a speech signal;

a linear predictive coding (LPC) device to produce spectral representations from the samples;

an interpolator to interpolate the spectral representations to generate interpolated spectral representations;

a pitch analyzer to perform open-loop pitch analysis based on the interpolated spectral representations; and
a bit packing device to transmit encoded speech and a codebook index.

14. The speech encoder of claim 13, wherein a residual signal is associated with the pitch analyzer.

15. The speech encoder of claim 14, wherein the codebook index is based on the residual signal.

16. The speech encoder of claim 13, wherein the sampler is to sample the speech signal at a sampling rate of 8 kHz.