US3786195A - Variable delay line signal processor for sound reproduction - Google Patents

Variable delay line signal processor for sound reproduction Download PDF

Info

Publication number
US3786195A
US3786195A US00171571A US3786195DA US3786195A US 3786195 A US3786195 A US 3786195A US 00171571 A US00171571 A US 00171571A US 3786195D A US3786195D A US 3786195DA US 3786195 A US3786195 A US 3786195A
Authority
US
United States
Prior art keywords
delay line
delay
signals
line means
variation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US00171571A
Inventor
M Schiffman
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
VSC Co WESTPORT CT A OF LP
Cambridge Research and Development Group
Variable Speech Control Co Vsc
DC DT Liquidating Partnership
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Application granted granted Critical
Publication of US3786195A publication Critical patent/US3786195A/en
Assigned to VSC COMPANY THE, WESTPORT, CT A LIMITED PARTNERSHIP OF reassignment VSC COMPANY THE, WESTPORT, CT A LIMITED PARTNERSHIP OF ASSIGNMENT OF ASSIGNORS INTEREST. Assignors: FLAKS, MARVIN, ATTORNEY-IN FACT, VSC COMPANY THE
Assigned to VARIABLE SPEECH CONTROL COMPANY reassignment VARIABLE SPEECH CONTROL COMPANY CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: USC COMPANY THE
Assigned to CAMBRIDGE RESEARCH AND DEVELOPMENT GROUP, A LIMITED PARTNERSHIP OF CT reassignment CAMBRIDGE RESEARCH AND DEVELOPMENT GROUP, A LIMITED PARTNERSHIP OF CT SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: VARIABLE SPEECH CONTROL COMPANY THE, A LIMITED PARTNERSHIP OF CT
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B3/00Line transmission systems
    • H04B3/02Details
    • H04B3/04Control of transmission; Equalising
    • H04B3/10Control of transmission; Equalising by pilot signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/04Time compression or expansion
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B21/00Head arrangements not specific to the method of recording or reproducing
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B5/00Recording by magnetisation or demagnetisation of a record carrier; Reproducing by magnetic means; Record carriers therefor
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03HIMPEDANCE NETWORKS, e.g. RESONANT CIRCUITS; RESONATORS
    • H03H11/00Networks using active elements
    • H03H11/02Multiple-port networks
    • H03H11/26Time-delay networks
    • H03H11/265Time-delay networks with adjustable delay
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03HIMPEDANCE NETWORKS, e.g. RESONANT CIRCUITS; RESONATORS
    • H03H7/00Multiple-port networks comprising only passive electrical elements as network components
    • H03H7/30Time-delay networks
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03KPULSE TECHNIQUE
    • H03K4/00Generating pulses having essentially a finite slope or stepped portions
    • H03K4/06Generating pulses having essentially a finite slope or stepped portions having triangular shape
    • H03K4/08Generating pulses having essentially a finite slope or stepped portions having triangular shape having sawtooth shape
    • H03K4/48Generating pulses having essentially a finite slope or stepped portions having triangular shape having sawtooth shape using as active elements semiconductor devices
    • H03K4/50Generating pulses having essentially a finite slope or stepped portions having triangular shape having sawtooth shape using as active elements semiconductor devices in which a sawtooth voltage is produced across a capacitor
    • H03K4/501Generating pulses having essentially a finite slope or stepped portions having triangular shape having sawtooth shape using as active elements semiconductor devices in which a sawtooth voltage is produced across a capacitor the starting point of the flyback period being determined by the amplitude of the voltage across the capacitor, e.g. by a comparator
    • H03K4/502Generating pulses having essentially a finite slope or stepped portions having triangular shape having sawtooth shape using as active elements semiconductor devices in which a sawtooth voltage is produced across a capacitor the starting point of the flyback period being determined by the amplitude of the voltage across the capacitor, e.g. by a comparator the capacitor being charged from a constant-current source
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03KPULSE TECHNIQUE
    • H03K7/00Modulating pulses with a continuously-variable modulating signal
    • H03K7/08Duration or width modulation ; Duty cycle modulation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B1/00Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
    • H04B1/66Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B1/00Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
    • H04B1/66Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
    • H04B1/662Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission using a time/frequency relationship, e.g. time compression or expansion

Definitions

  • ABSTRACT An electrical delay line whichhas variable delay controlled by a signal input thereto is connected in a sound signal channel for signals such as human speech, to compress or expand the sound signal waveform depending on whether the time delay in the line is increased or decreased.
  • variable delay line By periodically sweeping the delay line from minimum to maximum time delay or vice-versa, repeated segments of a continuous sound signal waveform are processed so that an output audio signal can be obtained having the original frequency components of the signal and occupying a time duration which is equal to or smaller or larger than the original sound sequence: with the successive segments of the signal processed by the variable delay line assembled with regard both to the significant parameters of human speech or other coding and the electrical conditions imposed by the system to produce a composite audio signal which is an intelligible replica of the original and substantially free of annoying aberrations introduced by the delay line processor.
  • Variable delay using analog or digital signal storage is also provided.
  • the field of this invention is the processing of human speech signals or similar signals for ultimate comprehension by the human listener at substantially the natural or normal frequency component distribution but at time interval durations which are usually different from the original time duration of the speech utterance.
  • Sound compression and expansion systems which utilize relative motion between a magnetic tape record medium and the air gap of the pick-up head which senses the recorded signal on the magnetic medium are well known as exemplified by the patent to Schiiller 2,352,023.
  • Devices of this type suffer from the usual operational, cost and weight limitations involved in equipment which utilizes substantial mechanical motion components.
  • a delay line version of time compression and expansion for real time signals is also well known as shown, for example, in the patent to French et al.
  • 3,347,997 shows a closed loop servo which adjusts playback speed to compensate for the relatively low frequency wow component whereas imperfections of a high frequency nature (i.e., flutter") are compensated by a variable delay line.
  • Such compensation systems depend for their operation on the repetitive character of the error signal and thus by The systems described in the ⁇ patents to Schuller,
  • French et al. and Greenberg et al. when used to reduce the frequency of a speech signal, while compressing the time in which a given segment of speech is reproduced, inevitably involve discarding a portion of the original speech wave.
  • the ratio of the speech signal discarded to that which is used is directly related to the compression ratio and the discard loss is inherently and fundamentally related to this process of reducing the frequency and compressing the time for the processing of a given passage of speech. Since the portion of the speech which is reproduced alternates with portions which are discarded the problem of merging to reproduce sections in continuous time slots presents some problem and various solutions have been offered.
  • the patent to French at al. suggests a number of alternatives including the use of' two spaced transducers rotating in unison with respect to the delay medium such that the message reproduced in compressed time in one transducer has superposed thereon from the other transducer the message which would ordinarily BRIEF SUMMARY OF THE INVENTION
  • the 1 present invention provides compressionexpansion systems for speech or other coded signals in which the active frequency-time conversion element is a signal responsive delay line which is directly interposed in the path between the signal source and the ultimate reproducer or utilization device which receives the converted speech wave. Such a system does not avoid the inherent problem in time compression occasioned by the discontinuity resulting from discarding alternate portions of the speech wave.
  • the discard portion of the speech wave may be stored or cancelled in the delay line ordiverted from entering the input terminal of the delay line which is directly in the signal channel.
  • this signal and transients produced by line switching must be discarded as the variable delay line is repetitively controlled by the delay signal between its minimum and maximum delay values.
  • Line switching and discard occur coincidentally with the requirement for making contiguous two originally spaced portions of the speech wave and hence both of these functions are accomplished by a number of alternative embodiments herein disclosed which function in a manner consistent with requirements imposed by the parameters of the speech signal itself.
  • a speech compression-expansion system which utilizes a signal controlled variable delay line located directly in the signal channel between the signal source and sound reproducer which delay line is repeatedly sequenced betweenmaximum and minimum delay values to modify the frequency-time characteristic of the sound reproduced from the original signal.
  • a further feature of the invention provides maximum discard intervals as determined by the maximum delay line length which in relation to the compression ratio limits the actual original speech message discard to a value which minimizes the loss of significant cues or transitions in the speech code thereby minimizing the loss of information content transferred to the listener.
  • a still further feature of the present invention is to provide, in a system utilizing variable delay line speech compression or expansion, for signal processing at the point of juncture of two reproduced speech portions to suppress distracting noise components and also to avoid the introduction of false cues which could modify the information conveyed in the subsequent speech segment.
  • the transition between successive reproduced speech samples can be modified by simple transfer function selection or control, orthe transition can be eased by the introduction of synthetic or speechderived signal portions to approximate a smooth transition within a time interval which does not lose actual cues and under such conditions that do not introduce false cues.
  • FIGS. 4(a) to 4(f) show waveforms useful in describing forms of processing of a transition between adjacent reproduced speech samples.
  • FIGS. 5(a) to 5(d) show a set of curves representing active processing of the transition between adjacent samples.
  • FIGS. 6(a) to 6(e) show waveforms useful in describing the use of two delay lines to effect transition between adjacent speech samples.
  • FIG. 7 shows a block diagram of a speech compressor-expander system in accordance with the invention.
  • FIGSTSKa) 6823 show waveforms useful in describing the operation of the system of FIG. 7.
  • FIG. 9 shows a block diagram of a dual delay line system in accordance with the invention.
  • FIGS. 10(a) to 10(d) show waveforms useful in describing the operation of the system of FIG. 9 for compression.
  • FIGS. l1(a) to 11(d) show waveforms useful in describing the operation of the system of FIG. 9 for expansion.
  • FIG. 12 is a partial block diagram of a modification.
  • I IYG STIKGTEE ISYE show waveforms useful in describing the operation of circuit of FIG. 12 for compression.
  • FIGS. 14(a) to l4(c) show waveforms useful in describing the operation of the modification of FIG. 12 for expansion.
  • FIG. 15 is a partial diagram of a dual delay line binaural system.
  • FIGS. 16(a) to 16(f) show waveforms useful in describing the operation of the system of FIG. 15.
  • FIG. 17 is a partial block diagram of a speech processor in accordance with the invention using an analog shift register at the variable delay element.
  • FIG. 18 is a block diagram showing gap filling with signal continuity in a system similar to FIG. 17.
  • FIG. 19 is a partial showing of an embodiment of the invention with variable delay provided by an r-bit parallel digital shift register.
  • FIG. 20 shows an embodiment of the invention with variable delay provided by a serial digital shift register.
  • FIG. 21 shows an embodiment using an analog storage memory matrix for variable delay.
  • FIG. 22 shows an embodiment using an r-bit digital random access memory.
  • FIG. 23 shows a logic diagram of a directional zero signal level gating control in a dual line system. 7
  • FIG. 24 shows waveforms useful in describing the operation of the circuit of FIG. 23.
  • FIG. 25 shows graphically the clock frequency and maximum signal frequency for the system of FIG. 17.
  • the method and apparatus are usable however in expansion mode and the considerations involved for use of reproduced signals which occupy a greater length of time than the original speech utterance will be described separately hereinafterq
  • the system is also capable of frequency transformation without a corresponding time change to achieve a desired frequency signal, such as may be involved in generating speech in amedium having a propagation velocity different than air.
  • a magnetic tape 21 has the speech signal recorded thereon of which the lowest frequency component at 333 l-Iz'is depicted by sine wave 22 with the tape being drawn past a pick-up transducer as it is wound on a take-up reel 24 at speed S.
  • the electrical signal produced by the transducer 23 passes through a compression processor 25 and is reproduced as an audible signal from speaker 26.
  • the system 23 26 shown on line (a) of FIG. 1 just described reproduces the recorded signal on the tape 21 without frequency or time change if the speed of takeup reel 24 draws the tape past the transducer 23 at the speed of recording S and for this condition processor 25 would introduce a fixed constant delay time of any value.
  • This discard includes cycles 5, 6, 7 and 8 of the original wave 22 and represents the gap in the information content between successive chunks which are reproduced as audible signals.
  • This audible output is represented on line (d) where the chunk is depicted as a piece of tape 31 played at speed 2S and containing cycles 14 which afterprocessing is effectively stretched into a piece of tape 32 occupying the original l2ms of recorded time and containing the cycles 1-4 at their original recorded frequency.
  • the next cycle reproduced is cycle number 9 of the original wave after cycles 5-8 inclusive have been discarded.
  • the representation in line (d) of a smooth transition between the end of cycle 4 and the start of cycle 9 should not be taken as representative of real signal conditions as would be obvious from a consideration of an actual signal as opposed to the idealized signals presented in FIG. 1.
  • Lines (f), (g) and (h) of FIG. 1 illustrate the situation which prevails when the compression ratio is equal to five with the tape speed drawn past transducer 23 at five times the recorded speed- S.
  • this compression ratio results in a chunk length of 1.5 containing 2 a cycles of the 333 Hz wave 25 of line (a) and again a discard interval of 6 ms equal to the final signal delay and corresponding to a delay line length of 10rns at the end of the sample.
  • the information gap however, has increased to the point where the last half of cycle number 3 and first half of cycle number 13 and all the intervening information in the original recorded wave have been lost in the discard and this gap in the message represents 30ms of the originally recorded speech utterance.
  • V Esincmt i where c is called the compression ratio. If c l the time is compressed for any given speech passage and if c l time is expanded by the factor e where e 1/0.
  • the delay is ct flo restore) (C such that c d/2 (c l)
  • the average delay rate of the line 4 which is derived as one half the sum of the final and initial delay values of the delay line, which when multiplied by time, t, thus yields the total delay, ct;
  • FIG. 2(a) shows a plot for a given signal sample of signal output time, r vs. the corresponding input time, t
  • a line with a slope of 4 represents signal of four times the original frequency or speed of presentation, and one-fourth the periodicity while a line with a slope of 1 represents the resultant restored, or unchanged signal.
  • the input signal, ct it is necessary to increasingly delay the input signal, ct by an amount ct [or (cl)t] as shown at line III.
  • T 24 has an ordinate which intersects III at the ordinate value c'T and this value when added to the time abscissa value at the point cT on I, delays the signal to T on line II.
  • the delay dt introduced by the delay line is shown by line IV.
  • Such a delay line has the effect of delaying the instantaneous signal, t through a linearly increasing amount d-t for the interval from t to t as shown by line IV.
  • the restoration may be achieved by cumulatively delaying an input singal, t,,,, by an amount from which we obtain (4)
  • FIG. 2(b) the corresponding relationsfor signal expansion are shown.
  • Line l with a slope of one-fourth represents a signal of one-fourth the original frequency or speed of presentation.
  • the 333Hz signal is processed at sample periods approaching 3 ms would with its samples reassembled accordingly produce a compressed output of poor quality since the sampling would then be causing a disruptive discontinuity for nearly every cycle of the 333 Hz signal processed.
  • Sample periods less than 3ms would not permit completion of any one cycle so that the resultant reassembled output would not only contain the said disruption but would also begin to exhibit a basic change in its frequency characteristic in the form of waveform compression by truncation to produce false frequencies. While this condition does not represent a real condition for a speech wave due to the complexity of the waveforms, this principle is controlling and sample periods less than the period of the lowest frequency wave in the speech signal will not provide proper compression.
  • Sample periods greater than the period of the lowest frequency wave will produce compression and an interval of disruption exists from the region where the sample period is only slightly greater than the period of the lowest frequency wave as indicated on the time axis between 3 ms and 6 ms in FIG. 3(a).
  • the result obtained within this period of disruption is a distorted expanded wave in which the effect of disjunctions between samples becomes extremely severe as the single cycle point is approached and diminishes as the number of cycles in the sample increases.
  • two and one-half cycles per sample is indicated as the desired limit in FIG. 3(a) but in general the more cycles in the sample the less the disturbance factor.
  • the chunk length increases with the result that the actual time sample increases to greater than 7.5 ms and therefore more than the minimum number of cycles for the lowest frequency wave component will be present in the chunk. Thus it would be at the users option to operate the line over less than the 6ms indicated delay for AT to reduce the amount of discard.
  • the actual information loss is the compression ratio times 6ms so that with c 5 the actual information discarded for each sample is 30ms of recorded time. As shown on the time axis of FIG. 2 this is the interval from 7.5 ms to 37.5 ms and the relation of this loss of information to the intelligibility of the reproduced speech signal must be examined.
  • human speech represents an extremely complex coding of a relatively limited set of sounds called phonemes which taken in context with the various attributes of the speech code such as the voicedunvoiced components, pitch, formant frequencies and the continuum of sound pattern represented by sound energy (and the absence thereof) connected by the all important transititions between the temporal components thereof constitutes an acoustic stream of infinite variety and versatility.
  • the ability of the human ear to receive this acoustic message and the ear-brain system to decode the message is not altogether understood since it appears that the readily comprehended information rate far exceeds the mere acoustic response characteristics of the ear as a receiver.
  • the ability of the ear-brain system to comprehend the message which is conveyed by human speech signals is sufficiently good to permit large portions of the actual acoustic stream to be lost or discarded without significant loss in the perception and comprehension of the message information content of the acoustic signal. Since the comprehension of message content decreases more rapidly than the recogni tion of individual words as the message is presented to the listener at increasing rate,-the problem associated with the discard of a portion of the signal stream can be resolved in favor of comprehension and short of the point where intelligibility of individual words deteriorates.
  • the loss of intelligibility can be associated with discarding portions of the message containing significant cues or phonemes which components vary in length with the shortest being approximately l0ms to 20ms long. These short cues do not dominate speech but occur with sufficient regularity to make their systematic loss undesirable and hence a desirable upper limit for the discard period would be considered to be 30 ms and preferably closer to l5ms.
  • FIG. 3(a) indicates the recording time discard relation to compression ratio as the linear function cAT with the range from 15 ms to 30 ms designated the discard uncertainty range.
  • Smaller values of c result in smaller actual discard time and hence the intelligibility is improved especially for those cues which are at the lower end of the time scale, i.e., in the neighborhood of lms.
  • Table I and FIG. 3(a) represent parameters for a typical speech compression system having a final signal delay of 6ms and define the limits of operation within fairly narrow limits, it will be appreciated that the principles involved can be adapted for use over a wider range of operation.
  • the variation of the actual frequency band of the speech signal and the maximum length of the delay line are both important design factors which influence the selection of the chunk-todiscard ratio and sample period for a given range of the compression ratio c.
  • the actual frequency range of the signal has an important bearing on the design of the delay line which must accommodate the frequency spectrum present in the signal as to such quantitative and qualitative factors as the voice pitch, the .presence of all or only some of the format frequencies for an individual voice and the width of the signal spectrum over which linear phase-frequency properties must be preserved.
  • the ultimate system used however will embody design choices of the factors involved within the broad limits herein defined.
  • FIG. 3(b) is a plot of corresponding relations for signal expansion showing the initial gap, output chunk and maximum delay line length variation with expansion ratio e for a given input sample interval T
  • the output gap occurs at the start of each sample period and thereafter for the balance of the sample period the reduced frequency time-expanded output chunk appears.
  • the maximum delay d T required is also shown as a function of the expansion ratio e.
  • FIG. 4 a portion of the 333 Hz wave at the transition point illustrated in FIG. 1(d) has been reproduced in which cycle 4 and cycle 9 of the original recorded 333 Hz wave are shown as a smooth uninterrupted sine wave.
  • a further improvement is possible by using the gating signal as a gain control signal and tapering the off and perhaps the on transitions of the gate so that a gradual transition of the audio output from off to on is accomplished and a relatively smooth transition as indicated in FIG. 4(f) results.
  • the object is to mini mize the gap effect which in itself has an audio characteristic and can act like a cue. Tapering the trailing edge of the gate helps this considerably whereas an anticipating start (or'relative delay of the speech signal) would be preferable for gradual onset for the leading edge. With these relatively simple expedients the smooth transition between adjacent chunks which are disjunctively joined by the operation of the compression-discard process are achieved in a manner which is satisfactory for many applications.
  • FIG. 5(a) a disjunctive transition which is the norm to be expected represents a sharp discontinuity in the message signal and has superposed thereon the noise transient from switching the line as previously described.
  • FIG. 5(b) By introducing a gate signal FIG. 5(b) of sufficient width to encompass the line switching transient and conditioning the gate to coincide with a zero level and same direction of change for the adjacent signals being processed a zero level gating transition as shown in FIG. 5(c) can be achieved.
  • This transition which is free of line switching noise and essentially continues an existing zero amplitude signal level during the interval of the gate has been found to provide little or no disturbance to the average listener.
  • FIG. 6 represents a preferred form of gap filling where two signal controlled delay lines are used.
  • the speech signal is applied to both delay lines designated channel A and channel B in FIGS. 6(a) and 6(b) respectively and these two lines are signal controlled to have symmetrical complementary gain characteristics and overlapping variable delay characteristics as shown in FIGS. 6(c) and 6(d).
  • the delay control signals as shown in FIG. 6(d) are phased to overlap at least an amount corresponding to the transition portion of the gain control characteristics of FIG. 6(c).
  • the outputs of both delay channels A and B are combined to produce the combined output shown in FIG. 6(c).
  • the length of the delay lines used for channels A and B in FIG. 6 will employ one full length delay line and one relatively shorter length delay line for stor ing the signal used for gap filling purposes. This arrangement will reduce the cost of the equipment represented by the multiple section delay lines necessary to obtain the required maximum delay length for system performance requirements.
  • two equal full length variable delay lines can be employed and their control signals can be alternately applied so that the signal channel is through first one and then the other delay line thereby giving a full signal period for switching the inactive delay line back to minimum delay conplied as indicated in FIG. 6(c).
  • variable speed playback device 51 which is indicated to be a tape transport with a manual select speed control input 52.
  • the signal derived from transporting the tape past a magnetic transducer is applied to an AGC amplifier 53 which also passes the signal through a band pass filter having an adjustable low and high frequency cutoff.
  • the selection of the cutoff frequencies for the filter may be operated from manual control 52 in conjunction with the selection of playback speed for the playback device 51.
  • the manual control-52 also supplies an amplitude control signal to a fine voice pitch adjust control 54 which supplies on line 55 a signal to control the end amplitude of the linearly increasing waveform which controls the variable delay line as hereinafter described.
  • variable delay line 56 which can be signal controlled between minimum and maximum delay limits.
  • This control signal applied on line 57 is derived from a ramp level and amplitude charger 58 which receives as its input either a compression triangular waveform on line 59 or an expansion inverse of the waveform on line 59 which appears on line 61 after passing through an inverter 62.
  • One or the: other of the lines 59 and 61 is energized with a ramp waveform depending upon the setting of a switch 63 which supplies the basic ramp waveform from ramp pulse: train generator 64.
  • the repetition period of the ramp waveform is selectable by a manual control 65.
  • a pulse coincident with the reset of the linear portion of the ramp waveform appears on line 66 and is applied to a blanking pulse generator 67 to produce a blanking pulse output the width of which can be controlled by manual adjustment 68 and which is synchronized with the input pulse on line 66.
  • variable delay line 56 The output of the variable delay line 56 is applied to a blanking circuit and amplifier 71 which transmits or blocks the signal depending upon the blanking pulse B applied on line 72 from generator 67 and when the blanking pulse is not present F the delay signal is applied to a speech bandpass filter 73 the output of which is applied to an audio reproducer 74.
  • variable delay line 56 will generally be of any known type and in particular may be 360 RC filter stages where the shunt resistor is provided by a PET or other semiconductor device which varies resistance in response to a controlled voltage or current. Such delay lines generally perform best: with respect to distortion of the signal passing therethrough if the phase delay per stage is kept well below the maximum possi ble value of 90.
  • the line can be designed to operate with 45 to 60 phase delay per stage maximum and the number of stages is then determined as greater than the quantity: N (6 or 8) c(f,,,, AT
  • N (6 or 8) c(f,,, AT
  • the digits o and 8 represent the number of stages per electrical cycle of the highest frequency to be passed corresponding to a phase delay of 60 or 45, respectively, as themaximum phase shift per stage which is to be utilized
  • the quantity c is the compression ratio
  • the quantity f is the highest frequency being passed by the line
  • AT is the maximum signal delay desired as dictated by the maximum permissible discard interval previously specified.
  • Many other forms of delay line construct-ions which are capable of being signal controlled are known in the art and the present invention is not to be considered as limited to any particular form of delay line.
  • the sample period waveform 81 has an adjustable period set by control for producing an asymmetrical sawtooth waveform 82 which produces a relatively long negative. going linear voltage followed by a shorter positive going linear voltage.
  • This waveform is used directly on line 59 for speech compression while, after inversion in inverter 62, its inverse is used on line 61 for expansion.
  • the expansion waveform is indicated in dotted lines at 83 inFIG. 8(a).
  • the waveforms 82 and 83 have the proper sense for controlling the delay interval and the magnitude of the delay is determined by amplitude control 52 relative to a voltage level set by the level adjust 60.
  • the maximum delay AT in the line 56 is a fixed value as obtained by selecting line length according to the value d-T as given in FIG. 3(a) and Table I for the desired compression ratio. If the maximum delay to the signal is not maintained constant the discard period will change correspondingly as is evident from the description of FIG. 1 and corresponding adjustments in the amplitude of the wave will be required to give the slope (I required for a compression ratio 0. Similar considerations apply for the slope of curve 83 which must be set at its corresponding value d for an expansion ratio e.
  • blanking pulse generator 67 The operation of blanking pulse generator 67 is shown to produce a pulse 84 in FIG. 8(b) of predetermined width in response to the start pulse of the sample period signal 81 received on line 66.
  • This pulse may be applied in gain control fashion to the circuit 71 with modified trailing edge as previously described to reduce the transient signal and provide a gradual onset of voice sound signals which are passed to the transducer 74.
  • the width B of the blanking pulse is selected with control 68 and is normally made of sufficient duration to permit the short steep linear-portion of the ramp waveform to return the delay line 56 to its zero or minimum delay condition and dissipate the signal energy therein (or the transient caused by switching the line itself) prior to enabling the signal. channel which energizes the transducer 74 with the subsequent speech signal segments.
  • the blanking period B and enabled period B for expansion mode are shown in FIG. 8(c).
  • the expanded chunks with an initial output gap are shown in FIG. 8(d).
  • the system of FIG. 7 can also be used to substitute noise or pseudo signal gap filling signals corresponding to the system described in FIG. 5.
  • a source 75 of such signals is arranged to supply the input signal to filter 73 during the blanking interval.
  • a switch 76 this gap filling during the blanking interval can be made optional.
  • the gap filling signal 75 can also be derived from the message signal output of amplifier 53.
  • FIG. 9 a modified-form of the invention particularly suitable for accomplishing the various gap filling procedures for the speech compression systems previously described will be disclosed. Portions of FIG. 9 which are essentially the same as those described in FIG. 7 have corresponding reference numerals and accordingly only the additions and changes will be further described.
  • a second variable delay line 91 receives the signal wave from amplifier 53. The output of the delay lines 56 and 91 are applied respectively to complementary blanking circuits 92 and 93. Signals passed by these blanking circuits 92 and 93 are amplified and filtered in element 73 and passed to the acoustic reproducer 74 as heretofore described.
  • a pulse train generator 94 produces a pulse wave train as shown in FIG. 10(a) having a selectable pulse repetition rate determined by the setting of manual control thereby establishing the basic sample period.
  • the output pulse, from generator 94 is delayed in delay unit 95 and applied to a first ramp generator 96 and in undelayed form is applied to a second ramp generator 97.
  • the ramp generators 96 and 97 are subject to waveform level control from manual adjust element 60 and ramp linear wave amplitude control from the manual adjust element 52.
  • the fine pitch adjustment 54 may be provided for slightly modifying the ramp slope as a voice pitch adjustment by effectively altering the frequency conversion over a small range.
  • the blanking width interval of each generator is adjustable with controls 68 and 70 respectively.
  • the outputs of the ramp generators 96 and 97 are applied respectively to delay lines 56 and 91 to control the time delay of signals passing through the respective lines in accordance with the control signals applied.
  • c or e select controls the sense of the slope of the ramp waveforms can be selected for compression or expansion.
  • the level and amplitude controls for setting the respective ramp generators 96 and 97 are preferably relatively adjustable to permit selection of the relationship between the two ramp waveforms. By making the delay and phasing of the unit 95 adjustable any desired delay line overlap can also be achieved. It is also possible to rearrange the components to have the complementary gating at the inputs of the two delay lines 56 and 91 with the outputs switched to be combined in a common channel to amplifier 73. This alternative discards the portion of the speech signal that is not utilized by each line before it enters the line and thus eliminates the necessity for dissipating these portions when the lines are switched between active periods.
  • the pulse train generator 94 produces the timing waveform of FIG. 10(a). This pulse triggers the transition of waveform C2 in pulse ramp generator 97 which produces the blanking pulse indicated in FIG. 10(0) with the predetermined width of B and B being determined by the blankingpulse width control 68. After the delay indicated in FIG. 10(b) the pulse from generator 94 triggers the ramp generator 96 to produce the waveform Cl shown in FIG. 10(1)). With this arrangement the control wave CI for the delay line 56 is overlapped in time by waveform C2 having slope in the same sense and bridging the steep return slope waveform of ramp wave Cl. With the asymmetrical time intervals shown in FIG.
  • the arrangement for gap filling modes of operations shown in FIGS. 5 and 6 can be practiced.
  • the waveforms C1 and C2 have symmetrical rising and falling portions the arrangement is suitable for alternate switching of the lines 56 and 91 to provide alternate compressed (or expanded) chunks of the speech sample.
  • the choice of the relative lengths of sample through line 56 and 91 will generally be dictated by manufacturing costs for the delay line.
  • a relatively shorter line 91 used only for gap filling purposes will generally be more economical.
  • FIG. 1 The operation of the system of FIG. 9 for speech expansion, i. e., increasing the time duration for a given speech utterance and increasing the frequency components thereof from a reproducer running at a slower than recorded rate is shown in FIG. 1 1.
  • the ramp generators 96 and 97 have inverted outputs to produce the expansion waveforms E1 and E2 shown in FIGS. 11(a) and 11(c) respectively and the blanking waveform has been made symmetrical such that the delay lines 56 and 91 are used alternately for approximately equal periods.
  • a gap in the signal output will always occur since the lines are controlled to change from maximum delay at the start of the sample to minimum to zero delay at the end of the sample.
  • the system shown in FIG. 12 represents a simplification of the system of FIG. 9 where a fixed delay line 101 is used in place of the second variable delay line 91 of FIG. 9.
  • the control of blanking ciruits 92', 93' is simplified in that the variable width blanking gate B as derived from pulse train generator 94 correspondingly produces gaps in the output signals which have been delayed by passage through variable delay line 56.
  • the fixed delay of line 101 is selected to further delay some portion of the signal emerging from the delay line 56 by an amount sufficient to fill the gap caused by blanking pulse B thereby essentially repeating some portion of each message chunk while the variable delay line 56 is switched back to its minimum delay condition. Again, this repetition is not objectionable and may merely introduce a slight echo effect which is much less objectionable than the presence of the gap in the message signal.
  • This sequence of operation is shown in FIG. 13 where the variable chunk C and the fixed chunk C alternate in supplying the output.
  • FIG. 14 The expansion mode for operation of the circuit of FIG. 12 is shown in FIG. 14 where the ramp signals are inverted for the expansion waveform which controls the delay line 56 to vary from maximum delay to minimum delay over the linear ramp portion E shown in FIG. 14(a).
  • the blanking waveform B is selected to pass some portion of the signal chunk through the appropriate amount of delay to fill the gap between chunks in the output as indicated in'FIG. 14(c).
  • the output is composed of chunks E and E in alternation for continuous signal.
  • the system of FIG. 12 could be further simplified by eliminating delay line 101 and conditioning gate 93' to introduce in the gap interval'any pseudo or noise signal from a suitable source which would simulate the frequency content of the actual speech signal. While this version would be less desirable than using the actual speech signal for gap filling it would, nevertheless, be better than reproducing the speech signal with the message gaps present since the audible effect of gaps becomes detrimental to recognition of the message content, especially at high compression ratios. This modification gives a mode of operation similar to the optional noise gap filling described for FIG. 7.
  • FIG. 15 shows a modification of the invention for binaural processing.
  • the speech signal from the band pass filter 53 is applied to symmetrical variable delay lines VDLl and VDLZ controlled by waveform generator 102.
  • the output of VDLl is applied as an input to gates 103 and 105.
  • the output of VDL is applied as an input to gates 104 and 106.
  • the delay line VDL is con trolled for linear variation of delay according to the waveform of FIG. 16(c).
  • the delay line VDL is controlled for linear variation of delay according to the waveform of FIG. 16(d).
  • Each of these waveforms has its rapid return transition at the mid-point of the linear delay portion of the other waveform.
  • the gates 1t 3 and 106 are controlled by gating waveforms B and 13, shown in FIG. 16(e).
  • Gate 103 passes signal during B and is blocked during B
  • Gate 106 is blocked during B and passes signal during B
  • Amplifier 107 combines the outputs of gates 103 and 106 and applies the combined signal to an audio reproducer 108.
  • the system of FIG. 15 operates to reproduce the entire original signal (for compression ratio equal to two) since each delay line processes the portion which is the discard for the other line as is evident from FIGS. 16(a) and 16(b). For compression ratios greater than two some message discard occurs and for ratios less than two the overlap or message duplication increases in the output. By listening binaurally, however, the intelligibility is enhanced since the overall discard is eliminated (or greatly reduced for the higher compression ratios) and the overlap or repeat of message portions is not detrimental to word detection by the listener.
  • a binaural system without supplemental gap filling would be achieved by removing gates 105 and 106 in FIG. 15.
  • the lines VDL, and VDL, would supply the processed signal in alternation to the respective output transducers 108 and 110 for binaural output.
  • FIG. 17 shows the invention using a form of delay line capable of processing speech signals in a manner which greatly reduces the problems associated with dis carding stored information in the line.
  • The; system shown in FIG. 17 comprises an analog shift register having a plurality of stages ASR ASR ASR,, which has a speech signal input applied on line 111 and a compressed or expanded speech signal output on line 112. Alternate stages of the delay line are clocked by

Abstract

An electrical delay line which has variable delay controlled by a signal input thereto is connected in a sound signal channel for signals such as human speech, to compress or expand the sound signal waveform depending on whether the time delay in the line is increased or decreased. By periodically sweeping the delay line from minimum to maximum time delay or vice-versa, repeated segments of a continuous sound signal waveform are processed so that an output audio signal can be obtained having the original frequency components of the signal and occupying a time duration which is equal to or smaller or larger than the original sound sequence with the successive segments of the signal processed by the variable delay line assembled with regard both to the significant parameters of human speech or other coding and the electrical conditions imposed by the system to produce a composite audio signal which is an intelligible replica of the original and substantially free of annoying aberrations introduced by the delay line processor. Variable delay using analog or digital signal storage is also provided.

Description

United States Patent [191 Schiifman 1 1 VARIABLE DELAY LINE SIGNAL PROCESSOR FOR SOUND REPRODUCTION [75] Inventor: Murray M. Schiffman, Newton,
Mass.
[73] Assignees: Cambridge Research and Development Group, Westport, Conn.; Sanford D. Gr'eenberg, Washington, DC; D. T. Liquidating Partnership, New York, NY. 22 Filed: Aug. 13, 1971 [21] Appl. No.: 171,571
[52] US. Cl. 179/1555 T [51] Int. Cl. G101 1/06 [58] Field of Search 179/15.55 T, 15.55 R, 179/15 BW; 178/54 HE, 66 TC, DIG. 3
[56] References Cited UNITED STATES PATENTS 3,202,769 8/1965 Coleman ,l78/5.4 HE 3,093,796 6/1963 Westerfield.. 179/1555 R 3,278,907 10/1966 Barry 179/1555 T 3,409,736 111/1968 Hurst 178/6.6 TC
Primary Examiner-Kathleen H. Claffy Assistant Examiner-Jon Bradford Leaheey Anor'ney'Charls E. Pfund, Esq; and Chittick, Thompson & Pfund 1 Jan. 15, 1974 [57] ABSTRACT An electrical delay line whichhas variable delay controlled by a signal input thereto is connected in a sound signal channel for signals such as human speech, to compress or expand the sound signal waveform depending on whether the time delay in the line is increased or decreased. By periodically sweeping the delay line from minimum to maximum time delay or vice-versa, repeated segments of a continuous sound signal waveform are processed so that an output audio signal can be obtained having the original frequency components of the signal and occupying a time duration which is equal to or smaller or larger than the original sound sequence: with the successive segments of the signal processed by the variable delay line assembled with regard both to the significant parameters of human speech or other coding and the electrical conditions imposed by the system to produce a composite audio signal which is an intelligible replica of the original and substantially free of annoying aberrations introduced by the delay line processor. Variable delay using analog or digital signal storage is also provided.
65 Claims, 64 Drawing Figures Audio 53 N SpeechBand A f e Variable Blanking 335 5OOO Output 5| P ypok Input l l Dem 1 Circuit a BQndP j evlce A Te I Band-Pass 7| j k Filter 74 Filter 76 9 l 72 F" 73 I 57 Gap lElv 75.
c\ |B| E lBlonking Gate Rompevel Blanking 6? Pulse Amplitude Changer Generator I I i 61 59 U. u.
Comp Sample 63 J\. Period Ramp Pulse- Train Generator 64 66 PATENTEB JAN 15 1874 t for =4 Delay for 6:3, d :g
l t for e-4 Delay for c- 4 d SHEET 02 0F 10 T Input Chunk Oufput Sample OUT 7 m 11 W /Z ?or (2 1 [gno (Cor recred Signal) EL 7 H C-T 1 FIG. 2(a) SIGNAL COMPRESSION i mm.
' T Somple Period=cT T I 1 C m ou'r I e=%, T Input Chuhk=eT T foufput Chunk,
Sample Period t'=tT c'=c--1= ILLUSTRATION FOR 9 =4 1e 11 et=t for e =1 1 n Qa-T (Corrected Signal) 1 FIG. 2(b) SIGNAL 4'3 EXPANSION e z; me I g & ""m maummm rzclhjnj TOUT) INVENTOR 1 i MURRAY M. SCHIFFMAN our TO'UT BY et =t' Sample Period =eT =T ATTORNEY PATENTEUJAN 15 m4 SHEET DJUF 10 INVENTOR MURRAY M SCHIFFMAN QMeTGM ATTORNEY PATENTEDJAN 15 m4 SHEET 0% OF 10 5150 85m A: Emmi .rDnTrDO mmkfm ozikoozw AB .rDnCbO omEmwQ AS INVENTOR MURRAY M. SCHIFFMAN WSGM- ATTORNEY Pmmmmsmu 3.786.195
sum user 10 Avc At Speech Bond Audio 51 Playback Inpui Amplifier \lgriloble 5:1 5:9 333 -sooo 1)) eu r De e Vorroble Line 7l Amplifier BorEHPoss 74 Bond-Puss 5 er FiHer 7 r 2 r 57\ Gop lHer 75 C\ B E f Romp Level BOnking I 6K eei e'izr r Changer Inverter P62 Exp. Comp. Sample 7 63 J\\ Period Romp Samp Pulse- Period T ain 1 65 Cnil Generator \64 56 Romp Pulse Generator 84 B Speech 8 Sig no! r i b) W I I Chunk Lenqih Sample Period gup (c) B E |npuf EXPANSION CHUNK Output (d) J INVENTOR A v G p MURRAY M. SCHIFFMAN maew ATTORNEY mm 15 m4 3388.195
SHEET UYUF 10 53 Audio In Out FIG. l2
*1 B "-+1 F|G.|3 (b) BLANKING "COMPRESSION FIG. I4
EXPANSION K INVENTOR F v MURRAYMSCHIFFMAN ATTORNEY mimmm 15 m4 3786L195 sum use; 10
I FIG. l5 1 w 5 51 Fl F1 (e) B hf) I H H E n m INPUT SIGNAL 2 AsRi ASR2 ASRN OUTPUTHZIGNAL FIG. I? r 1 SHIFT n3 FREQUENCY ILJ'LF GENERATOR n4 IELFL T INVENTOR MURRAY M. SCHIFFMAN we". GM
ATTOR N EY VARIABLE DELAY LINE SIGNAL PROCESSOR FOR SOUND REPRODUCTION BACKGROUND OF THE INVENTION The field of this invention is the processing of human speech signals or similar signals for ultimate comprehension by the human listener at substantially the natural or normal frequency component distribution but at time interval durations which are usually different from the original time duration of the speech utterance.
Sound compression and expansion systems which utilize relative motion between a magnetic tape record medium and the air gap of the pick-up head which senses the recorded signal on the magnetic medium are well known as exemplified by the patent to Schiiller 2,352,023. Devices of this type suffer from the usual operational, cost and weight limitations involved in equipment which utilizes substantial mechanical motion components. A delay line version of time compression and expansion for real time signals is also well known as shown, for example, in the patent to French et al. 1,671,151, where a voice signal is propagated along a delay line and a movable pick-up repeatedly scans the delay line to sense the signal propagating therethrough with the relative velocity between the pick-up head and the propagation velocity of the wave in the medium giving bandwidth compression or expansion for the purpose of transmitting the signal over a narrow band telephone line. Later workers in the field eliminated the mechanical motion portions of systems such as French et al. by substituting electronic switching sequentially along taps on the electrical delay line thereby providing frequency compression or expansion of thesignal for transmission over a narrow band line. The application of the sequentially scanned tapped delay line for modifying the time duration of recorded speech signals without altering the frequency components thereof is disclosed by Greenberg et al. U.S. Pat. No. 3,480,737. By relating the speed of scan of the delay line to the velocity of propagation in the delay medium and the relative speed of the reproduction of a recorded message compared to the speech utterance from which it originated Greenberg et al. achieve time expansion or compression of a recorded speech signal without altering the frequency components thereof.
Another form of frequency-time transformation is known in the art using a signal controlled variable time delay line for error correction. Systems of this type detect an unwanted frequency effect due to time irregularities in the pulse'train of a repetitive signal or such variations in an audio system where the speed of the record medium past the pick-up head is subject to periodic variations which result in the production of the audible irregularity known as wow. In reproducing the original signal these systems eliminate speed errors by servo-control of the time delay of a delay line which is interposed in the signal channel. Audio systems which employ a reference or timing signal track with variable delay line compensation for playback speed are shown for example in FIG. 9 of Coleman, Jr. U.S. Pat. No. 3,202,769 who shows an open loop servo. Woodruff U.S. Pat. No. 3,347,997 shows a closed loop servo which adjusts playback speed to compensate for the relatively low frequency wow component whereas imperfections of a high frequency nature (i.e., flutter") are compensated by a variable delay line. Such compensation systems depend for their operation on the repetitive character of the error signal and thus by The systems described in the {patents to Schuller,
French et al. and Greenberg et al., when used to reduce the frequency of a speech signal, while compressing the time in which a given segment of speech is reproduced, inevitably involve discarding a portion of the original speech wave. The ratio of the speech signal discarded to that which is used is directly related to the compression ratio and the discard loss is inherently and fundamentally related to this process of reducing the frequency and compressing the time for the processing of a given passage of speech. Since the portion of the speech which is reproduced alternates with portions which are discarded the problem of merging to reproduce sections in continuous time slots presents some problem and various solutions have been offered.
Thus Schuller suggests a skewed air gap in his rotating magnetic pick-up head or skewed tape approach to the point of contact with the rotating air gap so that the arrival and departure of the magnetic tape record relative to the air gap will occur gradually as the skew provides a transition from zero to full air gap contact with the recording medium.
The patent to French at al. suggests a number of alternatives including the use of' two spaced transducers rotating in unison with respect to the delay medium such that the message reproduced in compressed time in one transducer has superposed thereon from the other transducer the message which would ordinarily BRIEF SUMMARY OF THE INVENTION The 1 present invention provides compressionexpansion systems for speech or other coded signals in which the active frequency-time conversion element is a signal responsive delay line which is directly interposed in the path between the signal source and the ultimate reproducer or utilization device which receives the converted speech wave. Such a system does not avoid the inherent problem in time compression occasioned by the discontinuity resulting from discarding alternate portions of the speech wave. The discard portion of the speech wave may be stored or cancelled in the delay line ordiverted from entering the input terminal of the delay line which is directly in the signal channel. In any case this signal and transients produced by line switching must be discarded as the variable delay line is repetitively controlled by the delay signal between its minimum and maximum delay values. Line switching and discard occur coincidentally with the requirement for making contiguous two originally spaced portions of the speech wave and hence both of these functions are accomplished by a number of alternative embodiments herein disclosed which function in a manner consistent with requirements imposed by the parameters of the speech signal itself.
Accordingly, it is the principal object of the present invention to provide a speech compression-expansion system which utilizes a signal controlled variable delay line located directly in the signal channel between the signal source and sound reproducer which delay line is repeatedly sequenced betweenmaximum and minimum delay values to modify the frequency-time characteristic of the sound reproduced from the original signal.
It is a feature of the invention to control the input and output bandwidth of the system for the speech frequencies which are required to be reproduced for intelligibility in relation to the compression ratio thereby to exclude those frequencies which would produce distortion and intermodulation due to insufficient sampling rate or excessive phase shift per delay stage for the high frequencies and inadequate output chunk length to attain frequency conversion for the lowest frequencies.
A further feature of the invention provides maximum discard intervals as determined by the maximum delay line length which in relation to the compression ratio limits the actual original speech message discard to a value which minimizes the loss of significant cues or transitions in the speech code thereby minimizing the loss of information content transferred to the listener.
, A still further feature of the present invention is to provide, in a system utilizing variable delay line speech compression or expansion, for signal processing at the point of juncture of two reproduced speech portions to suppress distracting noise components and also to avoid the introduction of false cues which could modify the information conveyed in the subsequent speech segment. To this end, if required to suppress such noise, the transition between successive reproduced speech samples can be modified by simple transfer function selection or control, orthe transition can be eased by the introduction of synthetic or speechderived signal portions to approximate a smooth transition within a time interval which does not lose actual cues and under such conditions that do not introduce false cues.
These and other features and advantages of the invention will be apparent from the following detailed description taken in conjunction with the accompanying drawings.
BRIEF DESCRIPTION OF THE DRAWINGS FIGS. 4(a) to 4(f) show waveforms useful in describing forms of processing of a transition between adjacent reproduced speech samples.
FIGS. 5(a) to 5(d) show a set of curves representing active processing of the transition between adjacent samples.
FIGS. 6(a) to 6(e) show waveforms useful in describing the use of two delay lines to effect transition between adjacent speech samples.
FIG. 7 shows a block diagram of a speech compressor-expander system in accordance with the invention.
FIGSTSKa) 6823) show waveforms useful in describing the operation of the system of FIG. 7.
FIG. 9 shows a block diagram of a dual delay line system in accordance with the invention.
FIGS. 10(a) to 10(d) show waveforms useful in describing the operation of the system of FIG. 9 for compression.
FIGS. l1(a) to 11(d) show waveforms useful in describing the operation of the system of FIG. 9 for expansion.
FIG. 12 is a partial block diagram of a modification. I IYG STIKGTEE ISYE) show waveforms useful in describing the operation of circuit of FIG. 12 for compression.
FIGS. 14(a) to l4(c) show waveforms useful in describing the operation of the modification of FIG. 12 for expansion.
FIG. 15 is a partial diagram of a dual delay line binaural system.
FIGS. 16(a) to 16(f) show waveforms useful in describing the operation of the system of FIG. 15.
FIG. 17 is a partial block diagram of a speech processor in accordance with the invention using an analog shift register at the variable delay element.
FIG. 18 is a block diagram showing gap filling with signal continuity in a system similar to FIG. 17.
FIG. 19 is a partial showing of an embodiment of the invention with variable delay provided by an r-bit parallel digital shift register.
FIG. 20 shows an embodiment of the invention with variable delay provided by a serial digital shift register.
FIG. 21 shows an embodiment using an analog storage memory matrix for variable delay.
' FIG. 22 shows an embodiment using an r-bit digital random access memory.
FIG. 23 shows a logic diagram of a directional zero signal level gating control in a dual line system. 7
FIG. 24 shows waveforms useful in describing the operation of the circuit of FIG. 23.
FIG. 25 shows graphically the clock frequency and maximum signal frequency for the system of FIG. 17.
DESCRIPTION OF THE PREFERRED EMBODIMENTS The description of the preferred embodiments will be preceded by a discussion of the parameters of the speech signal particularly as they relate to speech compression for reproducing a given speech message in a shorter period of time. Because of the fundamental and unavoidable limitations involved in speech compression, the following discussion will proceed with primary attention directed to the method and apparatus used in compression mode. Compression mode operation in the time domain results in the discard of a fraction of the original information directly proportional to the compression factor which is also the factor by which the time to present a given speech sequence is decreased. The method and apparatus are usable however in expansion mode and the considerations involved for use of reproduced signals which occupy a greater length of time than the original speech utterance will be described separately hereinafterqThe system is also capable of frequency transformation without a corresponding time change to achieve a desired frequency signal, such as may be involved in generating speech in amedium having a propagation velocity different than air.
Referring tdF IGTi, a partiedar yaaauragqaaay line which provides maximum time delay of 6ms for the final portion of the sample is shown. Assuming that the speech signal to be processed is limited to frequency components between 333 and 5,000 Hz, certain parameters of the playback system for compression can be defined. A magnetic tape 21 has the speech signal recorded thereon of which the lowest frequency component at 333 l-Iz'is depicted by sine wave 22 with the tape being drawn past a pick-up transducer as it is wound on a take-up reel 24 at speed S. The electrical signal produced by the transducer 23 passes through a compression processor 25 and is reproduced as an audible signal from speaker 26.
The system 23 26 shown on line (a) of FIG. 1 just described reproduces the recorded signal on the tape 21 without frequency or time change if the speed of takeup reel 24 draws the tape past the transducer 23 at the speed of recording S and for this condition processor 25 would introduce a fixed constant delay time of any value. Thus in line (b) of FIG. 1 where c=1, the reproduction of the 333 Hz sinusoidal signal without change other than fixed phase delay (which has been ignored) is shown.
speed is discarded and on line (c) is labelled discard."
This discard includes cycles 5, 6, 7 and 8 of the original wave 22 and represents the gap in the information content between successive chunks which are reproduced as audible signals. This audible output is represented on line (d) where the chunk is depicted as a piece of tape 31 played at speed 2S and containing cycles 14 which afterprocessing is effectively stretched into a piece of tape 32 occupying the original l2ms of recorded time and containing the cycles 1-4 at their original recorded frequency. In line ((1) it will be noted that the next cycle reproduced is cycle number 9 of the original wave after cycles 5-8 inclusive have been discarded. The representation in line (d) of a smooth transition between the end of cycle 4 and the start of cycle 9 should not be taken as representative of real signal conditions as would be obvious from a consideration of an actual signal as opposed to the idealized signals presented in FIG. 1.
Lines (f), (g) and (h) of FIG. 1 illustrate the situation which prevails when the compression ratio is equal to five with the tape speed drawn past transducer 23 at five times the recorded speed- S. With a final signal delay of 6ms maximum this compression ratio results in a chunk length of 1.5 containing 2 a cycles of the 333 Hz wave 25 of line (a) and again a discard interval of 6 ms equal to the final signal delay and corresponding to a delay line length of 10rns at the end of the sample. The information gap, however, has increased to the point where the last half of cycle number 3 and first half of cycle number 13 and all the intervening information in the original recorded wave have been lost in the discard and this gap in the message represents 30ms of the originally recorded speech utterance.
The relations among the parameters of a speech compression system and those pertaining to the information content of the speech coding interr'elate in a manner to specify optimum conditions and place outside limits on the mode of operation of the systems of the invention for a given intelligibility factor. These parameters can be examined with respect to a particular system for various compression ratios and for this purpose the system parameters for a system using a delay line with a maximum final signal delay AT of 6ms are set forth in the following table. i
i TABLE 1 Typical Parameters for Speech Compressor Chunk/Discard Ratio Comp Line (Playback (Recording Sample Rep Cycles Ratio Length cTime) Time Period Rate Sample m u/ M, nIt nnz T U c d (ms) (ms) (ms) llP. Hz)
1.25 2/9 6 24/6 30/7.5 30 33.3 10 1.5 2/5 7 1/5 12/6 18/9 18 55.6 v 6 2 35 8 6/6 12/12 12 83.3 4 3 l 9 3/6 9/18 9 111 3 4 6/5 9 3/5 2/6 8/24 s 125 2 "a 5 4/3 10 1.5/6 7.5/30 7 A 113 2 pying 24ms of recorded time. This retained portion is designated a chunk and is shown in speeded form prior to processing on line (c) of FIG. 1 to include cycles number 1, 2, 3 and 4. By virtue of the compression process and since the maximum finalsignal delay is- The basis for the frequency-time transformation employed in the present invention can be derived as follows. Consider a sine wave V=E sin wt recorded with a tape recorder. If the tape is played back at 0 times the original recording rate, the result is V=Esincmt i where c is called the compression ratio. If c l the time is compressed for any given speech passage and if c l time is expanded by the factor e where e 1/0.
If the signal is then applied to a delay line in which the delay of the line is caused to increase linearly with time at the rate d, so as to cause the average delay of the signal to be c which represents the delay any point on the waveform will experience in passing through the line, then the signal (1) becomes V=E sin (c-c') wt The original signal is restored if the delay is ct flo restore) (C such that c d/2 (c l), the average delay rate of the line 4 which is derived as one half the sum of the final and initial delay values of the delay line, which when multiplied by time, t, thus yields the total delay, ct;
FIG. 2(a) shows a plot for a given signal sample of signal output time, r vs. the corresponding input time, t Thus a line with a slope of 4 represents signal of four times the original frequency or speed of presentation, and one-fourth the periodicity while a line with a slope of 1 represents the resultant restored, or unchanged signal. In order to convert such a signal (as represented by the line I of slope c 4) to one corresponding to line II with a slope of 1, and with a corresponding frequency decrease, it is necessary to increasingly delay the input signal, ct by an amount ct [or (cl)t] as shown at line III. Thus a signal chunk, T 24 has an ordinate which intersects III at the ordinate value c'T and this value when added to the time abscissa value at the point cT on I, delays the signal to T on line II. The delay dt introduced by the delay line is shown by line IV. Such a delay line has the effect of delaying the instantaneous signal, t through a linearly increasing amount d-t for the interval from t to t as shown by line IV. Thus as in the case of the end signal at time t=T one-half the sum of the initial delay, ([1],, and the final delay, dT, yields an average delay value on line IV of c'T the amount required for restoration.
In more general terms the restoration may be achieved by cumulatively delaying an input singal, t,,,, by an amount from which we obtain (4) In FIG. 2(b) the corresponding relationsfor signal expansion are shown. Line l with a slope of one-fourth represents a signal of one-fourth the original frequency or speed of presentation. In order to convert such a signal to one corresponding to line II with a slope of 1, and with a corresponding frequency increase, it is necessary to decreasingly delay the input signal, ct (=(1/e) t (1/4) t,-,,) by an amount ct l-e/e) t= -34!) from an initial delay of c'T This amount of delay, ct, at any point shifts the signal to the corresponding ordinate value on line II. The delay dt introduced by the delay line is shown by line IV Such a delay line has the effect of delaying the instantaneous signal by a linearly decreasing amount d't' for the interval from t to t as shown by line W Thus as in the case of the initial signal at time, t=0, one half the sum of the initial delay, d-T,,, and its final delay, dT yields an average delay value on line IV, of -c'T The process of linearly increasing time delay cannot continue indefinitely, and from time to time the delay line must be returned to its original length. If this process is repeated at periodic intervals, provided the interval is longer than the period of the lowest frequency component of the signal, chunks of the original signal will be played back at the angular frequency (c c')w and the rest discarded. When (3) and (4) are satisfied, the system operates as though sections were cut out of the original tape, pasted together, and played back at normal speed. The sections of signal are heard at the correctfrequency but the information is transmitted in a shorter time (if c l). The speech has been compressed to 1/0 of its original length.
The values set forth in Table I have been plotted in FIG. 3(a). For any given compression ratio the sample time is given by the curve T and the chunk length is shown by the curve T The difference between these two curves is the discard which is equal to the final delay to the signal at the end of the sample period (6ms in the example shown in FIG. (3a). Entering the curve at any compression ratio, such as c 5 in FIG. (3a), one obtains the chunk and discard times for the tape running at c times the recorded speed and these values projected to the time axis show the actual original recorded time for the respective chunk and discard portions. As indicated for c 5 the chunk is 1.5 ms long and the discard is 6 ms long representing respectively 7.5 ms of recorded and reproduced information and 30 ms of discarded information. This latter value is represented by the quantity c AT which is also plotted in FIG. 3(a) For a speech signal in which the lowest frequency 333I-Iz has a period of 3 ms, a chunk length of 1.5 ms at c 5 corresponding to 7.5 ms of recorded time will contain 2.5 cycles of the 333I-I signal. For any higher frequency components in the speech signal more cycles will be contained in the 1.5 ms chunk. The length of the chunk should exceed the period of the lowest frequency to be passed (i.e., should include at least a full cycle) otherwise satisfactory compression will not be obtained. As indicated in FIG. 3(a) below the time axis at 3ms, the 333Hz signal is processed at sample periods approaching 3 ms would with its samples reassembled accordingly produce a compressed output of poor quality since the sampling would then be causing a disruptive discontinuity for nearly every cycle of the 333 Hz signal processed. Sample periods less than 3ms would not permit completion of any one cycle so that the resultant reassembled output would not only contain the said disruption but would also begin to exhibit a basic change in its frequency characteristic in the form of waveform compression by truncation to produce false frequencies. While this condition does not represent a real condition for a speech wave due to the complexity of the waveforms, this principle is controlling and sample periods less than the period of the lowest frequency wave in the speech signal will not provide proper compression.
Sample periods greater than the period of the lowest frequency wave will produce compression and an interval of disruption exists from the region where the sample period is only slightly greater than the period of the lowest frequency wave as indicated on the time axis between 3 ms and 6 ms in FIG. 3(a). The result obtained within this period of disruption is a distorted expanded wave in which the effect of disjunctions between samples becomes extremely severe as the single cycle point is approached and diminishes as the number of cycles in the sample increases. As a practical matter two and one-half cycles per sample is indicated as the desired limit in FIG. 3(a) but in general the more cycles in the sample the less the disturbance factor.
In order. to avoid the extreme distortion produced by waves which have a longer wavelength than the sample period, these lower frequencies should be filtered out before the speech signals enters the delay line otherwise these disjointed and highly distorted waves will be propagated down the line and intermodulate with the desired signal and may severly degrade the system performance. v
For lower values of the compression ratio than c 5, and keeping AT 6ms, the chunk length increases with the result that the actual time sample increases to greater than 7.5 ms and therefore more than the minimum number of cycles for the lowest frequency wave component will be present in the chunk. Thus it would be at the users option to operate the line over less than the 6ms indicated delay for AT to reduce the amount of discard.
Considering the discard portion of the sample as a constant 6ms long at the compressed rate of playback, the actual information loss is the compression ratio times 6ms so that with c 5 the actual information discarded for each sample is 30ms of recorded time. As shown on the time axis of FIG. 2 this is the interval from 7.5 ms to 37.5 ms and the relation of this loss of information to the intelligibility of the reproduced speech signal must be examined.
In general, human speech represents an extremely complex coding of a relatively limited set of sounds called phonemes which taken in context with the various attributes of the speech code such as the voicedunvoiced components, pitch, formant frequencies and the continuum of sound pattern represented by sound energy (and the absence thereof) connected by the all important transititions between the temporal components thereof constitutes an acoustic stream of infinite variety and versatility. The ability of the human ear to receive this acoustic message and the ear-brain system to decode the message is not altogether understood since it appears that the readily comprehended information rate far exceeds the mere acoustic response characteristics of the ear as a receiver.
Fortunately, the ability of the ear-brain system to comprehend the message which is conveyed by human speech signals is sufficiently good to permit large portions of the actual acoustic stream to be lost or discarded without significant loss in the perception and comprehension of the message information content of the acoustic signal. Since the comprehension of message content decreases more rapidly than the recogni tion of individual words as the message is presented to the listener at increasing rate,-the problem associated with the discard of a portion of the signal stream can be resolved in favor of comprehension and short of the point where intelligibility of individual words deteriorates. This latter point is reached where the loss or alteration of transitions or other ones representing the connection between a consonant and vowel sound results effectively from the discard of much or all of a given cue or cues so as to alter the apparent information content of cotiguous concatenated chunks. Even before the point of absolute loss of intelligibility is reached the limit of tolerance due to discomfort for sustained listening occurs as a result of the unnatural sounds and the fatigue which develops in the intense concentration required in attempting to extract the information content in the presence of excessive time clipping.
For the purpose of speech compression the loss of intelligibility can be associated with discarding portions of the message containing significant cues or phonemes which components vary in length with the shortest being approximately l0ms to 20ms long. These short cues do not dominate speech but occur with sufficient regularity to make their systematic loss undesirable and hence a desirable upper limit for the discard period would be considered to be 30 ms and preferably closer to l5ms. With this limit set for intelligibility of the reproduced syllables and words the rate of presenting a given message can be increased to the comprehension limit for any given listener and degree of difficulty of the subject matter with minimum concern for-the limitation which would be imposed by permitting loss or distortion of the word content or the generation of false cues from the concatenated message chunks.- FIG. 3(a) indicates the recording time discard relation to compression ratio as the linear function cAT with the range from 15 ms to 30 ms designated the discard uncertainty range. Thus the 6ms discard at c 5 projects to include the real time recording interval from time t 7.5 to t= 37.5 which approaches the upper limit permitted for discard without undue loss of intelligibility as required not to contribute significantly to the loss of comprehension in the message perceived. Smaller values of c result in smaller actual discard time and hence the intelligibility is improved especially for those cues which are at the lower end of the time scale, i.e., in the neighborhood of lms.
While Table I and FIG. 3(a) represent parameters for a typical speech compression system having a final signal delay of 6ms and define the limits of operation within fairly narrow limits, it will be appreciated that the principles involved can be adapted for use over a wider range of operation. Thus the variation of the actual frequency band of the speech signal and the maximum length of the delay line are both important design factors which influence the selection of the chunk-todiscard ratio and sample period for a given range of the compression ratio c. On the other hand, the actual frequency range of the signal has an important bearing on the design of the delay line which must accommodate the frequency spectrum present in the signal as to such quantitative and qualitative factors as the voice pitch, the .presence of all or only some of the format frequencies for an individual voice and the width of the signal spectrum over which linear phase-frequency properties must be preserved. The ultimate system used however will embody design choices of the factors involved within the broad limits herein defined.
FIG. 3(b) is a plot of corresponding relations for signal expansion showing the initial gap, output chunk and maximum delay line length variation with expansion ratio e for a given input sample interval T The output gap occurs at the start of each sample period and thereafter for the balance of the sample period the reduced frequency time-expanded output chunk appears. The maximum delay d T required is also shown as a function of the expansion ratio e.
One aspect of the speech compression system described in connection with FIG. 1 has not been treated, namely, the audible output of the transducer 26 when the variable delay processing unit 25 is switched from maximum to minimum delay at the end of the sample period. Just prior to switching the delay line is loaded with the speech signal which is to be discarded and if the line is instantaneously switched to zero delay all of this information unless cancelled or predeleted, will be presented in highly condensed form in the output signal. As a practical matter with conventional delay lines utilizing R and L or C components there will be a time interval required for switching the line from maximum to minimum delay and it has been found that even if the line does not contain signal information this switching of a line has a significant minimum time constant associated with it which produces a disturbing transient audible in the output signal. The repetition rate of this transient is the reciprocal of the sample period. Because of the limitations imposed by the parameters of the system as previously set forth herein, this switching frequency and spectral components of the transient itself will always be within the audiorange and thus present as a highly disagreeable intermoduation component in the audio output of the device. The present invention provides a number of implementations for transient suppression and message gap bridging arrangements for the purpose of minimizing the disagreeable noise effects involved. In more elaborate systems the substitution of pseudo or real message components further improves the transition from one sample to the next and can be adapted to fill in a portion of what is discarded in the compression process.
Referring now to FIG. 4 a portion of the 333 Hz wave at the transition point illustrated in FIG. 1(d) has been reproduced in which cycle 4 and cycle 9 of the original recorded 333 Hz wave are shown as a smooth uninterrupted sine wave. The junction between the end of cycle 4 and the beginning of cycle 9 at point 41, although shown as a continuous portion of the sine wave, is in actuality, as previously stated, almost never so related in the non-selective periodic sampling of independent complex waveforms and thus instead of a smooth transition point 41 a disjunction between the end of one chunk and the beginning of the next chunk in successive samples is to be expected. This disjunction could undoubtedly be accommodated with no loss of intelligibility if the transient from switching the line (either loaded or unloaded) did not have to be dealt with at exactly this point in time. Since this transient is responsible for a highly annoying audible output from the system it must be eliminated and for this purpose a gating signal as indicated in FIG. 4( b) may be applied symmetrically with respect to the transition point 41 to produce the output signal shown in FIG. 4(c). By making the gate long enought to encompass the transient resulting from switching the line, the audible noise so generated is eliminated. The improvement obtained by this expedient, while significant, is not ideal since the introduction of the gate signal within the audio range is itself audible as a repetitive disjunctive gap which intermodulates with the audio signal. This effect can be reduced by using an output filter designed for the particular repetition rate and gate width to smooth the abrupt transition shown in FIG. 4(c) and this output response is indicated in FIG. 4(d).
A further improvement is possible by using the gating signal as a gain control signal and tapering the off and perhaps the on transitions of the gate so that a gradual transition of the audio output from off to on is accomplished and a relatively smooth transition as indicated in FIG. 4(f) results. The object is to mini mize the gap effect which in itself has an audio characteristic and can act like a cue. Tapering the trailing edge of the gate helps this considerably whereas an anticipating start (or'relative delay of the speech signal) would be preferable for gradual onset for the leading edge. With these relatively simple expedients the smooth transition between adjacent chunks which are disjunctively joined by the operation of the compression-discard process are achieved in a manner which is satisfactory for many applications.
Referring now to FIG. 5, the more elaborate arrangements for bridging the gap between adjacent samples will be described. As shown in FIG. 5(a) a disjunctive transition which is the norm to be expected represents a sharp discontinuity in the message signal and has superposed thereon the noise transient from switching the line as previously described. By introducing a gate signal FIG. 5(b) of sufficient width to encompass the line switching transient and conditioning the gate to coincide with a zero level and same direction of change for the adjacent signals being processed a zero level gating transition as shown in FIG. 5(c) can be achieved. This transition which is free of line switching noise and essentially continues an existing zero amplitude signal level during the interval of the gate has been found to provide little or no disturbance to the average listener. I
Because of the nature of the human hearing phenomenon, particularly the ability of the'ear to synthesize the message it is concentrating upon even in the presence of noise, it may be useful in certain circumstances to introduce a pseudo or real message component in the zero level interval indicated in FIG. (c). For this purpose suitably selected noise or signal components of approximately the same amplitude and frequency can be inserted in what is otherwise a quiet gap interval in the message stream and this arrangement of the invention is indicated in FIG. 5(d). Where the gap is to be filled with noise components, a suitable source and symmetrical switching to introduce noise from the source into the signal channel can be readily applied during the gating interval.
FIG. 6 represents a preferred form of gap filling where two signal controlled delay lines are used. The speech signal is applied to both delay lines designated channel A and channel B in FIGS. 6(a) and 6(b) respectively and these two lines are signal controlled to have symmetrical complementary gain characteristics and overlapping variable delay characteristics as shown in FIGS. 6(c) and 6(d). Here the delay control signals as shown in FIG. 6(d) are phased to overlap at least an amount corresponding to the transition portion of the gain control characteristics of FIG. 6(c). The outputs of both delay channels A and B are combined to produce the combined output shown in FIG. 6(c).
Generally the length of the delay lines used for channels A and B in FIG. 6 will employ one full length delay line and one relatively shorter length delay line for stor ing the signal used for gap filling purposes. This arrangement will reduce the cost of the equipment represented by the multiple section delay lines necessary to obtain the required maximum delay length for system performance requirements. On the other hand, for systems where cost is not a primary factor, two equal full length variable delay lines can be employed and their control signals can be alternately applied so that the signal channel is through first one and then the other delay line thereby giving a full signal period for switching the inactive delay line back to minimum delay conplied as indicated in FIG. 6(c).
Referring now to FIG. 7, a basic speech compressionexpansion system in accordance with the invention will be described. This system comprises a variable speed playback device 51 which is indicated to be a tape transport with a manual select speed control input 52. The signal derived from transporting the tape past a magnetic transducer is applied to an AGC amplifier 53 whichalso passes the signal through a band pass filter having an adjustable low and high frequency cutoff. The selection of the cutoff frequencies for the filter may be operated from manual control 52 in conjunction with the selection of playback speed for the playback device 51. The manual control-52 also supplies an amplitude control signal to a fine voice pitch adjust control 54 which supplies on line 55 a signal to control the end amplitude of the linearly increasing waveform which controls the variable delay line as hereinafter described.
The signal after passing through the amplifier and filter 53 enters a variable delay line 56 which can be signal controlled between minimum and maximum delay limits. This control signal applied on line 57 is derived from a ramp level and amplitude charger 58 which receives as its input either a compression triangular waveform on line 59 or an expansion inverse of the waveform on line 59 which appears on line 61 after passing through an inverter 62. One or the: other of the lines 59 and 61 is energized with a ramp waveform depending upon the setting of a switch 63 which supplies the basic ramp waveform from ramp pulse: train generator 64. The repetition period of the ramp waveform is selectable by a manual control 65. A pulse coincident with the reset of the linear portion of the ramp waveform appears on line 66 and is applied to a blanking pulse generator 67 to produce a blanking pulse output the width of which can be controlled by manual adjustment 68 and which is synchronized with the input pulse on line 66.
The output of the variable delay line 56 is applied to a blanking circuit and amplifier 71 which transmits or blocks the signal depending upon the blanking pulse B applied on line 72 from generator 67 and when the blanking pulse is not present F the delay signal is applied to a speech bandpass filter 73 the output of which is applied to an audio reproducer 74. y
In addition to the amplitude excursion established for the linear voltage ramp signal from generator 64 which is controlled by manual control 52 the absolute level of the voltage applied can be controlled by level adjust means 60. The variable delay line 56 will generally be of any known type and in particular may be 360 RC filter stages where the shunt resistor is provided by a PET or other semiconductor device which varies resistance in response to a controlled voltage or current. Such delay lines generally perform best: with respect to distortion of the signal passing therethrough if the phase delay per stage is kept well below the maximum possi ble value of 90. Accordingly, the line can be designed to operate with 45 to 60 phase delay per stage maximum and the number of stages is then determined as greater than the quantity: N (6 or 8) c(f,,, AT In the above inequality the digits o and 8 represent the number of stages per electrical cycle of the highest frequency to be passed corresponding to a phase delay of 60 or 45, respectively, as themaximum phase shift per stage which is to be utilized; the quantity c is the compression ratio; the quantity f is the highest frequency being passed by the line; and AT is the maximum signal delay desired as dictated by the maximum permissible discard interval previously specified. Many other forms of delay line construct-ions which are capable of being signal controlled are known in the art and the present invention is not to be considered as limited to any particular form of delay line.
Referring now to FIGS. 8(a) and 8(b) the operation of the system of FIG. 7 will be described. The sample period waveform 81 has an adjustable period set by control for producing an asymmetrical sawtooth waveform 82 which produces a relatively long negative. going linear voltage followed by a shorter positive going linear voltage. This waveform is used directly on line 59 for speech compression while, after inversion in inverter 62, its inverse is used on line 61 for expansion. The expansion waveform is indicated in dotted lines at 83 inFIG. 8(a). For a variable delay line 56 which increases delay as the control voltage becomes more negative, the waveforms 82 and 83 have the proper sense for controlling the delay interval and the magnitude of the delay is determined by amplitude control 52 relative to a voltage level set by the level adjust 60. Thus 'the operating point in the excursion of the waveform 82 is selected for a given compression ratio in conjunction with the sample period which will be a predetermined combination for any given compression ratio assuming the maximum delay AT in the line 56 is a fixed value as obtained by selecting line length according to the value d-T as given in FIG. 3(a) and Table I for the desired compression ratio. If the maximum delay to the signal is not maintained constant the discard period will change correspondingly as is evident from the description of FIG. 1 and corresponding adjustments in the amplitude of the wave will be required to give the slope (I required for a compression ratio 0. Similar considerations apply for the slope of curve 83 which must be set at its corresponding value d for an expansion ratio e.
The operation of blanking pulse generator 67 is shown to produce a pulse 84 in FIG. 8(b) of predetermined width in response to the start pulse of the sample period signal 81 received on line 66. This pulse may be applied in gain control fashion to the circuit 71 with modified trailing edge as previously described to reduce the transient signal and provide a gradual onset of voice sound signals which are passed to the transducer 74. The width B of the blanking pulse is selected with control 68 and is normally made of sufficient duration to permit the short steep linear-portion of the ramp waveform to return the delay line 56 to its zero or minimum delay condition and dissipate the signal energy therein (or the transient caused by switching the line itself) prior to enabling the signal. channel which energizes the transducer 74 with the subsequent speech signal segments.
The blanking period B and enabled period B for expansion mode are shown in FIG. 8(c). The expanded chunks with an initial output gap are shown in FIG. 8(d).
The system of FIG. 7 can also be used to substitute noise or pseudo signal gap filling signals corresponding to the system described in FIG. 5. For this purpose a source 75 of such signals is arranged to supply the input signal to filter 73 during the blanking interval. By means of a switch 76 this gap filling during the blanking interval can be made optional. The gap filling signal 75 can also be derived from the message signal output of amplifier 53.
Referring now to FIG. 9, a modified-form of the invention particularly suitable for accomplishing the various gap filling procedures for the speech compression systems previously described will be disclosed. Portions of FIG. 9 which are essentially the same as those described in FIG. 7 have corresponding reference numerals and accordingly only the additions and changes will be further described. In addition to the variable delay line 56 a second variable delay line 91 receives the signal wave from amplifier 53. The output of the delay lines 56 and 91 are applied respectively to complementary blanking circuits 92 and 93. Signals passed by these blanking circuits 92 and 93 are amplified and filtered in element 73 and passed to the acoustic reproducer 74 as heretofore described.
A pulse train generator 94 produces a pulse wave train as shown in FIG. 10(a) having a selectable pulse repetition rate determined by the setting of manual control thereby establishing the basic sample period. The output pulse, from generator 94 is delayed in delay unit 95 and applied to a first ramp generator 96 and in undelayed form is applied to a second ramp generator 97. The ramp generators 96 and 97 are subject to waveform level control from manual adjust element 60 and ramp linear wave amplitude control from the manual adjust element 52. As previously stated, the fine pitch adjustment 54 may be provided for slightly modifying the ramp slope as a voice pitch adjustment by effectively altering the frequency conversion over a small range. In addition the blanking width interval of each generator is adjustable with controls 68 and 70 respectively. The outputs of the ramp generators 96 and 97 are applied respectively to delay lines 56 and 91 to control the time delay of signals passing through the respective lines in accordance with the control signals applied. By means of c or e select controls the sense of the slope of the ramp waveforms can be selected for compression or expansion.
The level and amplitude controls for setting the respective ramp generators 96 and 97 are preferably relatively adjustable to permit selection of the relationship between the two ramp waveforms. By making the delay and phasing of the unit 95 adjustable any desired delay line overlap can also be achieved. It is also possible to rearrange the components to have the complementary gating at the inputs of the two delay lines 56 and 91 with the outputs switched to be combined in a common channel to amplifier 73. This alternative discards the portion of the speech signal that is not utilized by each line before it enters the line and thus eliminates the necessity for dissipating these portions when the lines are switched between active periods.
Referring now to FIG. 10, the operation of the speech compression system of FIG. 9 will be described. The pulse train generator 94 produces the timing waveform of FIG. 10(a). This pulse triggers the transition of waveform C2 in pulse ramp generator 97 which produces the blanking pulse indicated in FIG. 10(0) with the predetermined width of B and B being determined by the blankingpulse width control 68. After the delay indicated in FIG. 10(b) the pulse from generator 94 triggers the ramp generator 96 to produce the waveform Cl shown in FIG. 10(1)). With this arrangement the control wave CI for the delay line 56 is overlapped in time by waveform C2 having slope in the same sense and bridging the steep return slope waveform of ramp wave Cl. With the asymmetrical time intervals shown in FIG. 10, the arrangement for gap filling modes of operations shown in FIGS. 5 and 6 can be practiced. By making the waveforms C1 and C2 have symmetrical rising and falling portions the arrangement is suitable for alternate switching of the lines 56 and 91 to provide alternate compressed (or expanded) chunks of the speech sample. The choice of the relative lengths of sample through line 56 and 91 will generally be dictated by manufacturing costs for the delay line. Thus for a main delay line 56 of adequate length for the compression ratio desired, a relatively shorter line 91 used only for gap filling purposes will generally be more economical. On the other hand, two full length lines which are alternately active to pass speech sample chunks thereby providing adequate time for the non-active line to be returned to its minimum delay condition will provide for smooth transitions, any desired overlap and the maximum time interval for discharging the line to minimum delay condition prior to its processing the next speech sample. The action of the system of FIG. 9 in the gap filling mode is indicated in FIG.- 10(d) and generally corresponds to that previously described with respect to FIG. 5.
The operation of the system of FIG. 9 for speech expansion, i. e., increasing the time duration for a given speech utterance and increasing the frequency components thereof from a reproducer running at a slower than recorded rate is shown in FIG. 1 1. Here the ramp generators 96 and 97 have inverted outputs to produce the expansion waveforms E1 and E2 shown in FIGS. 11(a) and 11(c) respectively and the blanking waveform has been made symmetrical such that the delay lines 56 and 91 are used alternately for approximately equal periods. By the nature of speech expansion, a gap in the signal output will always occur since the lines are controlled to change from maximum delay at the start of the sample to minimum to zero delay at the end of the sample. Thus when the line is switched to maximum delay there will inevitably be a time gap before delayed signal emerges from the output end of the line. Applying the control sequence indicated at FIG. 11 the speech samples processed by lines 56 and 91 are over lapped so as to fill the gap as indicated in FIG. 11(d) by the solid and dotted outlined signal chunks E and E The presence of a slight overlap in the reproduced signal does not significantly interfere with intelligibility since it generally is not noticeable and at worst may result in a slight echo effect of the type commonly encountered in a telephone conversation. The time expanded speech waveform obtained using the mode of operation indicated in FIG. 11 is useful for the recognition and comprehension of difficult passages and for analysis and study of foreign languages and the like.
The system shown in FIG. 12 represents a simplification of the system of FIG. 9 where a fixed delay line 101 is used in place of the second variable delay line 91 of FIG. 9. The control of blanking ciruits 92', 93' is simplified in that the variable width blanking gate B as derived from pulse train generator 94 correspondingly produces gaps in the output signals which have been delayed by passage through variable delay line 56. The fixed delay of line 101 is selected to further delay some portion of the signal emerging from the delay line 56 by an amount sufficient to fill the gap caused by blanking pulse B thereby essentially repeating some portion of each message chunk while the variable delay line 56 is switched back to its minimum delay condition. Again, this repetition is not objectionable and may merely introduce a slight echo effect which is much less objectionable than the presence of the gap in the message signal. This sequence of operation is shown in FIG. 13 where the variable chunk C and the fixed chunk C alternate in supplying the output.
' The expansion mode for operation of the circuit of FIG. 12 is shown in FIG. 14 where the ramp signals are inverted for the expansion waveform which controls the delay line 56 to vary from maximum delay to minimum delay over the linear ramp portion E shown in FIG. 14(a). The blanking waveform B is selected to pass some portion of the signal chunk through the appropriate amount of delay to fill the gap between chunks in the output as indicated in'FIG. 14(c). Thus the output is composed of chunks E and E in alternation for continuous signal.
The system of FIG. 12 could be further simplified by eliminating delay line 101 and conditioning gate 93' to introduce in the gap interval'any pseudo or noise signal from a suitable source which would simulate the frequency content of the actual speech signal. While this version would be less desirable than using the actual speech signal for gap filling it would, nevertheless, be better than reproducing the speech signal with the message gaps present since the audible effect of gaps becomes detrimental to recognition of the message content, especially at high compression ratios. This modification gives a mode of operation similar to the optional noise gap filling described for FIG. 7.
FIG. 15 shows a modification of the invention for binaural processing. The speech signal from the band pass filter 53 is applied to symmetrical variable delay lines VDLl and VDLZ controlled by waveform generator 102. The output of VDLl is applied as an input to gates 103 and 105. The output of VDL is applied as an input to gates 104 and 106. The delay line VDL, is con trolled for linear variation of delay according to the waveform of FIG. 16(c). The delay line VDL is controlled for linear variation of delay according to the waveform of FIG. 16(d). Each of these waveforms has its rapid return transition at the mid-point of the linear delay portion of the other waveform.
The gates 1t 3 and 106 are controlled by gating waveforms B and 13, shown in FIG. 16(e). Gate 103 passes signal during B and is blocked during B Gate 106 is blocked during B and passes signal during B, Amplifier 107 combines the outputs of gates 103 and 106 and applies the combined signal to an audio reproducer 108.
The gates 104 and are controlled by the gating wavefonns B and B, shown in FIG. 16(f). Gate 104 passes signal during fi gnd is blocked during B Gate 105 is blocked during B and passes signal during B Amplifier 109 combines the outputs of gates 104 and 105 and applies the combined signal to an audio reproducer l 10.
The system of FIG. 15 operates to reproduce the entire original signal (for compression ratio equal to two) since each delay line processes the portion which is the discard for the other line as is evident from FIGS. 16(a) and 16(b). For compression ratios greater than two some message discard occurs and for ratios less than two the overlap or message duplication increases in the output. By listening binaurally, however, the intelligibility is enhanced since the overall discard is eliminated (or greatly reduced for the higher compression ratios) and the overlap or repeat of message portions is not detrimental to word detection by the listener.
A binaural system without supplemental gap filling (as just described) would be achieved by removing gates 105 and 106 in FIG. 15. The lines VDL, and VDL, would supply the processed signal in alternation to the respective output transducers 108 and 110 for binaural output.
FIG. 17 shows the invention using a form of delay line capable of processing speech signals in a manner which greatly reduces the problems associated with dis carding stored information in the line. The; system shown in FIG. 17 comprises an analog shift register having a plurality of stages ASR ASR ASR,,, which has a speech signal input applied on line 111 and a compressed or expanded speech signal output on line 112. Alternate stages of the delay line are clocked by

Claims (65)

1. A processor for electric signals representing coded audible signals such as speech or the like comprising: means for deriving said electric signals as analog representations of said audible signals with the frequency components of said electric signals related by a given factor to the frequency components of said audible signals; variable delay line means having an input and an output, said input coupled to said means for deriving said electric signals for propagating representations of said electric signals to said output with controllable time delay; means for controlling said delay line means for periodic linear variation of said time delay between predetermined delay values; and means coupled to said output of said delay line means and responsive substantially only to signals propagating through said delay line means which have been subject to unidirectional variation of said time delay for producing output signals reproducible as an audible representation of said electric signals having frequency components altered by substantially said factor to approximate the frequency coMponents of said audible signals.
2. A processor for electric signals representing coded audible signals such as speech or the like comprising: means for deriving said electric signals as analog representations of said audible signals with the frequency components of said electric signals related by a given factor to the frequency components of said audible signals; variable delay line means having an input and an output, said input coupled to said means for deriving said electric signals for propagating representations of said electric signals to said output with controllable time delay; means for controlling said delay line means for periodic linear variation of said time delay between predetermined delay values; and means coupled to said output of said delay line means and responsive substantially only to signals propagating through said delay line means which have been subject to unidirectional variation of said time delay for producing output signals reproducible as an audible representation of said electric signals having frequency components altered by substantially said factor to approximate the frequency components of said audible signals; and in which said means for controlling said delay line means provides periods of linearly changing unidirectional delay variation alternating with reset intervals wherein said delay line is returned to an initial predetermined delay value and including means for blanking output signals from said output of said delay line means to substantially zero amplitude during said reset intervals.
3. Apparatus according to claim 2 in which said means for blanking includes means for gradually restoring the amplitude of said output signals to full amplitude after said reset intervals.
4. Apparatus according to claim 1 and including input high pass filter means, said filter means substantially eliminating components of said electrical signals from propagating through said delay line means which are below a frequency approximately equal to said factor times the reciprocal of the period of said linear variation.
5. Apparatus according to claim 4 and including band pass output filter means for limiting said output signals to a predetermined audio bandwidth.
6. Apparatus according to claim 1 and including: input high pass filter means having adjustable low frequency cutoff interposed between said means for deriving said electric signals and said input of said variable delay line means; means for adjusting the effective rate of said periodic linear variation to adjust said factor by which frequencies propagated through said delay line are altered; and means coupling said cutoff adjusting means and said effective rate adjusting means for related variation to eliminate low frequency components which would otherwise produce frequency components in said delay line means below an audio low frequency limit related to said factor and the period of said variation.
7. Apparatus according to claim 6 in which said means for controlling said delay line means provides periods of linearly changing unidirectional delay variation alternating with reset intervals wherein said delay line is returned to an initial predetermined delay value and including means for blanking output signals from said output of said delay line means to substantially zero amplitude during said reset intervals.
8. Apparatus according to claim 7 in which said means for blanking includes means for gradually restoring the amplitude of said output signals to full amplitude after said reset intervals.
9. Apparatus according to claim 2 in which said means for blanking includes means responsive to the level of said output signals being zero at the beginning of said reset interval to initiate said blanking and means responsive to the level of said output signals being zero at the end of said reset interval to terminate said blanking.
10. Apparatus according to claim 2 and including means for introducing an audio signal segmenT into for combining with said audible representation of said electric signals during said reset interval.
11. A processor for electric signals representing coded audible signals such as speech or the like comprising: means for deriving said electric signals as analog representations of said audible signals with the frequency components of said electric signals related by a given factor to the frequency components of said audible signals; variable delay line means having an input and an output, said input coupled to said means for deriving said electric signals for propagating representations of said electric signals to said output with controllable time delay; means for controlling said delay line means for periodic linear variation of said time delay between predetermined delay values; and means coupled to said output of said delay line means and responsive substantially only to signals propagating through said delay line means which have been subject to unidirectional variation of said time delay for producing output signals reproducible as an audible representation of said electric signals having frequency components altered by substantially said factor to approximate the frequency components of said audible signals; and in which said means for controlling said delay line means provides periods of linearly changing unidirectional delay variation alternating with reset intervals wherein said delay line is returned to an initial predetermined delay value and further including: second delay line means coupled to said means for deriving said electric signals; and means for interrupting signals from the output of said variable delay line means and substituting signals from the output of said second delay line means during said reset intervals to produce said output signals reproducible as said audible representation.
12. Apparatus according to claim 11 in which said second delay line means include a variable delay line and including means for controlling said second delay line means for periodic linear variation of said time delay between predetermined delay values, said variation of said second delay line means occurring during said reset intervals for the first recited said delay line means.
13. A processor for electric signals representing coded audible signals such as speech or the like, said electric signals being analog representations of said audible signals with the frequency components of said electric signals related by a given factor to the frequency components of said audible signals comprising: first and second variable delay line means coupled to a source of said electric signals for propagating said electric signals therein with controllable time delay; means for controlling said first and second delay line means for periodic linear variation of said time delay between predetermined delay values, said means for controlling said first and second delay line means providing periods of linearly changing unidirectional delay variation alternating with reset intervals wherein said delay lines are returned to an initial predetermined delay value, said reset intervals for each said delay line means occurring within the period of the unidirectional delay variation of the other said delay line means; and means coupled to the outputs of said first and second delay line means and responsive substantially only to signals propagating through said delay line means which have been subject to unidirectional variation of said delay for producing an audible representation of said electric signals having frequency components altered by substantially said factor to approximate the frequency components of said audible signals.
14. Apparatus according to claim 13 in which said first and second delay line means vary between the same said predetermined delay values alternately with substantially equal periods of said linear variation.
15. Apparatus according to claim 14 in which said reset intervals are substantialLy shorter than the periods of said linear variation and occur for each said delay line means approximately at the mid-point of said linear variation for the other said delay line means and including: separate audio reproducer means coupled to the respective outputs of said first and second delay line means for producing simultaneously separate versions of said audible representation from each said reproducer means.
16. Apparatus according to claim 11 and including: input high pass filter means having adjustable low frequency cutoff interposed between said means for deriving said electric signals and said input of said variable delay line means; means for adjusting the effective rate of said periodic linear variation to adjust said factor by which frequencies propagated through said delay line are altered; and means coupling said cutoff adjusting means and said effective rate adjusting means for related variation to eliminate low frequency components which would otherwise produce frequency components in said delay line means below an audio low frequency limit related to said factor and the period of said variation.
17. Apparatus according to claim 16 in which said second delay line means is also a variable delay line and including means for controlling said second delay line means for periodic linear variation of said time delay between predetermined delay values, said variation of said second delay line means occurring during said reset intervals for the first recited said delay line means.
18. A processor for electric signals representing coded audible signals such as speech or the like, said electric signals being analog representations of said audible signals with the frequency components of said electric signals related by a given factor to the frequency components of said audible signals comprising: first and second variable delay line means coupled to a source of said electric signals for propagating said electric signals therein with controllable time delay; input high pass filter means having adjustable low frequency cutoff interposed between said source and the input of said variable delay line means; means for controlling said first and second delay line means for periodic linear variation of said time delay between predetermined delay values, said means for controlling said first and second delay line means providing periods of linearly changing unidirectional delay variation alternating with reset intervals wherein said delay lines are returned to an initial predetermined delay value, said reset intervals for each said delay line means occurring within the period of the unidirectional delay variation of the other said delay line means; means for adjusting the effective rate of said periodic linear variation to adjust the factor by which frequencies propagated through said delay line are altered; means coupling said cutoff adjusting means and said effective rate adjusting means for related variation to eliminate low frequency components which would otherwise produce frequency components in said delay line means below an audio low frequency limit related to said factor and the period of said variation; and, means coupled to the outputs of said first and second delay line means and responsive substantially only to signals propagating through said delay line means which have been subject to unidirectional variation of said delay for producing an audible representation of said electric signals having frequency components altered by substantially said given factor to approximate the frequency components of said audible signals.
19. Apparatus according to claim 18 in which said first and second delay line means vary between the same said predetermined delay values alternately with substantially equal periods of said linear variation.
20. Apparatus according to claim 19 in which said reset intervals are substantially shorter than the periods of said linear variation and occur for each said delay linE means approximately at the mid-point of said linear variation for the other said delay line means and including: separate audio reproducer means coupled to the respective outputs of said first and second delay line means for producing simultaneously separate versions of said audible representation from each said reproducer means.
21. Apparatus according to claim 11 in which said second delay line means is a fixed delay line the input of which is coupled to an output of said variable delay line means.
22. Apparatus according to claim 21 and including: input high pass filter means having adjustable low frequency cutoff interposed between said source and the input of said variable delay line means; means for adjusting the effective rate of said periodic linear variation to adjust said factor by which frequencies propagated through said delay line are altered; and means coupling said cutoff adjusting means and said effective rate adjusting means for related variation to eliminate low frequency components which would otherwise produce frequency components in said delay line means below an audio low frequency limit related to said factor and the period of said variation.
23. A processor for recordings representing coded audible signals such as speech or the like comprising: playback means operable to reproduce said recording at a rate faster by a given factor than the recording rate thereby producing electric signals which are the time compressed analog of said audible signals with frequency components increased by said factor; variable delay line means coupled to said playback means for propagating said electric signals therein with controllable time delay; means for controlling said delay line means for periodic linear increase of said time delay between predetermined delay values; and, means coupled to the output of said delay line means and responsive subtantially only to signals propagating through said delay line means which have been subject to said increase of said time delay for producing an audible representation of said electrical signals having frequency components reduced substantially by said factor to approximate the frequency components of said audible signals.
24. Apparatus according to claim 23 and including input high pass filter means, said filter means substantially eliminating components for said electrical signals from propagating through said delay line means which are below a frequency approximately equal to said factor times the reciprocal of the period of said linear variation.
25. Apparatus according to claim 24 and including band pass output filter means limiting said output representation to a predetermined audio bandwidth.
26. Apparatus according to claim 23 and including: input high pass filter means having adjustable low frequency cutoff interposed between said playback means and the input of said variable delay line means; means for adjusting the effective rate of said periodic linear variation to adjust said factor by which frequencies propagated through said delay line are altered; and means coupling said cutoff adjusting means and said effective rate adjusting means for related variation to eliminate low frequency components which would otherwise produce frequency components in said delay line means below an audio low frequency limit related to said factor and the period of said variation.
27. Apparatus according to claim 26 in which said means for controlling said delay line means provides periods of linearly changing unidirectional delay variation alternating with reset intervals wherein said delay line is returned to an initial predetermined delay value and including means for blanking output signals from said output of said delay line means to substantially zero amplitude during said reset intervals.
28. Apparatus according to claim 27 in which said means for blanking includes means for gradually restoring the amplitude of said output to full amplitude aFter said reset intervals.
29. Apparatus according to claim 2 in which said means for blanking includes means responsive to the level of said output signals being zero at the beginning of said reset interval to initiate said blanking and means responsive to the level of said output signals being zero at the end of said reset interval to terminate said blanking.
30. Apparatus according to claim 29 and including means for introducing an audio signal segment into said output signals and said audible representation of said electric signals during said reset interval.
31. Apparatus according to claim 23 in which said means for controlling said delay line means provides periods of linearly changing unidirectional delay variation alternating with reset intervals wherein said delay line is returned to an initial predetermined delay value and including: second delay line means coupled to a source of said electric signals; and means for interrupting signals from the output of said variable delay line means and substituting signals from the output of said second delay line means during said reset intervals to produce said audible representation.
32. Apparatus according to claim 31 in which said second delay line means includes a variable delay line and including means for controlling said second delay line means for periodic linear variation of said time delay between predetermined delay values, said variation of said second delay line means occurring during said reset intervals for the first recited said delay line means.
33. A processor for recordings representing coded audible signals such as speech or the like comprising: playback means operable to reproduce said recording at a rate faster by a given factor than the recording rate thereby producing electric signals which are the time compressed analog of said audible signals with frequency components increased by said factor; first and second variable delay line means coupled to said playback means for propagating said electric signals therein with controllable time delay; means for controlling said first and second delay line means for periodic linear variation of said time delay between predetermined delay values, said means for controlling said first and second delay line means providing periods of linearly changing unidirectional delay variation alternating with reset intervals wherein said delay lines are returned to an initial predetermined delay value, said reset intervals for each said delay line means occurring within the period of the unidirectional delay variation of the other said delay line means; and means coupled to the outputs of said first and second delay line means and responsive substantially only to signals propagating through said delay line means which have been subject to unidirectional variation of said delay for producing an audible representation of said electric signals having frequency components reduced by substantially said factor to approximate the frequency components of said audible signals.
34. Apparatus according to claim 33 in which said first and second delay line means vary between the same said predetermined delay values alternately with substantially equal periods of said linear variation.
35. Apparatus according to claim 34 in which said reset intervals are substantially shorter than the periods of said linear variation and occur for each said delay line means approximately at the mid-point of said linear variation for the other said delay line means and including: separate audio reproducer means coupled to the respective outputs of said first and second delay line means for producing simultaneously separate versions of said audible representation from each said reproducer means.
36. Apparatus according to claim 31 and including: input high pass filter means having adjustable low frequency cutoff interposed between said source and the input of said variable delay line means; means for adjusting the effective rate of said pEriodic linear variation to adjust said factor by which frequencies propagated through said delay line are altered; and means coupling said cutoff adjusting means and said effective rate adjusting means for related variation to eliminate low frequency components which would otherwise produce frequency components in said delay line means below an audio low frequency limit related to said factor and the period of said variation.
37. Apparatus according to claim 36 in which said second delay line means is also a variable delay line and including means for controlling said second delay line means for periodic linear variation of said time delay between predetermined delay values, said variation of said second delay line means occurring during said reset intervals for the first recited said delay line.
38. Apparatus according to claim 18 in which said source of said electric signals comprises playback means operable to reproduce a recording at a rate faster by a given factor than the recording rate, thereby producing electric signals which are the time compressed analog of said audible signals with frequency components increased by said given factor.
39. Apparatus according to claim 18 in which said first and second delay line means vary between the same said predetermined delay values alternately with substantially equal periods of said linear variation.
40. Apparatus according to claim 39 in which said reset intervals are substantially shorter than the periods of said linear variation and occur for each said delay line means approximately at the mid-point of said linear variation for the other said delay line means and including: separate audio reproducer means coupled to the respective outputs of said first and second delay line means for producing simultaneously separate versions of said audible representation from each said reproducer means.
41. Apparatus according to claim 40 in which said second delay line means is a fixed delay line the input of which is coupled to an output of said variable delay line means.
42. Apparatus according to claim 41 and including: input high pass filter means having adjustable low frequency cutoff interposed between said source and the input of said variable delay line means; means for adjusting the effective rate of said periodic linear variations to adjust said factor by which frequencies propagated through said delay line are altered; and means coupling said cutoff adjusting means and said effective rate adjusting means for related variation to eliminate low frequency components which would otherwise produce frequency components in said delay line means below an audio low frequency limit related to said factor and the period of said variation.
43. A processor for recordings representing coded audible signals such as speech or the like comprising: playback means operable to reproduce said recording at a rate slower by a given factor than the recording rate thereby producing electric signals which are the time expanded analog of said audible signals with frequency components decreased by said factor; variable delay line means coupled to said playback means for propagating said electric signals therein with controllable time delay; means for controlling said delay line means for periodic linear decrease of said time delay between predetermined delay values; and means coupled to the output of said delay line means and responsive substantially only to signals propagating through said delay line means which have been subject to said decrease of said time delay for producing an audible representation of said electrical signals having frequency components increased substantially by said factor to approximate the frequency components of said audible signals.
44. Apparatus according to claim 43 in which said means for controlling said delay line means provides periods of linearly changing unidirectional delay variation alternating with reset intervals wherein said delay line Is returned to an initial predetermined delay value and including: second delay line means coupled to a source of said electric signals; and means for interrupting signals from the output of said variable delay line means and substituting signals from the output of said second delay line means during said reset intervals to produce said audible representation.
45. Apparatus according to claim 44 in which said second delay line means include a variable delay line and including means for controlling said second delay line means for periodic linear variation of said time delay between predetermined delay values, said variation of said second delay line means occurring during said reset intervals for the first recited said delay line means.
46. Apparatus according to claim 44 in which said second delay line means is a fixed delay line the input of which is coupled to an output of said variable delay line means.
47. Apparatus according to claim 1 in which said delay line means is controlled such that: the time delay difference between said delay values is d X T where d 2 c-1/c+1, c is said factor and T is said period; and the period of said variation is greater than the period of the lowest input frequency component to be processed through said line divided by said factor.
48. Apparatus according to claim 1 in which: said factor is a number greater than one for time compression and less than one for time expansion; the period of said variation is greater than the period of the lowest frequency component of said electric signals appearing at said output of said delay line; and said unidirectional variation of said delay is given by 2 (c-1/c+1)t where t is the time and c is said factor.
49. A processor according to claim 1 in which said variable delay line means is an analog shift register and said means for controlling said delay line means is a variable frequency clock pulse generator having a frequency that periodically varies inversely with time.
50. A processor according to claim 11 in which both of said delay line means are analog shift registers each having a variable frequency clock pulse generator.
51. Apparatus according to claim 1 in which said variable delay line means is a parallel r-word-n-stage shift register and including input A/D means for converting said electric signals into a succession of r-bit-digital words; output D/A means for converting the digital word from the n-th stage into an analog signal and variable frequency clock signal means for transferring the r-bit words from stage to stage through said n stages to effect linear variation of time delay for said signal.
52. Apparatus according to claim 1 in which said variable delay line means comprises input A/D converter means, serializer means to produce a digital word sequence, a serial digital shift register, parallelizer means to produce a parallel digital word and a D/A converter means; said control means comprising a variable frequency clock generator for said shift register.
53. Apparatus according to claim 1 in which said delay line means comprises an analog storage matrix and including write control means for entering said electric signals into said matrix at a first predetermined clock rate; and read control means for reading said electric signals from said matrix at a different second predetermined rate.
54. Apparatus according to claim 1 in which said variable delay line means is a digital storage matrix having A/D input means and D/A output means; and means for writing digital information into storage in said matrix at one rate and reading stored information out of said matrix at another rate.
55. The method of processing random speech signals to convey information intelligibly to a human listener at a rate different than the normal speaking equivalent for said information and without obJectionable alteration of the frequency components of said equivalent as reproduced for the listener comprising the steps of: developing a full speech signal train of said information with the elapsed time for said signal train altered into a time interval differing by a predetermined factor from that of the normal speaking equivalent for said signal train thereby changing by said factor the frequency of the spectral components in said signal train relative to said equivalent; processing said signal train by a linear periodic time delay function to alter a regular succession of predetermined length segments of said signal train into segments approximating a continuous signal with the frequency components in said continuous signal altered by said factor relative to said components in said signal train to approximate the components of said normal speaking equivalent; and reproducing said continuous signal as an intelligible representation of said information content with the elapsed time altered by said factor but with the frequency components unchanged relative to said equivalent.
56. Apparatus according to claim 2 in which said means for blanking includes means responsive to said output signals for initiating and terminating said blanking for zero output signal levels and for signal excursion adjacent said zero levels in the same direction.
57. Apparatus according to claim 43, in which said means for controlling said delay line means provides periods of linearly changing unidirectional delay variation alternating with reset intervals wherein said delay line is returned to an initial predetermined delay value and including means for blanking output signals from said output of said delay line means to substantially zero amplitude during said reset intervals.
58. A processor for electric signals representing coded audible signals such as speech or the like, said electric signals being analog representations of said audible signals with the frequency components of said electric signals related by a given factor to the frequency components of said audible signals comprising: two similar delay lines coupled to a source of said electric signals for propagating said electric signals between the input and output terminals thereof with controllable frequency transformation; means for controlling said delay lines alternately for periodic propagation of said electric signals to produce predetermined frequency transformation of signals emerging at the outputs of said delay lines; and means coupled to said outputs of both said delay line means and responsive substantially only to signals propagating through said delay lines which have been subject to said predetermined frequency transformation for producing an audible representation of said electric signals having frequency components altered by substantially said factor to approximate the frequency components of said audible signals.
59. Apparatus according to claim 58 in which said two delay lines are analog shift registers.
60. Apparatus according to claim 58 in which said analog shift registers are controlled alternately by a clock frequency periodically varied between predetermined limits.
61. The method of processing random speech signals to convey information intelligibly to a human listener at a rate different than the normal speaking equivalent for said information and without objectionable alteration of the frequency components of said equivalent as reproduced for the listener comprising the steps of: developing a full speech signal train of said information with the elapsed time for said signal train altered into a time interval differing by a predetermined factor from that of the normal speaking equivalent for said signal train thereby changing by said factor the frequency of the spectral components in said signal train relative to said equivalent; propagating said signal train through a storage channel within which it is subject to a periodic frequency transformAtion function to alter a regular succession of predetermined length input segments of said signal train into output segments approximating a continuous signal with the frequency components in said continuous signal altered by the inverse of said factor relative to said components in said signal train to approximate the components of said normal speaking equivalent; and reproducing said continuous signal as an intelligible audible representation of the information content of said signal train but with the frequency components unchanged relative to said equivalent.
62. The method according to claim 61 in which said storage channel comprises analog shift register means controlled by a clock frequency periodically varied between predetermined frequency values.
63. The method according to claim 62 in which said input segments are propagated alternately through separate analog shift registers for frequency transformation.
64. Apparatus according to claim 48 in which said delay line means comprises analog shift register means and said periodic variation of said time delay is obtained by controlling the clock frequency for said shift register means to provide said unidirectional variation of said delay to be: N (p-1/p) ( 1/ft -(1/fo ) 2 (c-1/c+1) t where ft is the clock frequency at time t, N is the number of stages of phase p and fo is the initial clock frequency.
65. In a processor for electric signals such as those representing the coded audible sounds of speech or the like, said electric signals being analog representations of said audible sounds with the frequency components of said electric signals related by a given factor to the frequency components of said audible sounds, the improvement comprising: controllable delay means having an input and an output, said input coupled to a source of said electric signals for passing signals from said source into said delay means, and said delay means passing signals therein to said output with controllable time delay; means for controlling said delay means with repetitive variation of said controllable time delay between predetermined delay values the period of said variation being greater than the period of the lowest frequency component of said electric signals at said output of said delay means thereby progressively delaying signals during each period of said repetitive variation as they pass through said delay means to obtain a predetermined frequency transformation of said electric signals as determined by the relation:
US00171571A 1971-08-13 1971-08-13 Variable delay line signal processor for sound reproduction Expired - Lifetime US3786195A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US17157171A 1971-08-13 1971-08-13

Publications (1)

Publication Number Publication Date
US3786195A true US3786195A (en) 1974-01-15

Family

ID=22624258

Family Applications (1)

Application Number Title Priority Date Filing Date
US00171571A Expired - Lifetime US3786195A (en) 1971-08-13 1971-08-13 Variable delay line signal processor for sound reproduction

Country Status (3)

Country Link
US (1) US3786195A (en)
BR (1) BR7205491D0 (en)
ZA (1) ZA725451B (en)

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3855425A (en) * 1971-10-20 1974-12-17 B Keiser Delay line time compressor and expander
US4149039A (en) * 1976-11-05 1979-04-10 Kokusai Denshin Denwa Co., Ltd. Frequency band compression of FDM using time expansion
US4207440A (en) * 1976-01-30 1980-06-10 The Vsc Company Dictation recorder with speech-extendable adjustment predetermined playback time
US4228322A (en) * 1979-01-02 1980-10-14 International Business Machines Corporation Decreasing time duration of recorded speech
EP0064750A2 (en) * 1981-05-11 1982-11-17 Integrated Sound Systems, Inc. Gapless splicing of pitch altered waveforms
US4406001A (en) * 1980-08-18 1983-09-20 The Variable Speech Control Company ("Vsc") Time compression/expansion with synchronized individual pitch correction of separate components
USRE31614E (en) * 1979-01-02 1984-06-26 International Business Machines Corporation Decreasing time duration of recorded speech
US4771345A (en) * 1985-09-12 1988-09-13 Sharp Kabushiki Kaisha Reproducing apparatus
US4805217A (en) * 1984-09-26 1989-02-14 Mitsubishi Denki Kabushiki Kaisha Receiving set with playback function
US4866647A (en) * 1988-02-04 1989-09-12 American Telephone And Telegraph Company Continuously variable digital delay circuit
EP0427953A2 (en) * 1989-10-06 1991-05-22 Matsushita Electric Industrial Co., Ltd. Apparatus and method for speech rate modification
US5175769A (en) * 1991-07-23 1992-12-29 Rolm Systems Method for time-scale modification of signals
WO1996012270A1 (en) * 1994-10-12 1996-04-25 Pixel Instruments Time compression/expansion without pitch change
US5630013A (en) * 1993-01-25 1997-05-13 Matsushita Electric Industrial Co., Ltd. Method of and apparatus for performing time-scale modification of speech signals
US5848130A (en) * 1996-12-31 1998-12-08 At&T Corp System and method for enhanced intelligibility of voice messages
US5864792A (en) * 1995-09-30 1999-01-26 Samsung Electronics Co., Ltd. Speed-variable speech signal reproduction apparatus and method
US20020052735A1 (en) * 2000-07-17 2002-05-02 Soichi Nagano Sound reproduction apparatus and method with variable-speed fast forward/reverse
US20040015345A1 (en) * 2000-08-09 2004-01-22 Magdy Megeid Method and system for enabling audio speed conversion
US20040090555A1 (en) * 2000-08-10 2004-05-13 Magdy Megeid System and method for enabling audio speed conversion
US20040114567A1 (en) * 1995-10-05 2004-06-17 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20060161952A1 (en) * 1994-11-29 2006-07-20 Frederick Herz System and method for scheduling broadcast of an access to video programs and other data using customer profiles
US20130197905A1 (en) * 2010-10-12 2013-08-01 Nec Corporation Signal processing device, signal processing method and signal processing program
US8570328B2 (en) 2000-12-12 2013-10-29 Epl Holdings, Llc Modifying temporal sequence presentation data based on a calculated cumulative rendition period
CN111030412A (en) * 2019-12-04 2020-04-17 瑞声科技(新加坡)有限公司 Vibration waveform design method and vibration motor

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3093796A (en) * 1959-10-28 1963-06-11 Everett C Westerfield Automatic signal time compressor with gate means for controlling rate of shift register output
US3202769A (en) * 1960-08-02 1965-08-24 Columbia Broadcasting Syst Inc Apparatus for modifying the timing characteristic of a signal
US3278907A (en) * 1964-02-17 1966-10-11 Hughes Aircraft Co Time compressor
US3409736A (en) * 1965-05-17 1968-11-05 Rca Corp Phase and frequency correction system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3093796A (en) * 1959-10-28 1963-06-11 Everett C Westerfield Automatic signal time compressor with gate means for controlling rate of shift register output
US3202769A (en) * 1960-08-02 1965-08-24 Columbia Broadcasting Syst Inc Apparatus for modifying the timing characteristic of a signal
US3278907A (en) * 1964-02-17 1966-10-11 Hughes Aircraft Co Time compressor
US3409736A (en) * 1965-05-17 1968-11-05 Rca Corp Phase and frequency correction system

Cited By (90)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3855425A (en) * 1971-10-20 1974-12-17 B Keiser Delay line time compressor and expander
US4207440A (en) * 1976-01-30 1980-06-10 The Vsc Company Dictation recorder with speech-extendable adjustment predetermined playback time
US4149039A (en) * 1976-11-05 1979-04-10 Kokusai Denshin Denwa Co., Ltd. Frequency band compression of FDM using time expansion
US4228322A (en) * 1979-01-02 1980-10-14 International Business Machines Corporation Decreasing time duration of recorded speech
USRE31614E (en) * 1979-01-02 1984-06-26 International Business Machines Corporation Decreasing time duration of recorded speech
US4406001A (en) * 1980-08-18 1983-09-20 The Variable Speech Control Company ("Vsc") Time compression/expansion with synchronized individual pitch correction of separate components
EP0064750A2 (en) * 1981-05-11 1982-11-17 Integrated Sound Systems, Inc. Gapless splicing of pitch altered waveforms
EP0064750A3 (en) * 1981-05-11 1984-05-02 Integrated Sound Systems, Inc. Gapless splicing of pitch altered waveforms
US4805217A (en) * 1984-09-26 1989-02-14 Mitsubishi Denki Kabushiki Kaisha Receiving set with playback function
US4771345A (en) * 1985-09-12 1988-09-13 Sharp Kabushiki Kaisha Reproducing apparatus
US4866647A (en) * 1988-02-04 1989-09-12 American Telephone And Telegraph Company Continuously variable digital delay circuit
US5341432A (en) * 1989-10-06 1994-08-23 Matsushita Electric Industrial Co., Ltd. Apparatus and method for performing speech rate modification and improved fidelity
EP0427953A2 (en) * 1989-10-06 1991-05-22 Matsushita Electric Industrial Co., Ltd. Apparatus and method for speech rate modification
EP0427953A3 (en) * 1989-10-06 1991-05-29 Matsushita Electric Industrial Co., Ltd. Apparatus and method for speech rate modification
US5175769A (en) * 1991-07-23 1992-12-29 Rolm Systems Method for time-scale modification of signals
US5630013A (en) * 1993-01-25 1997-05-13 Matsushita Electric Industrial Co., Ltd. Method of and apparatus for performing time-scale modification of speech signals
US8769601B2 (en) 1994-10-12 2014-07-01 J. Carl Cooper Program viewing apparatus and method
US9723357B2 (en) 1994-10-12 2017-08-01 J. Carl Cooper Program viewing apparatus and method
US6098046A (en) * 1994-10-12 2000-08-01 Pixel Instruments Frequency converter system
WO1996012270A1 (en) * 1994-10-12 1996-04-25 Pixel Instruments Time compression/expansion without pitch change
US8428427B2 (en) 1994-10-12 2013-04-23 J. Carl Cooper Television program transmission, storage and recovery with audio and video synchronization
US8185929B2 (en) 1994-10-12 2012-05-22 Cooper J Carl Program viewing apparatus and method
US20050039219A1 (en) * 1994-10-12 2005-02-17 Pixel Instruments Program viewing apparatus and method
US20100247065A1 (en) * 1994-10-12 2010-09-30 Pixel Instruments Corporation Program viewing apparatus and method
US20060015348A1 (en) * 1994-10-12 2006-01-19 Pixel Instruments Corp. Television program transmission, storage and recovery with audio and video synchronization
US20050240962A1 (en) * 1994-10-12 2005-10-27 Pixel Instruments Corp. Program viewing apparatus and method
US6901209B1 (en) * 1994-10-12 2005-05-31 Pixel Instruments Program viewing apparatus and method
US20060161952A1 (en) * 1994-11-29 2006-07-20 Frederick Herz System and method for scheduling broadcast of an access to video programs and other data using customer profiles
US5864792A (en) * 1995-09-30 1999-01-26 Samsung Electronics Co., Ltd. Speed-variable speech signal reproduction apparatus and method
US7894423B2 (en) 1995-10-05 2011-02-22 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7760703B2 (en) 1995-10-05 2010-07-20 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20040174843A1 (en) * 1995-10-05 2004-09-09 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20040246940A1 (en) * 1995-10-05 2004-12-09 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20050013266A1 (en) * 1995-10-05 2005-01-20 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20050036467A1 (en) * 1995-10-05 2005-02-17 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20040160912A1 (en) * 1995-10-05 2004-08-19 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communication among wireless terminals and telephones
US20050083872A1 (en) * 1995-10-05 2005-04-21 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20040151164A1 (en) * 1995-10-05 2004-08-05 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20040151151A1 (en) * 1995-10-05 2004-08-05 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20050254475A1 (en) * 1995-10-05 2005-11-17 Kubler Joseph J Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20040146037A1 (en) * 1995-10-05 2004-07-29 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20040146020A1 (en) * 1995-10-05 2004-07-29 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US8238264B2 (en) 1995-10-05 2012-08-07 Broadcom Corporation Hierarchical data collection network supporting packetized voice communication among wireless terminals and telephones
US7460507B2 (en) * 1995-10-05 2008-12-02 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20090022304A1 (en) * 1995-10-05 2009-01-22 Kubler Joseph J Hierarchical Data Collection Network Supporting Packetized Voice Communications Among Wireless Terminals and Telephones
US20090034454A1 (en) * 1995-10-05 2009-02-05 Kubler Joseph J Hierarchical Data Collection Network Supporting Packetized Voice Communications Among Wireless Terminals and Telephones
US20090059903A1 (en) * 1995-10-05 2009-03-05 Kubler Joseph J Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7580384B2 (en) 1995-10-05 2009-08-25 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7586907B2 (en) 1995-10-05 2009-09-08 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7586861B2 (en) 1995-10-05 2009-09-08 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7633934B2 (en) 1995-10-05 2009-12-15 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20090323680A1 (en) * 1995-10-05 2009-12-31 Kubler Joseph J Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7646743B2 (en) 1995-10-05 2010-01-12 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7688811B2 (en) 1995-10-05 2010-03-30 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20100080182A1 (en) * 1995-10-05 2010-04-01 Kubler Joseph J Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7697467B2 (en) 1995-10-05 2010-04-13 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7715375B2 (en) 1995-10-05 2010-05-11 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20100118864A1 (en) * 1995-10-05 2010-05-13 Kubler Joseph J Hierarchical Data Collection Network Supporting Packetized Voice Communications Among Wireless Terminals And Telephones
US20100142503A1 (en) * 1995-10-05 2010-06-10 Kubler Joseph J Hierarchical Data Collection Network Supporting Packetized Voice Communications Among Wireless Terminals And Telephones
US20100142518A1 (en) * 1995-10-05 2010-06-10 Kubler Joseph J Hierarchical Data Collection Network Supporting Packetized Voice Communications Among Wireless Terminals and Telephones
US20040174841A1 (en) * 1995-10-05 2004-09-09 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7768951B2 (en) 1995-10-05 2010-08-03 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20100232323A1 (en) * 1995-10-05 2010-09-16 Kubler Joseph J Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20100232312A1 (en) * 1995-10-05 2010-09-16 Kubler Joseph J Hierarchical Data Collection Network Supporting Packetized Voice Communications Among Wireless Terminals And Telephones
US20040145775A1 (en) * 1995-10-05 2004-07-29 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7848316B2 (en) 1995-10-05 2010-12-07 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US20040114567A1 (en) * 1995-10-05 2004-06-17 Kubler Joseph J. Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7899007B2 (en) 1995-10-05 2011-03-01 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7912016B2 (en) 1995-10-05 2011-03-22 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7912043B2 (en) 1995-10-05 2011-03-22 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7916706B2 (en) 1995-10-05 2011-03-29 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7920553B2 (en) 1995-10-05 2011-04-05 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7933252B2 (en) 1995-10-05 2011-04-26 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US7936713B2 (en) 1995-10-05 2011-05-03 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US8018907B2 (en) 1995-10-05 2011-09-13 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US8139749B2 (en) 1995-10-05 2012-03-20 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US8149825B2 (en) 1995-10-05 2012-04-03 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US8228879B2 (en) * 1995-10-05 2012-07-24 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US8194595B2 (en) 1995-10-05 2012-06-05 Broadcom Corporation Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones
US5848130A (en) * 1996-12-31 1998-12-08 At&T Corp System and method for enhanced intelligibility of voice messages
US20020052735A1 (en) * 2000-07-17 2002-05-02 Soichi Nagano Sound reproduction apparatus and method with variable-speed fast forward/reverse
US7363232B2 (en) * 2000-08-09 2008-04-22 Thomson Licensing Method and system for enabling audio speed conversion
US20040015345A1 (en) * 2000-08-09 2004-01-22 Magdy Megeid Method and system for enabling audio speed conversion
US20040090555A1 (en) * 2000-08-10 2004-05-13 Magdy Megeid System and method for enabling audio speed conversion
US8570328B2 (en) 2000-12-12 2013-10-29 Epl Holdings, Llc Modifying temporal sequence presentation data based on a calculated cumulative rendition period
US8797329B2 (en) 2000-12-12 2014-08-05 Epl Holdings, Llc Associating buffers with temporal sequence presentation data
US9035954B2 (en) 2000-12-12 2015-05-19 Virentem Ventures, Llc Enhancing a rendering system to distinguish presentation time from data time
US20130197905A1 (en) * 2010-10-12 2013-08-01 Nec Corporation Signal processing device, signal processing method and signal processing program
US9613632B2 (en) * 2010-10-12 2017-04-04 Nec Corporation Signal processing device, signal processing method and signal processing program
CN111030412A (en) * 2019-12-04 2020-04-17 瑞声科技(新加坡)有限公司 Vibration waveform design method and vibration motor

Also Published As

Publication number Publication date
ZA725451B (en) 1974-02-27
BR7205491D0 (en) 1973-07-03

Similar Documents

Publication Publication Date Title
US3786195A (en) Variable delay line signal processor for sound reproduction
US4406001A (en) Time compression/expansion with synchronized individual pitch correction of separate components
US6226605B1 (en) Digital voice processing apparatus providing frequency characteristic processing and/or time scale expansion
JP4895418B2 (en) Audio reproduction method and audio reproduction apparatus
JPH07118749B2 (en) Voice / data transmission equipment
KR20000010930A (en) Speaking speed changing method and device
US6009386A (en) Speech playback speed change using wavelet coding, preferably sub-band coding
JP3630609B2 (en) Audio information reproducing method and apparatus
JPH09263010A (en) Code printing apparatus
US6292454B1 (en) Apparatus and method for implementing a variable-speed audio data playback system
US3936610A (en) Dual delay line storage sound signal processor
WO2000077775A1 (en) Sound switching device
WO1996013826A1 (en) Coded data decoding device and video/audio multiplexed data decoding device using it
CS195258B2 (en) Facility for processing the signals with the controlled retardation
US3541264A (en) Apparatus for deleting a portion of a signal
JPH0329555A (en) Storage system for agenda
JPH09146587A (en) Speech speed changer
JPH06202692A (en) Control system for speech reproducing speed
JP2002156998A (en) Bit stream processing method for audio signal, recording medium where the same processing method is recorded, and processor
KR0170321B1 (en) Voice signal generating method and apparatus
JPH04367898A (en) Method and device for voice reproduction
US20060069565A1 (en) Compressed data processing apparatus and method and compressed data processing program
JP3008640B2 (en) Digital audio processing unit
US3838218A (en) Bifrequency controlled analog shift register speech processor
JPS60239129A (en) Method for compressing sound information quantity

Legal Events

Date Code Title Description
AS Assignment

Owner name: VARIABLE SPEECH CONTROL COMPANY

Free format text: CHANGE OF NAME;ASSIGNOR:USC COMPANY THE;REEL/FRAME:004022/0602

Effective date: 19811214

Owner name: VSC COMPANY THE, WESTPORT, CT A LIMITED PARTNERSHI

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST.;ASSIGNORS:FLAKS, MARVIN, ATTORNEY-IN FACT;VSC COMPANY THE;REEL/FRAME:004022/0598

Effective date: 19761215

AS Assignment

Owner name: CAMBRIDGE RESEARCH AND DEVELOPMENT GROUP, 21 BRIDG

Free format text: SECURITY INTEREST;ASSIGNOR:VARIABLE SPEECH CONTROL COMPANY THE, A LIMITED PARTNERSHIP OF CT;REEL/FRAME:004040/0166

Effective date: 19820809

STCF Information on status: patent grant

Free format text: PATENTED FILE - (OLD CASE ADDED FOR FILE TRACKING PURPOSES)