USRE36714E - Perceptual coding of audio signals - Google Patents
Perceptual coding of audio signals Download PDFInfo
- Publication number
- USRE36714E USRE36714E US08/622,313 US62231394A USRE36714E US RE36714 E USRE36714 E US RE36714E US 62231394 A US62231394 A US 62231394A US RE36714 E USRE36714 E US RE36714E
- Authority
- US
- United States
- Prior art keywords
- frequency
- iaddend
- iadd
- signals
- function
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04B—TRANSMISSION
- H04B1/00—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission
- H04B1/66—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission
- H04B1/665—Details of transmission systems, not covered by a single one of groups H04B3/00 - H04B13/00; Details of transmission systems not characterised by the medium used for transmission for reducing bandwidth of signals; for improving efficiency of transmission using psychoacoustic properties of the ear, e.g. masking effect
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
Definitions
- the present invention relates to coding of time varying signals, such as audio signals representing voice or music information.
- CD compact disc
- DAT digital audio tape
- Perceptual coding as described in the Johnston, et al paper relates to a technique for lowering required bitrates (or reapportioning available bits) in representing audio signals.
- the masking threshold for unwanted signals is identified as a function of frequency of the desired signal. Then the coarseness of quantizing used to represent a signal component of the desired signal is selected such that the quantizing noise introduced by the coding does not rise above the noise threshold, though it may be quite near this threshold.
- the masking effect of noise is dependent on the "tonelike or noiselike" nature of the signal.
- an offset for the masking threshold for each critical band is developed which depends on whether a "coefficient of tonality" for the signal in each critical band indicates that the signal is relatively more tonelike or noiselike. This coefficient of tonality is, in turn, conveniently derived from a measure of flatness of the spectrum of the signal over that critical band.
- the present invention improves on the tonality based perceptual coding techniques described in the cited copending application Ser. No. 292,598. Because the frequency analysis typically involves determining spectral information at discrete frequencies ("frequency lines") within the audio spectrum, and because a number of these discrete frequencies will, in general, fall within each critical band, the processing described in the prior application Ser. No. 292,598 and the cited Johnston paper, illustratively grouped spectral values for frequencies within each critical band. That is, the spectral processing used to determine the tonality and masking threshold was typically accomplished on a critical-band-by-critical-band basis.
- the improvements made in accordance with aspects of the present invention permit grouping of values at discrete frequencies, but also include the use of a frequency-line-by-frequency-line analysis, rather than analysis on a spectrum-wide basis, in calculating the tonality metric values.
- This line-by-line calculation is advantageously based on a history of consecutive frames of the input power spectrum, rather than on the current frame alone.
- the present invention then advantageously determines improved estimates of perceptual thresholds on a line-by-line basis, rather than on a critical-band-by-critical-band basis. In appropriate cases, the critical band masking threshold can be used.
- the tonality estimate of the present invention advantageously uses a statistic of a plurality, typically two, of the previous time frames to predict the value of a given power spectrum frequency line in the current time frame.
- This process features the use of a Euclidian distance between the predicted line and the actual line in a present frame to estimate the tonality (or noisiness) of each spectral line. It proves convenient in these calculations to perform a normalization of the estimates using the predicted and actual values.
- These tonality estimates can then be combined, e.g., on a critical-band basis, to obtain an estimate of the actual tonality. This is done for each frequency to determine the noise-masking thresholds to be used in quantizing the frequency information to be finally coded for recording, transmission or other use.
- a spreading operation known in the art e.g., that is described generally in the Schroeder, et al paper, supra, is employed in an alternative implementation of certain aspects of the improved masking threshold determination process of the present invention.
- Spreading generally relates to the masking effect on a signal at a given frequency by signals separated in frequency from the given signal frequency.
- matrix processing is disclosed which involves signal spreading effects from signals many bark frequencies away. A bark is the term used to indicated a frequency difference of one critical band.
- FIG. 1 is a block diagram of an overall system based on the present invention
- FIG. 2 is a flow chart illustrating the masking threshold processing employed in an illustrative embodiment of the coder in accordance with the present invention.
- FIG. 3 shows a detailed block diagram of a decoder that may be used in the system of FIG. 1.
- the application WO 88/01811 describes the so-called OCF coder that may be used as one alternative to the transform coder described in the Jayant, et al reference or the application Ser. No. 292,598.
- FIG. 1 of the present application discloses the overall organization of a system incorporating the present invention.
- an analog signal on input 100 is applied to preprocessor 105 where it is sampled (typically at 32 kHz) and each sample is converted to a digital sequence (typically 16 bits) in standard fashion.
- Preprocessor 105 then groups these digital values in frames (or blocks or sets) of, e.g., 512 digital values, corresponding to, e.g., 16 msec of audio input.
- frames or blocks or sets
- 512 digital values corresponding to, e.g., 16 msec of audio input.
- Other typical values for these and other system or process parameters are discussed in the ISO Document.
- each frame contains 512 ordered digital values, 256 of these values are repeated from the preceding 512-value frame.
- each input digital value appears in two successive frames, first as part of the second half of the frame and then as part of the first half of the frame.
- pre-echo and dynamic windowing techniques described, for example, in the above-referenced ISO Document.
- Other pre-processing techniques that can be included in the functionality represented by preprocessor block 105 in FIG. 1 include those described in the ISO Document.
- Perceptual coder block 110 shown in FIG. 1 includes the perceptual masking estimation improvements of the present invention and will be described in detail below.
- Quantizer/Coder block 115 in FIG. 1 represents the above-mentioned transform or OCF coder and related coder functionality described in the incorporated application Ser. No. 292,598 and the ISO Document.
- Block 120 in FIG. 1 represents the recording or transmission medium to which the coded output of quantizer/coder 115 are applied. Suitable formatting and modulation of the output signals from quantizer/coder 115 is included in the medium block 120. Such techniques are well known to the art and will be dictated by the particular medium, transmission or recording rates and other system parameters.
- the medium 120 includes noise or other corrupting influences, it may be necessary to include additional error-control devices or processes, as is well known in the art.
- the medium is an optical recording medium similar to the standard CD devices, then redundancy coding of the type common in that medium can be used with the present invention.
- the medium is one used for transmission, e.g., a broadcast, telephone, or satellite medium
- error control mechanisms Any modulation, redundancy or other coding to accommodate (or combat the effects of) the medium will, of course, be reversed upon the delivery from the channel or other medium to the decoder.
- the originally coded information provided by quantizer/coder 115 will therefore be applied at a reproduction device.
- these coded signals will be applied to decoder 130 shown in FIG. 1, and to perceptual decoder 140.
- some of the information derived by perceptual coder 110 and delivered via quantizer/coder 115 and medium 120 to the perceptual decoder 140 is in the nature of "side information.” Such side information is described more completely below and in the ISO Document.
- Other information provided by quantizer/coder 115 via medium 120 relating to the spectral coefficients of the input information is illustratively provided directly to decoder 130.
- perceptual decoder 140 After processing the side information, perceptual decoder 140 provides decoder 130 with the additional information to allow it to recreate, with little or no perceptual distortion, the original spectral signals developed in pre-processor 105. These recreated signals are then applied to post-processor 150, where the inverse MDCT or equivalent operations and D/A functions are accomplished (generally as described in application Ser. No. 292,598) to recreate the original analog signal on output 160.
- the output on 160 is in such form as to be perceived by a listener as substantially identical to that supplied on input 100.
- FIG. 2 is a flow chart representation of the processing accomplished in perceptual coder 110.
- Listing 1 attached, forms part of this application.
- This listing is an illustrative annotated FORTRAN program listing reflecting processing in accordance with aspects of the present invention relating to developing a noise masking threshold.
- a useful reference for understanding the FORTRAN processing as described herein is FX/FORTRAN Programmer's Handbook, Alliant Computer Systems Corp., July 1988.
- general purpose computers like those from Alliant Computer Systems Corp. can be used to execute the program of Listing 1.
- Table 1 is a list of constants used in connection with the illustrative program of Listing 1.
- FIG. 2 and Listing 1 will now be discussed in detail to give a fuller understanding of the illustrative embodiment of the present invention.
- Function 200 in FIG. 2 indicates the start of the processing performed in determining the improved estimates of the masking thresholds in accordance with the present invention.
- Block 210 represents the initializing functions, using the absolute threshold values from Table 1, represented by block 220 in FIG. 2.
- i is used, e.g., as the index for the critical bands, of the type described in the application Ser. No. 292,598, and has values from 0 to 25.
- the index i may be used with different ranges for other processing in other occurrences appearing in Listing 1.
- abslow is a constant assigned the indicated value to set the absolute threshold of hearing.
- rzotz is the desired sampling rate.
- rnorm is a normalization variable used in connection with the spreading function. openas is simply an operator used for opening an ascii file.
- db is a dummy variable used to calculate table entries.
- the actual threshold calculation begins with the sub-routine thrgen. Its variables r and phi are, of course, the spectral coefficients provided by preprocessor 105 in FIG. 1. They are vectors having 257 values (zero frequency, the Nyquist frequency and all intermediate components).
- Block 210 represents the initialization, using the absolute threshold information in Table 1 (shown in block 220 in FIG. 2).
- the next step in calculation of the perceptual threshold is the calculation of the tonality t(j) of the signal energy within each critical band j. This operation is indicated by block 230 in FIG. 2.
- the tonality metric is determined in accordance with the program of Listing 1 by forming
- dr and d ⁇ are the differences between the radius (r( ⁇ )) and phase ( ⁇ ( ⁇ )) of the previous calculation block and the one two previous.
- the calculation is done on a frequency line by frequency line ( ⁇ ) basis. Note that if the blocks are shortened by the dynamic windowing technique referred to in the ISO Document, the frequency lines are duplicated accordingly, so that the number of frequency lines remains the same. Additionally, the difference is multiplied accordingly in such a dynamic windowing context, so that it represents the (estimated) difference over one differently sized block.
- c values are used later to calculate the appropriate threshold in each critical band, through the calculation of t(j).
- the unspread threshold uthr(j) is calculated.
- the proper value for the masking SNR (snr db (j)), corresponding to frequency and tonality, is calculated in decibels:
- fmin is tabulated in the ISO Document and in Table 2 as an energy ratio, rather than in db.
- Table 2 also indicates critical band boundaries, expressed in terms of frequency lines for the indicated sampling rate. Then the ratio of masked noise energy to signal energy is calculated: ##EQU4## and the unspread threshold value is calculated:
- the spread threshold (sthr) is calculated from the unspread threshold, the snr(j), and the critical band energies, (P(j), according to
- mask(i-j) is tabulated at the end of the ISO Document, and represents an example modified spreading function.
- the spreading may be accomplished using the function sprdgf(j, i) given in Listing 1.
- the spread threshold is compared to the absolute threshold, and the maximum substituted in the limited threshold, lthr(j).
- lthr(j) the maximum substituted in the limited threshold
- the threshold lthr(j) is transferred to a variable named 1xmin(j) for use in the outer iteration loop described in the ISO Document.
- a final step in the threshold calculation procedure calculates an entropy measure that is used to estimate the number of bits needed for the current signal block. This estimate is derived for use by the quantizer/coder 115 using ##EQU5##
- An output of the processing described above and in Listing 1 is a set of threshold values that the quantizer/coder 115 FIG. 1 employs to efficiently encode the input signal information for transmission or storage as described above.
- the inputs to quantizer/coder 115 in FIG. 1 include spectral information derived by MDCT and other processing in accordance with functions represented by block 105 in FIG. 1, and outputs of perceptual coder 110, including the noise threshold information and perceptual energy information. Quantizer/coder 115 then processes this information and in doing so provides a bitstream to the channel or recording medium 120 in FIG. 1, which bitstream includes information divided into three main parts:
- a second part containing the scaling factors for the 23 critical bands and additional side information used for so-called adaptive window switching, when used; the length of this part can vary depending on information in the first part;
- the following bits are dependent on the side information of part 1 (e.g. subblock information is only needed if coding in subblocks is actually selected)
- a part of the Huffman code is ordered in a two-dimenional array with the number of columns depending on the longest codeword of the Huffman codetable (5, 16, 18, 22 or 19 bits for ESC-tables).
- the number of rows is the size of part 3 divided by the number of columns.
- the codewords of the higher frequencies that can not be ordered into this rectangular array are fit into the remaining gaps.
- FIG. 3 shows a detailed block diagram of a decoder in accordance with aspects of the present invention.
- FIG. 3 shows a synchronization buffer 310 which acts to appropriately buffer input bitstreams arriving on input lead 305. Error correction is then effected in the part of the system represented by block 315. This block also provides for extraction of low frequency spectral coefficients.
- Side information extracted in block 320 is demultiplexed from the other arriving information and is sentto either the Huffman coder 330 or the speech reconstruction functional elements 335.
- the actual coded spectral coefficient information is sent to the Huffman decoder itself.
- the decoder 330 is provided with a stored Huffman codebook equivalent to that maintained at the coder of FIG. 1.
- the MDCT synthesis or other frequency synthesis operation
- Standard aliasing techniques are then applied to provide samples to be converted by digital-to-analog conversion and reproduction to acoustic or other analog signals.
Abstract
Description
dr(ω)=r.sub.t-1 (ω)-r.sub.t-2 (ω)
dφ(ω)=φ.sub.t-1 (ω)-φ.sub.t-2 (ω).
r(ω)=r.sub.t-1 (ω)+dr(ω)
φ(ω)=φ.sub.t-1 (ω)+dφ(ω),
tmp(j)=max(0.05, min(0.5. C(j))),
t(j)=0.43*1n tmp(j)-0.299
snr.sub.db (j)=max(max(24.5, 15.5+j)+5.5*(1.-t(j)), fmin(j))
uthr(j)=P(j)*snr(j).
sthr(j)=max(uthr(j), snr(j)*P(i)*mask(i-j)[i>j])
thr(j)=min(lthr(j), 2*othr(j))
othr(j)=lthr(j).
__________________________________________________________________________ PART I __________________________________________________________________________ sync work (0110111) signals the start of the block 7 bit position of parts 2 & 3 (bitsav) difference between the last bit of part 2 & 12 bit and the first bit ofpart 1 word length selector for part 2 (cbtable) selects by a table a word length for 4 bit scaling factors for the 12 lower critical bands between 0..4 and for the higher critical bands between 0..3. Four combinations with a small expectation are unused number of big spectral values (bigvalues) number of pairs of spectral values that 8 bit coded with a two dimensional Huffman code, able to code values larger than 1 × 1 the so called small spectral values quantizer and global gain information (Gain) level differences between original and 7 bit quantized values in steps of 2.sup.1 Huffman codetable (iqfeld) values 0..3 select a 4 × 4, 8 × 8, 16 × 16 or 4 bit 32 × 32 codetable values > 3 select a 32 × 32 ESC-table when 31 is an ESC-character followed by (Huffman codetable-3) bits of linear transmitted part of the spectral value, that has to be added to the 31 pre-emphasis (preflag) flag, that the higher part of the spectrum 1 bit quantized with a smaller quantizer step size critical band scaling stepsize (ps-scale) flag, whether the critical band scaling 1 bitr has a stepsize of 2 or 2.sup.1 block split (split-flag) flag, whether the block is split into 1 bit subblocks (dynamic windowing) 0/1 codetable (count 1 table) selection of one of two possible codebooks 1 bit for the coding of small values (-1,0,1) DC-part of the signal (dc-value) 9 bit 55 bit __________________________________________________________________________
______________________________________ global gain for subblock 2 3 bit DC-value of subblock 2 9 bit global gain for subblock 3 3 bit DC-value of subblock 3 9 bit global gain for subblock 4 3 bit DC-value of subblock 4 9 bit scaling factors for the lower 12 *(0 . . . 4) 48 bit 12 critical bands scaling factors for the higher *(0 . . . 3) 33 bit 11 critical bands 117 bit ______________________________________
______________________________________LISTING 1 ______________________________________ c First startup routine subroutine strt( ) c sets up threshold generation tables, ithr and bval real freq(0:25)/0.,100.,200.,300.,400.,500.,630.,770., 1 920.,1080.,1270.,1480.,1720.,2000.,2320.,2700., 1 315O.,3700.,4400.,5300.,6400.,7700.,9500.,12000.,15500., 1 25000./ common/thresh/ithr(26),bval(257,rnorm(257) common/absthr/abslow(257) common/sigs/ifirst c ithr(i) is bottom of crital band i. bval is bark index c of each line write(*,*) `what spl will + - 32000 be →` read(*.*) abslev abslev = abslev - 96. abstow = 5224245.*5224245./exp(9.6*alog(10.)) ifirst = 0 write(*.*) `what is the sampling rate` read(*.*) rzotz fnyq = rzotz/2. c nyquest frequency of interest. ithr(1) = 2. i = 2 10 ithr(i) = freq(i - 1)/fnyq*256. + 2. i = i + 1 if (freq(i - 1) .It. fnyq) goto 10 c sets ithr to bottom of cb ithr(i:26) = 257 c now, set up the critical band indexing array bval(1) = 0 c first, figure out frequency, then . . . do i = 2,257,1 fre = (i - 1)/256.*fnyq c write(*,*) i,fre c fre is now the frequency of the line. convert c it to critical band number . . . do j = 0,25,1 if (fre .gt. freq(j)) k = j end do c so now, k = last CB lower than fre rpart = fre - freq(k) range = freq(k + 1) - freq(k) bval(i) = k + rpart/range end do rnorm = 1 do i = 2,257,1 imp = 0 do j = 2.257,1 imp = imp + sprdngf(bval(j),bval(i)) end do rnorm(i) = tmp end do rnorm = 1./rnorm c do i = 1.257.1 c write(*.*)i.bval(i), 10.*alog10(rnorm(i)) c end do call openas(0.`/usr/jj/nsrc/thrtry/freqlist`.0) do i = 2.257,1 read(0,*) ii.db if (ii .ne. i) then write(*,*) `freqlist is bad.` stop end if db = exp((db-abslev)/10.*alog(10.)) c write(*,*) i,db abslow(i) = abslow(i)*db end do abslow(1) = 1. write(*,*) `lowest level is `, sqrt(abslow(45)) return end c Threshold calculation program subroutine thrgen(rt,phi,thr) real r(257),phi(257) real rt(257) real thr(257) common/blnk/ or(257),ophi(257),dr(257),dphi(257) common/blk1/othr(257) real alpha(257),tr(257),tphi(257) real beta(257),bcalc(257) common/absthr/abslow(257) common/thresh/ithr(26),bval(257),rnorm(257) common/sigs/ifirst r = max(rt,.0005) bcalc = 1. if(ifirst .eq. 0) then or = 0). othr = le20 ophi = 0 dr = 0 dphi = 0 ifirst = 1 end if c this subroutine figures out the new threshold values c using line-by-line measurement. tr. = or -dr tphi = ophi ÷ dphi dr = r -or dphi = phi - ophi or = r ophi = phi alpha = sqrt((r*cos(phi) - tr*cos(tphi)) 1 *(r*cos(phi) - tr*cos(tphi)) 2 + (r*sin(phi) - tr*sin(tphi)) 3 *(r*sin(phi) - tr*sin(tphi))) 4/(r + abs(tr) + 1.) beta alpha c now, beta is the unweighted tonality factor alpha = r*r c now, the energy is in each c line. Must spread. (ecch) c write(*,*) `before spreading` thr = 0 bcalc = 0 cvdS1 cncall do i = 2.2257,1 cvdS1 cncall do j = 2.257,1 glorch = sprdngf(bval(j),bval(i)) thr(i) = alpha(j)*glorch + thr(i) bcalc(i) = alpha(j)*glorch*beta(j) + bcalc(i) c thr is the spread energy. bcalc is the weighted chaos end do c if (thr(i) .eq. 0) then c write(*,*) `zero threshold, you blew it` c stop c end if bcalc(i) = bcalc(i)/thr(i) if (bcalc(i) .gt. .5)bcalc(i) = 1. - bcalc(i) c that normalizes bcalc to 0-.5 end do c write(*.*) `after spreading` bcalc = max(bcalc,.05) bcalc = min(bcalc,.5) c bcalc is now the chaos metric, convert to the c tonality metric bcalc = -.45*alog(bcalc) ` .299 c now calculate DB bcalc = max(24 5.(15.5 + bval))*bcalc + 5.5*(1. - bcalc) bcalc = exp((-bcalc/10.) * alog (10.)) c now, bcalc it actual tonality factor, for power c space. thr = thr*rnorm*bcalc c threshold is tonality factor times energy (with normalization) thr = max(thr,abslow) alpha = thr thr = min(thr,othr*2.) othr = alpha c write(*,*) `leaving thrgen` return end c And, the spreading function function sprdngf(j,i) real i,j real sprdngf c this calculates the value of the spreading function for c the i'th bark, with the center being the j'th c bark temp1 = i - j temp2 = 15.8811389 + 7.5*(templ + .474) temp2 = temp2 - 17.5*sqrt(1. + (templ + .474)* (temp1 + .474)) if( temp2 .1e. - 100.) then temp3 = 0. else temp2 = temp2/10.*alog(10.) temp3 = exp(temp2) end if sprdngf = temp3 return end ______________________________________
TABLE I ______________________________________ Absolute Threshold File - ("freqlist" for start-up routine) ______________________________________ 1 56 3. 111 16. 166 16. 221 50. 2 27. 57 4. 112 17. 167 16. 222 50. 3 18. 58 4. 113 17. 168 16. 223 50. 4 16. 59 5. 114 17. 169 16. 224 50. 5 10. 60 5. 115 17. 170 16. 225 50. 6 9. 61 5. 116 18. 171 17. 226 50. 7 8. 62 6. 117 18. 172 17. 227 50. 8 8. 63 6. 118 18. 173 17. 228 50. 9 8. 64 6. 119 18. 174 17. 229 50. 10 8. 65 6. 120 18. 175 17. 230 50. 11 8. 66 7. 121 18. 176 17. 231 50. 12 7. 67 7. 122 18. 177 18. 232 50. 13 7. 68 7. 123 18. 178 18. 233 50. 14 7. 69 8. 124 17. 179 18. 234 60. 15 7. 70 9. 125 17. 180 18. 235 60. 16 7. 71 10. 126 16. 181 18. 236 60. 17 7. 72 10. 127 16. 182 19. 237 60. 18 7. 73 10. 128 16. 183 19. 238 60. 19 7. 74 10. 129 16. 184 19. 239 60. 20 7. 75 10. 130 15. 185 19. 240 60. 21 7. 76 10. 131 15. 186 19. 241 60. 22 7. 77 10. 132 15. 187 20. 242 60. 23 7. 78 10. 133 15. 188 21. 243 60. 24 7. 79 10. 134 14. 189 22. 244 60. 25 6. 80 10. 135 14. 190 23. 245 60. 26 5. 81 11. 136 13. 191 24. 246 60. 27 5. 82 11. 137 12. 192 25. 247 60. 28 5. 83 11. 138 12. 193 26. 248 60. 29 5. 84 11. 139 12. 194 27. 249 60. 30 5. 85 11. 140 12. 195 28. 250 60. 31 4. 86 12. 141 12. 196 29. 251 60. 32 4. 87 12. 142 12. 197 30. 252 60. 33 4. 88 12. 143 12. 198 31. 253 60. 34 4. 89 12. 144 13. 199 32. 254 60. 35 4. 90 12. 145 13. 200 33. 255 60. 36 3. 91 12. 146 14. 201 34. 256 60. 37 3. 92 13. 147 14. 202 35. 257 60. 38 3. 93 13. 148 14. 203 36. 39 3. 94 13. 149 14. 204 37. 40 2. 95 13. 150 14. 205 38. 41 2. 96 13. 151 14. 206 39. 42 1. 97 13. 152 14. 207 40. 43 1. 98 14. 153 14. 208 41. 44 1. 99 14. 154 14. 209 42. 45 1. 100 14. 155 14. 210 43. 46 0. 101 14. 156 15. 211 44. 47 0. 102 15. 157 15. 212 45. 48 0. 103 15. 158 15. 213 46. 49 0. 104 15. 159 15. 214 47. 50 0. 105 15. 160 15. 215 48. 51 0. 106 15. 161 15. 216 49. 52 2. 107 16. 162 15. 217 50. 53 2. 108 16. 163 15. 218 50. 54 2. 109 16. 164 15. 219 50. 55 3. 110 16. 165 15. 220 50. ______________________________________
TABLE 2 ______________________________________ table of critical bands and fmin (used at 48 kHz sampling frequency) The upper band edge is set to 20 kHz (line 214 at block length 256, line 428 at block length 512) The following table is used at block length 512. The table for block length 256 can easily be calculated from the table for 512 block length. The tables for other sampling rates can also be calculated from this list. cb start width fmin ______________________________________ 1 0 4 .007 2 4 4 .007 3 8 4 .007 4 12 4 .007 5 16 4 .007 6 20 4 .007 7 24 4 .007 8 28 4 .01 9 32 4 .01 10 36 4 .01 11 40 6 .01 12 46 6 .0144 13 52 8 .0225 14 60 8 .04 15 68 12 .0625 16 80 12 .09 17 92 16 .09 18 108 20 .09 19 128 26 .1225 20 154 30 .1225 21 184 38 .16 22 222 50 .2025 23 272 70 .25 24 342 86 ______________________________________
______________________________________ 17 17 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 17 17 17 17 17 17 17 14 17 17 17 17 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 17 16 17 17 17 17 17 14 18 17 17 17 17 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 17 17 17 17 17 17 17 14 18 17 17 17 17 17 17 16 17 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 16 17 17 17 17 17 17 13 18 18 17 17 17 17 17 17 17 17 16 17 17 16 17 16 17 17 17 16 16 16 17 17 17 17 17 17 17 17 17 14 18 18 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 18 14 18 18 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 14 18 18 18 18 18 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 14 19 18 18 18 18 18 18 17 17 18 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 18 13 19 19 18 18 18 18 18 18 18 17 17 18 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 17 18 14 15 15 15 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 14 13 14 14 14 8 ______________________________________
Claims (20)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/622,313 USRE36714E (en) | 1989-10-18 | 1994-11-10 | Perceptual coding of audio signals |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US07/423,088 US5040217A (en) | 1989-10-18 | 1989-10-18 | Perceptual coding of audio signals |
US10649993A | 1993-08-13 | 1993-08-13 | |
US08/622,313 USRE36714E (en) | 1989-10-18 | 1994-11-10 | Perceptual coding of audio signals |
Related Parent Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US07/423,088 Reissue US5040217A (en) | 1989-10-18 | 1989-10-18 | Perceptual coding of audio signals |
US10649993A Continuation | 1989-10-18 | 1993-08-13 |
Publications (1)
Publication Number | Publication Date |
---|---|
USRE36714E true USRE36714E (en) | 2000-05-23 |
Family
ID=23677639
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US07/423,088 Ceased US5040217A (en) | 1989-10-18 | 1989-10-18 | Perceptual coding of audio signals |
US08/622,313 Expired - Lifetime USRE36714E (en) | 1989-10-18 | 1994-11-10 | Perceptual coding of audio signals |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US07/423,088 Ceased US5040217A (en) | 1989-10-18 | 1989-10-18 | Perceptual coding of audio signals |
Country Status (7)
Country | Link |
---|---|
US (2) | US5040217A (en) |
EP (1) | EP0424016B1 (en) |
JP (1) | JP2756515B2 (en) |
KR (1) | KR100209870B1 (en) |
CA (1) | CA2027136C (en) |
DE (1) | DE69028675T2 (en) |
HK (1) | HK220296A (en) |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6424939B1 (en) * | 1997-07-14 | 2002-07-23 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Method for coding an audio signal |
US6654716B2 (en) * | 2000-10-20 | 2003-11-25 | Telefonaktiebolaget Lm Ericsson | Perceptually improved enhancement of encoded acoustic signals |
US6766293B1 (en) * | 1997-07-14 | 2004-07-20 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Method for signalling a noise substitution during audio signal coding |
US20040267522A1 (en) * | 2001-07-16 | 2004-12-30 | Eric Allamanche | Method and device for characterising a signal and for producing an indexed signal |
US6915264B2 (en) | 2001-02-22 | 2005-07-05 | Lucent Technologies Inc. | Cochlear filter bank structure for determining masked thresholds for use in perceptual audio coding |
US20050289066A1 (en) * | 2000-08-11 | 2005-12-29 | Microsoft Corporation | Audio fingerprinting |
USRE39080E1 (en) | 1988-12-30 | 2006-04-25 | Lucent Technologies Inc. | Rate loop processor for perceptual encoder/decoder |
US20070140499A1 (en) * | 2004-03-01 | 2007-06-21 | Dolby Laboratories Licensing Corporation | Multichannel audio coding |
US7333930B2 (en) | 2003-03-14 | 2008-02-19 | Agere Systems Inc. | Tonal analysis for perceptual audio coding using a compressed spectral representation |
USRE40280E1 (en) | 1988-12-30 | 2008-04-29 | Lucent Technologies Inc. | Rate loop processor for perceptual encoder/decoder |
US20080195654A1 (en) * | 2001-08-20 | 2008-08-14 | Microsoft Corporation | System and methods for providing adaptive media property classification |
US20090264960A1 (en) * | 2007-07-13 | 2009-10-22 | Advanced Bionics, Llc | Tonality-Based Optimization of Sound Sensation for a Cochlear Implant Patient |
US20100042406A1 (en) * | 2002-03-04 | 2010-02-18 | James David Johnston | Audio signal processing using improved perceptual model |
US20100272285A1 (en) * | 2009-04-22 | 2010-10-28 | General Electric Company | Masking of pure tones within sound from a noise generating source |
Families Citing this family (135)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5341457A (en) * | 1988-12-30 | 1994-08-23 | At&T Bell Laboratories | Perceptual coding of audio signals |
US5434948A (en) * | 1989-06-15 | 1995-07-18 | British Telecommunications Public Limited Company | Polyphonic coding |
US5115240A (en) * | 1989-09-26 | 1992-05-19 | Sony Corporation | Method and apparatus for encoding voice signals divided into a plurality of frequency bands |
JP2969862B2 (en) * | 1989-10-04 | 1999-11-02 | 松下電器産業株式会社 | Voice recognition device |
JPH03139700A (en) * | 1989-10-25 | 1991-06-13 | Sony Corp | Audio signal reproducing device |
US6695477B1 (en) * | 1989-10-25 | 2004-02-24 | Sony Corporation | Audio signal reproducing apparatus |
US5388181A (en) * | 1990-05-29 | 1995-02-07 | Anderson; David J. | Digital audio compression system |
DE4020656A1 (en) * | 1990-06-29 | 1992-01-02 | Thomson Brandt Gmbh | METHOD FOR TRANSMITTING A SIGNAL |
US5317672A (en) * | 1991-03-05 | 1994-05-31 | Picturetel Corporation | Variable bit rate speech encoder |
CA2105387A1 (en) * | 1991-03-05 | 1992-09-06 | Antony H. Crossman | Variable bit rate speech encoder |
KR100268623B1 (en) * | 1991-06-28 | 2000-10-16 | 이데이 노부유끼 | Compressed data recording and/or reproducing apparatus and signal processing method |
US5388209A (en) * | 1991-08-02 | 1995-02-07 | Sony Corporation | Apparatus for high-speed recording compressed digital data with increased compression |
DE4212339A1 (en) * | 1991-08-12 | 1993-02-18 | Standard Elektrik Lorenz Ag | CODING PROCESS FOR AUDIO SIGNALS WITH 32 KBIT / S |
EP0786874B1 (en) * | 1991-09-30 | 2000-08-16 | Sony Corporation | Method and apparatus for audio data compression |
JP3310682B2 (en) * | 1992-01-21 | 2002-08-05 | 日本ビクター株式会社 | Audio signal encoding method and reproduction method |
US5285498A (en) * | 1992-03-02 | 1994-02-08 | At&T Bell Laboratories | Method and apparatus for coding audio signals based on perceptual model |
JP3446216B2 (en) * | 1992-03-06 | 2003-09-16 | ソニー株式会社 | Audio signal processing method |
US5495552A (en) * | 1992-04-20 | 1996-02-27 | Mitsubishi Denki Kabushiki Kaisha | Methods of efficiently recording an audio signal in semiconductor memory |
JP3278900B2 (en) * | 1992-05-07 | 2002-04-30 | ソニー株式会社 | Data encoding apparatus and method |
JP3153933B2 (en) * | 1992-06-16 | 2001-04-09 | ソニー株式会社 | Data encoding device and method and data decoding device and method |
JP3508146B2 (en) * | 1992-09-11 | 2004-03-22 | ソニー株式会社 | Digital signal encoding / decoding device, digital signal encoding device, and digital signal decoding device |
JP3185413B2 (en) * | 1992-11-25 | 2001-07-09 | ソニー株式会社 | Orthogonal transform operation and inverse orthogonal transform operation method and apparatus, digital signal encoding and / or decoding apparatus |
JP3123286B2 (en) * | 1993-02-18 | 2001-01-09 | ソニー株式会社 | Digital signal processing device or method, and recording medium |
JP3186292B2 (en) * | 1993-02-02 | 2001-07-11 | ソニー株式会社 | High efficiency coding method and apparatus |
US5579404A (en) * | 1993-02-16 | 1996-11-26 | Dolby Laboratories Licensing Corporation | Digital audio limiter |
JP3186307B2 (en) * | 1993-03-09 | 2001-07-11 | ソニー株式会社 | Compressed data recording apparatus and method |
JP3123290B2 (en) * | 1993-03-09 | 2001-01-09 | ソニー株式会社 | Compressed data recording device and method, compressed data reproducing method, recording medium |
JP3173218B2 (en) * | 1993-05-10 | 2001-06-04 | ソニー株式会社 | Compressed data recording method and apparatus, compressed data reproducing method, and recording medium |
US5581654A (en) * | 1993-05-25 | 1996-12-03 | Sony Corporation | Method and apparatus for information encoding and decoding |
KR100458969B1 (en) * | 1993-05-31 | 2005-04-06 | 소니 가부시끼 가이샤 | Signal encoding or decoding apparatus, and signal encoding or decoding method |
EP0663739B1 (en) * | 1993-06-30 | 2001-08-22 | Sony Corporation | Digital signal encoding device, its decoding device, and its recording medium |
US5664057A (en) * | 1993-07-07 | 1997-09-02 | Picturetel Corporation | Fixed bit rate speech encoder/decoder |
TW272341B (en) * | 1993-07-16 | 1996-03-11 | Sony Co Ltd | |
TW327223B (en) * | 1993-09-28 | 1998-02-21 | Sony Co Ltd | Methods and apparatus for encoding an input signal broken into frequency components, methods and apparatus for decoding such encoded signal |
DE69428435T2 (en) * | 1993-11-04 | 2002-07-11 | Sony Corp | SIGNAL ENCODERS, SIGNAL DECODERS, RECORD CARRIERS AND SIGNAL ENCODER METHODS |
WO1995013660A1 (en) * | 1993-11-09 | 1995-05-18 | Sony Corporation | Quantization apparatus, quantization method, high efficiency encoder, high efficiency encoding method, decoder, high efficiency encoder and recording media |
US5488665A (en) * | 1993-11-23 | 1996-01-30 | At&T Corp. | Multi-channel perceptual audio compression system with encoding mode switching among matrixed channels |
US5608713A (en) * | 1994-02-09 | 1997-03-04 | Sony Corporation | Bit allocation of digital audio signal blocks by non-linear processing |
JPH07261797A (en) * | 1994-03-18 | 1995-10-13 | Mitsubishi Electric Corp | Signal encoding device and signal decoding device |
JP3186412B2 (en) * | 1994-04-01 | 2001-07-11 | ソニー株式会社 | Information encoding method, information decoding method, and information transmission method |
CN1103141C (en) * | 1994-04-01 | 2003-03-12 | 索尼公司 | Method and device for encoding information, method and device for decoding information, information transmitting method, and information recording medium |
JP3277677B2 (en) * | 1994-04-01 | 2002-04-22 | ソニー株式会社 | Signal encoding method and apparatus, signal recording medium, signal transmission method, and signal decoding method and apparatus |
JPH07322252A (en) * | 1994-05-23 | 1995-12-08 | Canon Inc | Image coder |
JP3250376B2 (en) * | 1994-06-13 | 2002-01-28 | ソニー株式会社 | Information encoding method and apparatus, and information decoding method and apparatus |
JP3277699B2 (en) * | 1994-06-13 | 2002-04-22 | ソニー株式会社 | Signal encoding method and apparatus, and signal decoding method and apparatus |
JP3277705B2 (en) | 1994-07-27 | 2002-04-22 | ソニー株式会社 | Information encoding apparatus and method, and information decoding apparatus and method |
JP3341474B2 (en) * | 1994-07-28 | 2002-11-05 | ソニー株式会社 | Information encoding method and decoding method, information encoding device and decoding device, and information recording medium |
US5625743A (en) * | 1994-10-07 | 1997-04-29 | Motorola, Inc. | Determining a masking level for a subband in a subband audio encoder |
ATE211869T1 (en) * | 1994-10-28 | 2002-01-15 | Rai Radiotelevisione Italiana | SUB-BAND CODING WITH TONE-BASED PREDICTION CODING IN EACH SINGLE SUB-BAND |
US5654952A (en) * | 1994-10-28 | 1997-08-05 | Sony Corporation | Digital signal encoding method and apparatus and recording medium |
KR970011727B1 (en) * | 1994-11-09 | 1997-07-14 | Daewoo Electronics Co Ltd | Apparatus for encoding of the audio signal |
JP3557674B2 (en) * | 1994-12-15 | 2004-08-25 | ソニー株式会社 | High efficiency coding method and apparatus |
JP3371590B2 (en) * | 1994-12-28 | 2003-01-27 | ソニー株式会社 | High efficiency coding method and high efficiency decoding method |
US5646961A (en) * | 1994-12-30 | 1997-07-08 | Lucent Technologies Inc. | Method for noise weighting filtering |
EP0720316B1 (en) * | 1994-12-30 | 1999-12-08 | Daewoo Electronics Co., Ltd | Adaptive digital audio encoding apparatus and a bit allocation method thereof |
KR970003559Y1 (en) * | 1994-12-30 | 1997-04-18 | 기아자동차 주식회사 | Instrument core being capable of resisting heat deformations |
US5682463A (en) * | 1995-02-06 | 1997-10-28 | Lucent Technologies Inc. | Perceptual audio compression based on loudness uncertainty |
US5699479A (en) * | 1995-02-06 | 1997-12-16 | Lucent Technologies Inc. | Tonality for perceptual audio compression based on loudness uncertainty |
FR2730594B1 (en) * | 1995-02-09 | 1997-03-21 | France Telecom | VECTOR QUANTIFICATION METHOD ON A CONSTRAINED FLOW NETWORK |
KR100346734B1 (en) * | 1995-09-22 | 2002-11-23 | 삼성전자 주식회사 | Audio coder and decoder having high speed analyzing filter and composite filter |
JPH1084284A (en) * | 1996-09-06 | 1998-03-31 | Sony Corp | Signal reproducing method and device |
JP3496411B2 (en) * | 1996-10-30 | 2004-02-09 | ソニー株式会社 | Information encoding method and decoding device |
US6147010A (en) * | 1996-11-14 | 2000-11-14 | Micron Technology, Inc. | Solvent prewet and method to dispense the solvent prewet |
US6782365B1 (en) | 1996-12-20 | 2004-08-24 | Qwest Communications International Inc. | Graphic interface system and product for editing encoded audio data |
US5845251A (en) * | 1996-12-20 | 1998-12-01 | U S West, Inc. | Method, system and product for modifying the bandwidth of subband encoded audio data |
US6516299B1 (en) | 1996-12-20 | 2003-02-04 | Qwest Communication International, Inc. | Method, system and product for modifying the dynamic range of encoded audio signals |
US5864820A (en) * | 1996-12-20 | 1999-01-26 | U S West, Inc. | Method, system and product for mixing of encoded audio signals |
US5864813A (en) * | 1996-12-20 | 1999-01-26 | U S West, Inc. | Method, system and product for harmonic enhancement of encoded audio signals |
US6463405B1 (en) | 1996-12-20 | 2002-10-08 | Eliot M. Case | Audiophile encoding of digital audio data using 2-bit polarity/magnitude indicator and 8-bit scale factor for each subband |
US6477496B1 (en) | 1996-12-20 | 2002-11-05 | Eliot M. Case | Signal synthesis by decoding subband scale factors from one audio signal and subband samples from different one |
SE512719C2 (en) * | 1997-06-10 | 2000-05-02 | Lars Gustaf Liljeryd | A method and apparatus for reducing data flow based on harmonic bandwidth expansion |
US6266419B1 (en) * | 1997-07-03 | 2001-07-24 | At&T Corp. | Custom character-coding compression for encoding and watermarking media content |
US6111844A (en) * | 1997-07-03 | 2000-08-29 | At&T Corp. | Quality degradation through compression/decompression |
DE19840853B4 (en) * | 1997-10-24 | 2004-12-09 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Methods and devices for encoding an audio signal |
US6091773A (en) * | 1997-11-12 | 2000-07-18 | Sydorenko; Mark R. | Data compression method and apparatus |
US6243424B1 (en) | 1998-03-27 | 2001-06-05 | Ibiguity Digital Corporation | Method and apparatus for AM digital broadcasting |
US20060265091A1 (en) * | 1998-08-14 | 2006-11-23 | Bruce Mihura | Audio entertainment system for storing and playing audio information |
US6704705B1 (en) | 1998-09-04 | 2004-03-09 | Nortel Networks Limited | Perceptual audio coding |
US6122610A (en) * | 1998-09-23 | 2000-09-19 | Verance Corporation | Noise suppression for low bitrate speech coder |
JP2000165251A (en) * | 1998-11-27 | 2000-06-16 | Matsushita Electric Ind Co Ltd | Audio signal coding device and microphone realizing the same |
US6418404B1 (en) | 1998-12-28 | 2002-07-09 | Sony Corporation | System and method for effectively implementing fixed masking thresholds in an audio encoder device |
SE9903553D0 (en) | 1999-01-27 | 1999-10-01 | Lars Liljeryd | Enhancing conceptual performance of SBR and related coding methods by adaptive noise addition (ANA) and noise substitution limiting (NSL) |
US6480820B1 (en) | 1999-09-20 | 2002-11-12 | Advanced Cochlear Systems, Inc. | Method of processing auditory data |
DE19947877C2 (en) * | 1999-10-05 | 2001-09-13 | Fraunhofer Ges Forschung | Method and device for introducing information into a data stream and method and device for encoding an audio signal |
US7215776B1 (en) | 1999-11-09 | 2007-05-08 | University Of New Hampshire | Method and apparatus for the compression and decompression of audio files using a chaotic system |
US7286670B2 (en) * | 1999-11-09 | 2007-10-23 | Chaoticom, Inc. | Method and apparatus for chaotic opportunistic lossless compression of data |
US7215772B2 (en) * | 1999-11-09 | 2007-05-08 | Chaoticom, Inc. | Method and apparatus for remote digital key generation |
US6549544B1 (en) * | 1999-11-10 | 2003-04-15 | Ibiquity Digital Corporation | Method and apparatus for transmission and reception of FM in-band on-channel digital audio broadcasting |
US6523147B1 (en) | 1999-11-11 | 2003-02-18 | Ibiquity Digital Corporation | Method and apparatus for forward error correction coding for an AM in-band on-channel digital audio broadcasting system |
US6499010B1 (en) | 2000-01-04 | 2002-12-24 | Agere Systems Inc. | Perceptual audio coder bit allocation scheme providing improved perceptual quality consistency |
SE0001926D0 (en) | 2000-05-23 | 2000-05-23 | Lars Liljeryd | Improved spectral translation / folding in the subband domain |
US7110547B2 (en) * | 2000-06-20 | 2006-09-19 | University Of New Hampshire | Method and apparatus for the compression and decompression of image files using a chaotic system |
CA2414093A1 (en) * | 2000-06-20 | 2001-12-27 | University Of New Hampshire | Method and apparatus for the compression and decompression of audio files using a chaotic system |
US6732180B1 (en) | 2000-08-08 | 2004-05-04 | The University Of Tulsa | Method to inhibit the identification and retrieval of proprietary media via automated search engines utilized in association with computer compatible communications network |
US7447639B2 (en) | 2001-01-24 | 2008-11-04 | Nokia Corporation | System and method for error concealment in digital audio transmission |
US7069208B2 (en) * | 2001-01-24 | 2006-06-27 | Nokia, Corp. | System and method for concealment of data loss in digital audio transmission |
WO2003036621A1 (en) * | 2001-10-22 | 2003-05-01 | Motorola, Inc., A Corporation Of The State Of Delaware | Method and apparatus for enhancing loudness of an audio signal |
US7240001B2 (en) | 2001-12-14 | 2007-07-03 | Microsoft Corporation | Quality improvement techniques in an audio encoder |
GB2388502A (en) * | 2002-05-10 | 2003-11-12 | Chris Dunn | Compression of frequency domain audio signals |
US20040165734A1 (en) * | 2003-03-20 | 2004-08-26 | Bing Li | Audio system for a vehicle |
US6813661B2 (en) | 2003-03-20 | 2004-11-02 | Bing Li | Portable audio system |
US7724827B2 (en) * | 2003-09-07 | 2010-05-25 | Microsoft Corporation | Multi-layer run level encoding and decoding |
US7447317B2 (en) | 2003-10-02 | 2008-11-04 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V | Compatible multi-channel coding/decoding by weighting the downmix channel |
US7460990B2 (en) | 2004-01-23 | 2008-12-02 | Microsoft Corporation | Efficient coding of digital media spectral data using wide-sense perceptual similarity |
US20050172154A1 (en) * | 2004-01-29 | 2005-08-04 | Chaoticom, Inc. | Systems and methods for providing digital content and caller alerts to wireless network-enabled devices |
US7539870B2 (en) * | 2004-02-10 | 2009-05-26 | Microsoft Corporation | Media watermarking by biasing randomized statistics |
US7505902B2 (en) * | 2004-07-28 | 2009-03-17 | University Of Maryland | Discrimination of components of audio signals based on multiscale spectro-temporal modulations |
US7676362B2 (en) * | 2004-12-31 | 2010-03-09 | Motorola, Inc. | Method and apparatus for enhancing loudness of a speech signal |
US8280730B2 (en) | 2005-05-25 | 2012-10-02 | Motorola Mobility Llc | Method and apparatus of increasing speech intelligibility in noisy environments |
JP4639966B2 (en) * | 2005-05-31 | 2011-02-23 | ヤマハ株式会社 | Audio data compression method, audio data compression circuit, and audio data expansion circuit |
KR100718132B1 (en) * | 2005-06-24 | 2007-05-14 | 삼성전자주식회사 | Method and apparatus for generating bitstream of audio signal, audio encoding/decoding method and apparatus thereof |
US7630882B2 (en) * | 2005-07-15 | 2009-12-08 | Microsoft Corporation | Frequency segmentation to obtain bands for efficient coding of digital media |
US7562021B2 (en) * | 2005-07-15 | 2009-07-14 | Microsoft Corporation | Modification of codewords in dictionary used for efficient coding of digital media spectral data |
US8599925B2 (en) * | 2005-08-12 | 2013-12-03 | Microsoft Corporation | Efficient coding and decoding of transform blocks |
EP1943642A4 (en) * | 2005-09-27 | 2009-07-01 | Lg Electronics Inc | Method and apparatus for encoding/decoding multi-channel audio signal |
CN101427307B (en) * | 2005-09-27 | 2012-03-07 | Lg电子株式会社 | Method and apparatus for encoding/decoding multi-channel audio signal |
ES2296489B1 (en) * | 2005-12-02 | 2009-04-01 | Cesar Alonso Abad | SCALABLE METHOD OF AUDIO AND IMAGE COMPRESSION. |
KR101355376B1 (en) * | 2007-04-30 | 2014-01-23 | 삼성전자주식회사 | Method and apparatus for encoding and decoding high frequency band |
KR101411900B1 (en) * | 2007-05-08 | 2014-06-26 | 삼성전자주식회사 | Method and apparatus for encoding and decoding audio signal |
US7774205B2 (en) * | 2007-06-15 | 2010-08-10 | Microsoft Corporation | Coding of sparse digital media spectral data |
US7761290B2 (en) | 2007-06-15 | 2010-07-20 | Microsoft Corporation | Flexible frequency and time partitioning in perceptual transform coding of audio |
EP2162880B1 (en) * | 2007-06-22 | 2014-12-24 | VoiceAge Corporation | Method and device for estimating the tonality of a sound signal |
US8046214B2 (en) * | 2007-06-22 | 2011-10-25 | Microsoft Corporation | Low complexity decoder for complex transform coding of multi-channel sound |
US7885819B2 (en) | 2007-06-29 | 2011-02-08 | Microsoft Corporation | Bitstream syntax for multi-process audio decoding |
US8249883B2 (en) | 2007-10-26 | 2012-08-21 | Microsoft Corporation | Channel extension coding for multi-channel source |
KR101454867B1 (en) | 2008-03-24 | 2014-10-28 | 삼성전자주식회사 | Method and apparatus for audio signal compression |
KR101619972B1 (en) | 2008-10-02 | 2016-05-11 | 한국전자통신연구원 | Apparatus and method for coding/decoding image selectivly using descrete cosine/sine transtorm |
US20100324915A1 (en) * | 2009-06-23 | 2010-12-23 | Electronic And Telecommunications Research Institute | Encoding and decoding apparatuses for high quality multi-channel audio codec |
US20110257978A1 (en) * | 2009-10-23 | 2011-10-20 | Brainlike, Inc. | Time Series Filtering, Data Reduction and Voice Recognition in Communication Device |
US8925024B2 (en) | 2009-12-31 | 2014-12-30 | The Nielsen Company (Us), Llc | Methods and apparatus to detect commercial advertisements associated with media presentations |
JP5732994B2 (en) * | 2011-04-19 | 2015-06-10 | ソニー株式会社 | Music searching apparatus and method, program, and recording medium |
EP2717263B1 (en) * | 2012-10-05 | 2016-11-02 | Nokia Technologies Oy | Method, apparatus, and computer program product for categorical spatial analysis-synthesis on the spectrum of a multichannel audio signal |
ES2834929T3 (en) * | 2013-01-29 | 2021-06-21 | Fraunhofer Ges Forschung | Filled with noise in perceptual transform audio coding |
US9848222B2 (en) | 2015-07-15 | 2017-12-19 | The Nielsen Company (Us), Llc | Methods and apparatus to detect spillover |
US10043527B1 (en) | 2015-07-17 | 2018-08-07 | Digimarc Corporation | Human auditory system modeling with masking energy adaptation |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US28276A (en) * | 1860-05-15 | Bedstead-fastening | ||
US28488A (en) * | 1860-05-29 | Eataet-machiite | ||
US3420742A (en) * | 1964-10-16 | 1969-01-07 | Dairy Technics Inc | Milk fermenting product and method of making same |
USRE28276E (en) | 1963-06-06 | 1974-12-17 | Milk fermenting product and method of making same | |
USRE28488E (en) | 1973-11-19 | 1975-07-22 | Milk fermenting product | |
US4972484A (en) * | 1986-11-21 | 1990-11-20 | Bayerische Rundfunkwerbung Gmbh | Method of transmitting or storing masked sub-band coded audio signals |
US5285498A (en) * | 1992-03-02 | 1994-02-08 | At&T Bell Laboratories | Method and apparatus for coding audio signals based on perceptual model |
US5535300A (en) * | 1988-12-30 | 1996-07-09 | At&T Corp. | Perceptual coding of audio signals using entropy coding and/or multiple power spectra |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS59129900A (en) * | 1983-01-18 | 1984-07-26 | 日本電信電話株式会社 | Band division coding system |
JPS6027459A (en) * | 1983-07-22 | 1985-02-12 | Sumitomo Metal Ind Ltd | Device for preventing corrosion of current-conducting roll and billet |
JPS60159800A (en) * | 1984-01-30 | 1985-08-21 | 日本電気株式会社 | Adaptive forecast transform encoding system |
CA1229681A (en) * | 1984-03-06 | 1987-11-24 | Kazunori Ozawa | Method and apparatus for speech-band signal coding |
JPH0833746B2 (en) * | 1987-02-17 | 1996-03-29 | シャープ株式会社 | Band division coding device for voice and musical sound |
JPH032559Y2 (en) * | 1987-03-11 | 1991-01-23 | ||
JPS63237100A (en) * | 1987-03-26 | 1988-10-03 | 沖電気工業株式会社 | Voice detector |
JPS63281200A (en) * | 1987-05-14 | 1988-11-17 | 沖電気工業株式会社 | Voice section detecting system |
JP2586043B2 (en) * | 1987-05-14 | 1997-02-26 | 日本電気株式会社 | Multi-pulse encoder |
JPS6446880A (en) * | 1987-08-17 | 1989-02-21 | Fuji Xerox Co Ltd | Image processor |
-
1989
- 1989-10-18 US US07/423,088 patent/US5040217A/en not_active Ceased
-
1990
- 1990-10-09 CA CA002027136A patent/CA2027136C/en not_active Expired - Lifetime
- 1990-10-10 EP EP90311084A patent/EP0424016B1/en not_active Expired - Lifetime
- 1990-10-10 DE DE69028675T patent/DE69028675T2/en not_active Expired - Lifetime
- 1990-10-16 KR KR1019900016433A patent/KR100209870B1/en not_active IP Right Cessation
- 1990-10-17 JP JP2276617A patent/JP2756515B2/en not_active Expired - Lifetime
-
1994
- 1994-11-10 US US08/622,313 patent/USRE36714E/en not_active Expired - Lifetime
-
1996
- 1996-12-24 HK HK220296A patent/HK220296A/en not_active IP Right Cessation
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US28276A (en) * | 1860-05-15 | Bedstead-fastening | ||
US28488A (en) * | 1860-05-29 | Eataet-machiite | ||
USRE28276E (en) | 1963-06-06 | 1974-12-17 | Milk fermenting product and method of making same | |
US3420742A (en) * | 1964-10-16 | 1969-01-07 | Dairy Technics Inc | Milk fermenting product and method of making same |
USRE28488E (en) | 1973-11-19 | 1975-07-22 | Milk fermenting product | |
US4972484A (en) * | 1986-11-21 | 1990-11-20 | Bayerische Rundfunkwerbung Gmbh | Method of transmitting or storing masked sub-band coded audio signals |
US5535300A (en) * | 1988-12-30 | 1996-07-09 | At&T Corp. | Perceptual coding of audio signals using entropy coding and/or multiple power spectra |
US5285498A (en) * | 1992-03-02 | 1994-02-08 | At&T Bell Laboratories | Method and apparatus for coding audio signals based on perceptual model |
Non-Patent Citations (16)
Title |
---|
"Critical Bands", Foundations of Modern Auditory Theory, J. V. Tobias, Chapter 5, B. Scharf, Academic Press, New York, 1970. |
"Digital audio tape for data storage", IEEE Spectrum, Oct. 1989, pp. 34-38, E. Tan and B. Vermeulen. |
"MSC: Stereo Audio Coding With CD-Quality and 256 kBIT/SEC", IEEE Transactions on Consumer Electronics, vol. CE-33, No. 4, Nov. 1987, pp. 512-519, E. F. Schroeder and H. J. Platte. |
"Optimizing digital speech coders by exploiting masking properties of the human ear", Journal of Acoustical Society of America, vol. 66 (6), Dec., 1979, pp. 1647-1652, M.R. Schroeder et al. |
"Sub-band Transform Coding Using Filter Bank Designs Based on Time Domain Aliasing Cancellation," IEEE ICASSP, 1987, J. Princen et al., pp. 2161-2164. |
"Transform Coding of Audio Signals Using Perceptual Noise Criteria", IEEE Journal On Selected Areas In Communications, vol. 6, No. 2, Feb. 1988, pp. 314-323, J. D. Johnston. |
Critical Bands , Foundations of Modern Auditory Theory , J. V. Tobias, Chapter 5, B. Scharf, Academic Press, New York, 1970. * |
Digital audio tape for data storage , IEEE Spectrum , Oct. 1989, pp. 34 38, E. Tan and B. Vermeulen. * |
FX/FORTRAN Programmer s Handbook , Alliant Computer Systems Corp., Jul. 1988. * |
FX/FORTRAN Programmer's Handbook, Alliant Computer Systems Corp., Jul. 1988. |
MSC: Stereo Audio Coding With CD Quality and 256 kBIT/SEC , IEEE Transactions on Consumer Electronics , vol. CE 33, No. 4, Nov. 1987, pp. 512 519, E. F. Schroeder and H. J. Platte. * |
N. S. Jayant and P. Noll, Digital Coding of Waveforms Principles and Applications to Speech and Video, Chapter 12, Transform Coding , 1987. * |
N. S. Jayant and P. Noll, Digital Coding of Waveforms--Principles and Applications to Speech and Video, Chapter 12, "Transform Coding", 1987. |
Optimizing digital speech coders by exploiting masking properties of the human ear , Journal of Acoustical Society of America, vol. 66 (6), Dec., 1979, pp. 1647 1652, M.R. Schroeder et al. * |
Sub band Transform Coding Using Filter Bank Designs Based on Time Domain Aliasing Cancellation, IEEE ICASSP , 1987, J. Princen et al., pp. 2161 2164. * |
Transform Coding of Audio Signals Using Perceptual Noise Criteria , IEEE Journal On Selected Areas In Communications , vol. 6, No. 2, Feb. 1988, pp. 314 323, J. D. Johnston. * |
Cited By (44)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
USRE40280E1 (en) | 1988-12-30 | 2008-04-29 | Lucent Technologies Inc. | Rate loop processor for perceptual encoder/decoder |
USRE39080E1 (en) | 1988-12-30 | 2006-04-25 | Lucent Technologies Inc. | Rate loop processor for perceptual encoder/decoder |
US6766293B1 (en) * | 1997-07-14 | 2004-07-20 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Method for signalling a noise substitution during audio signal coding |
US6424939B1 (en) * | 1997-07-14 | 2002-07-23 | Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. | Method for coding an audio signal |
US7240207B2 (en) * | 2000-08-11 | 2007-07-03 | Microsoft Corporation | Fingerprinting media entities employing fingerprint algorithms and bit-to-bit comparisons |
US20050289066A1 (en) * | 2000-08-11 | 2005-12-29 | Microsoft Corporation | Audio fingerprinting |
US20050289065A1 (en) * | 2000-08-11 | 2005-12-29 | Microsoft Corporation | Audio fingerprinting |
US7080253B2 (en) * | 2000-08-11 | 2006-07-18 | Microsoft Corporation | Audio fingerprinting |
US6654716B2 (en) * | 2000-10-20 | 2003-11-25 | Telefonaktiebolaget Lm Ericsson | Perceptually improved enhancement of encoded acoustic signals |
US6915264B2 (en) | 2001-02-22 | 2005-07-05 | Lucent Technologies Inc. | Cochlear filter bank structure for determining masked thresholds for use in perceptual audio coding |
US7478045B2 (en) * | 2001-07-16 | 2009-01-13 | M2Any Gmbh | Method and device for characterizing a signal and method and device for producing an indexed signal |
US20040267522A1 (en) * | 2001-07-16 | 2004-12-30 | Eric Allamanche | Method and device for characterising a signal and for producing an indexed signal |
US8082279B2 (en) | 2001-08-20 | 2011-12-20 | Microsoft Corporation | System and methods for providing adaptive media property classification |
US20080195654A1 (en) * | 2001-08-20 | 2008-08-14 | Microsoft Corporation | System and methods for providing adaptive media property classification |
US20100042406A1 (en) * | 2002-03-04 | 2010-02-18 | James David Johnston | Audio signal processing using improved perceptual model |
US7333930B2 (en) | 2003-03-14 | 2008-02-19 | Agere Systems Inc. | Tonal analysis for perceptual audio coding using a compressed spectral representation |
US20080031463A1 (en) * | 2004-03-01 | 2008-02-07 | Davis Mark F | Multichannel audio coding |
US9454969B2 (en) | 2004-03-01 | 2016-09-27 | Dolby Laboratories Licensing Corporation | Multichannel audio coding |
US11308969B2 (en) | 2004-03-01 | 2022-04-19 | Dolby Laboratories Licensing Corporation | Methods and apparatus for reconstructing audio signals with decorrelation and differentially coded parameters |
EP1914722A1 (en) | 2004-03-01 | 2008-04-23 | Dolby Laboratories Licensing Corporation | Multichannel audio decoding |
EP2224430A2 (en) | 2004-03-01 | 2010-09-01 | Dolby Laboratories Licensing Corporation | Multichannel audio decoding |
US10796706B2 (en) | 2004-03-01 | 2020-10-06 | Dolby Laboratories Licensing Corporation | Methods and apparatus for reconstructing audio signals with decorrelation and differentially coded parameters |
US20070140499A1 (en) * | 2004-03-01 | 2007-06-21 | Dolby Laboratories Licensing Corporation | Multichannel audio coding |
US8170882B2 (en) | 2004-03-01 | 2012-05-01 | Dolby Laboratories Licensing Corporation | Multichannel audio coding |
US10460740B2 (en) | 2004-03-01 | 2019-10-29 | Dolby Laboratories Licensing Corporation | Methods and apparatus for adjusting a level of an audio signal |
US10403297B2 (en) | 2004-03-01 | 2019-09-03 | Dolby Laboratories Licensing Corporation | Methods and apparatus for adjusting a level of an audio signal |
US10269364B2 (en) | 2004-03-01 | 2019-04-23 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques |
US8983834B2 (en) | 2004-03-01 | 2015-03-17 | Dolby Laboratories Licensing Corporation | Multichannel audio coding |
US9311922B2 (en) | 2004-03-01 | 2016-04-12 | Dolby Laboratories Licensing Corporation | Method, apparatus, and storage medium for decoding encoded audio channels |
EP2065885A1 (en) | 2004-03-01 | 2009-06-03 | Dolby Laboratories Licensing Corporation | Multichannel audio decoding |
US9520135B2 (en) | 2004-03-01 | 2016-12-13 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques |
US9640188B2 (en) | 2004-03-01 | 2017-05-02 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques |
US9672839B1 (en) | 2004-03-01 | 2017-06-06 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters |
US9691405B1 (en) | 2004-03-01 | 2017-06-27 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters |
US9691404B2 (en) | 2004-03-01 | 2017-06-27 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques |
US9697842B1 (en) | 2004-03-01 | 2017-07-04 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters |
US9704499B1 (en) | 2004-03-01 | 2017-07-11 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters |
US9715882B2 (en) | 2004-03-01 | 2017-07-25 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques |
US9779745B2 (en) | 2004-03-01 | 2017-10-03 | Dolby Laboratories Licensing Corporation | Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters |
US8914124B2 (en) | 2007-07-13 | 2014-12-16 | Advanced Bionics Ag | Tonality-based optimization of sound sensation for a cochlear implant patient |
US8412340B2 (en) * | 2007-07-13 | 2013-04-02 | Advanced Bionics, Llc | Tonality-based optimization of sound sensation for a cochlear implant patient |
US20090264960A1 (en) * | 2007-07-13 | 2009-10-22 | Advanced Bionics, Llc | Tonality-Based Optimization of Sound Sensation for a Cochlear Implant Patient |
US8223985B2 (en) | 2009-04-22 | 2012-07-17 | General Electric Company | Masking of pure tones within sound from a noise generating source |
US20100272285A1 (en) * | 2009-04-22 | 2010-10-28 | General Electric Company | Masking of pure tones within sound from a noise generating source |
Also Published As
Publication number | Publication date |
---|---|
JP2756515B2 (en) | 1998-05-25 |
DE69028675D1 (en) | 1996-10-31 |
CA2027136A1 (en) | 1991-04-19 |
US5040217A (en) | 1991-08-13 |
JPH03144700A (en) | 1991-06-20 |
KR910008975A (en) | 1991-05-31 |
DE69028675T2 (en) | 1997-02-13 |
HK220296A (en) | 1997-01-03 |
KR100209870B1 (en) | 1999-07-15 |
EP0424016A3 (en) | 1992-09-02 |
CA2027136C (en) | 2001-01-16 |
EP0424016B1 (en) | 1996-09-25 |
EP0424016A2 (en) | 1991-04-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
USRE36714E (en) | Perceptual coding of audio signals | |
US5341457A (en) | Perceptual coding of audio signals | |
USRE36683E (en) | Apparatus and method for audio data compression and expansion with reduced block floating overhead | |
US5664056A (en) | Digital encoder with dynamic quantization bit allocation | |
CA2140779C (en) | Method, apparatus and recording medium for coding of separated tone and noise characteristics spectral components of an acoustic signal | |
US5357594A (en) | Encoding and decoding using specially designed pairs of analysis and synthesis windows | |
KR100310214B1 (en) | Signal encoding or decoding device and recording medium | |
JP3277692B2 (en) | Information encoding method, information decoding method, and information recording medium | |
US5625746A (en) | Method and apparatus for encoding, decoding and compression of audio-type data | |
US5774844A (en) | Methods and apparatus for quantizing, encoding and decoding and recording media therefor | |
EP0545017A2 (en) | Data compression method and apparatus in which quantizing bits are allocated to a block in a present frame in response to the block in a past frame | |
EP0717392B1 (en) | Encoding method, decoding method, encoding-decoding method, encoder, decoder, and encoder-decoder | |
JPH08190764A (en) | Method and device for processing digital signal and recording medium | |
JPH0830295A (en) | Method and device for digital/audio signal recording and reproducing | |
JPS61112433A (en) | Frequency region voice encoding method and device | |
PL207862B1 (en) | Low bit-rate audio coding | |
US5754127A (en) | Information encoding method and apparatus, and information decoding method and apparatus | |
EP0376553B1 (en) | Perceptual coding of audio signals | |
JPH09134200A (en) | Method and apparatus for digital audio coding | |
KR100556505B1 (en) | Reproducing and recording apparatus, decoding apparatus, recording apparatus, reproducing and recording method, decoding method and recording method | |
JP3277699B2 (en) | Signal encoding method and apparatus, and signal decoding method and apparatus | |
JP3353868B2 (en) | Audio signal conversion encoding method and decoding method | |
US6647063B1 (en) | Information encoding method and apparatus, information decoding method and apparatus and recording medium | |
JPH11330974A (en) | Encoding method and device, decoding method and device, digital signal recording method and device, recording medium and digital transmitting method and device | |
JP3465698B2 (en) | Signal decoding method and apparatus |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
AS | Assignment |
Owner name: THE CHASE MANHATTAN BANK, AS COLLATERAL AGENT, TEX Free format text: CONDITIONAL ASSIGNMENT OF AND SECURITY INTEREST IN PATENT RIGHTS;ASSIGNOR:LUCENT TECHNOLOGIES INC. (DE CORPORATION);REEL/FRAME:011722/0048 Effective date: 20010222 |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
AS | Assignment |
Owner name: LUCENT TECHNOLOGIES INC., NEW JERSEY Free format text: TERMINATION AND RELEASE OF SECURITY INTEREST IN PATENT RIGHTS;ASSIGNOR:JPMORGAN CHASE BANK, N.A. (FORMERLY KNOWN AS THE CHASE MANHATTAN BANK), AS ADMINISTRATIVE AGENT;REEL/FRAME:018590/0287 Effective date: 20061130 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
FPAY | Fee payment |
Year of fee payment: 8 |
|
FPAY | Fee payment |
Year of fee payment: 12 |
|
AS | Assignment |
Owner name: CREDIT SUISSE AG, NEW YORK Free format text: SECURITY INTEREST;ASSIGNOR:ALCATEL-LUCENT USA INC.;REEL/FRAME:030510/0627 Effective date: 20130130 |
|
AS | Assignment |
Owner name: ALCATEL-LUCENT USA INC., NEW JERSEY Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:CREDIT SUISSE AG;REEL/FRAME:033949/0531 Effective date: 20140819 |