EP0741946A1 - Video coding method using subsampling for reducing the frame memory size - Google Patents

Video coding method using subsampling for reducing the frame memory size

Info

Publication number
EP0741946A1
EP0741946A1 EP95940775A EP95940775A EP0741946A1 EP 0741946 A1 EP0741946 A1 EP 0741946A1 EP 95940775 A EP95940775 A EP 95940775A EP 95940775 A EP95940775 A EP 95940775A EP 0741946 A1 EP0741946 A1 EP 0741946A1
Authority
EP
European Patent Office
Prior art keywords
block
frame
output
pels
video data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP95940775A
Other languages
German (de)
French (fr)
Inventor
Ran Xiaonong
Mike Van Scherrenburg
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National Semiconductor Corp
Original Assignee
National Semiconductor Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National Semiconductor Corp filed Critical National Semiconductor Corp
Publication of EP0741946A1 publication Critical patent/EP0741946A1/en
Withdrawn legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformation in the plane of the image
    • G06T3/40Scaling the whole image or part thereof
    • G06T3/4007Interpolation-based scaling, e.g. bilinear interpolation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/149Data rate or code amount at the encoder output by estimating the code amount by means of a model, e.g. mathematical model or statistical model
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/423Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
    • H04N19/423Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements
    • H04N19/426Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation characterised by memory arrangements using memory downsizing methods
    • H04N19/428Recompression, e.g. by spatial or temporal decimation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/567Motion estimation based on rate distortion criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/59Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/80Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
    • H04N19/82Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation involving filtering within a prediction loop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output

Definitions

  • This invention relates to the compression of video signals, and more particularly to a method for reducing the required size of a frame buffer used in a video compression system.
  • Video signals for transmission typically originate from video cameras.
  • the bandwidth of these non- compressed video signals is substantial and, consequently, numerous attempts have been made to reduce the bandwidth of the signals for transmission without unduly degrading the images.
  • the frames of video signals are encoded, and redundancies in consecutive frames of the encoded signals are extracted and deleted. Only the differences between consecutive frames are then transmitted.
  • Various techniques are used in the art depending on the particular application.
  • One of the techniques for further reducing the bandwidth of moving images is generally referred to as motion compensated predictive coding.
  • Fig. 1 illustrates one type of conventional video coder which utilizes motion compensation.
  • Video signals in digital form are received at input 10. It will be assumed for Fig.
  • the video data applied to input 10 is in a well known block format wherein blocks of 8x8 picture elements (pels) in an image are sequentially applied to input 10.
  • the pels in a block are applied to input 10 in a raster scan type sequence.
  • a subtractor 12 outputs the difference between the current video signal and a predicted video signal from a predictor 14.
  • Predictor 14 includes a first frame buffer containing the full reconstructed previous frame of video data. This frame buffer is relatively large and expensive since it must typically store at least 352 x 288 pels. If each pel requires 8 bits to encode, then the frame buffer must store about 811K bits. A 1 Mbit buffer is typically used.
  • the difference between the current video frame applied to input 10 and the predicted video frame outputted from predictor 14 is outputted from subtractor 12.
  • the difference signals for an entire block are then transformed by a processor/quantizer 18 to generate transform coefficients using, for example, discrete-cosine transformation, and the coefficients are then quantized.
  • the quantized coefficients are then encoded by a coder 20 to be in a conventional variable-length type code for additional compression of the video signal.
  • the resulting difference signal is then outputted at output 21 for transmission to a receiver.
  • Motion estimator 16 compares the current block of video data received at input 10 with the data in a search window of the previous frame (in predictor 14) to identify that region or block of the reconstructed previous frame which the current block most closely resembles.
  • the search window (e.g., 16x16 pels) takes into account the anticipated worst-case motion of a block from one frame to another.
  • the search window is centered at the same location in the previous frame as the current block location in the current frame.
  • the pels of the current block are successively compared to other pels in a block within the search window to find a matching block.
  • the shift of the current block which gives the best match to the corresponding block in the previous frame is deemed to be the best estimate of the displacement of the block between frames. This best estimate is usually determined based on a mean squared error (MSE) or mean absolute difference (MAD) criteria.
  • MSE mean squared error
  • MAD mean absolute difference
  • This motion vector is also applied to the address generator controlling the addressing of the first frame buffer in predictor 14 so that the block of pels outputted by predictor 14 corresponds to the displaced block of pels applied to input 10.
  • the pels outputted by predictor 14 are motion compensated before being compared with the incoming block of pels, thereby making the predictor 14 output a better prediction of the current frame.
  • Motion estimator 16 may, instead of performing block-matching motion estimation, use a well-known pel recursive technique, which generates motion vectors to minimize the prediction error at each pel. This is more computationally intensive than the block-matching technique.
  • the receiver After the motion vector and coded difference signal are transmitted, the receiver then updates the previous frame (already stored at the receiver) using the transmitted motion vector and difference signal.
  • the frame buffer at the receiver must store the full frame, causing this frame buffer to be large and expensive.
  • the difference signal outputted by processor/quantizer 18 is also fed back to a second frame buffer in either predictor 14 or motion estimator 16 through an inverse processor/quantizer 22 and adder 24.
  • the output of adder 24 is the motion compensated predicted frame in the first frame buffer plus the difference signal.
  • the second frame buffer in predictor 14 or motion estimator 16 now stores essentially the current frame of the video image (identical to that stored at the receiver) , while the first frame buffer still stores the previous frame.
  • This second frame buffer like the first frame buffer, stores a full frame of pels and is consequently large and expensive.
  • the functions of the second frame buffer (now storing the previous frame) and the first frame buffer are reversed, so that the second frame buffer outputs blocks of pels to subtractor 12.
  • the above-described block-matching process is then repeated for the next frame.
  • a signal is transmitted which signifies to the receiver/decoder that it needs to merely repeat the pels of that block from the previous frame to obtain the pels for the current block.
  • a video compression system in accordance with the present invention uses a frame buffer which is only a fraction of the size of a full frame buffer.
  • a subsampler connected to an input of the frame buffer performs 4 to 1 subsampling on the video data to be stored in the frame buffer. This allows the frame buffer to be one-fourth the size of a full frame buffer.
  • the subsampling may even be 9 to 1, 16 to 1, or another ratio, for a concomitant decrease in frame buffer size.
  • An upsampler is connected to the output of the frame buffer for providing interpolated values between the subsample ⁇ . Novel methods of filtering and interpolating performed by the upsampler are described.
  • a new motion estimation technique is also described herein which directly detects the number of bits required to be transmitted to convey the difference between the predicted video data and the current video data, where a fewer number of bits used to convey the difference corresponds to better motion estimation.
  • the search criterion for the best estimate of movement of a block is the minimum number of bits for conveying this difference instead of minimizing the mean squared error (MSE) or mean average difference
  • Fig. l is a block diagram of a prior art video coding system with motion compensation.
  • Fig. 2 is a block diagram of a reduced size video coding/decoding (codec) system with motion estimation.
  • Fig. 3 is a flow chart describing the basic steps performed by the structure of Fig. 2 for carrying out one embodiment of the motion estimation process.
  • Fig. 4 illustrates the basic functional blocks of one embodiment of the motion estimator.
  • Fig. 5 is a block diagram of a preferred embodiment of the video coding/decoding system which uses a smaller predictor frame memory.
  • Fig. 6 illustrates the operation of the subsampler in Fig. 5.
  • Figs. 7A, 7B, 7C, 8A, and 8B illustrate the operation of the upsampler in Fig. 5.
  • Fig. 9 is a block diagram of the subsampler and upsampler of Fig. 5.
  • FIG. 2 A flow chart of the basic steps used in the motion estimation process is shown in Fig. 3.
  • digital video data is applied to input 30 of subtractor 32 in a block format.
  • a suitable block organizer buffer (not shown) organizes the input frame into non-overlapping M x N size blocks of the pel data.
  • One block may be 4x4 pels, 8x8 pels, 16x16 pels, or 32x32 pels, depending on the requirements of the system.
  • Digital signals representing each sequential pel of a block are then applied to input 30 in a well known manner.
  • a second input 34 of subtractor 32 is connected to an output of an optional low pass filter 36, which suppresses high frequency noise.
  • the low pass filter 36 filters the output of a conventional frame buffer 38 which sequentially outputs bytes representing addressed pels in the video frame stored in frame buffer 38.
  • the output of subtractor 32 is applied to a processor 39 which receives the difference signal for each pel in the block.
  • the processor 39 then performs a transform on the difference signals in the block to generate coefficients (e.g., frequency coef icients) representing the difference signals for the block.
  • This transform may be a well known discrete cosine transform or other suitable transform.
  • One other possible transform coding technique is described in U.S. Application Serial No. 08/153,931, entitled "Quadtree-Structured Walsh Transform Coding," incorporated herein by reference.
  • Transform coding is a conventional and well known method to reduce the bandwidth of video signals, and the best coding method would be based on the particular requirements for cost, complexity, and the tolerable signal-to-noise ratio of the recovered signals.
  • Quantizer 40 then quantizes the coefficients to reduce the number of levels.
  • Such a quantizer 40 is conventional and well known.
  • a motion estimator 42 is connected to an output of quantizer 40. Motion estimator 42 is also connected to an address generator for frame buffer 38 via line 45 to cause frame buffer 38 to output a block of pels which has been shifted or motion compensated for input into subtractor 32. If the initial output of quantizer 40 indicates that no significant displacement of the block has taken place between frames, then the transmitted motion vector will be zero, and coder 48 will encode and transmit to the receiver the output of quantizer 40. If motion has taken place, then motion estimator 42 attempts to motion compensate the output of frame buffer 38 to best match the outputted block to the current block being supplied at input 30.
  • the block difference at the output of subtractor 32 of the current block and the best matching block output from filter 36 will be processed by processor 39, quantized by quantizer 40, and then input to coder 48.
  • the bits output by coder 48 along with the code for the best estimate motion vector will be transmitted to the receiver through an appropriate channel 60.
  • frame buffer 38 may output a motion compensated block of pels.
  • Motion estimator 42 addresses a look-up table 44 using the quantized coefficients.
  • the look-up table 44 cross-references each quantized value with the number of bits required to be transmitted to convey that quantized value and then outputs that number to the motion estimator 42.
  • the motion estimator 42 then adds up the total number of bits required to transmit the difference signals for the block. This total number of bits represents the extent of the error between the block of the current frame and the motion compensated output of frame buffer 38.
  • Motion estimator 42 can use conventional block-matching search schemes to repeatedly shift (or motion compensate) the block of pels within a search window outputted by frame buffer 38.
  • Steps 2 and 3 in Fig. 3. For each shift of the block outputted by frame buffer 38 (which stores the previous frame) , the block is compared with a block in the current video frame and this difference is processed by processor 39 and quantized by quantizer 40.
  • Step 4 in Fig. 3. At each iteration, the look-up table 44 in conjunction with the motion estimator 42 determine the total number of bits necessary to transmit this difference.
  • the search scheme which may be a well known full search of a search window or a well known three-step search, the shift that required the fewest number of bits to be transmitted is chosen as the best estimate shift.
  • the number of bits required to be outputted by the video coder is used as the best estimate search criterion.
  • the iterative process is stopped when the current guantized difference (as measured by the number of bits required to convey the difference) is not smaller than the difference from the previous step.
  • various block displacements are first performed, and the displacement resulting in the smallest difference is then selected as the best estimate.
  • the total number of bits, as determined by the outputs of the look-up table 44, may be used by the motion estimator 42 in adaptively modifying the search strategy performed by the motion estimator 42, such as would be the procedure using a 3- step process.
  • a 3-step process is described by Musmann et al. in the article "Advances in Picture
  • the chosen best estimate shift is then applied to the address generator for frame buffer 38 via line 45, and the outputted motion compensated block is compared to the corresponding block in the current frame at input 30. (Step 9 of Fig. 3.)
  • the quantized output of quantizer 40 representing the resulting minimum difference signal is then outputted on line 46 into coder 48.
  • Coder 48 changes the quantized difference into a suitable variable length code, such as a Hoffman code. This code, which will be a minimum number of bits, is then transmitted to a receiver/decoder. (Step 10 in Fig. 3.)
  • the stored look-up table 44 values would depend on the particular quantizer 40 and coder 48 used.
  • the quantized difference signal applied to coder 48 for transmission is also applied to an input of an inverse quantizer 49.
  • the output of inverse quantizer 49 is then applied to the input of an inverse processor 50, which performs an inverse transform on the difference signals.
  • the output of inverse processor 50 is applied to an input 51 of adder 52 which has another input 53 connected to the output of filter 36.
  • the output of adder 52 on line 54 is connected to an input of frame buffer 38.
  • This output of adder 52 will thus be the motion compensated block from the previous frame, as corrected by the difference signal, corresponding to the current block.
  • this motion compensated block is then stored in a second frame buffer within frame buffer 38 so that the second frame buffer will eventually store the current frame.
  • the first frame buffer in frame buffer 38 continues to be used as the source of motion compensated blocks from the previous frame for application to subtractor 32.
  • the functions of the first frame buffer and second frame buffer are reversed, using a multiplexer, so that the second frame buffer now supplies the motion compensated blocks of the previous frame to substractor 32.
  • a motion vector corresponding to the best estimate shift, is outputted by motion estimator 42 on line 56 for transmission to a receiver.
  • This motion vector is the same vector which minimized the number of bits outputted by coder 48.
  • the receiver/decoder is also show in Fig. 2.
  • the difference signal outputted by coder 48 is transmitted over channel 60 and decoded by a suitable decoder 62.
  • the decoded difference signal is then applied to an inverse quantizer 63.
  • the resulting signal is then applied to an inverse transform processor 64.
  • the output of processor 64 is applied to an input 65 of adder 66.
  • the transmitted motion vector on line 56 is received on line 72 and used to displace the corresponding block of video data in frame buffer 67 (which contains a previous frame) to match the position of that block in the current frame.
  • This motion compensated video data in frame buffer 67 is then applied to a second input 74 of adder 66, via low pass filter 68, such that the output of adder 66 en line 70 is the reconstructed motion video data.
  • This reconstructed data is also fed back into frame buffer 67 to cause frame buffer 67 to eventually contain the current frame.
  • This reconstructed data on line 70 may be applied to a video screen via suitable conversion circuitry.
  • a difference signal and zero motion vector are transmitted which signifies to the receiver/decoder that it needs to merely repeat the pels of that block from the previous frame to obtain the pels for the current block.
  • a determination of non-movement of a block may be that the total number of bits identified by look-up table 44 is below a threshold number.
  • Fig. 4 is a simplified block diagram illustrating certain functional elements of one embodiment of the motion estimator 42. Numerous other implementations would be apparent to those skilled in the art.
  • the output of quantizer 40 is applied as an address to look-up table 44.
  • Look-up table 44 outputs the number of bits corresponding to the output of quantizer 40.
  • Adder 80 totals the number of bits representing the block of data. This total is applied to one input of a comparator 82.
  • a memory 84 stores the smallest total count thus far outputted by adder 80 for the present block. This smallest total count is applied to a second input of comparator 82.
  • comparator 82 When comparator 82 detects a smaller count outputted by adder 80, the count and the corresponding motion vector are then stored in memory 84.
  • the output of comparator 82 may be connected to a write enable port of the memory 84.
  • a conventional block shifting routine is carried out by block shifter 86, which incrementally shifts the blocks outputted by frame buffer 38 (Fig. 2) .
  • Block shifter 86 may be an ASIC.
  • the block shifter 86 causes the frame buffer 38 to output the block corresponding to that shift.
  • a control signal generated by controller 88 closes switch 90, which connects the output of quantizer 40 to coder 48.
  • the motion vector corresponding to the best estimate shift is outputted on line 56.
  • coder 48 transmits the current difference signal.
  • coder 48 is controlled to output a very low bandwidth signal to the receiver indicating such non-movement. Detecting no significant block movement may be accomplished in one embodiment by comparing the initial output of adder 80 to a threshold number, using comparator 91. If the threshold number is not exceeded, the comparator 91 controls controller 88 to close switch 90 or controls coder 48 to output a signal indicating that there has been no movement of the block.
  • Fig. 5 illustrates a preferred embodiment of a video codec which includes a subsampler 94 connected to the input of a predictor 96.
  • Predictor 96 has the same functions as the frame buffer 38 shown in Fig. 2.
  • Subsampler 94 reduces the rate at which video data outputted by adder 52 is stored in predictor 96. This reduces the storage requirements and bandwidth requirements of predictor 96, thus reducing hardware costs.
  • subsampler 94 samples every other horizontal pel and every other row of pels to provide 4:1 sampling.
  • the frame buffer in predictor 96 need be only one-fourth the size of frame buffer 38 in Fig. 2, resulting in considerable cost savings and reducing the size of the video codec.
  • An upsampler 98 is connected to the output of predictor 96 for generating interpolated data between the successive samples stored in predictor 96.
  • the output of upsampler 98 will be similar to that outputted by frame buffer 38 in Fig. 2. No filtering is performed in the embodiment of Fig. 5.
  • the receiver/decoder portion on the right side of Fig. 5 includes a similar subsampler 100, predictor 101, and upsampler 102.
  • the remainder of the receiver/decoder is identical to that of Fig. 2.
  • the size of a block processed by the system of Fig. 5 would normally be 4x4, 8x8, 16x16, or 32x32 pels. In the following description of the subsampler and upsampler operations, the block size 8x8 is assumed. The invention can be applied to other sizes in a similar manner.
  • the 4x4 block is identified by numeral 104 in Fig. 6. This means that the subsampler 94 causes every other horizontal pel in every other row in the 8x8 block 103 to be stored in the frame buffer 96.
  • the frame buffer 96 or 101 in Fig. 5 needs to be only one-fourth of the size of the conventional predictor frame buffer 38 or 67 in Fig. 2.
  • the upsampler 98 and 102 in the encoder or the decoder reads the corresponding 4x4 block 104. output from the predictor frame buffer 96 or 101 and performs a linear upsampling operation as illustrated by Figs. 7A, 7B, and 7C.
  • the linear upsampling operation of the upsampler 98 and 102 is performed in three steps. In Fig. 7A, some of the pels in the 8x8 block 105 output by the upsampler are directly replaced with the corresponding pels of the 4x4 block 104 from the predictor frame buffer 96 or 101.
  • Fig. 7B The second step performed by the upsampler 98 or 102 is shown in Fig. 7B.
  • step 7B a linear interpolation in the horizontal direction is performed to fill in some of the missing pels of the output 8x8 block 105.
  • two facing pels e.g., pels 106 and 107 in the output 4x4 block 104 (Fig. 7A) along a horizontal row are averaged together, and this average pel value is inserted (e.g., at 108) between the two facing pels.
  • Fig. 7C The third step performed by the upsampler 98 or 102 is shown in Fig. 7C where the linear interpolations are then performed in the vertical direction to fill in the remaining missing pels of the output 8x8 block 105.
  • facing pels e.g., 109, 110
  • this average pel value is inserted (e.g. , at 111) between the facing pels.
  • FIG. 8A and 8B In addition to the linear upsainpling operation described with respect to Figs. 7A, 7B, and 7C, another operation by upsampler 98 or 102 is applied to blocks with non-zero motion, as illustrated in Figs. 8A and 8B.
  • the operation of Figs. 8A and 8B provides a certain amount of filtering of the pel data output by the predictor frame buffer 96 or 101 and achieves the effect of loop filtering which is commonly used in conventional video codecs.
  • the loop filtering helps avoid any visual artifacts arising from the block motion compensation process.
  • This loop filtering operation of the upsampler 98 or 102 is performed in two steps.
  • the first step takes some of the pels (e.g., 106, 107 in Fig. 7A) which were directly copied from the 4x4 input block 104 and filters these pels in a horizontal direction.
  • the previously averaged pel values e.g., 108, 112 facing each other in the horizontal direction, calculated in Fig. 7B, are now averaged together to create new pel values, such as value 113.
  • These new values e.g. 113) then replace the original pel value (e.g., 107) between the two averaged pel values (e.g., 108, 112).
  • the loop filtering of Figs. 8A and 8B slightly blurs the block when movement is detected so that the boundaries between adjacent blocks will not be noticeable.
  • the general structure of the subsampler 94/100 and upsampler 98/102 is shown in Fig. 9.
  • the pels of the 8x8 input block (103 in Fig. 6) are received in the following order: B(0,0), B(0,l), B(0,2), ..., B(0,7), B(1,0), B(l, 1) , ..., B(l,7), ..., B(7,0), B(7,l), ..., B(7,7) .
  • Subsampler 94 or 100 is shown as address generator 120 in Fig. 9 where the address generator 120 generates sequential addresses and write-enable signals for the predictor frame buffer 96.
  • the address generator 120 At the start of block data writing, the address generator 120 generates the initial address for B(0,0). Then the address generator 120 sequentially updates the address but only asserts the write-enable signal every other clock cycle. From B(1,0) to B(l,7) , the address generator 120 does not assert write-enable. After B(l,7) , the address generator 120 generates the initial address for B(2,0), and then asserts write-enable every other clock cycle. The above procedure is then repeated similarly for the rest of the pel rows shown in Fig. 7A.
  • a dual port memory 122 of 8x8 size is needed, which can be the same on- chip memory for the transformation and quantization operations included in either the processor 39 or quantizer 40 in Fig. 5.
  • the linear interpolation steps shown in Figs. 7B and 7C are implemented using pel delay 124, adder 126, divide-by-half shifter 128, multiplexer 130, and the address generator/mux controller 132.
  • adder 126 adds a present pel value from the predictor frame buffer 96 and a previous pel value outputted by pel delay 124.
  • the divide-by-two shifter 128 then takes an average of these two pel values and applies this average to multiplexer 130.
  • Multiplexer 130 is controlled by controller 132 to alternately store the pel value from the predictor frame buffer 96 followed by the averaged pel value as shown in Fig. 7B.
  • the pel values now stored in block memory 122 are then applied to the input of the pel delay 124 in a vertical raster scan order.
  • the pel values are now interpolated vertically as shown in Fig. 7C.
  • the loop filtering is performed by pel delay 134, adder 136, adder 138, and divide-by-four shifter 140, in conjunction with the remainder of the circuit, if movement of a block of pels is detected.
  • the loop filtering for the horizontal pels in a row, shown in Fig. 8A, is performed before the loop filtering in the vertical direction, shown in Fig. 8B.
  • the divide-by-four shifter 140 outputs those filtered pel values indicated by the merging arrows in Figs. 8A and 8B.
  • One skilled in the art would readily understand the operation of the circuit of Fig. 9 and the various components necessary to implement this circuit.
  • the RGB or YUV (or other format) signals may be processed separately using the same codec or may be processed in parallel, which would be readily understood by those skilled in the art.

Abstract

A video compression system in accordance with the present invention may use a frame buffer which is only a fraction of the size of a full frame buffer. A subsampler connected to an input of the frame buffer performs 4 to 1 subsampling on the video data to be stored in the frame buffer. This allows the frame buffer to be one-fourth the size of a full frame buffer. The subsampling may even be 9 to 1, 16 to 1, or another ratio, for a concomitant decrease in frame buffer size. An upsampler is connected to the output of the frame buffer for providing interpolated and filtered values between the subsamples. Novel methods of filtering and interpolating performed by the upsampler are described. A new motion estimation technique is also described herein which directly detects the number of bits required to be transmitted to convey the difference between the predicted video data and the current video data, where a fewer number of bits used to convey the difference corresponds to better motion estimation. The search criterion for the best estimate of movement of a block is the minimum number of bits for conveying this difference.

Description

VIDEO CODING METHOD USING SUBSAMPLING FOR REDUCING THE FRAME MEMORY SIZE
FIELD OF THE INVENTION
This invention relates to the compression of video signals, and more particularly to a method for reducing the required size of a frame buffer used in a video compression system.
BACKGROUND OF THE INVENTION Video signals for transmission typically originate from video cameras. The bandwidth of these non- compressed video signals is substantial and, consequently, numerous attempts have been made to reduce the bandwidth of the signals for transmission without unduly degrading the images. Typically, to reduce bandwidth, the frames of video signals are encoded, and redundancies in consecutive frames of the encoded signals are extracted and deleted. Only the differences between consecutive frames are then transmitted. Various techniques are used in the art depending on the particular application. One of the techniques for further reducing the bandwidth of moving images is generally referred to as motion compensated predictive coding. Fig. 1 illustrates one type of conventional video coder which utilizes motion compensation. Video signals in digital form are received at input 10. It will be assumed for Fig. 1 that the video data applied to input 10 is in a well known block format wherein blocks of 8x8 picture elements (pels) in an image are sequentially applied to input 10. The pels in a block are applied to input 10 in a raster scan type sequence. A subtractor 12 outputs the difference between the current video signal and a predicted video signal from a predictor 14. Predictor 14 includes a first frame buffer containing the full reconstructed previous frame of video data. This frame buffer is relatively large and expensive since it must typically store at least 352 x 288 pels. If each pel requires 8 bits to encode, then the frame buffer must store about 811K bits. A 1 Mbit buffer is typically used.
Thus, the difference between the current video frame applied to input 10 and the predicted video frame outputted from predictor 14 is outputted from subtractor 12. The difference signals for an entire block are then transformed by a processor/quantizer 18 to generate transform coefficients using, for example, discrete-cosine transformation, and the coefficients are then quantized. The quantized coefficients are then encoded by a coder 20 to be in a conventional variable-length type code for additional compression of the video signal. The resulting difference signal is then outputted at output 21 for transmission to a receiver. Motion estimator 16 compares the current block of video data received at input 10 with the data in a search window of the previous frame (in predictor 14) to identify that region or block of the reconstructed previous frame which the current block most closely resembles. The search window (e.g., 16x16 pels) takes into account the anticipated worst-case motion of a block from one frame to another. The search window is centered at the same location in the previous frame as the current block location in the current frame. Within this search window, the pels of the current block are successively compared to other pels in a block within the search window to find a matching block. The shift of the current block which gives the best match to the corresponding block in the previous frame is deemed to be the best estimate of the displacement of the block between frames. This best estimate is usually determined based on a mean squared error (MSE) or mean absolute difference (MAD) criteria. The amount of this best estimate shift, called the motion vector, is then transmitted to the receiver/decoder,
This motion vector is also applied to the address generator controlling the addressing of the first frame buffer in predictor 14 so that the block of pels outputted by predictor 14 corresponds to the displaced block of pels applied to input 10. Thus, the pels outputted by predictor 14 are motion compensated before being compared with the incoming block of pels, thereby making the predictor 14 output a better prediction of the current frame. This results in the difference outputted by subtractor 12 being, on average, smaller, and permits the coder 20 to encode the picture using a lower bit rate than would otherwise be the case.
Motion estimator 16 may, instead of performing block-matching motion estimation, use a well-known pel recursive technique, which generates motion vectors to minimize the prediction error at each pel. This is more computationally intensive than the block-matching technique.
After the motion vector and coded difference signal are transmitted, the receiver then updates the previous frame (already stored at the receiver) using the transmitted motion vector and difference signal. The frame buffer at the receiver must store the full frame, causing this frame buffer to be large and expensive. The difference signal outputted by processor/quantizer 18 is also fed back to a second frame buffer in either predictor 14 or motion estimator 16 through an inverse processor/quantizer 22 and adder 24. The output of adder 24 is the motion compensated predicted frame in the first frame buffer plus the difference signal. Thus, the second frame buffer in predictor 14 or motion estimator 16 now stores essentially the current frame of the video image (identical to that stored at the receiver) , while the first frame buffer still stores the previous frame. This second frame buffer, like the first frame buffer, stores a full frame of pels and is consequently large and expensive. For the next frame applied to input 10, the functions of the second frame buffer (now storing the previous frame) and the first frame buffer are reversed, so that the second frame buffer outputs blocks of pels to subtractor 12. The above-described block-matching process is then repeated for the next frame.
If a determination is made that the pels of the block have not changed as compared the previous frame (which is usually the case) , a signal is transmitted which signifies to the receiver/decoder that it needs to merely repeat the pels of that block from the previous frame to obtain the pels for the current block.
The above-described video compression system is relatively large and expensive due to the complex circuitry needed to calculate the best estimate block shift and due to the need for large frame buffers capable of storing an entire frame. What is needed is a video compression system which is less expensive, smaller, and simpler than conventional video compression systems. SUMMARY
A video compression system in accordance with the present invention uses a frame buffer which is only a fraction of the size of a full frame buffer. A subsampler connected to an input of the frame buffer performs 4 to 1 subsampling on the video data to be stored in the frame buffer. This allows the frame buffer to be one-fourth the size of a full frame buffer. The subsampling may even be 9 to 1, 16 to 1, or another ratio, for a concomitant decrease in frame buffer size.
An upsampler is connected to the output of the frame buffer for providing interpolated values between the subsampleε. Novel methods of filtering and interpolating performed by the upsampler are described. A new motion estimation technique is also described herein which directly detects the number of bits required to be transmitted to convey the difference between the predicted video data and the current video data, where a fewer number of bits used to convey the difference corresponds to better motion estimation. The search criterion for the best estimate of movement of a block is the minimum number of bits for conveying this difference instead of minimizing the mean squared error (MSE) or mean average difference
(MAD) . Thus, complex calculations involving MSD or MAD are avoided. This further reduces the size of the video compression system.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. l is a block diagram of a prior art video coding system with motion compensation.
Fig. 2 is a block diagram of a reduced size video coding/decoding (codec) system with motion estimation. Fig. 3 is a flow chart describing the basic steps performed by the structure of Fig. 2 for carrying out one embodiment of the motion estimation process.
Fig. 4 illustrates the basic functional blocks of one embodiment of the motion estimator.
Fig. 5 is a block diagram of a preferred embodiment of the video coding/decoding system which uses a smaller predictor frame memory.
Fig. 6 illustrates the operation of the subsampler in Fig. 5.
Figs. 7A, 7B, 7C, 8A, and 8B illustrate the operation of the upsampler in Fig. 5.
Fig. 9 is a block diagram of the subsampler and upsampler of Fig. 5.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
One embodiment of a video codec is shown in Fig. 2. A flow chart of the basic steps used in the motion estimation process is shown in Fig. 3. In Fig. 2, digital video data is applied to input 30 of subtractor 32 in a block format. (Step 1 of Fig. 3.) A suitable block organizer buffer (not shown) organizes the input frame into non-overlapping M x N size blocks of the pel data. One block may be 4x4 pels, 8x8 pels, 16x16 pels, or 32x32 pels, depending on the requirements of the system. Digital signals representing each sequential pel of a block are then applied to input 30 in a well known manner.
A second input 34 of subtractor 32 is connected to an output of an optional low pass filter 36, which suppresses high frequency noise. The low pass filter 36 filters the output of a conventional frame buffer 38 which sequentially outputs bytes representing addressed pels in the video frame stored in frame buffer 38. The output of subtractor 32 is applied to a processor 39 which receives the difference signal for each pel in the block. The processor 39 then performs a transform on the difference signals in the block to generate coefficients (e.g., frequency coef icients) representing the difference signals for the block. This transform may be a well known discrete cosine transform or other suitable transform. One other possible transform coding technique is described in U.S. Application Serial No. 08/153,931, entitled "Quadtree-Structured Walsh Transform Coding," incorporated herein by reference.
Transform coding is a conventional and well known method to reduce the bandwidth of video signals, and the best coding method would be based on the particular requirements for cost, complexity, and the tolerable signal-to-noise ratio of the recovered signals.
Quantizer 40 then quantizes the coefficients to reduce the number of levels. Such a quantizer 40 is conventional and well known.
A motion estimator 42 is connected to an output of quantizer 40. Motion estimator 42 is also connected to an address generator for frame buffer 38 via line 45 to cause frame buffer 38 to output a block of pels which has been shifted or motion compensated for input into subtractor 32. If the initial output of quantizer 40 indicates that no significant displacement of the block has taken place between frames, then the transmitted motion vector will be zero, and coder 48 will encode and transmit to the receiver the output of quantizer 40. If motion has taken place, then motion estimator 42 attempts to motion compensate the output of frame buffer 38 to best match the outputted block to the current block being supplied at input 30. The block difference at the output of subtractor 32 of the current block and the best matching block output from filter 36 will be processed by processor 39, quantized by quantizer 40, and then input to coder 48. The bits output by coder 48 along with the code for the best estimate motion vector will be transmitted to the receiver through an appropriate channel 60.
Other techniques may be used to cause frame buffer 38 to output a motion compensated block of pels.
Motion estimator 42 addresses a look-up table 44 using the quantized coefficients. The look-up table 44 cross-references each quantized value with the number of bits required to be transmitted to convey that quantized value and then outputs that number to the motion estimator 42. The motion estimator 42 then adds up the total number of bits required to transmit the difference signals for the block. This total number of bits represents the extent of the error between the block of the current frame and the motion compensated output of frame buffer 38.
Motion estimator 42 can use conventional block-matching search schemes to repeatedly shift (or motion compensate) the block of pels within a search window outputted by frame buffer 38. (Steps 2 and 3 in Fig. 3.) For each shift of the block outputted by frame buffer 38 (which stores the previous frame) , the block is compared with a block in the current video frame and this difference is processed by processor 39 and quantized by quantizer 40. (Step 4 in Fig. 3.) At each iteration, the look-up table 44 in conjunction with the motion estimator 42 determine the total number of bits necessary to transmit this difference. (Step 5 in Fig. 3. ) At the end of the search scheme, which may be a well known full search of a search window or a well known three-step search, the shift that required the fewest number of bits to be transmitted is chosen as the best estimate shift. (Steps 6, 7 and 8 of Fig. 3.) Thus, instead of calculating mean square error or mean absolute difference at each iteration to determine the error between the motion compensated block and the current block applied to input 30, the number of bits required to be outputted by the video coder is used as the best estimate search criterion. In one type of searching scheme, the iterative process is stopped when the current guantized difference (as measured by the number of bits required to convey the difference) is not smaller than the difference from the previous step. In another search scheme, various block displacements are first performed, and the displacement resulting in the smallest difference is then selected as the best estimate.
In one embodiment, the total number of bits, as determined by the outputs of the look-up table 44, may be used by the motion estimator 42 in adaptively modifying the search strategy performed by the motion estimator 42, such as would be the procedure using a 3- step process. Such a 3-step process .is described by Musmann et al. in the article "Advances in Picture
Coding," Proceedings of the IEEE, Vol. 73, pp. 523-548, April, 1985. Other searches may use a logarithmetic 2- D search, such as described by Musmann et al. The book "Digital Pictures," by Natravali et al., Plenum Press, 1988, describes additional search techniques.
The chosen best estimate shift is then applied to the address generator for frame buffer 38 via line 45, and the outputted motion compensated block is compared to the corresponding block in the current frame at input 30. (Step 9 of Fig. 3.)
The quantized output of quantizer 40 representing the resulting minimum difference signal is then outputted on line 46 into coder 48. Coder 48 changes the quantized difference into a suitable variable length code, such as a Hoffman code. This code, which will be a minimum number of bits, is then transmitted to a receiver/decoder. (Step 10 in Fig. 3.)
The stored look-up table 44 values would depend on the particular quantizer 40 and coder 48 used. The quantized difference signal applied to coder 48 for transmission is also applied to an input of an inverse quantizer 49. The output of inverse quantizer 49 is then applied to the input of an inverse processor 50, which performs an inverse transform on the difference signals. The output of inverse processor 50 is applied to an input 51 of adder 52 which has another input 53 connected to the output of filter 36. The output of adder 52 on line 54 is connected to an input of frame buffer 38. This output of adder 52 will thus be the motion compensated block from the previous frame, as corrected by the difference signal, corresponding to the current block. In one embodiment, this motion compensated block is then stored in a second frame buffer within frame buffer 38 so that the second frame buffer will eventually store the current frame.
The first frame buffer in frame buffer 38 continues to be used as the source of motion compensated blocks from the previous frame for application to subtractor 32. For the next frame to be applied to input 30, the functions of the first frame buffer and second frame buffer are reversed, using a multiplexer, so that the second frame buffer now supplies the motion compensated blocks of the previous frame to substractor 32.
Other ways to store a current frame in frame buffer 38 may also be used. Such methods may simply update portions of a single frame buffer with motion compensated data once those portions of the frame buffer have already been addressed for providing a motion compensated block to subtractor 32. This would avoid the need for two frame buffers. Such techniques are well known in the art.
A motion vector, corresponding to the best estimate shift, is outputted by motion estimator 42 on line 56 for transmission to a receiver. This motion vector is the same vector which minimized the number of bits outputted by coder 48.
The receiver/decoder is also show in Fig. 2. The difference signal outputted by coder 48 is transmitted over channel 60 and decoded by a suitable decoder 62. The decoded difference signal is then applied to an inverse quantizer 63. The resulting signal is then applied to an inverse transform processor 64. The output of processor 64 is applied to an input 65 of adder 66.
The transmitted motion vector on line 56 is received on line 72 and used to displace the corresponding block of video data in frame buffer 67 (which contains a previous frame) to match the position of that block in the current frame. This motion compensated video data in frame buffer 67 is then applied to a second input 74 of adder 66, via low pass filter 68, such that the output of adder 66 en line 70 is the reconstructed motion video data. This reconstructed data is also fed back into frame buffer 67 to cause frame buffer 67 to eventually contain the current frame. This reconstructed data on line 70 may be applied to a video screen via suitable conversion circuitry. If a determination is made by the motion estimator 42 that the pels of a block have not significantly changed as compared with the previous frame (which is usually the case) , a difference signal and zero motion vector are transmitted which signifies to the receiver/decoder that it needs to merely repeat the pels of that block from the previous frame to obtain the pels for the current block. Such a determination of non-movement of a block may be that the total number of bits identified by look-up table 44 is below a threshold number. Fig. 4 is a simplified block diagram illustrating certain functional elements of one embodiment of the motion estimator 42. Numerous other implementations would be apparent to those skilled in the art.
The output of quantizer 40 is applied as an address to look-up table 44. Look-up table 44 outputs the number of bits corresponding to the output of quantizer 40. Adder 80 totals the number of bits representing the block of data. This total is applied to one input of a comparator 82. A memory 84 stores the smallest total count thus far outputted by adder 80 for the present block. This smallest total count is applied to a second input of comparator 82.
When comparator 82 detects a smaller count outputted by adder 80, the count and the corresponding motion vector are then stored in memory 84. The output of comparator 82 may be connected to a write enable port of the memory 84.
A conventional block shifting routine is carried out by block shifter 86, which incrementally shifts the blocks outputted by frame buffer 38 (Fig. 2) . Block shifter 86 may be an ASIC. When the minimum total count and best estimate shift have been determined, the block shifter 86 causes the frame buffer 38 to output the block corresponding to that shift. A control signal generated by controller 88 closes switch 90, which connects the output of quantizer 40 to coder 48. The motion vector corresponding to the best estimate shift is outputted on line 56.
If the initial output of quantizer 40 indicates that no significant movement of the block has taken place since the last frame, a zero motion vector is transmitted and coder 48 transmits the current difference signal. In one embodiment, coder 48 is controlled to output a very low bandwidth signal to the receiver indicating such non-movement. Detecting no significant block movement may be accomplished in one embodiment by comparing the initial output of adder 80 to a threshold number, using comparator 91. If the threshold number is not exceeded, the comparator 91 controls controller 88 to close switch 90 or controls coder 48 to output a signal indicating that there has been no movement of the block.
Fig. 5 illustrates a preferred embodiment of a video codec which includes a subsampler 94 connected to the input of a predictor 96. Predictor 96 has the same functions as the frame buffer 38 shown in Fig. 2. Subsampler 94 reduces the rate at which video data outputted by adder 52 is stored in predictor 96. This reduces the storage requirements and bandwidth requirements of predictor 96, thus reducing hardware costs. In one embodiment, subsampler 94 samples every other horizontal pel and every other row of pels to provide 4:1 sampling. Thus, the frame buffer in predictor 96 need be only one-fourth the size of frame buffer 38 in Fig. 2, resulting in considerable cost savings and reducing the size of the video codec. An upsampler 98 is connected to the output of predictor 96 for generating interpolated data between the successive samples stored in predictor 96. The output of upsampler 98 will be similar to that outputted by frame buffer 38 in Fig. 2. No filtering is performed in the embodiment of Fig. 5.
The remainder of the circuit of Fig. 5 is identical to that of Fig. 2.
The receiver/decoder portion on the right side of Fig. 5 includes a similar subsampler 100, predictor 101, and upsampler 102. The remainder of the receiver/decoder is identical to that of Fig. 2.
The size of a block processed by the system of Fig. 5 would normally be 4x4, 8x8, 16x16, or 32x32 pels. In the following description of the subsampler and upsampler operations, the block size 8x8 is assumed. The invention can be applied to other sizes in a similar manner.
Since the blocks are individually written into the predictor frame buffer 96, the aliasing effects for the pels across the block boundaries cannot be reduced. Thus, no subsampling filtering is applied. This leads to a simple decimation operation performed in subsamplers 94 and 100 of Fig. 5. The subsampling operation is illustrated in Fig. 6, where pel values are located at the intersections of the two grids. In Fig. 6, if the input 8x8 block 103 to the subsampler 94 or 100 is denoted by B(i,j) for i,j = 0, 1, ... 7, where i is the row and j is the pixel along that row, then the output 4x4 block S(i,j) = B(2i,2j) for i,j = 0, 1, 2, 3. The 4x4 block is identified by numeral 104 in Fig. 6. This means that the subsampler 94 causes every other horizontal pel in every other row in the 8x8 block 103 to be stored in the frame buffer 96. Thus, the frame buffer 96 or 101 in Fig. 5 needs to be only one-fourth of the size of the conventional predictor frame buffer 38 or 67 in Fig. 2.
To obtain a predicted 8x8 block for application to subtractor 32 or adder 66 in Fig. 5, the upsampler 98 and 102 in the encoder or the decoder reads the corresponding 4x4 block 104. output from the predictor frame buffer 96 or 101 and performs a linear upsampling operation as illustrated by Figs. 7A, 7B, and 7C. The linear upsampling operation of the upsampler 98 and 102 is performed in three steps. In Fig. 7A, some of the pels in the 8x8 block 105 output by the upsampler are directly replaced with the corresponding pels of the 4x4 block 104 from the predictor frame buffer 96 or 101. More specifically, if the output 8x8 block 105 of the upsampler 98 or 102 is denoted by C(i,j) for i,j = 0, 1, ... 7, and the input 4x4 block 104 from the predictor frame buffer 96 or 101 is denoted as T(i,j) for i,j = 0, 1, 2, 3, then the step performed in Fig. 7A is to set C(2i,2j) = T(i,j) , for i,j = 0, 1, 2, 3.
The second step performed by the upsampler 98 or 102 is shown in Fig. 7B. In step 7B, a linear interpolation in the horizontal direction is performed to fill in some of the missing pels of the output 8x8 block 105. In Fig. 7B, two facing pels (e.g., pels 106 and 107) in the output 4x4 block 104 (Fig. 7A) along a horizontal row are averaged together, and this average pel value is inserted (e.g., at 108) between the two facing pels. This is illustrated in Fig. 7B by arrows from two facing pels 106 and 107 merging into the averaged pel value inserted between the facing pels. Note how pels in the penultimate position in a row are simply replicated into the last position of the row. More specifically, C(i,j) = [C(i,j - 1) + C(i,j + l)]/2, for i = 0, 2, 4, 6 and j = 1, 3, 5, and C(i,7) = C(i,6) for i = 0, 2, 4, 6.
The third step performed by the upsampler 98 or 102 is shown in Fig. 7C where the linear interpolations are then performed in the vertical direction to fill in the remaining missing pels of the output 8x8 block 105. As seen from Fig. 7C, facing pels (e.g., 109, 110) along the vertical direction are averaged together, and this average pel value is inserted (e.g. , at 111) between the facing pels. Note how pels in the bottom row are simply replications of those pels from the penultimate row. More specifically, C(i,j) = [C(i-1, j) + C(i+l,j) ]/2, for i = 1, 3, 5, and j = 0, 1, ... 7, and C(7,j) = C(6,j) for j = 0, 1, ... 7.
Only the above-described linear interpolation (Figs. 7B and 7C) is performed by the upsampler 98 or 102 on those blocks for which zero movement is detected. Zero movement is detected, as previously described, by the quantized difference signal being below a threshold value. When such zero movement is detected, the resulting 8x8 block 105 of pels output from upsampler 98 or 102 is applied to downstream circuitry as shown in Fig. 5.
In addition to the linear upsainpling operation described with respect to Figs. 7A, 7B, and 7C, another operation by upsampler 98 or 102 is applied to blocks with non-zero motion, as illustrated in Figs. 8A and 8B. The operation of Figs. 8A and 8B provides a certain amount of filtering of the pel data output by the predictor frame buffer 96 or 101 and achieves the effect of loop filtering which is commonly used in conventional video codecs. The loop filtering helps avoid any visual artifacts arising from the block motion compensation process. This loop filtering operation of the upsampler 98 or 102 is performed in two steps.
The first step, illustrated in Fig. 8A, takes some of the pels (e.g., 106, 107 in Fig. 7A) which were directly copied from the 4x4 input block 104 and filters these pels in a horizontal direction. The previously averaged pel values (e.g., 108, 112) facing each other in the horizontal direction, calculated in Fig. 7B, are now averaged together to create new pel values, such as value 113. These new values (e.g. 113) then replace the original pel value (e.g., 107) between the two averaged pel values (e.g., 108, 112). The pel values (e.g., 109) in the left most pel position of a row are not used in this calculation. More specifically, C(i,j) = [C(i,j - 1) + 2C(i,j) + C(i,j + i)]/4, for i = o, 2, 4, 6, 7 and j = 2, 4, 6.
The second step performed in the upsampler 98 or 102 for loop filtering is illustrated in Fig. 8B where some of the pels which were directly copied from the 4x4 input block 104 in Fig. 7A are filtered in the vertical direction. This is performed by averaging two pel values (e.g. , Ill, 114) along the vertical direction and replacing the middle original pel value (e.g., 110 in Fig. 7C) with this averaged pel value (e.g., at 115) . More speci ically, C(i,j) = (C(i - 1, j) + 2C(i,j) + C(i + 1, j)]/4, for i = 2, 4, 6 and j = 0, 2, 4, 6, 7. The loop filtering of Figs. 8A and 8B slightly blurs the block when movement is detected so that the boundaries between adjacent blocks will not be noticeable.
The general structure of the subsampler 94/100 and upsampler 98/102 is shown in Fig. 9. The pels of the 8x8 input block (103 in Fig. 6) are received in the following order: B(0,0), B(0,l), B(0,2), ..., B(0,7), B(1,0), B(l, 1) , ..., B(l,7), ..., B(7,0), B(7,l), ..., B(7,7) .
Subsampler 94 or 100 is shown as address generator 120 in Fig. 9 where the address generator 120 generates sequential addresses and write-enable signals for the predictor frame buffer 96. At the start of block data writing, the address generator 120 generates the initial address for B(0,0). Then the address generator 120 sequentially updates the address but only asserts the write-enable signal every other clock cycle. From B(1,0) to B(l,7) , the address generator 120 does not assert write-enable. After B(l,7) , the address generator 120 generates the initial address for B(2,0), and then asserts write-enable every other clock cycle. The above procedure is then repeated similarly for the rest of the pel rows shown in Fig. 7A.
For the upsampler 98 or 102, a dual port memory 122 of 8x8 size is needed, which can be the same on- chip memory for the transformation and quantization operations included in either the processor 39 or quantizer 40 in Fig. 5. The linear interpolation steps shown in Figs. 7B and 7C are implemented using pel delay 124, adder 126, divide-by-half shifter 128, multiplexer 130, and the address generator/mux controller 132. For horizontal interpolation, adder 126 adds a present pel value from the predictor frame buffer 96 and a previous pel value outputted by pel delay 124. The divide-by-two shifter 128 then takes an average of these two pel values and applies this average to multiplexer 130. Multiplexer 130 is controlled by controller 132 to alternately store the pel value from the predictor frame buffer 96 followed by the averaged pel value as shown in Fig. 7B. For the vertical interpolation shown in Fig. 7C, the pel values now stored in block memory 122 are then applied to the input of the pel delay 124 in a vertical raster scan order. The pel values are now interpolated vertically as shown in Fig. 7C. The loop filtering is performed by pel delay 134, adder 136, adder 138, and divide-by-four shifter 140, in conjunction with the remainder of the circuit, if movement of a block of pels is detected.
The loop filtering for the horizontal pels in a row, shown in Fig. 8A, is performed before the loop filtering in the vertical direction, shown in Fig. 8B. The divide-by-four shifter 140 outputs those filtered pel values indicated by the merging arrows in Figs. 8A and 8B. One skilled in the art would readily understand the operation of the circuit of Fig. 9 and the various components necessary to implement this circuit.
Other implementations for subsampling and upsampling of data may also be used. The various circuits and algorithms for carrying out all the remaining embodiments of Figs. 2 and 4-9 would be readily understood by those skilled in the art.
When compressing/decompressing color video signals, the RGB or YUV (or other format) signals may be processed separately using the same codec or may be processed in parallel, which would be readily understood by those skilled in the art.
While particular embodiments of the present invention have been shown and described, it will be obvious to those skilled in the art that changes and modifications may be made without departing from this invention in its broader aspects and, therefore, the appended claims are to encompass within their scope all such changes and modifications as fall within the true spirit and scope of this invention.

Claims

CLAIMSWhat is claimed is:
1. A method for generating compressed video data comprising the steps of: receiving first video data at one or more input terminals representing a first block of M x N picture elements (pels) from a first frame of Q x R pels; subsampling second video data representing a second block of M x N pels, by outputting only selected subsampled video data from said second block, so as to reduce said second block to a subsampled block of S x T pels; storing in a memory said subsampled block representing S x T pels; addressing said memory to output said subsampled block representing S x T pels; upsampling said subsampled block output from said memory, by interpolating pel values, to output an upsampled block of M x N pels; generating a difference signal representing differences between said first block of pels from said first frame and said upsampled block; transmitting a signal corresponding to said difference signal to a receiver; adding said difference signal to said upsampled block to form an input block of video data for subsampling; and repeating said steps of receiving, subsampling, storing, addressing, upsampling, generating, transmitting, and adding for a plurality of blocks of M x N pels applied to said one or more input terminals until all video data in said first frame has been received and difference signals for said first frame have been generated.
2. A method of Claim 1 wherein said step of subsampling comprises the step of performing 4 to 1 subsampling of said second video data such that a representation of said first frame is stored in said memory using one-fourth of the pels in said first frame.
3. A method of Claim 1 wherein said step of subsampling comprises the step of performing 9 to 1 subsampling of said second video data such that a representation of said first frame is stored in said memory using one-ninth of the pels in said first frame.
4. A method of Claim 1 wherein said step of subsampling comprises the step of performing 16 to l subsampling of said second video data such that a representation of said first frame is stored in said memory using one-sixteenth of the pe s in said first frame.
5. The method of Claim 1 wherein said step of subsampling is performing by controlling a write-enable signal for said memory while also addressing said memory so as to store only selected subsampled video data in said memory.
6. The method of Claim 1 wherein said memory is approximately one-quarter the size of a memory required to store a full first frame.
7. The method of Claim 1 wherein said step of upsampling averages selected pel values together to created interpolated pels for said upsampled block.
8. The method of Claim 1 wherein said step of upsampling comprises a first upsampling process for upsampling said subsampled block of SxT pels when no movement of said block is detected between said first frame and a previous frame, and performs an additional second upsampling process for low pass filtering of said subsampled block when it is detected that there has been movement of said block between said first frame and said previous frame.
9. The method of Claim 8 wherein said low pass filtering performs additional averaging of pel values over that performed during said first upsampling process.
10. The method of Claim 1 further comprising the steps of: motion compensating said subsampled block output from said memory if it is determined in said step of generating a difference signal that movement has occurred between a previous frame and said first frame; detecting a number of bits required to be transmitted in order to convey said difference signal; selecting for motion compensation of said subsampled block from said memory a motion vector which produces a fewest number of bits required to be transmitted to convey said difference signal; and transmitting said motion vector and said difference signal which resulted in said fewest number of bits required to convey said difference signal.
11. The method of Claim 10 further comprising the step of transmitting a signal indicating that no movement of said first block contained in said first frame has occurred between said previous frame and said first frame if, in said step of generating a difference signal, a determination is made that no movement has occurred.
12. The method of Claim 11 wherein said difference signal addresses a look-up table which cross-references said difference signal to the number of bits required to be transmitted in order to convey said difference signal.
13. The method of Claim 12 wherein said look-up table cross-references said difference signal with the anticipated output of a coder which converts a difference signal into a variable-length bit stream.
14. The method of Claim 10 wherein said steps of generating a difference signal, motion compensating said subsampled block, and detecting a number of bits are repeated for different motion compensations of said subsampled block from said memory to identify a particular motion vector in said step of selecting which resulted in said fewest number of bits to convey said difference signal.
15. A circuit for generating compressed video data comprising: a subtractor having a first input connected to a source of video data from a first frame and having a second input connected to a output of an upsampler, said output of said upsampler providing video data from a previous frame, an output of said subtractor providing a difference signal; an adder having a first input connected to said output of said upsampler and having a second input connected to receive a signal corresponding to said difference signal; a subsampler having an input connected to an output of said adder; and a memory having an input connected to an output of said subsampler and having an output connected to an input of said upsampler.
16. The circuit of Claim 15 wherein said subsampler subsamples at a 4 to 1 ratio to output an amount of data for said first frame approximately egual to one-guarter of that data in said first frame.
17. The circuit of Claim 15 wherein said subsampler subsamples at a 9 to 1 ratio to output an amount of data for said first frame approximately egual to one-ninth of that data in said first frame.
18. The circuit of Claim 15 wherein said upsampler generates interpolated video data based upon an output of said memory.
19. The circuit of Claim 18 wherein said upsampler also provides low pass filtering of video data output from said memory only in the event that movement of an image represented by said video data is detected between said previous frame and said first frame.
20. The circuit of Claim 15 further comprising: a motion estimator connected to said output of said subtractor, said motion estimator comprising: a converter for converting a difference value corresponding to said output of said subtractor to a number of bits reguired to be transmitted in order to convey said difference value; a detector for receiving said number of bits required to convey said difference value and for determining whether said number is the fewest number of bits detected for conveying said difference value; and a motion vector generator for motion compensating said video data from said previous frame for application to said substractor and for generating for transmission the motion vector which resulted in said difference value corresponding to said fewest number of bits.
EP95940775A 1994-11-30 1995-11-20 Video coding method using subsampling for reducing the frame memory size Withdrawn EP0741946A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US08/348,272 US5644361A (en) 1994-11-30 1994-11-30 Subsampled frame storage technique for reduced memory size
US348272 1994-11-30
PCT/US1995/015135 WO1996017478A1 (en) 1994-11-30 1995-11-20 Video coding method using subsampling for reducing the frame memory size

Publications (1)

Publication Number Publication Date
EP0741946A1 true EP0741946A1 (en) 1996-11-13

Family

ID=23367315

Family Applications (1)

Application Number Title Priority Date Filing Date
EP95940775A Withdrawn EP0741946A1 (en) 1994-11-30 1995-11-20 Video coding method using subsampling for reducing the frame memory size

Country Status (4)

Country Link
US (2) US5644361A (en)
EP (1) EP0741946A1 (en)
KR (1) KR970700980A (en)
WO (1) WO1996017478A1 (en)

Families Citing this family (87)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE59505097D1 (en) * 1994-08-24 1999-03-25 Siemens Ag METHOD FOR DECODING COMPRESSED VIDEO DATA WITH REDUCED STORAGE NEEDS
JP3655651B2 (en) * 1994-09-02 2005-06-02 テキサス インスツルメンツ インコーポレイテツド Data processing device
DE19524688C1 (en) * 1995-07-06 1997-01-23 Siemens Ag Method for decoding and encoding a compressed video data stream with reduced memory requirements
US6873738B2 (en) * 1995-10-02 2005-03-29 Sony Corporation Hierarchical image processor for encoding or decoding, and memory on the same chip
US6035369A (en) * 1995-10-19 2000-03-07 Rambus Inc. Method and apparatus for providing a memory with write enable information
GB2306829A (en) * 1995-10-20 1997-05-07 Amstrad Plc Digital video decoding apparatus
US5777677A (en) * 1996-02-09 1998-07-07 International Business Machines Corporation Approximate MPEG decoder with compressed reference frames
EP0793389B1 (en) * 1996-02-27 2001-08-16 STMicroelectronics S.r.l. Memory reduction in the MPEG-2 main profile main level decoder
JP3575508B2 (en) * 1996-03-04 2004-10-13 Kddi株式会社 Encoded video playback device
US5825424A (en) * 1996-06-19 1998-10-20 Thomson Consumer Electronics, Inc. MPEG system which decompresses and recompresses image data before storing image data in a memory and in accordance with a resolution of a display device
US6222886B1 (en) * 1996-06-24 2001-04-24 Kabushiki Kaisha Toshiba Compression based reduced memory video decoder
US6256348B1 (en) * 1996-08-30 2001-07-03 Texas Instruments Incorporated Reduced memory MPEG video decoder circuits and methods
JP3466032B2 (en) * 1996-10-24 2003-11-10 富士通株式会社 Video encoding device and decoding device
US6002438A (en) * 1996-10-25 1999-12-14 Texas Instruments Incorporated Method and apparatus for storing decoded video information
US6028635A (en) * 1996-12-03 2000-02-22 Stmicroelectronics, Inc. Reducing the memory required for decompression by storing compressed information using DCT based techniques
CN100518319C (en) * 1996-12-18 2009-07-22 汤姆森消费电子有限公司 Fixed-length block data compression and decompression method
US6594315B1 (en) 1996-12-18 2003-07-15 Thomson Licensing S.A. Formatting of recompressed data in an MPEG decoder
US6879631B1 (en) 1996-12-18 2005-04-12 Thomson Licensing S.A. Selective compression network in an MPEG compatible decoder
US6184935B1 (en) 1997-03-12 2001-02-06 Matsushita Electric Industrial, Co. Ltd. Upsampling filter and half-pixel generator for an HDTV downconversion system
JP3844844B2 (en) * 1997-06-06 2006-11-15 富士通株式会社 Moving picture coding apparatus and moving picture coding method
US5933195A (en) * 1997-09-26 1999-08-03 Sarnoff Corporation Method and apparatus memory requirements for storing reference frames in a video decoder
US6370192B1 (en) 1997-11-20 2002-04-09 Hitachi America, Ltd. Methods and apparatus for decoding different portions of a video image at different resolutions
US6137837A (en) * 1998-01-23 2000-10-24 Motorola, Inc. Motion estimation for digital video with reduced number of search window pixels
JP4264571B2 (en) * 1998-04-09 2009-05-20 ソニー株式会社 Digital image decoding apparatus and method, and recording medium
US6310919B1 (en) * 1998-05-07 2001-10-30 Sarnoff Corporation Method and apparatus for adaptively scaling motion vector information in an information stream decoder
JP4214562B2 (en) * 1998-06-26 2009-01-28 ソニー株式会社 Decoding device
US6167153A (en) * 1998-07-20 2000-12-26 Xerox Corporation Method of compressing JPEG files
JP2000059790A (en) * 1998-08-05 2000-02-25 Victor Co Of Japan Ltd Dynamic image code string converter and method therefor
US6487249B2 (en) 1998-10-09 2002-11-26 Matsushita Electric Industrial Co., Ltd. Efficient down conversion system for 2:1 decimation
US6137539A (en) * 1998-10-09 2000-10-24 Matshushita Electric Industrial Co, Ltd Digital television status display
DE69933858T2 (en) * 1998-12-10 2007-05-24 Matsushita Electric Industrial Co., Ltd., Kadoma ARITHMETIC FILTER DEVICE
US6680974B1 (en) * 1999-12-02 2004-01-20 Lucent Technologies Inc. Methods and apparatus for context selection of block transform coefficients
US6473460B1 (en) * 2000-03-31 2002-10-29 Matsushita Electric Industrial Co., Ltd. Method and apparatus for calculating motion vectors
US6459452B1 (en) * 2000-03-31 2002-10-01 Intel Corporation System, method, and apparatus for decoding digital video sequences
DE10022264A1 (en) * 2000-05-08 2001-11-22 Micronas Munich Gmbh Video signal decoder for television receiver uses sampling to provide reduced reference image data blocks provided with movement compensation before reconstruction and summation with difference image data blocks
US6982764B1 (en) * 2000-05-25 2006-01-03 Northrop Grumman Corporation Image enhancement
KR100370076B1 (en) * 2000-07-27 2003-01-30 엘지전자 주식회사 video decoder with down conversion function and method of decoding a video signal
JP3643777B2 (en) * 2001-02-28 2005-04-27 三洋電機株式会社 Moving picture decoding apparatus and moving picture decoding method
US7602847B1 (en) 2001-03-27 2009-10-13 Vixs Systems, Inc. Device and method for compression of a video stream
US20070053428A1 (en) * 2001-03-30 2007-03-08 Vixs Systems, Inc. Managed degradation of a video stream
US8107524B2 (en) * 2001-03-30 2012-01-31 Vixs Systems, Inc. Adaptive bandwidth footprint matching for multiple compressed video streams in a fixed bandwidth network
WO2003001814A1 (en) * 2001-06-26 2003-01-03 Koninklijke Philips Electronics N.V. Video coding method
US7675972B1 (en) * 2001-07-30 2010-03-09 Vixs Systems, Inc. System and method for multiple channel video transcoding
US6959348B1 (en) * 2001-07-30 2005-10-25 Vixs Systems, Inc. Method and system for accessing data
US20030039226A1 (en) * 2001-08-24 2003-02-27 Kwak Joseph A. Physical layer automatic repeat request (ARQ)
EP1442608B1 (en) * 2001-10-29 2015-05-06 Ceva D.S.P. Ltd. Method and apparatus for motion estimation in a sequence of digital images
US7139330B1 (en) 2001-10-31 2006-11-21 Vixs Systems, Inc. System for signal mixing and method thereof
US7596127B1 (en) 2001-10-31 2009-09-29 Vixs Systems, Inc. System for allocating data in a communications system and method thereof
US7106715B1 (en) 2001-11-16 2006-09-12 Vixs Systems, Inc. System for providing data to multiple devices and method thereof
US7403564B2 (en) * 2001-11-21 2008-07-22 Vixs Systems, Inc. System and method for multiple channel video transcoding
US7356079B2 (en) * 2001-11-21 2008-04-08 Vixs Systems Inc. Method and system for rate control during video transcoding
US7165180B1 (en) 2001-11-27 2007-01-16 Vixs Systems, Inc. Monolithic semiconductor device for preventing external access to an encryption key
KR100463515B1 (en) * 2001-12-29 2004-12-29 엘지전자 주식회사 Video decoding system
US7596179B2 (en) * 2002-02-27 2009-09-29 Hewlett-Packard Development Company, L.P. Reducing the resolution of media data
US7310679B1 (en) 2002-04-29 2007-12-18 Vixs Systems Inc. Method and system for transmitting video content while preventing other transmissions in a contention-based network
US7120253B2 (en) * 2002-05-02 2006-10-10 Vixs Systems, Inc. Method and system for protecting video data
US20040141555A1 (en) * 2003-01-16 2004-07-22 Rault Patrick M. Method of motion vector prediction and system thereof
US7408989B2 (en) * 2003-01-16 2008-08-05 Vix5 Systems Inc Method of video encoding using windows and system thereof
US7327784B2 (en) * 2003-02-24 2008-02-05 Vixs Systems, Inc. Method and system for transcoding video data
US7606305B1 (en) 2003-02-24 2009-10-20 Vixs Systems, Inc. Method and system for transcoding video data
US7133452B1 (en) 2003-02-24 2006-11-07 Vixs Systems, Inc. Method and system for transcoding video data
US7130350B1 (en) 2003-02-28 2006-10-31 Vixs Systems, Inc. Method and system for encoding and decoding data in a video stream
US7519115B2 (en) * 2003-03-31 2009-04-14 Duma Video, Inc. Video compression method and apparatus
US7551671B2 (en) * 2003-04-16 2009-06-23 General Dynamics Decision Systems, Inc. System and method for transmission of video signals using multiple channels
US7739105B2 (en) * 2003-06-13 2010-06-15 Vixs Systems, Inc. System and method for processing audio frames
US7277101B2 (en) 2003-09-29 2007-10-02 Vixs Systems Inc Method and system for scaling images
US7668396B2 (en) * 2003-09-29 2010-02-23 Vixs Systems, Inc. Method and system for noise reduction in an image
US20050099534A1 (en) * 2003-10-23 2005-05-12 Aufranc Richard E.Jr. Display system for an interlaced image frame with a wobbling device
US7406598B2 (en) * 2004-02-17 2008-07-29 Vixs Systems Inc. Method and system for secure content distribution
US7421048B2 (en) * 2005-01-20 2008-09-02 Vixs Systems, Inc. System and method for multimedia delivery in a wireless environment
US7609766B2 (en) * 2005-02-08 2009-10-27 Vixs Systems, Inc. System of intra-picture complexity preprocessing
US8290057B2 (en) 2005-02-18 2012-10-16 Mediatek Incorporation Method of decoding a digital video sequence and related apparatus
US8949920B2 (en) * 2005-03-17 2015-02-03 Vixs Systems Inc. System and method for storage device emulation in a multimedia processing system
US7400869B2 (en) * 2005-03-22 2008-07-15 Vixs Systems Inc. System and method for adaptive DC offset compensation in wireless transmissions
US7983458B2 (en) * 2005-09-20 2011-07-19 Capso Vision, Inc. In vivo autonomous camera with on-board data storage or digital wireless transmission in regulatory approved band
US7707485B2 (en) 2005-09-28 2010-04-27 Vixs Systems, Inc. System and method for dynamic transrating based on content
US20070112826A1 (en) * 2005-11-10 2007-05-17 Vixs Systems, Inc. Multimedia transcoding based on remaining storage capacity
US8131995B2 (en) * 2006-01-24 2012-03-06 Vixs Systems, Inc. Processing feature revocation and reinvocation
GB2435360B (en) 2006-02-16 2009-09-23 Imagination Tech Ltd Method and apparatus for determining motion between video images
US8472527B2 (en) * 2006-09-13 2013-06-25 Texas Instruments Incorporated Hierarchical motion estimation using original frame for sub-sampled reference
US7940973B2 (en) * 2006-09-19 2011-05-10 Capso Vision Inc. Capture control for in vivo camera
US8121195B2 (en) * 2006-11-30 2012-02-21 Lsi Corporation Memory reduced H264/MPEG-4 AVC codec
US20130156113A1 (en) * 2010-08-17 2013-06-20 Streamworks International, S.A. Video signal processing
GB2488396A (en) * 2010-08-17 2012-08-29 Streamworks Internat S A Video signal processing
EP2429192A1 (en) * 2010-08-17 2012-03-14 Streamworks International S.A. Video signal processing
US10674171B2 (en) 2011-09-27 2020-06-02 Telefonaktiebolaget Lm Ericsson (Publ) Decoders and methods thereof for managing pictures in video decoding process
US9053366B2 (en) 2013-02-20 2015-06-09 Honeywell International Inc. System and method for detecting motion in compressed video

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4383272A (en) * 1981-04-13 1983-05-10 Bell Telephone Laboratories, Incorporated Video signal interpolation using motion estimation
US4488175A (en) * 1982-06-28 1984-12-11 At&T Bell Laboratories DPCM Video signal processing technique with spatial subsampling
CA1277416C (en) * 1984-08-13 1990-12-04 Akihiro Furukawa Inter-frame predictive coding apparatus for video signal
DE3688093T2 (en) * 1985-03-22 1993-10-21 Toray Industries Transparent object and process for its manufacture.
AU579550B2 (en) * 1985-06-10 1988-11-24 Nec Corporation Movement compensation predictive encoder for a moving picture signal with a reduced amount of information
WO1987005770A1 (en) * 1986-03-19 1987-09-24 British Broadcasting Corporation Video signal processing for bandwidth reduction
US4922273A (en) * 1987-04-02 1990-05-01 Konica Corporation Compression method of halftone image data
ES2048823T3 (en) * 1988-02-23 1994-04-01 Philips Nv SPACE-TEMPORAL SUB-SAMPLING DEVICE FOR VIDEO NUMERICAL REPRESENTATIVE NAMES OF A SERIES OF LINKED OR SEQUENTIAL IMAGES; HIGH DEFINITION TV TRANSMISSION SYSTEM INCLUDING A DEVICE. OF THIS TYPE AND STAGES OF EMISSION AND RECEPTION FOR A SYSTEM. OF THIS TYPE.
FR2633137B1 (en) * 1988-06-21 1990-11-09 Labo Electronique Physique HIGH DEFINITION TELEVISION TRANSMISSION AND RECEPTION SYSTEM WITH IMPROVED SPEED ESTIMATOR AND REDUCED DATA RATE
JPH0294891A (en) * 1988-09-30 1990-04-05 Toshiba Corp Movement detection circuit
GB8909498D0 (en) * 1989-04-26 1989-06-14 British Telecomm Motion estimator
US5193003A (en) * 1989-08-15 1993-03-09 Sony Corporation Apparatus for decoding digital video data with small memory requirement
GB2241402B (en) * 1990-02-26 1994-02-02 Sony Corp Encoding and decoding of MAC video signals
US5043808A (en) * 1990-03-19 1991-08-27 At&T Bell Laboratories High definition television arrangement employing motion compensated prediction error signals
US5128759A (en) * 1990-05-31 1992-07-07 Sony Corporation Video signal decoder
JPH04104681A (en) * 1990-08-24 1992-04-07 Victor Co Of Japan Ltd Motion vector detection device
US5200820A (en) * 1991-04-26 1993-04-06 Bell Communications Research, Inc. Block-matching motion estimator for video coder
US5151784A (en) * 1991-04-30 1992-09-29 At&T Bell Laboratories Multiple frame motion estimation
US5184218A (en) * 1991-07-03 1993-02-02 Wavephore, Inc. Bandwidth compression and expansion system
US5418617A (en) * 1991-08-08 1995-05-23 Matsushita Electric Corporation Of America Motion compensation using minimum bits per motion block as criterion for block matching
EP0535272A1 (en) * 1991-10-02 1993-04-07 Alcatel N.V. Hybrid encoder arrangement for an image processing system
US5227878A (en) * 1991-11-15 1993-07-13 At&T Bell Laboratories Adaptive coding and decoding of frames and fields of video
KR0152013B1 (en) * 1992-05-08 1998-10-15 강진구 Moving estimation device considering variable length coding
JP3545000B2 (en) * 1992-11-02 2004-07-21 ソニー株式会社 Image signal encoding device, image signal decoding device
US5329318A (en) * 1993-05-13 1994-07-12 Intel Corporation Method for optimizing image motion estimation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO9617478A1 *

Also Published As

Publication number Publication date
KR970700980A (en) 1997-02-12
US5581302A (en) 1996-12-03
US5644361A (en) 1997-07-01
WO1996017478A1 (en) 1996-06-06

Similar Documents

Publication Publication Date Title
US5644361A (en) Subsampled frame storage technique for reduced memory size
JP3314929B2 (en) Video signal encoding circuit
US8401079B2 (en) Image coding apparatus, image coding method, image decoding apparatus, image decoding method and communication apparatus
KR970000683B1 (en) Resolution adaptive video compression/decompression method and apparatus
EP0630157B1 (en) Systems and methods for coding alternate fields of interlaced video sequences
US5686962A (en) Motion image coder using pre-filter to reduce quantization error
US5610657A (en) Video compression using an iterative error data coding method
US5812787A (en) Video coding scheme with foreground/background separation
US5812788A (en) Encoding/decoding video signals using quantization tables based on explicitly encoded base and scale matrices
US5703966A (en) Block selection using motion estimation error
US6414992B1 (en) Optimal encoding of motion compensated video
EP0396360A2 (en) Apparatus for inter-frame predictive encoding of video signal
US6125146A (en) Method and device for decoding coded digital video signals
US5721822A (en) Run-length encoding/decoding video signals using scan patterns explicitly encoded into bitstreams
US5778190A (en) Encoding video signals using multi-phase motion estimation
WO1996031069A1 (en) Motion vector based frame insertion process for increasing the frame rate of moving images
US5627601A (en) Motion estimation with bit rate criterion
US5751861A (en) Reducing residual artifacts in video coding schemes with integer motion compensation
EP0920204A1 (en) MPEG2 decoder with reduced RAM requisite by recompression using adaptive tree search vector quantization
US6008849A (en) Method and system for decoding coded video signals
EP0714208B1 (en) Method and system for decoding coded video signals
US5394190A (en) Video signal encoding apparatus
KR19980017213A (en) Image Decoding System with Compensation Function for Degraded Image
CN113994684A (en) Method and apparatus for prediction dependent residual scaling for video coding
JPH09261661A (en) Method for forming bidirectional coding picture from two reference pictures

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): DE FR GB

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 19961207