WO1992001267A1 - Charge domain block matching processor - Google Patents

Charge domain block matching processor Download PDF

Info

Publication number
WO1992001267A1
WO1992001267A1 PCT/US1991/004790 US9104790W WO9201267A1 WO 1992001267 A1 WO1992001267 A1 WO 1992001267A1 US 9104790 W US9104790 W US 9104790W WO 9201267 A1 WO9201267 A1 WO 9201267A1
Authority
WO
WIPO (PCT)
Prior art keywords
charge
network
networks
packets
magnitude
Prior art date
Application number
PCT/US1991/004790
Other languages
French (fr)
Inventor
Alice M. Chiang
Original Assignee
Massachusetts Institute Of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Massachusetts Institute Of Technology filed Critical Massachusetts Institute Of Technology
Publication of WO1992001267A1 publication Critical patent/WO1992001267A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation

Definitions

  • the present invention is in the field of integrated circuit networks and more particularly directed to charge domain processing networks.
  • Intraframe coders are effective to remove spatial redundancy from within respective video frames.
  • Interframe coders generally use some form of predictive coding based on the information in a previous frame to remove redundancy between frames.
  • Tremendous advantages are gained by incorporating motion detection and compensation (MDC) in an interframe coder. This is due to the fact that, in an area where motion is occurring, a better prediction can be formed based on the direction of motion.
  • MDC motion detection and compensation
  • Transform image coding based on the discrete cosine transform (DCT) algorithm has been shown to provide good-quality, low-bit-rate image transmission.
  • DCT discrete cosine transform
  • a hybrid interframe/intraframe coder offers the potential for achieving a higher bit-rate reduction.
  • Devices implementing such techniques are particularly advantageous, for example, in high-definition television (HDTV) receivers capable of accommodating different transmission formats and different field rates, and for low-bit-rate video communications in general.
  • HDTV high-definition television
  • MDC permits implementation of efficient interpolation algorithms to reconstruct missing fields following the use of temporal subsampling to reduce transmission bandwidth.
  • MDE real-time motion detection and estimation
  • NTSC National Television System Committee
  • PAL Phase Alternating Line
  • SECAM Sequential Couleur a Memoire
  • the luminance resolution of the future HDTV is at least four times, better than the NTSC system; to transcode such a high resolution video between different standards requires a temporal interpolation processor capable of performing more than 16 billion operations/s.
  • the availability of a low-cost field-rate converter would considerably simplify the trade-offs between different transmission formats.
  • BMA full search block matching algorithm
  • the BMA effectively passes a subarray, or template, of data values over the larger full array and identifies the particular location of the subarray which provides the highest spatial correlation (i.e. best match) of the subarray to the underlying array.
  • the full array may be a 31x31 pixel rectangular array
  • the template may be a 16x16 pixel rectangular array.
  • the computational requirement for motion compensation coding with a 16x16 full-search BMA is 3.8 billion computations/s for present NTSC video, and is expected to be on the order of 16 billion computations/s for future HDTV signals. For a digital HDTV transmitter and receiver, this computation rate requirement is near the upper limit of present digital technology, and consequently would result in highly complex and correspondingly costly equipment. Accordingly, it is an object of the present invention to provide an improved charge domain processing network.
  • Another object is to provide a charge domain full search block matching algorithm processing system.
  • the present invention is a full search block matching charge domain processor for determining the best matching mxn element subarray of an MxN element array with an mxn element template, where m ⁇ . M and n ⁇ _ N.
  • the processor indicates a tapped delay line, a template network, m groups of n absolute magnitude difference (AMD) networks, m groups of n summing networks, m accumulator networks, a column accumulator network, and .a column minimum difference network.
  • AMD absolute magnitude difference
  • m groups of n summing networks m groups of n summing networks
  • m accumulator networks a column accumulator network
  • column minimum difference network a column minimum difference network.
  • all of these elements are charge domain devices, but one or more may have other forms.
  • the charge domain tapped delay line includes at least (mxn) + (m-l)x(N-n) stages and has a serial input channel and m groups of n parallel output channels. Each of the n output channels in each group is coupled to an associated one of n successive stages in a corresponding group of stages in the delay line, and each of the groups of n successive stages in the delay line is separated by N-n successive stages in the delay line.
  • the delay line is adapted for establishing a succession of at least mxn charge packets therein in response to a succession of applied input signals, where each of the packets has a magnitude corresponding to an associated one of a corresponding succession of elements of the array.
  • the charge packets may be selectively shifted from stage-to-stage along the delay line at an input rate.
  • Each of N floating gate sensing electrodes overlies one of the stages and is adapted to provide a potential thereon representative of the magnitude of a charge packet currently within its underlying stage, where the potential is a search signal associated with the electrode.
  • the template network includes a charge storage region and is adapted for establishing a succession of mxn charge packets in that storage region at the input rate, where each of the packets has a magnitude corresponding to an associated element in the template.
  • a floating gate sensing electrode overlies the storage region and is adapted to provide a potential thereon representative of the magnitude of a charge packet currently underlying the electrode, where the potential is a template signal.
  • Each AMD network of the m groups of n charge domain AMD networks has a template input port, a search input port, and a charge domain output port.
  • Each of the n AMD networks of each of the groups has its search input port coupled to an associated sensing electrode of an associated group of the output channels of the delay line, and has its template input port coupled to the sensing electrode of the template network.
  • Each of the AMD networks provides at its output port a succession of charge packets at the input rate, where each of the charge packets is representative of the absolute magnitude of the difference between the current one of the search signals at its search input port and the current one of the template signals at the template input port.
  • Each of the summing networks is associated with an AMD network in an associated one of the m groups of AMD networks, and each provides at an output thereof a succession of output charge packets at a summing rate equal to l/(mxn) times the input rate.
  • Each of these output charge packet has a magnitude representative of the siirn of the magnitude of mxn successive charge packets provided at the output port of its associated AMD network.
  • Each row accumulator network is associated with one of the m groups of n AMD networks, and substantially simultaneously receives the output charge packets of its associated n summing networks at distinct locations in the row accumulator network.
  • the received charge packets are transferred to establish a succession of the output charge packets at an output port thereof at an accumulator rate equal to 1/m times the input rate.
  • the column accumulator network is associated with the m row accumulator networks. That network substantially simultaneously receives the succession of output charge packets of said m row accumulator networks at distinct locations in the column accumulator network, and transfers those received charge packets to establish a succession of output charge packets at an output port thereof at a rate equal to the input rate.
  • the charge domain column minimum difference network is associated with the column accumulator network, and is adapted to store a charge packet therein.
  • the succession of said output charge packets is received one at a time at the input rate, and at the receipt of each of the charge packets, retains and stores only the smaller magnitude one of the just received charge packet and the next previously stored charge packet.
  • the magnitude of the charge packet stored at the column MD network is representative of the best matching mxn subarray of the elements that have been represented by charge packets in the delay line.
  • separate row minimum difference (MD) networks may be associated with each of the M row accumulator networks and be adapted to store a charge packet therein.
  • Each row MD network receives the succession of output charge packets one at a time at the accumulator rate, and at the receipt of each charge packet, retains and stores only the smaller magnitude one of the just received charge packet and a previously stored charge packet.
  • the output from the respective row MD networks are received at the distinct locations in the column accumulator network. Upon receipt, those charge packets are transferred in succession to the output part of that accumulator at the accumulator (i.e. relatively low) clock rate.
  • the column MD network operates as described above to identify the best match.
  • FIG. 1 shows in schematic block diagram form, a charge domain processing system embodying the present invention
  • FIG. 1A shows in schematic block diagram from, an alternative charge domain processing system embodying the invention
  • FIG. 2 shows in schematic block diagram form an absolute magnitude difference minimum detection network of the system of FIG. 1;
  • FIG. 3 shows in schematic block diagram form, a minimum detection network of the system of FIG. 1.
  • the preferred embodiment of the invention is a very large scale integration (VLSI) implementation of a motion detection and estimation video processor.
  • VLSI very large scale integration
  • Motion estimation and compensation techniques have a wide range of applications in the video signal processing area, including video signal coding in order to remove redundancy along the temporal domain, and therefore, to reduce transmission bit-rates.
  • MC motion compensation
  • the displacement of objects is estimated from one field, or frame, to the next. A prediction is then formed in the direction of the motion. This process gives lower prediction errors than does a standard interframe prediction process, i.e., a process with no motion compensation.
  • MC systems have been used to transmit tele ision broadcast signals and to code video conferencing signals.
  • a block-match MC system the current array of pixels, or frames, in a video sequence is segmented into subarrays, or blocks. Each block is compared to blocks in the previous frame within a search area to detect the displacement or the motion of objects. According to a matching criterion, such as the mean of the absolute difference (MAD) or the mean-square-error, a motion vector is extracted for each block. With the motion-vector information, a receiver can reconstruct the current frame using the available previous frame data and the coded frame difference (as determined by the best matched block). Among several possible searching algorithms, full search, which searches all possible blocks within the tracking range in the previous frame, gives the most accurate estimation of the motion vector.
  • a matching criterion such as the mean of the absolute difference (MAD) or the mean-square-error
  • the full-search block-matching method requires a large number of computations. For example, for a given 16xl6-pixel block X m ,n in the present frame, and a 31x31-pixel search area Ym + l,n + j in the previous frame, the displacement vector (i,j) is determined by the minimum of the distortion function D:
  • the search requires about 16 billion operations and 8 billion data accesses per second.
  • the preferred form of the present invention implements the full search BMA. That embodiment includes a charge-domain serial tapped delay line as an input buffer, and an array of charge domain signal processors. The delay line shifts and holds analog sampled data which are in the form of charge packets. At each stage of delay, the signals are nondestructively sensed and coupled to a corresponding signal processor, and the sampled data are transferred and subsequently processed in parallel.
  • the processed data from all the processors can be read out either in a parallel or serial format through a parallel-in-serial-out output buffer.
  • a parallel-in-serial-out output buffer In this structure, only the serial input I buffer has to be clocked at the system throughput rate; the internal clock rate of each processor is reduced by the number of parallel processors.
  • all of the computation functions are performed in the charge domain, and local charge domain memories are included for storing the processed signal.
  • charge domain full search BMA processing system 10 is shown in schematic block diagram form in FIG. 1.
  • System 10 is adapted to determine the best matching mxn element subarray of an MxN element array with an mxn template, where m ⁇ .M and n_N.
  • AMD absolute magnitude difference
  • the tapped delay line 12 has the same general configuration as the tapped delay line disclosed in U.S. Patent No. 4,464,726, and generally includes at least (mxn) + (m-l)x(N-n) stages.
  • a serial input is provided at line 12a for receiving a succession of applied values representative of the MxN elements of the current frame array. Those values are clocked through the delay line 12 at an input rate R 0 (in response to an input clock signal generated by controller 40) .
  • the template network 20 As the input data is passed through the delay line 12 at rate R 0 , the template network 20 generates at that same rate, a succession of template values corresponding to the values of the mxn subarray (or block) from the previous frame. Those values establish a succession of charge packets under the template input in each of the AMD networks in groups 1 through 16.
  • each AMD network During each clock period of the R 0 input clock signal, each AMD network provides at its output port a charge packet representative of the absolute magnitude of the difference of the magnitudes of the charge packets at the search input and the template input. Each of these charge packets is applied to an associated one of the summing networks.
  • the charge packets are accumulated in a summing network associated with each AMD network to provide a composite charge packet at its output having a magnitude substantially equal to the sum of the magnitudes of the charge packets provided by the AMD network during those mxn input clock periods.
  • the row minimum detector networks, MD1 through MD16 each operate to receive the succession of charge packets from its associated row accumulator network.
  • the charge packets stored in each of the row MD networks are transferred to correspondingly positioned locations in the column accumulator 30, and then those charge packets are stepped together at the accumulator clock rate (1/m x R 0 ) through accumulator 30 and applied by way of an output port 30a to the column minimum detection network 35.
  • the controller 40 provides the appropriate clock signals to accomplish the various charge transfer operations.
  • the controller 40 tracks the various charge packets identified by the respective row and column minimum detection networks so that "position" of the mxn subarray is known for the particular charge packet that is ultimately determined to be the best match (i.e. the packet stored at minimum detection network 35) .
  • FIG. 1A An alternative system 10' embodying the invention is shown is FIG. 1A, where elements corresponding to elements in the embodiment of FIG. 1 are denoted by the same reference numerals.
  • the system 10' is substantially the same as the system 10, except that there are no elements corresponding to minimum detection (MD) networks MD1-MD16.
  • MD minimum detection
  • the output charge packets from row accumulators ACC1-ACC16 are loaded directly into their respective locations in column accumulator 30.
  • the stored packets in the column accumulator 30 are ' then clocked through the accumulator 30 at the input rate R 0 to the column minimum detector network 35.
  • the system 10' requires a high clock rate (R 0 ) column accumulator 30 and minimum detector network (compared to the system 10 which utilizes a low clock rate (1/rn x R 0 ) column accumulator 30 and minimum detector network 35), but it eliminates the need for the row minimum detectors networks (corresponding to MD1-MD16 in system 10), i.e. only a single minimum detector network 35 is required.
  • FIG. 2 a preferred form of a charge domain absolute magnitude difference network is shown schematically in FIG. 2.
  • the illustrated network denoted by reference designation 50, is a two input channel device.
  • Network 50 includes a U-shaped charge transport medium 52 having one of charge injection diodes 54 and 56 positioned at each end.
  • a first pair of gate electrodes 60 and 62 overlies a first leg of the U-shaped medium 52 with gate 60 being between diode 54 and gate 62
  • a second pair of gate electrodes 64 and 66 overlies the second leg of the medium 52 with gate 64 being between diodes 56 and gate 66.
  • Gates 60 and 66 are electrically coupled together and also to template network 20, thereby coupling a template signal to device 50.
  • Gates 62 and 64 are electrically coupled together and also to the associated one of the output channels from tapped delay line 12, thereby coupling a search signal to device 50.
  • a charge packet is provided at the output portion 52a of medium 52, which packet has a magnitude proportional to the absolute magnitude of the difference between the template signal at gates 60 and 66 and the search signal at gates 62 and 64.
  • FIG. 3 a preferred form of a charge domain minimum detection network is shown schematically in FIG. 3.
  • the illustrated network denoted by reference designation 70, can detect and store the smallest of a succession of applied charge packets at a clock rate. At each clock period, the network 70 examines a previously stored charge packet and a currently applied charge packet, and detects and stores the smaller packet, and discards the larger packet.
  • An associated buffer 82 and digital counter 80 keep track of the identity (i.e. time of receipt) of the stored packet.
  • the network 70 includes a two stage CCD tapped delay line 72 having floating gate sensing electrodes 72a and 72b, an FET flip-flop 74, a charge sink 76, and transfer gates 78a and 78b coupled to an associated one of sensing electrodes 72a and 72b.
  • Each of electrodes 72a and 72b is coupled by way of an associated one of FET's 74a and 74b to an associated input of the flip-flop 74.
  • the outputs A and B of flip-flop 74 are coupled to associated ones of charge domain transfer gates 78a and 78b.
  • the output A of flip-flop 74 is coupled to the counter 80 to update the address value for the charge packet currently stored in charge sink 76.
  • packets of charge Qi and Q2 are loaded into the tapped delay line 72 under gates 72a and 72b, respectively. If Qi is larger than Q2, output A of the flip-flop 74 is high, the transfer gate 78a is "on” and the transfer gate 78b is “off”. As a result, packet Q2 will remain in the delay line 72 and Qi is discarded (i.e. transferred to charge sink 76). Alternatively, if Qi is smaller than Q2, the transfer gate 78b is "on” and the gate 78a is “off”. Under these conditions, Qi is retained and Q2 is discarded (i.e. transferred to charge sink 76).
  • the retained smaller charge packet is shifted under gate 72a and a new charge packet Q3 is loaded under gate 72b.
  • the whole procedure repeats until the last charge packet in the succession passes through network 70.
  • the digital counter 80 is updated as required to identify the source (i.e. time of arrival) of the stored packet. Whenever the flip-flop 74 output A signals a new charge packet is smaller than the currently stored minimum, the number in the counter 80 updates a buffer 82 which saves the address of the minimum charge packet.
  • network 70 not only selects the smallest charge packet, but also records its location.

Abstract

A full search block matching algorithm includes a charge-domain serial tapped delay line as an input buffer, and an array of charge domain signal processors. The delay line shifts and holds analog sampled data which are in the form of charge packets. At each stage of delay, the signals are nondestructively sensed and coupled to a corresponding signal processor, and the sampled data are transferred and subsequently processed in parallel. The processed data from all the processors can be read out either in a parallel or serial format through a parallel-in-serial-out output buffer. In this structure, only the serial input buffer has to be clocked at the system throughout rate; the internal clock rate of each processor is reduced by the number of parallel processors. Within each processor, all of the computation functions are performed in the charge domain, and local charge domain memories are included for storing the processed signal.

Description

CHARGE DOMAIN BLOCK MATCHING PROCESSOR
BACKGROUND OF THE INVENTION
The present invention is in the field of integrated circuit networks and more particularly directed to charge domain processing networks.
The need for transmitting data representative of an image through a relatively low data-rate channel is the driving force for the development of video bandwidth compression techniques. There are two basic classes of video compression algorithms: intraframe and interframe coding. Intraframe coders are effective to remove spatial redundancy from within respective video frames. Interframe coders generally use some form of predictive coding based on the information in a previous frame to remove redundancy between frames. Tremendous advantages are gained by incorporating motion detection and compensation (MDC) in an interframe coder. This is due to the fact that, in an area where motion is occurring, a better prediction can be formed based on the direction of motion. Transform image coding based on the discrete cosine transform (DCT) algorithm has been shown to provide good-quality, low-bit-rate image transmission. A hybrid interframe/intraframe coder offers the potential for achieving a higher bit-rate reduction. Devices implementing such techniques are particularly advantageous, for example, in high-definition television (HDTV) receivers capable of accommodating different transmission formats and different field rates, and for low-bit-rate video communications in general. In video receivers, the use of MDC permits implementation of efficient interpolation algorithms to reconstruct missing fields following the use of temporal subsampling to reduce transmission bandwidth.
In automatic target tracking applications of video systems, it is important to detect the displacement of objects between pairs of successive image frames. Such signal processing generally requires a real-time motion detection and estimation (MDE) device for optimum performance.
The development of improved television systems is presently being limited by the use of different field rate standards. At present, there are three international television standards: the National Television System Committee (NTSC) , the Phase Alternating Line (PAL) and the Sequential Couleur a Memoire (SECAM) . The problem of frame conversion between different standards arises when there is no integral relationship between the two field rates; it is necessary to construct pictures for intermediate times. Linear interpolation without motion compensation would degrade the picture quality. While motion-compensated transcoding between PAL and NTSC has become technically quite successful, the transcoders are very expensive because temporal interpolation is required. The luminance resolution of the future HDTV is at least four times, better than the NTSC system; to transcode such a high resolution video between different standards requires a temporal interpolation processor capable of performing more than 16 billion operations/s. The availability of a low-cost field-rate converter would considerably simplify the trade-offs between different transmission formats.
An important aspect of most efficient MDC and MDE techniques is the applying a full search block matching algorithm (BMA) to an array of signal elements representative of an image frame. The BMA effectively passes a subarray, or template, of data values over the larger full array and identifies the particular location of the subarray which provides the highest spatial correlation (i.e. best match) of the subarray to the underlying array. In typical applications, the full array may be a 31x31 pixel rectangular array, and the template may be a 16x16 pixel rectangular array. By way of example, the computational requirement for motion compensation coding with a 16x16 full-search BMA is 3.8 billion computations/s for present NTSC video, and is expected to be on the order of 16 billion computations/s for future HDTV signals. For a digital HDTV transmitter and receiver, this computation rate requirement is near the upper limit of present digital technology, and consequently would result in highly complex and correspondingly costly equipment. Accordingly, it is an object of the present invention to provide an improved charge domain processing network.-
Another object is to provide a charge domain full search block matching algorithm processing system.
SUMMARY OF THE INVENTION
The present invention is a full search block matching charge domain processor for determining the best matching mxn element subarray of an MxN element array with an mxn element template, where m <. M and n <_ N.
The processor indicates a tapped delay line, a template network, m groups of n absolute magnitude difference (AMD) networks, m groups of n summing networks, m accumulator networks, a column accumulator network, and .a column minimum difference network. In this preferred form, all of these elements are charge domain devices, but one or more may have other forms.
The charge domain tapped delay line includes at least (mxn) + (m-l)x(N-n) stages and has a serial input channel and m groups of n parallel output channels. Each of the n output channels in each group is coupled to an associated one of n successive stages in a corresponding group of stages in the delay line, and each of the groups of n successive stages in the delay line is separated by N-n successive stages in the delay line.
The delay line is adapted for establishing a succession of at least mxn charge packets therein in response to a succession of applied input signals, where each of the packets has a magnitude corresponding to an associated one of a corresponding succession of elements of the array. The charge packets may be selectively shifted from stage-to-stage along the delay line at an input rate.
Each of N floating gate sensing electrodes overlies one of the stages and is adapted to provide a potential thereon representative of the magnitude of a charge packet currently within its underlying stage, where the potential is a search signal associated with the electrode.
The template network includes a charge storage region and is adapted for establishing a succession of mxn charge packets in that storage region at the input rate, where each of the packets has a magnitude corresponding to an associated element in the template. A floating gate sensing electrode overlies the storage region and is adapted to provide a potential thereon representative of the magnitude of a charge packet currently underlying the electrode, where the potential is a template signal. Each AMD network of the m groups of n charge domain AMD networks, has a template input port, a search input port, and a charge domain output port. Each of the n AMD networks of each of the groups has its search input port coupled to an associated sensing electrode of an associated group of the output channels of the delay line, and has its template input port coupled to the sensing electrode of the template network. Each of the AMD networks provides at its output port a succession of charge packets at the input rate, where each of the charge packets is representative of the absolute magnitude of the difference between the current one of the search signals at its search input port and the current one of the template signals at the template input port.
Each of the summing networks is associated with an AMD network in an associated one of the m groups of AMD networks, and each provides at an output thereof a succession of output charge packets at a summing rate equal to l/(mxn) times the input rate. Each of these output charge packet has a magnitude representative of the siirn of the magnitude of mxn successive charge packets provided at the output port of its associated AMD network.
Each row accumulator network is associated with one of the m groups of n AMD networks, and substantially simultaneously receives the output charge packets of its associated n summing networks at distinct locations in the row accumulator network. The received charge packets are transferred to establish a succession of the output charge packets at an output port thereof at an accumulator rate equal to 1/m times the input rate.
The column accumulator network is associated with the m row accumulator networks. That network substantially simultaneously receives the succession of output charge packets of said m row accumulator networks at distinct locations in the column accumulator network, and transfers those received charge packets to establish a succession of output charge packets at an output port thereof at a rate equal to the input rate.
The charge domain column minimum difference network is associated with the column accumulator network, and is adapted to store a charge packet therein. The succession of said output charge packets is received one at a time at the input rate, and at the receipt of each of the charge packets, retains and stores only the smaller magnitude one of the just received charge packet and the next previously stored charge packet.
The magnitude of the charge packet stored at the column MD network is representative of the best matching mxn subarray of the elements that have been represented by charge packets in the delay line. In an alternative form, separate row minimum difference (MD) networks may be associated with each of the M row accumulator networks and be adapted to store a charge packet therein. Each row MD network receives the succession of output charge packets one at a time at the accumulator rate, and at the receipt of each charge packet, retains and stores only the smaller magnitude one of the just received charge packet and a previously stored charge packet.
In the latter form of the invention, the output from the respective row MD networks are received at the distinct locations in the column accumulator network. Upon receipt, those charge packets are transferred in succession to the output part of that accumulator at the accumulator (i.e. relatively low) clock rate. The column MD network operates as described above to identify the best match.
BRIEF DESCRIPTION OF THE DRAWINGS
The foregoing and other, objects of this invention, the various features thereof, as well as the invention itself, may be more fully understood from the following description, when read together with the accompanying drawings in which:
FIG. 1 shows in schematic block diagram form, a charge domain processing system embodying the present invention; FIG. 1A shows in schematic block diagram from, an alternative charge domain processing system embodying the invention;
FIG. 2 shows in schematic block diagram form an absolute magnitude difference minimum detection network of the system of FIG. 1; and
FIG. 3 shows in schematic block diagram form, a minimum detection network of the system of FIG. 1.
DESCRIPTION OF THE PREFERRED EMBODIMENT
The preferred embodiment of the invention is a very large scale integration (VLSI) implementation of a motion detection and estimation video processor. Motion estimation and compensation techniques have a wide range of applications in the video signal processing area, including video signal coding in order to remove redundancy along the temporal domain, and therefore, to reduce transmission bit-rates. In a motion compensation (MC) predictive algorithm, the displacement of objects is estimated from one field, or frame, to the next. A prediction is then formed in the direction of the motion. This process gives lower prediction errors than does a standard interframe prediction process, i.e., a process with no motion compensation. MC systems have been used to transmit tele ision broadcast signals and to code video conferencing signals. In a block-match MC system, the current array of pixels, or frames, in a video sequence is segmented into subarrays, or blocks. Each block is compared to blocks in the previous frame within a search area to detect the displacement or the motion of objects. According to a matching criterion, such as the mean of the absolute difference (MAD) or the mean-square-error, a motion vector is extracted for each block. With the motion-vector information, a receiver can reconstruct the current frame using the available previous frame data and the coded frame difference (as determined by the best matched block). Among several possible searching algorithms, full search, which searches all possible blocks within the tracking range in the previous frame, gives the most accurate estimation of the motion vector. However, the full-search block-matching method requires a large number of computations. For example, for a given 16xl6-pixel block Xm,n in the present frame, and a 31x31-pixel search area Ym+l,n+j in the previous frame, the displacement vector (i,j) is determined by the minimum of the distortion function D:
16 16 D(i,j) = I ∑d(Xm,n *-m+ι,n+3 ) m=ln=l
where d(x,y) = | x-y | (MAD) for every possible displacement i and j . For a frame size of 1024x1024 pixels and a frame rate of 30 frames/s, the search requires about 16 billion operations and 8 billion data accesses per second. The preferred form of the present invention implements the full search BMA. That embodiment includes a charge-domain serial tapped delay line as an input buffer, and an array of charge domain signal processors. The delay line shifts and holds analog sampled data which are in the form of charge packets. At each stage of delay, the signals are nondestructively sensed and coupled to a corresponding signal processor, and the sampled data are transferred and subsequently processed in parallel. The processed data from all the processors can be read out either in a parallel or serial format through a parallel-in-serial-out output buffer. In this structure, only the serial input I buffer has to be clocked at the system throughput rate; the internal clock rate of each processor is reduced by the number of parallel processors. Within each processor, all of the computation functions are performed in the charge domain, and local charge domain memories are included for storing the processed signal.
The preferred embodiment, charge domain full search BMA processing system 10, is shown in schematic block diagram form in FIG. 1. System 10 is adapted to determine the best matching mxn element subarray of an MxN element array with an mxn template, where m^.M and n_N. In the illustrated embodiment, m=n=16 and M*=N=31. The system 10 includes a charge domain tapped delay line 12, a template network 20, m(=16) groups (denoted GRP1 through GRP16 in FIG. 1) of n(=16) charge domain absolute magnitude difference (AMD) networks (denoted AMD1 through AMD16 in each of the groups) , 16 groups (denoted GRP1 through GRP16 in FIG. 1) of 16 charge domain summing networks (denoted Jl through ∑16 in each of the groups), 16 charge domain row accumulator networks (denoted ACC1 through ACC16), 16 charge domain row minimum detector networks (denoted MD1 through MD16), a charge domain column accumulator network 30, a charge domain column minimum detector network 35, and a BMA controller 40.
The tapped delay line 12 has the same general configuration as the tapped delay line disclosed in U.S. Patent No. 4,464,726, and generally includes at least (mxn) + (m-l)x(N-n) stages. A serial input is provided at line 12a for receiving a succession of applied values representative of the MxN elements of the current frame array. Those values are clocked through the delay line 12 at an input rate R0 (in response to an input clock signal generated by controller 40) .
Delay line 12 has m(=16) groups of n(=16) parallel output channels, each group of channels being associated with a corresponding group of successive stages of line 12, where each group is separated by N-n successive stages in the delay line 12. With this configuration, as the input data is clocked through delay line 12, the values (charge packets) at the output channels of line 12 correspond to that which would result from sliding an mxn window (or patch) over the MxN data array, as in the configurations disclosed in U.S. Patent No. 4,313,159. The respective output channels from delay line 12 establish charge packets under the search input of an associated AMD network.
As the input data is passed through the delay line 12 at rate R0, the template network 20 generates at that same rate, a succession of template values corresponding to the values of the mxn subarray (or block) from the previous frame. Those values establish a succession of charge packets under the template input in each of the AMD networks in groups 1 through 16.
During each clock period of the R0 input clock signal, each AMD network provides at its output port a charge packet representative of the absolute magnitude of the difference of the magnitudes of the charge packets at the search input and the template input. Each of these charge packets is applied to an associated one of the summing networks.
For a period corresponding to mxn (=256) input clock periods, the charge packets are accumulated in a summing network associated with each AMD network to provide a composite charge packet at its output having a magnitude substantially equal to the sum of the magnitudes of the charge packets provided by the AMD network during those mxn input clock periods. Each summing network in each group applies its output charge packet to an associated one of the m(=16) charge domain row accumulator networks, at correspondingly positioned locations along the accumulator. Under the control of a row accumulator clock signal (at a "row accumulator rate" equal to 1/m times the input rate R0) from controller 40, the received charge packets in all of the row accumulators are stepped together through the accumulators so that a succession of those charge packets are provided at the output ports of those accumulators.
The row minimum detector networks, MD1 through MD16, each operate to receive the succession of charge packets from its associated row accumulator network. Upon receipt of each charge packet, a row MD network compares the magnitude of that just received packet with a stored charge packet from the previous accumulator clock period, and stores the lesser-magnitude charge packet and dispenses with the greater-magnitude charge packet. This process repeats for n(=16) accumulator clock periods so that at the end of that process, each MD network has stored the minimum magnitude charge packet received from all of the summing networks of its associated group.
At the end of each set of n(=16) accumulator clock periods, the charge packets stored in each of the row MD networks are transferred to correspondingly positioned locations in the column accumulator 30, and then those charge packets are stepped together at the accumulator clock rate (1/m x R0) through accumulator 30 and applied by way of an output port 30a to the column minimum detection network 35.
The column minimum detector network 35 is similar to each of the row minimum detector networks. That detector 35, after m(=16) input clock periods, stores or retains only the minimum magnitude one of charge packets that had been loaded into column accumulator 30. This stored charge packet is representative of the best match of all of the positions of the mxn subarray over the full MxN array.
During the above-described operation, the controller 40 provides the appropriate clock signals to accomplish the various charge transfer operations. In addition, the controller 40 tracks the various charge packets identified by the respective row and column minimum detection networks so that "position" of the mxn subarray is known for the particular charge packet that is ultimately determined to be the best match (i.e. the packet stored at minimum detection network 35) .
An alternative system 10' embodying the invention is shown is FIG. 1A, where elements corresponding to elements in the embodiment of FIG. 1 are denoted by the same reference numerals. The system 10' is substantially the same as the system 10, except that there are no elements corresponding to minimum detection (MD) networks MD1-MD16. In system 10, the output charge packets from row accumulators ACC1-ACC16 are loaded directly into their respective locations in column accumulator 30. The stored packets in the column accumulator 30 are ' then clocked through the accumulator 30 at the input rate R0 to the column minimum detector network 35. Thus, the system 10' requires a high clock rate (R0) column accumulator 30 and minimum detector network (compared to the system 10 which utilizes a low clock rate (1/rn x R0) column accumulator 30 and minimum detector network 35), but it eliminates the need for the row minimum detectors networks (corresponding to MD1-MD16 in system 10), i.e. only a single minimum detector network 35 is required.
By way of example, a preferred form of a charge domain absolute magnitude difference network is shown schematically in FIG. 2. The illustrated network, denoted by reference designation 50, is a two input channel device. Network 50 includes a U-shaped charge transport medium 52 having one of charge injection diodes 54 and 56 positioned at each end. A first pair of gate electrodes 60 and 62 overlies a first leg of the U-shaped medium 52 with gate 60 being between diode 54 and gate 62, and a second pair of gate electrodes 64 and 66 overlies the second leg of the medium 52 with gate 64 being between diodes 56 and gate 66. Gates 60 and 66 are electrically coupled together and also to template network 20, thereby coupling a template signal to device 50. Gates 62 and 64 are electrically coupled together and also to the associated one of the output channels from tapped delay line 12, thereby coupling a search signal to device 50.
With this configuration, a charge packet is provided at the output portion 52a of medium 52, which packet has a magnitude proportional to the absolute magnitude of the difference between the template signal at gates 60 and 66 and the search signal at gates 62 and 64.
Also, by way of example, a preferred form of a charge domain minimum detection network is shown schematically in FIG. 3. The illustrated network, denoted by reference designation 70, can detect and store the smallest of a succession of applied charge packets at a clock rate. At each clock period, the network 70 examines a previously stored charge packet and a currently applied charge packet, and detects and stores the smaller packet, and discards the larger packet. An associated buffer 82 and digital counter 80 keep track of the identity (i.e. time of receipt) of the stored packet.
The network 70 includes a two stage CCD tapped delay line 72 having floating gate sensing electrodes 72a and 72b, an FET flip-flop 74, a charge sink 76, and transfer gates 78a and 78b coupled to an associated one of sensing electrodes 72a and 72b. Each of electrodes 72a and 72b is coupled by way of an associated one of FET's 74a and 74b to an associated input of the flip-flop 74. The outputs A and B of flip-flop 74 are coupled to associated ones of charge domain transfer gates 78a and 78b. The output A of flip-flop 74 is coupled to the counter 80 to update the address value for the charge packet currently stored in charge sink 76.
In the operation of network 70, packets of charge Qi and Q2 are loaded into the tapped delay line 72 under gates 72a and 72b, respectively. If Qi is larger than Q2, output A of the flip-flop 74 is high, the transfer gate 78a is "on" and the transfer gate 78b is "off". As a result, packet Q2 will remain in the delay line 72 and Qi is discarded (i.e. transferred to charge sink 76). Alternatively, if Qi is smaller than Q2, the transfer gate 78b is "on" and the gate 78a is "off". Under these conditions, Qi is retained and Q2 is discarded (i.e. transferred to charge sink 76). In a serial mode, the retained smaller charge packet is shifted under gate 72a and a new charge packet Q3 is loaded under gate 72b. The whole procedure repeats until the last charge packet in the succession passes through network 70. The digital counter 80 is updated as required to identify the source (i.e. time of arrival) of the stored packet. Whenever the flip-flop 74 output A signals a new charge packet is smaller than the currently stored minimum, the number in the counter 80 updates a buffer 82 which saves the address of the minimum charge packet. Thus, network 70 not only selects the smallest charge packet, but also records its location.
The invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.
What is claimed is:

Claims

Claims :
1. A full search block matching processor for determining the best matching mxn element subarray of an MxN element array with an mxn element template, where m <_ M and n <. N, comprising:
A. a charge domain tapped delay line including:
i. at least (mxn) + (m-l)x(N-n) stages and having a serial input channel and m groups of n parallel output channels, each of said n output channels in each group being coupled to an associated one of n successive stages in a corresponding group of stages in said delay line, wherein each of said groups of n successive stages in said delay line are separated by N-n successive stages in said delay line,
ii. means for establishing a succession of at least mxn charge packets in said delay line in response to a succession of applied input signals, each of said packets having a magnitude corresponding to an associated one of a corresponding succession of elements of said array,
iii. means for shifting said charge packets from stage-to-stage along said delay line at an input rate, and iv. N floating gate sensing electrodes, each of said electrodes overlying one of said stages and being adapted to provide a potential thereon representative of the magnitude of a charge packet currently within its underlying stage, said potential being a search signal associated with said electrode,
10
B. a template network including:
i. a charge storage region and an associated means for establishing a
15 succession of mxn charge packets in said storage region at said input rate, each of said packets having a magnitude corresponding to an associated element in said template,
20 ii. a floating gate sensing electrode overlying said storage region and being adapted to provide a potential thereon representative of the magnitude of a
25 charge packet currently underlying said electrode, said potential being a template signal,
C. m groups of n charge domain absolute magnitude difference. (AMD) networks, each of said AMD networks:
i. having a template input port, a search input port, and a charge domain output port, wherein each of the n AMD networks of each of said groups has its search input port coupled to an associated sensing electrode of an associated group of said output channels of said delay line, and has its template input port coupled to said sensing electrode of said template network,
ii. each of said AMD networks includes means for providing at its output port a succession of charge packets at said input rate, each of said charge packets being representative of the absolute magnitude of the difference between the current one of said search signals at its search input port and the current one of said template signals at said template input port,
D. m groups of n charge domain summing networks, each of said summing networks:
i. being associated with an AMD network in an associated one of said m groups of AMD networks, and ii. including means for providing at an output thereof a succession of output charge packets at a summing rate equal to l/(mxn) times said input rate, each of said output charge packets having a magnitude representative of the sum of the magnitude of mxn successive charge packets provided at said output port of said associated AMD network.
10
E. m charge domain row accumulator networks, each row accumulator network:
i. being associated with one of said m 15 groups of n AMD networks, and
ii. including means for substantially simultaneously receiving said output charge packets of its associated n 20 summing networks at distinct locations in said row accumulator network, and
iii. means for transferring said received charge packets to establish a 25 succession of said output charge packets at an output port thereof at an accumulator rate equal to 1/m times said input rate.
F. m charge domain row minimum difference (MD) networks, each row MD network:
i. being associated with one of said m row accumulator networks, and
ii. including means for storing a charge packet therein, and
iii. including means for receiving said succession of said output charge packets one at a time at said accumulator rate, and means operative at the receipt of each of said charge packets for retaining and storing only the smaller magnitude one of said just received charge packet and a previously stored charge packet,
G. a charge domain column accumulator network, said column accumulator network:
i. being associated with said m row MD networks, and
ii. including means for substantially simultaneously receiving said stored charge packets of said m row MD networks at distinct locations in said column accumulator network, and iii. means for transferring said received charge packets to establish a succession of output charge packets at an output port thereof at a rate equal to said accumulator rate, and
H. a charge domain column minimum difference (MD) network, said column MD network:
i. being associated with one of said column accumulator networks, and
ii. including means for storing a charge packet therein, and
iii. including means for receiving said succession of said output charge packets one at a time at said accumulator rate, and means operative at the receipt of each of said charge packets for retaining and storing only the smaller magnitude one of said just received charge packet and the next previously stored charge packet.
whereby the magnitude of said charge packet stored at said column MD network is representative of said best matching mxn subarray of said elements that have been represented by charge packets in said delay line.
2. A processor according to claim 1 further comprising:
identifying means for identifying which mxn charge packet of said charge packets established in said delay line generates said stored charge packet in said column MD network.
3. A processor according to claim 2 wherein said identifying means includes:
means for identifying the AMD network which provides the minimum magnitude charge packet of said charge packets provided by its associated n row MD networks for each set of mxn input clock rate pulses, and
means for identifying the row MD network which provides the minimum magnitude charge packet of said charge packets provided by said m row MD networks.
4. A processor according to claim 1 wherein N=31, M=31, m=16 and n=16.
5. A full search block matching processor for determining the best matching mxn element subarray of an MxN element array with an mxn element template, where m M and n <_ N, comprising:
A. a charge domain tapped delay line including:
i. at least (mxn) + (m-l)x(N-n) stages and having a serial input channel and 5 groups of n parallel output channels, each of said n output channels in each group being coupled to an associated one of n successive stages in a corresponding group of stages in said 10 delay line, wherein each of said groups of n successive stages in said delay line are separated by N-n successive stages in said delay line,
15 ii. means for establishing a succession of at least mxn charge packets in said delay line in response to a succession of applied input signals, each of said packets having a magnitude
20 corresponding to an associated one of a corresponding succession of elements of said array,
iii. means for shifting said charge packets 25 from stage-to-stage along said delay line at an input rate, and
iv. N floating gate sensing electrodes, each of said electrodes overlying one of said stages and being adapted to provide a potential thereon representative of the magnitude of a charge packet currently within its underlying stage, said potential being a search signal associated with said electrode,
B. a template network including:
i. a charge storage region and an associated means for establishing a succession of mxn charge packets in said storage region at said input rate, each of said packets having a magnitude corresponding to an associated element in said template,
ii. a floating gate sensing electrode overlying said storage region and being adapted to provide a potential thereon representative of the magnitude of a charge packet currently underlying said electrode, said potential being a template signal,
C. m groups of n charge domain absolute magnitude difference (AMD) networks, each of said AMD networks: i. having a template input port, a search input port, and a charge domain output port, wherein each of the n AMD networks of each of said groups has its search input port coupled to an associated sensing electrode of an associated group of said output channels of said delay line, and has its template input port coupled to said sensing electrode of said template network,
ii. each of said AMD networks includes means for providing at its output port a succession of charge packets at said input rate, each of said charge packets being representative of the absolute magnitude of the difference between the current one of said search signals at its search input port and the current one of said template signals at said template input port,
D. m groups of n charge domain summing networks, each of said summing networks:
i. being associated with an AMD network in an associated one of said m groups of AMD networks, and ii. including means for providing at an output thereof a succession of output charge packets at a summing rate equal to l/(mxn) times said input rate, each of said output charge packets having a magnitude representative of the sum of the magnitude of mxn successive charge packets provided at said output port of said associated AMD network.
10
E. m charge domain row accumulator networks, each row accumulator network:
i. being associated with one of said m 15 groups of n AMD networks, and
ii. including means for substantially simultaneously receiving said output charge packets of its associated n 20 summing networks at distinct locations in said row accumulator network, and
iii. means for transferring said received charge packets to establish a 25 succession of said output charge packets at an output port thereof at an accumulator rate equal to 1/m times said input rate.
F. a charge domain column accumulator network, said column accumulator network:
i. being associated with said m row accumulator networks, and
ii. including means for substantially simultaneously receiving said succession of output charge packets of said m row accumulator networks at distinct locations in said column accumulator network, and
iii. means for transferring said received charge packets to establish a succession of output charge packets at an output port thereof at a rate equal to said input rate, and
G. a charge domain column minimum difference (MD) network, said column MD network:
i. being associated with said column accumulator network, and
ii. including means for storing a charge packet therein, and
iii. including means for receiving said succession of said output charge packets one at a time at said input rate, and means operative at the receipt of each of said charge packets for retaining and storing only the smaller magnitude one of said just received charge packet and the next previously stored charge packet,
whereby the magnitude of said charge packet stored at said column MD network is representative of said best matching mxn subarray of said elements that have been represented by charge packets in said delay line.
6. A processor according to claim 5 further comprising:
identifying means for identifying which mxn charge packet of said charge packets established in said delay line generates said stored charge packet in said column MD network.
7. A processor according to claim 6 wherein said identifying means includes:
means for identifying the AMD network which provides the minimum magnitude charge packet of said charge packets provided by its associated n row MD networks for each set of mxn input clock rate pulses, and means for identifying the row MD network which provides the minimum magnitude charge packet of said charge packets provided by said m row MD networks.
8. A processor according to claim 5 wherein N=31, M=31, m=16 and n»16.
PCT/US1991/004790 1990-07-11 1991-07-08 Charge domain block matching processor WO1992001267A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US07/551,947 US5030953A (en) 1990-07-11 1990-07-11 Charge domain block matching processor
US551,947 1990-07-11

Publications (1)

Publication Number Publication Date
WO1992001267A1 true WO1992001267A1 (en) 1992-01-23

Family

ID=24203320

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1991/004790 WO1992001267A1 (en) 1990-07-11 1991-07-08 Charge domain block matching processor

Country Status (2)

Country Link
US (1) US5030953A (en)
WO (1) WO1992001267A1 (en)

Families Citing this family (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0385884A (en) * 1989-08-29 1991-04-11 Sony Corp Image movement detecting circuit
JPH04139986A (en) * 1990-09-29 1992-05-13 Victor Co Of Japan Ltd Coding/decoding device for movement compensation prediction of picture signal
US5337085A (en) * 1992-04-10 1994-08-09 Comsat Corporation Coding technique for high definition television signals
US5659315A (en) * 1992-05-19 1997-08-19 Mandl; William J. Method and apparatus for multiplexed oversampled analog to digital modulation
US5248971A (en) * 1992-05-19 1993-09-28 Mandl William J Method and apparatus for multiplexed oversampled analog to digital modulation
US5461423A (en) * 1992-05-29 1995-10-24 Sony Corporation Apparatus for generating a motion vector with half-pixel precision for use in compressing a digital motion picture signal
JPH05336514A (en) * 1992-05-29 1993-12-17 Sony Corp Image encoding device
KR0160618B1 (en) * 1992-10-27 1999-01-15 윤종용 Device and method for real time moving estimation
DE69327040T2 (en) * 1993-02-22 2000-04-13 Ind Tech Res Inst Block matching architecture with multiple modules
KR950014343B1 (en) * 1993-05-20 1995-11-24 한국방송공사 Method of and apparatus for motion estimation of video data
EP0652676A1 (en) * 1993-11-08 1995-05-10 Sony Corporation Apparatus and method for compressing a digital motion picture signal
US5736757A (en) * 1994-07-07 1998-04-07 Massachusetts Institute Of Technology Charge-domain generation and replication devices
US5579007A (en) * 1994-07-07 1996-11-26 Massachusetts Institute Of Technology Charge-to-digital converter
US5706059A (en) * 1994-11-30 1998-01-06 National Semiconductor Corp. Motion estimation using a hierarchical search
US5957846A (en) * 1995-06-29 1999-09-28 Teratech Corporation Portable ultrasound imaging system
US7500952B1 (en) 1995-06-29 2009-03-10 Teratech Corporation Portable ultrasound imaging system
US5590658A (en) * 1995-06-29 1997-01-07 Teratech Corporation Portable ultrasound imaging system
US5964709A (en) * 1995-06-29 1999-10-12 Teratech Corporation Portable ultrasound imaging system
US8241217B2 (en) 1995-06-29 2012-08-14 Teratech Corporation Portable ultrasound imaging data
US5973742A (en) * 1996-05-24 1999-10-26 Lsi Logic Corporation System and method for performing motion estimation with reduced memory loading latency
US5652625A (en) * 1996-06-19 1997-07-29 United Microelectronics Corp. Multi-stage pipeline architecture for motion estimation block matching
US5990852A (en) * 1996-10-31 1999-11-23 Fujitsu Limited Display screen duplication system and method
US6111816A (en) * 1997-02-03 2000-08-29 Teratech Corporation Multi-dimensional beamforming device
US6721235B2 (en) 1997-02-03 2004-04-13 Teratech Corporation Steerable beamforming system
US6292433B1 (en) 1997-02-03 2001-09-18 Teratech Corporation Multi-dimensional beamforming device
US6842401B2 (en) 2000-04-06 2005-01-11 Teratech Corporation Sonar beamforming system
JP2003265468A (en) * 2002-03-19 2003-09-24 Ge Medical Systems Global Technology Co Llc Diagnosis information generating device and ultrasonograph
US20050096545A1 (en) * 2003-10-30 2005-05-05 Haider Bruno H. Methods and apparatus for transducer probe
US7527591B2 (en) * 2003-11-21 2009-05-05 General Electric Company Ultrasound probe distributed beamformer
US20050113698A1 (en) * 2003-11-21 2005-05-26 Kjell Kristoffersen Ultrasound probe transceiver circuitry
US7527592B2 (en) * 2003-11-21 2009-05-05 General Electric Company Ultrasound probe sub-aperture processing
US7015854B2 (en) * 2004-07-12 2006-03-21 Massachusetts Institute Of Technology Charge-domain A/D converter employing multiple pipelines for improved precision
US6972707B1 (en) * 2004-07-12 2005-12-06 Massachusetts Institute Of Technology Sub-ranging pipelined charge-domain analog-to-digital converter with improved resolution and reduced power consumption
US7768528B1 (en) * 2005-11-09 2010-08-03 Image Metrics Limited Replacement of faces in existing video
GB2439119B (en) * 2006-06-12 2011-04-20 Tandberg Television Asa Motion estimator
US20100228130A1 (en) * 2009-03-09 2010-09-09 Teratech Corporation Portable ultrasound imaging system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4464726A (en) * 1981-09-08 1984-08-07 Massachusetts Institute Of Technology Charge domain parallel processing network
US4547864A (en) * 1982-01-07 1985-10-15 Canon Kabushiki Kaisha Correlation detecting device
US4833636A (en) * 1987-06-19 1989-05-23 Fuji Photo Film Co., Ltd. Analog, two signal correlator

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4313159A (en) * 1979-02-21 1982-01-26 Massachusetts Institute Of Technology Data storage and access apparatus
US4264983A (en) * 1979-03-19 1981-04-28 Tektronix, Inc. Time-multiplexed CCD transversal filter
US4354257A (en) * 1980-05-23 1982-10-12 Fairchild Camera And Instrument Corporation Sense amplifier for CCD memory
US4514821A (en) * 1982-09-27 1985-04-30 Rca Corporation Electro-optical CCD transversal filter with analog reference weights
FR2560723B1 (en) * 1984-03-05 1989-02-24 Thomson Csf NON-RECURSIBLE ANALOG INTEGRATER
FR2566162B1 (en) * 1984-06-13 1986-08-29 Thomson Csf ANALOG IMAGE MEMORY DEVICE USING LOAD TRANSFER
US4625293A (en) * 1984-12-07 1986-11-25 The United States Of America As Represented By The Secretary Of The Air Force Parallel programmable charge domain device
US4922452A (en) * 1987-11-16 1990-05-01 Analytek, Ltd. 10 Gigasample/sec two-stage analog storage integrated circuit for transient digitizing and imaging oscillography

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4464726A (en) * 1981-09-08 1984-08-07 Massachusetts Institute Of Technology Charge domain parallel processing network
US4547864A (en) * 1982-01-07 1985-10-15 Canon Kabushiki Kaisha Correlation detecting device
US4833636A (en) * 1987-06-19 1989-05-23 Fuji Photo Film Co., Ltd. Analog, two signal correlator

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
IEEE JOURNAL OF SOLID-STATE CIRCUITS. vol. SC-18, no. 6, December 1983, NEW YORK US pages 736 - 744; B. E. BURKE ET AL.: 'A CCD Time-Integrating Correlator ' see page 736, right column, paragraph 3 - page 737, left column, paragraph 1; figures 1,2 *

Also Published As

Publication number Publication date
US5030953A (en) 1991-07-09

Similar Documents

Publication Publication Date Title
US5030953A (en) Charge domain block matching processor
US7362808B2 (en) Device for and method of estimating motion in video encoder
Chen et al. One-dimensional full search motion estimation algorithm for video coding
US5347309A (en) Image coding method and apparatus
US5781249A (en) Full or partial search block matching dependent on candidate vector prediction distortion
US6289052B1 (en) Methods and apparatus for motion estimation using causal templates
US5617144A (en) Image processing system using pixel-by-pixel motion estimation and frame decimation
EP0395440B1 (en) Apparatus for adaptive interframe predictive encoding of video signal
WO2006063191A2 (en) Motion estimation techniques for video encoding
EP1653744A1 (en) Non-integer pixel sharing for video encoding
EP1262073A2 (en) Methods and apparatus for motion estimation using neighboring macroblocks
JPH0879768A (en) Method and device for estimating bidirectional movement
US5717470A (en) Method and apparatus for detecting optimum motion vectors based on a hierarchical motion estimation approach
JPH0799663A (en) Movement-vector detector
EP0721284B1 (en) An image processing system using pixel-by-pixel motion estimation and frame decimation
EP0720356B1 (en) Apparatus for determining motion vectors
JPH08265765A (en) Image coding system and motion compensating device for use therein
US5731851A (en) Method for determining feature points based on hierarchical block searching technique
EP0731612B1 (en) Apparatus for encoding a video signal using search grids for motion estimation and compensation
US20090080527A1 (en) Method and Apparatus for Providing a Fast Motion Estimation Process
JPH09284777A (en) Video coding method and device using motion compensation without motion vector
JP2624013B2 (en) Image coding method and apparatus
KR100203638B1 (en) Method for estimating motion using half-pixel by half-pixel
Kim et al. Improved block-matching algorithm considering edge components
KR100213276B1 (en) Three step searching circuit of image processor

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FR GB GR IT LU NL SE