US8576912B2 - Encoding apparatus and decoding apparatus - Google Patents

Encoding apparatus and decoding apparatus Download PDF

Info

Publication number
US8576912B2
US8576912B2 US12/458,729 US45872909A US8576912B2 US 8576912 B2 US8576912 B2 US 8576912B2 US 45872909 A US45872909 A US 45872909A US 8576912 B2 US8576912 B2 US 8576912B2
Authority
US
United States
Prior art keywords
vector
direct
mvl
pixel
direct vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related, expires
Application number
US12/458,729
Other versions
US20100054335A1 (en
Inventor
Akihiro Yamori
Satoshi Shimada
Akira Nakagawa
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Assigned to FUJITSU LIMITED reassignment FUJITSU LIMITED ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: NAKAGAWA, AKIRA, SHIMADA, SATOSHI, YAMORI, AKIHIRO
Publication of US20100054335A1 publication Critical patent/US20100054335A1/en
Priority to US13/672,056 priority Critical patent/US8761256B2/en
Application granted granted Critical
Publication of US8576912B2 publication Critical patent/US8576912B2/en
Priority to US14/202,426 priority patent/US8971409B2/en
Expired - Fee Related legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/16Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter for a given display mode, e.g. for interlaced or progressive display mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding
    • H04N19/52Processing of motion vectors by encoding by predictive encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/523Motion estimation or motion compensation with sub-pixel accuracy
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/573Motion compensation with multiple frame prediction using two or more reference frames in a given prediction direction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/577Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties
    • H04N19/14Coding unit complexity, e.g. amount of activity or edge presence estimation

Definitions

  • the embodiments discussed herein are directed to an encoding apparatus and a decoding apparatus.
  • H.264 and Moving Picture Experts Group-4 (MPEG-4) Part 10 Advanced Video Coding have been standardized as methods for encoding moving image data, and brought to attention. Because the H.264 and the MPEG-4 Part 10 AVC are technologically the same, they will be referred to as H.264/AVC in the following.
  • the H.264/AVC has five encoding modes including an intra prediction, a forward prediction, a backward prediction, a bidirectional prediction, and a direct mode, as encoding modes for a bi-directional predictive picture (B-picture).
  • the direct mode is a newly added mode, and is a method to determine a motion vector of a current macroblock from motion vectors of temporally or spatially adjacent macroblocks, by focusing on the continuity of the moving image data.
  • FIG. 9 is a schematic of a direct vector (frame structure).
  • the temporal direct mode is simply referred to as direct mode.
  • a motion vector of a macroblock included in a picture processed immediately before and placed at the same position as the current macroblock is selected as a reference vector, and a motion vector of the current macroblock (hereinafter, referred to as direct vector) is determined by temporally scaling the selected reference vector.
  • direct vector a motion vector of the current macroblock
  • a reference picture in the past direction is called List 0
  • a reference picture in the future direction is called List 1
  • a direct vector in the forward direction (mvL 0 ) and a direct vector in the backward direction (mvL 1 ) of the macroblock on the current B picture (CurrPic) are determined.
  • the direct mode is performed based on the following principle. Assume that a macroblock placed at the same position as that on the reference picture in the future direction (colPic) has predicted and indicated a region on the reference picture in the past direction, by a reference vector (mvCol). In this case, it is likely to consider that a certain object included in the reference picture in the future direction (colPic) moved from the reference picture in the past direction (refPicCol) along a vector, which is the reference vector (mvCol), in space time. If this is so, the object also passes through the current picture (CurrPic) interposed between the reference picture in the future direction (colPic) and the reference picture in the past direction (refPicCol), along the reference vector (mvCol).
  • the macroblock on the current picture is predicted from the reference picture in the future direction (colPic) and the reference picture in the past direction (refPicCol), by using a vector parallel to the reference vector (mvCol) in space time.
  • Formulae for calculating a vector parallel to the reference vector (mvCol) that indicates from the reference picture in the future direction (colPic) to the reference picture in the past direction (refPicCol) in space time, between CurrPic and refPicCol, and between CurrPic and colPic are depicted below.
  • Formulae for calculating a normal direct vector are as follows: mvL 0 is a direct vector from the current picture (CurrPic) to the reference picture in the past direction (refPicCol), and mvL 1 is a direct vector from the current picture (CurrPic) to the reference picture in the future direction (colPic).
  • mvL 0 mvCol ⁇ tb/td (1)
  • mvL 1 mvL 0 ⁇ mvCol (2)
  • td is the time distance from the reference picture in the future direction (colPic) to the reference picture in the past direction (refPicCol)
  • tb is the time distance from the current picture (CurrPic) to the reference picture in the past direction (refPicCol).
  • the direct vectors (mvL 0 and mvL 1 ) determined here are calculated on assumption that the picture has a frame structure.
  • the vector in the forward direction and the vector in the backward direction are used as examples for conveniently explaining the direct vector.
  • the vectors mvL 0 and mvL 1 are not fixed to either of the forward direction or the backward direction, in the H.264 and MPEG-4 Part 10 AVC. Accordingly, the similar calculation can be carried out by using a vector of the combination of the forward direction/forward direction or the backward direction/backward direction.
  • the vectors mvL 0 and mvL 1 used for calculation are referred to as a first vector (direct vector) and a second vector (direct vector).
  • the encoding efficiency is decreased, when a picture having a field structure is encoded in the direct mode.
  • a pixel with opposite parity is referred to, while a picture having a field structure is encoded in the direct mode, an error corresponding to the difference in the parities occurs in the vector. Accordingly, the encoding efficiency is decreased.
  • FIG. 10 is a schematic of a direct vector (field structure).
  • a direct vector field structure
  • pixels in the Bottom_field are shifted in the downward direction by 0.5 pixel across the field, compared with the pixels in the Top_field.
  • the direct vectors mvL 0 and mvL 1 do not become parallel to the reference vector (mvCol) in space time.
  • Such a direct vector is not the maximum likelihood, thereby decreasing the encoding efficiency.
  • an encoding apparatus having a direct mode as a prediction mode includes a reference vector correcting unit that, when a pixel with opposite parity is referred to for obtaining a reference vector in the direct mode, performs correction by adding or subtracting a value corresponding to a half pixel to or from a value of the obtained reference vector; and a direct vector correcting unit that, when a pixel with opposite parity is referred to for obtaining a first and a second direct vector by temporally scaling the reference vector corrected by the reference vector correcting unit, performs correction by adding or subtracting a value corresponding to a half pixel to or from values of obtained direct vectors.
  • FIG. 1A is a schematic of an outline of an encoding apparatus and a decoding apparatus according to a first embodiment of the present invention
  • FIG. 1B is a diagram for explaining the outline of the encoding apparatus
  • FIG. 1C is formulae for explaining the outline of the encoding apparatus
  • FIG. 2 is a block diagram of a configuration of the encoding apparatus according to the first embodiment
  • FIG. 3 is a flowchart of a processing procedure performed by the encoding apparatus according to the first embodiment
  • FIG. 4 is a block diagram of a configuration of the decoding apparatus according to the first embodiment
  • FIG. 5 is a flowchart of a processing procedure performed by the decoding apparatus according to the first embodiment
  • FIGS. 6A and 6B are schematics of advantages of the first embodiment
  • FIG. 7 is a schematic of a computer that executes an encoding program
  • FIG. 8 is a schematic of a computer that executes a decoding program
  • FIG. 9 is a schematic of a direct vector (frame structure).
  • FIG. 10 is a schematic of a direct vector (field structure).
  • FIGS. 1A to 1C depict the outlines of the encoding apparatus and the decoding apparatus according to the first embodiment.
  • FIG. 1A if a pixel with opposite parity is referred to, while a picture having a field structure is encoded in the direct mode, an error corresponding to the difference in the parities occurs in the vector.
  • a pixel “2” in a Top_field refers to a pixel “1” in a Top_field
  • the vertical component of a vector is “one pixel” derived from “2” ⁇ “1” (see FIG. 1B ), and a value of the vertical component of the vector is correctly calculated as “ ⁇ 4(1 pixel/0.25 pixel)”.
  • a vertical component of a vector is also “one pixel” derived from “2” ⁇ “1”, and a value of the vertical component of the vector is also calculated as “ ⁇ 4(1 pixel/0.25 pixel)”.
  • the pixels in the Bottom_field are shifted by 0.5 pixel across the field in the downward direction, compared with the pixels in the Top_field. Accordingly, under normal circumstances, the vertical component of the vector should be “1.5 pixel” (see FIG. 1B ), and the value of the vertical component of the vector should be calculated as “ ⁇ 6(1.5 pixel/0.25 pixel)”.
  • the encoding apparatus and the decoding apparatus correct the error.
  • the encoding apparatus and the decoding apparatus according to the first embodiment correct a reference vector and a direct vector, by using three formulae depicted in FIG. 1C .
  • Formula (A) in FIG. 1C is a formula for correcting a reference vector.
  • the encoding apparatus and the decoding apparatus according to the first embodiment if parity of a reference picture in the past direction and a parity of a reference picture in the future direction are different, perform correction by adding or subtracting “2” corresponding to 0.5 pixel, to or from a value of the reference vector.
  • Formula (B) in FIG. 1C is a formula for correcting a first direct vector.
  • the encoding apparatus and the decoding apparatus according to the first embodiment if a parity of a reference picture in the past direction and a parity of a current picture are different, perform correction by adding or subtracting “2” corresponding to 0.5 pixel, to or from a value of the first direct vector.
  • Formula (C) in FIG. 1C is a formula for correcting a second direct vector.
  • the encoding apparatus and the decoding apparatus according to the first embodiment if a parity of a reference picture in the future direction and a parity of the current picture are different, perform correction by adding or subtracting “2” corresponding to 0.5 pixel, to or from a value of the second direct vector.
  • the encoding apparatus and the decoding apparatus perform correction by adding or subtracting an error, which is a half pixel corresponding to the difference in the parities. Accordingly, the vector is properly corrected, thereby preventing the encoding efficiency from being decreased.
  • FIG. 2 is a block diagram of the configuration of the encoding apparatus according to the first embodiment.
  • an encoding apparatus 100 includes a frame memory 110 , an orthogonal transformation/quantization unit 111 , an inverse orthogonal transformation/inverse quantization unit 112 , a frame memory 113 , and a motion compensating unit (intra predicting unit) 114 .
  • the encoding apparatus 100 also includes a motion vector detecting unit 115 , a direct vector determining unit 116 , a prediction mode determining unit 117 , and a variable length coding unit 118 . These units are the same as those included in a conventional encoding apparatus.
  • the encoding apparatus 100 according to the first embodiment includes a reference vector correcting unit 120 and a direct vector correcting unit 121 as indicated by the thick frames in FIG. 2 .
  • the encoding apparatus 100 stores an input image into the frame memory 110 and divides the input image into macroblock units.
  • the encoding apparatus 100 performs processing in a macroblock unit.
  • the motion vector detecting unit 115 detects a motion vector of the input image stored in the frame memory 110 , from a reference image stored in the frame memory 113 .
  • the motion compensating unit 114 then generates a prediction image, by using the reference image stored in the frame memory 113 and the motion vector detected by the motion vector detecting unit 115 .
  • the prediction mode determining unit 117 determines a prediction mode capable of efficient encoding, by comparing between an encoding of the prediction image generated by the motion compensating unit 114 and an encoding of the input image. If the prediction mode determining unit 117 determines that an interframe coding is performed, a prediction difference image that is a difference between the input image stored in the frame memory 110 and the prediction image generated by the motion compensating unit 114 is fed into the orthogonal transformation/quantization unit 111 . If the prediction mode determining unit 117 determines that an intraframe coding is performed, the input image stored in the frame memory 110 is directly fed into the orthogonal transformation/quantization unit 111 .
  • the orthogonal transformation/quantization unit 111 orthogonally transforms and quantizes the prediction difference image and feeds into the variable length coding unit 118 .
  • the variable length coding unit 118 encodes the prediction difference image and outputs the image as a bit stream.
  • the prediction difference image orthogonally transformed and quantized by the orthogonal transformation/quantization unit 111 is inversely orthogonally transformed and is inversely quantized by the inverse orthogonal transformation/inverse quantization unit 112 , for the motion compensation prediction performed in the next frame.
  • the prediction difference image is decoded by using the prediction image generated by the motion compensating unit 114 and is stored in the frame memory 113 .
  • the encoding apparatus 100 includes the reference vector correcting unit 120 and the direct vector correcting unit 121 .
  • the reference vector correcting unit 120 and the direct vector correcting unit 121 operate when the input image is a picture having a field structure and is in the temporal direct mode.
  • the reference vector correcting unit 120 when a reference vector is detected by the motion vector detecting unit 115 , determines whether a pixel with opposite parity is referred to. If it is determined that a pixel with opposite parity is referred to, the reference vector correcting unit 120 performs correction by adding or subtracting a value corresponding to 0.5 pixel, to or from a value of the reference vector value being detected. The reference vector correcting unit 120 then transmits the corrected reference vector to the direct vector determining unit 116 .
  • the reference vector correcting unit 120 performs correction by using the following formulae. The calculation is performed based on 0.25 pixel accuracy. The variables will now be described.
  • mvCol_correct corrected value of a reference vector (mvCol)
  • mvL 0 _correct corrected value of a first direct vector (mvL 0 )
  • mvL 1 _correct corrected value of a second direct vector (mvL 1 )
  • the reference vector correcting unit 120 performs correction using Formula (3), by considering the parity of the reference picture in the future direction (colPic), which is a reference source of a reference vector (mvCol), and the parity of a reference picture in the past direction (refPicCol), which is a picture to be referenced.
  • mvCol _correct mvCol+ 2 ⁇ (isBottomFieldrefPic Col ⁇ isBottomField Col ) (3)
  • Formula (3) regardless of parity, it is possible to calculate the correct direction of the reference vector (mvCol) in space time.
  • the reference vector correcting unit 120 transmits the reference vector detected by the motion vector detecting unit 115 to the direct vector determining unit 116 .
  • the direct vector determining unit 116 calculates a direct vector by using the corrected reference vector transmitted from the reference vector correcting unit 120 , and transmits the calculated direct vector to the direct vector correcting unit 121 .
  • the direct vector determining unit 116 performs correction using Formulae (4) and (5).
  • the direct vector determining unit 116 calculates a direct vector parallel to the corrected reference vector (mvCol_correct), by using the corrected reference vector (mvCol_correct) and a distance of each picture in the temporal direction.
  • mvL 0_correct mvCol _correct ⁇ tb/td (4)
  • mvL 1_correct mvL 0_correct ⁇ mvCol _correct (5)
  • the direct vector correcting unit 121 determines whether the direct vector transmitted from the direct vector determining unit 116 is obtained by referring to a pixel with opposite parity. If it is determined that a pixel with opposite parity is referred to, the direct vector correcting unit 121 performs correction by adding or subtracting a value corresponding to 0.5 pixel, to or from a value of the transmitted direct vector. The direct vector correcting unit 121 then transmits the corrected direct vector to the prediction mode determining unit 117 .
  • the direct vector correcting unit 121 performs correction on the first direct vector (mvL 0 ), using Formula (6), by considering the parity between the current picture (CurrPic) and the reference picture in the past direction (refPicCol).
  • the direct vector correcting unit 121 performs correction on the second director vector (mvL 1 ), using Formula (7), by considering the parity between the current picture (CurrPic) and the reference picture in the future direction (colPic).
  • the direct vector correcting unit 121 transmits the direct vector transmitted from the direct vector determining unit 116 to the prediction mode determining unit 117 .
  • FIG. 3 is a flowchart of the processing procedure performed by the encoding apparatus according to the first embodiment.
  • the processing procedure depicted in FIG. 3 is the detailed processing procedure performed when an input image is a picture having a field structure and is in the temporal direct mode.
  • the encoding apparatus 100 determines whether an input image has a field structure and is in the temporal direct mode (Step S 101 ). If the input image does not have a field structure nor is in the temporal direct mode (No at Step S 101 ), the encoding apparatus 100 performs a normal process (Step S 102 ), and proceeds to Step S 106 .
  • the reference vector correcting unit 120 corrects the reference vector by considering the field relationships (Step S 103 ).
  • the direct vector determining unit 116 then calculates a direct vector (Step S 104 ). Subsequently, the direct vector correcting unit 121 corrects the first direct vector and the second direct vector by considering the field relationships (Step S 105 ).
  • the prediction mode determining unit 117 determines a prediction mode (Step S 106 ). Subsequently, the macroblock is encoded by the orthogonal transformation/quantization unit 111 and the variable length coding unit 118 (Step S 107 ).
  • the encoding apparatus 100 determines whether a processing of one picture has finished (Step S 108 ). If the processing of one picture has finished (Yes at Step S 108 ), the encoding apparatus 100 finishes the process. If the processing of one picture is not yet finished (No at Step S 108 ), the encoding apparatus 100 returns to the process in which the reference vector correcting unit 120 corrects a reference vector (Step S 103 ).
  • FIG. 4 is a block diagram of the configuration of the decoding apparatus according to the first embodiment.
  • a decoding apparatus 200 includes a variable length decoding unit 210 , a prediction mode determining unit 211 , and an inverse orthogonal transformation/inverse quantization unit 212 .
  • the decoding apparatus 200 also includes a vector determining unit 213 , a motion compensating unit (intra predicting unit) 214 , a frame memory 215 , and a direct vector determining unit 216 . These units are the same as those included in a conventional decoding apparatus.
  • the decoding apparatus 200 according to the first embodiment includes a reference vector correcting unit 220 and a direct vector correcting unit 221 as indicated by the thick frames in FIG. 4 .
  • the decoding apparatus 200 feeds a bit stream transmitted from the encoding apparatus 100 into the variable length decoding unit 210 .
  • the variable length decoding unit 210 decodes the upper level header information of the input bit stream, and divides into macroblock units.
  • the decoding apparatus 200 performs processing in a macroblock unit.
  • the prediction mode determining unit 211 determines a prediction mode of decoding data in a macroblock unit. If the prediction mode determining unit 211 determines that an interframe coding is performed, the vector determining unit 213 extracts a motion vector from the bit stream. The motion compensating unit 214 then generates a prediction image, by using the motion vector extracted by the vector determining unit 213 and a decoded image stored in the frame memory 215 . The prediction image generated by the motion compensating unit 214 is added to the decoded result of the prediction difference image generated by the inverse orthogonal transformation/inverse quantization unit 212 , and is output as a decoded image.
  • the prediction mode determining unit 211 determines that an intraframe coding is performed, the signal extracted from the bit stream is directly decoded by the inverse orthogonal transformation/inverse quantization unit 212 , and output as a decoded image.
  • the decoded image is stored in the frame memory 215 , for the motion compensation process performed in the next frame.
  • the decoding apparatus 200 includes the reference vector correcting unit 220 and the direct vector correcting unit 221 .
  • the reference vector correcting unit 220 and the direct vector correcting unit 221 operate when the input image is a picture having a field structure and is in the temporal direct mode.
  • the reference vector correcting unit 220 determines whether a pixel with opposite parity is referred to, when a reference vector is determined by the vector determining unit 213 . If it is determined that a pixel with opposite parity is referred to, the reference vector correcting unit 220 performs correction by adding or subtracting a value corresponding to 0.5 pixel, to or from a value of the detected reference vector. The reference vector correcting unit 220 then transmits the corrected reference vector to the direct vector determining unit 216 .
  • the reference vector correcting unit 220 performs correction by using the following formula. The calculation is performed based on 0.25 pixel accuracy. The variables will now be described.
  • mvCol_correct corrected value of a reference vector (mvCol)
  • mvL 0 _correct corrected value of a first direct vector (mvL 0 )
  • mvL 1 _correct corrected value of a second direct vector (mvL 1 )
  • the reference vector correcting unit 220 performs correction using Formula (8), by considering the parity of the reference picture in the future direction (colPic), which is a reference source of the reference vector (mvCol), and the parity of the reference picture in the past direction (refPicCol), which is a picture to be referenced.
  • mvCol _correct mvCol+ 2 ⁇ (isBottomFieldrefPic Col ⁇ isBottomField Col ) (8)
  • Formula (8) regardless of parity, it is possible to calculate the correct direction of the reference vector (mvCol) in space time.
  • the reference vector correcting unit 220 transmits the reference vector determined by the vector determining unit 213 to the direct vector determining unit 216 .
  • the direct vector determining unit 216 calculates a direct vector by using the corrected reference vector transmitted from the reference vector correcting unit 220 , and transmits the calculated direct vector to the direct vector correcting unit 221 .
  • the direct vector determining unit 216 performs correction using Formulae (9) and (10). In other words, the direct vector determining unit 216 calculates a direct vector parallel to the corrected reference vector (mvCol_correct), by using the corrected reference vector (mvCol_correct) and a distance of each picture in the temporal direction.
  • mvL 0_correct mvCol _correct ⁇ tb/td (9)
  • mvL 1_correct mvL 0_correct ⁇ mvCol _correct (10)
  • the direct vector correcting unit 221 determines whether the direct vector transmitted from the direct vector determining unit 216 is obtained by referring to a pixel with opposite parity. If it is determined that a pixel with opposite parity is referred to, the direct vector correcting unit 221 performs correction by adding or subtracting a value corresponding to 0.5 pixel to or from a value of the transmitted direct vector. The direct vector correcting unit 221 then transmits the corrected direct vector to the motion compensating unit 214 .
  • the direct vector correcting unit 221 performs correction on the first direct vector (mvL 0 ) using Formula (11), by considering the parity between the current picture (CurrPic) and the reference picture in the past direction (refPicCol).
  • the direct vector correcting unit 221 performs correction on the second direct vector (mvL 1 ) using Formula (12), by considering the parity between the current picture (CurrPic) and the reference picture in the future direction (colPic).
  • the direct vector correcting unit 221 transmits the direct vector transmitted from the direct vector determining unit 216 to the motion compensating unit 214 .
  • FIG. 5 is a flowchart of the processing procedure performed by the decoding apparatus according to the first embodiment.
  • the processing procedure depicted in FIG. 5 is the detailed processing procedure performed when an input image is a picture having a field structure and is in the temporal direct mode.
  • the prediction mode determining unit 211 determines whether an input stream has a field structure and is in the temporal direct mode (Step S 201 ). If the input stream does not have a field structure nor is in the temporal direct mode (No at Step S 201 ), the decoding apparatus 200 performs a normal process (Step S 202 ), and proceeds to Step S 206 .
  • the reference vector correcting unit 220 corrects the reference vector by considering the field relationships (Step S 203 ).
  • the direct vector determining unit 216 then calculates a direct vector (Step S 204 ). Subsequently, the direct vector correcting unit 221 corrects the first direct vector and the second direct vector by considering the field relationships (Step S 205 ). The macroblock is then decoded by the motion compensating unit 214 (Step S 206 ).
  • the decoding apparatus 200 determines whether a processing of one picture has finished (Step S 207 ). If the processing of one picture has finished (Yes at Step S 207 ), the decoding apparatus 200 finishes the process. If the processing of one picture is not yet finished (No at Step S 207 ), the decoding apparatus 200 returns to the process (Step S 203 ) in which the reference vector correcting unit 220 corrects the reference vector.
  • the encoding apparatus performs correction by adding or subtracting a value corresponding to a half pixel to or from a value of the obtained reference vector, if a pixel with opposite parity is referred to so as to obtain a reference vector in the direct mode.
  • the encoding apparatus performs correction by adding or subtracting a value corresponding to a half pixel to or from a value of the obtained direct vector, if a pixel with opposite parity is referred to so as to obtain a direct vector from the corrected reference vector.
  • the correction is performed by adding or subtracting an error, which is a half pixel corresponding to the difference in parities. Accordingly, the vector is properly corrected, thereby preventing the encoding efficiency from being decreased.
  • the pixels in the Bottom_field are shifted in the downward direction for 0.5 pixel across the field, compared with the pixels in the Top_field. Accordingly, the direct vectors mvL 0 and mvL 1 do not become parallel to the reference vector (mvCol) in space time, with the vector obtained by referring to a pixel with opposite parity, while the picture having a field structure is encoded in the direct mode.
  • the reference vector (mvCol) and the direct vectors mvL 0 and mvl 1 are all 0 vectors, their directions are all different in relation to the parity. Because the conventional direct vector does not become maximum likelihood, the encoding efficiency is decreased.
  • the correction is performed on the reference vector and the direct vector.
  • the direct vectors mvL 0 and mvL 1 become parallel to the reference vector (mvCol) in space time. Because the direct vector becomes maximum likelihood, it is possible to prevent the encoding efficiency from being decreased.
  • the reference vector (mvCol) and the direct vectors mvL 0 and mvL 1 need not necessarily be 0 vectors.
  • the decoding apparatus performs correction by adding or subtracting a value corresponding to a half pixel, to or from a value of the obtained reference vector. If a pixel with opposite parity is referred to so as to obtain the direct vector from the corrected reference vector, the decoding apparatus performs correction by adding or subtracting a value corresponding to a half pixel, to or from a value of the obtained direct vector.
  • the correction is performed by adding or subtracting an error, which is a half pixel corresponding to the difference in parities. Accordingly, the vector is properly corrected, thereby preventing the encoding efficiency from being decreased.
  • the present invention is not limited thereto.
  • the present invention may similarly be applied to a method in which only the encoding apparatus is used for correction, or a method in which only the decoding apparatus is used for correction.
  • each apparatus depicted in the drawings are functionally conceptual, and are not necessarily be physically configured as illustrated (such as FIGS. 2 and 4 ).
  • the specific mode of dispersion and integration of each apparatus is not limited to the ones depicted in the drawings, and all or a part of the apparatus can be functionally or physically dispersed or integrated in an optional unit, depending on various kinds of load and the status of use.
  • All or an optional part of the respective processing functions carried out in each apparatus are realized by a central processing unit (CPU) and a computer program analyzed and executed by the CPU, or may be realized as hardware by the wired logic.
  • CPU central processing unit
  • FIG. 7 is a schematic of a computer that executes the encoding program
  • FIG. 8 is a schematic of a computer that executes the decoding program.
  • an encoding program (computer) 10 is connected to a cache 12 , a random access memory (RAM) 11 , a hard disk drive (HDD) 13 , a read only memory (ROM) 14 , and a CPU 15 via a bus 16 .
  • the ROM 14 includes an encoding program that can exercise functions similar to those of the embodiments. In other words, as depicted in FIG.
  • the ROM 14 includes an orthogonal transformation/quantization program 14 a , an inverse orthogonal transformation/inverse quantization program 14 b , a motion compensation program 14 c , a motion vector detection program 14 d , a reference vector correction program 14 e , a direct vector determination program 14 f , a direct vector correction program 14 g , a prediction mode determination program 14 h , and a variable length coding program 14 i.
  • the CPU 15 reads and executes the programs 14 a to 14 i , and thus the programs 14 a to 14 i , as depicted in FIG. 7 , function as an orthogonal transformation/quantization process 15 a , an inverse orthogonal transformation/inverse quantization process 15 b , a motion compensation process 15 c , a motion vector detection process 15 d , a reference vector correction process 15 e , a direct vector determination process 15 f , a direct vector correction process 15 g , a prediction mode determination process 15 h , and a variable length coding process 15 i .
  • the processes 15 a to 15 i respectively correspond to the orthogonal transformation/quantization unit 111 , the inverse orthogonal transformation/inverse quantization unit 112 , the motion compensating unit 114 , the motion vector detecting unit 115 , the reference vector correcting unit 120 , the direct vector determining unit 116 , the direct vector correcting unit 121 , the prediction mode determining unit 117 , and the variable length coding unit 118 , depicted in FIG. 2 .
  • the RAM 11 includes a frame memory 11 a and a frame memory 11 b .
  • the frame memory 11 a and the frame memory 11 b respectively correspond to the frame memory 110 and the frame memory 113 depicted in FIG. 2 .
  • the computer programs 14 a to 14 i need not necessarily be stored in the ROM 14 in advance.
  • the computer programs 14 a to 14 i may be stored in a “portable physical medium” such as a flexible disk (FD), a compact disk read only memory (CD-ROM), a magneto optical (MO) disk, a digital versatile disk (DVD), an optical disk, and an integrated circuit (IC) card that can be inserted into the computer 10 ; in a “fixed physical medium” such as a hard disk drive (HDD) provided inside and outside of the computer 10 ; or in “another computer (or server)” connected to the computer 10 via a public line, the Internet, a local area network (LAN), or a wide area network (WAN).
  • the computer 10 can read out each computer program, and execute the program.
  • a decoding program (computer) 20 having a configuration connected to a cache 22 , a RAM 21 , a HDD 23 , a ROM 24 , and a CPU 25 , via a bus 26 .
  • the ROM 24 includes a decoding program that can exercise functions similar to those of the embodiments. In other words, as depicted in FIG.
  • the ROM 24 includes a variable length decoding program 24 a , a prediction mode determination program 24 b , an inverse orthogonal transformation/inverse quantization program 24 c , a motion compensation program 24 d , a vector determination program 24 e , a reference vector correction program 24 f , a direct vector determination program 24 g , and a direct vector correction program 24 h.
  • the CPU 25 reads and executes the computer programs 24 a to 24 h , and thus the computer programs 24 a to 24 h , as depicted in FIG. 8 , function as a variable length decoding process 25 a , a prediction mode determination process 25 b , an inverse orthogonal transformation/inverse quantization process 25 c , a motion compensation process 25 d , a vector determination process 25 e , a reference vector correction process 25 f , a direct vector determination process 25 g , and a direct vector correction process 25 h .
  • the processes 25 a to 25 h respectively correspond to the variable length decoding unit 210 , the prediction mode determining unit 211 , the inverse orthogonal transformation/inverse quantization unit 212 , the motion compensating unit 214 , the vector determining unit 213 , the reference vector correcting unit 220 , the direct vector determining unit 216 , and the direct vector correcting unit 221 depicted in FIG. 4 .
  • the RAM 21 includes a frame memory 21 a .
  • the frame memory 21 a corresponds to the frame memory 215 depicted in FIG. 4 .
  • the computer programs 24 a to 24 h need not necessarily be stored in the ROM 24 in advance.
  • the computer programs 24 a to 24 h may be stored in a “portable physical medium” such as a FD, a CD-ROM, an MO disk, a DVD, an optical disk, and an IC card that can be inserted into the computer 20 ; in a “fixed physical medium” such as a HDD provided inside and outside of the computer 20 ; or in “another computer (or server)” connected to the computer 20 via a public line, the Internet, a LAN, or a WAN.
  • the computer 20 can read out each computer program, and execute the program.
  • the encoding apparatus and the decoding apparatus can prevent the encoding efficiency from being decreased.

Abstract

An encoding apparatus having a direct mode as a prediction mode, when a pixel with opposite parity from a top field to a bottom field, for example, is referred to for obtaining a reference vector in the direct mode, performs correction by adding or subtracting a value corresponding to a half pixel to or from a value of the obtained reference vector. The encoding apparatus, when a pixel with opposite parity is referred to for obtaining a first and a second direct vector by temporally scaling the corrected reference vector, performs correction by adding or subtracting a value corresponding to a half pixel to or from values of the obtained direct vectors.

Description

CROSS-REFERENCE TO RELATED APPLICATION
This application is based upon and claims the benefit of priority of the prior Japanese Patent Application No. 2008-225305, filed on Sep. 2, 2008, the entire contents of which are incorporated herein by reference.
FIELD
The embodiments discussed herein are directed to an encoding apparatus and a decoding apparatus.
BACKGROUND
H.264 and Moving Picture Experts Group-4 (MPEG-4) Part 10 Advanced Video Coding (AVC) have been standardized as methods for encoding moving image data, and brought to attention. Because the H.264 and the MPEG-4 Part 10 AVC are technologically the same, they will be referred to as H.264/AVC in the following.
The H.264/AVC has five encoding modes including an intra prediction, a forward prediction, a backward prediction, a bidirectional prediction, and a direct mode, as encoding modes for a bi-directional predictive picture (B-picture). In particular, the direct mode is a newly added mode, and is a method to determine a motion vector of a current macroblock from motion vectors of temporally or spatially adjacent macroblocks, by focusing on the continuity of the moving image data.
The principle of a temporal direct mode in the direct modes will now be described with reference to FIG. 9. FIG. 9 is a schematic of a direct vector (frame structure). The temporal direct mode is simply referred to as direct mode.
In the direct mode, a motion vector of a macroblock included in a picture processed immediately before and placed at the same position as the current macroblock is selected as a reference vector, and a motion vector of the current macroblock (hereinafter, referred to as direct vector) is determined by temporally scaling the selected reference vector. In a normal encoding order of moving image data, a certain B-picture is processed subsequent to a reference picture in the forward direction (in the past direction temporally) and a reference picture in the backward direction (in the future direction temporally). Accordingly, a picture processed immediately before the certain B picture is a reference picture in the future direction temporally.
Generally, a reference picture in the past direction is called List0, and a reference picture in the future direction is called List1. In the direct mode, as depicted in FIG. 9, a motion vector of a macroblock (refPicCol is a picture to be referenced) placed at the same position as the reference picture in the future direction (colPic: a picture of Ref_idx=0 of List1) is selected as a reference vector (mvCol). By temporally scaling (at a spacing ratio of picture order count (POC)) the selected reference vector, a direct vector in the forward direction (mvL0) and a direct vector in the backward direction (mvL1) of the macroblock on the current B picture (CurrPic) are determined.
More specifically, the direct mode is performed based on the following principle. Assume that a macroblock placed at the same position as that on the reference picture in the future direction (colPic) has predicted and indicated a region on the reference picture in the past direction, by a reference vector (mvCol). In this case, it is likely to consider that a certain object included in the reference picture in the future direction (colPic) moved from the reference picture in the past direction (refPicCol) along a vector, which is the reference vector (mvCol), in space time. If this is so, the object also passes through the current picture (CurrPic) interposed between the reference picture in the future direction (colPic) and the reference picture in the past direction (refPicCol), along the reference vector (mvCol). Based on the assumption, in the direct mode, the macroblock on the current picture (CurrPic) is predicted from the reference picture in the future direction (colPic) and the reference picture in the past direction (refPicCol), by using a vector parallel to the reference vector (mvCol) in space time. Formulae for calculating a vector parallel to the reference vector (mvCol) that indicates from the reference picture in the future direction (colPic) to the reference picture in the past direction (refPicCol) in space time, between CurrPic and refPicCol, and between CurrPic and colPic are depicted below.
Formulae for calculating a normal direct vector are as follows: mvL0 is a direct vector from the current picture (CurrPic) to the reference picture in the past direction (refPicCol), and mvL1 is a direct vector from the current picture (CurrPic) to the reference picture in the future direction (colPic).
mvL0=mvCol×tb/td  (1)
mvL1=mvL0−mvCol  (2)
where td is the time distance from the reference picture in the future direction (colPic) to the reference picture in the past direction (refPicCol), and tb is the time distance from the current picture (CurrPic) to the reference picture in the past direction (refPicCol). The direct vectors (mvL0 and mvL1) determined here are calculated on assumption that the picture has a frame structure.
The vector in the forward direction and the vector in the backward direction are used as examples for conveniently explaining the direct vector. However, the vectors mvL0 and mvL1 are not fixed to either of the forward direction or the backward direction, in the H.264 and MPEG-4 Part 10 AVC. Accordingly, the similar calculation can be carried out by using a vector of the combination of the forward direction/forward direction or the backward direction/backward direction. In the following description on the direct vector, the vectors mvL0 and mvL1 used for calculation are referred to as a first vector (direct vector) and a second vector (direct vector).
For example, methods of switching coefficients depending on the time distance, if a pixel with opposite parity is referred to, while a picture having a field structure is encoded in the direct mode, have been disclosed.
In the technologies disclosed in, for example, Japanese Laid-open Patent Publication No. 2004-048632 and Published Japanese Translation of PCT Application No. 2005-510984.
In the conventional technologies, the encoding efficiency is decreased, when a picture having a field structure is encoded in the direct mode. In other words, in the conventional technologies, if a pixel with opposite parity is referred to, while a picture having a field structure is encoded in the direct mode, an error corresponding to the difference in the parities occurs in the vector. Accordingly, the encoding efficiency is decreased.
FIG. 10 is a schematic of a direct vector (field structure). As depicted in FIG. 10, in a picture having a field structure, pixels in the Bottom_field are shifted in the downward direction by 0.5 pixel across the field, compared with the pixels in the Top_field. Accordingly, with a vector obtained by referring to a pixel with opposite parity, while a picture having a field structure is encoded in the direct mode, the direct vectors mvL0 and mvL1 do not become parallel to the reference vector (mvCol) in space time. Such a direct vector is not the maximum likelihood, thereby decreasing the encoding efficiency.
SUMMARY
According to an aspect of the invention, an encoding apparatus having a direct mode as a prediction mode includes a reference vector correcting unit that, when a pixel with opposite parity is referred to for obtaining a reference vector in the direct mode, performs correction by adding or subtracting a value corresponding to a half pixel to or from a value of the obtained reference vector; and a direct vector correcting unit that, when a pixel with opposite parity is referred to for obtaining a first and a second direct vector by temporally scaling the reference vector corrected by the reference vector correcting unit, performs correction by adding or subtracting a value corresponding to a half pixel to or from values of obtained direct vectors.
The object and advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the claims.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory and are not restrictive of the invention, as claimed.
BRIEF DESCRIPTION OF DRAWINGS
FIG. 1A is a schematic of an outline of an encoding apparatus and a decoding apparatus according to a first embodiment of the present invention;
FIG. 1B is a diagram for explaining the outline of the encoding apparatus;
FIG. 1C is formulae for explaining the outline of the encoding apparatus;
FIG. 2 is a block diagram of a configuration of the encoding apparatus according to the first embodiment;
FIG. 3 is a flowchart of a processing procedure performed by the encoding apparatus according to the first embodiment;
FIG. 4 is a block diagram of a configuration of the decoding apparatus according to the first embodiment;
FIG. 5 is a flowchart of a processing procedure performed by the decoding apparatus according to the first embodiment;
FIGS. 6A and 6B are schematics of advantages of the first embodiment;
FIG. 7 is a schematic of a computer that executes an encoding program;
FIG. 8 is a schematic of a computer that executes a decoding program;
FIG. 9 is a schematic of a direct vector (frame structure); and
FIG. 10 is a schematic of a direct vector (field structure).
DESCRIPTION OF EMBODIMENTS
Preferred embodiments of the present invention will be explained with reference to accompanying drawings. Outlines of an encoding apparatus and a decoding apparatus according to a first embodiment of the present invention are described first, followed by a configuration of the encoding apparatus and its processing procedure, a configuration of the decoding apparatus and its processing procedure according to the first embodiment, and advantages of the first embodiment. Other embodiments will be explained in the end.
[a] First Embodiment
[Outlines of Encoding Apparatus and Decoding Apparatus]
With reference to FIGS. 1A to 1C, outlines of the encoding apparatus and the decoding apparatus according to the first embodiment of the present invention will be described. FIGS. 1A to 1C depict the outlines of the encoding apparatus and the decoding apparatus according to the first embodiment.
As depicted in FIG. 1A, if a pixel with opposite parity is referred to, while a picture having a field structure is encoded in the direct mode, an error corresponding to the difference in the parities occurs in the vector. For example, as depicted in FIG. 1A, if a pixel “2” in a Top_field refers to a pixel “1” in a Top_field, the vertical component of a vector is “one pixel” derived from “2”−“1” (see FIG. 1B), and a value of the vertical component of the vector is correctly calculated as “−4(1 pixel/0.25 pixel)”.
As depicted in FIG. 1A, if a pixel “2” in a Bottom_field refers to a pixel “1” in a Top_field, a vertical component of a vector is also “one pixel” derived from “2”−“1”, and a value of the vertical component of the vector is also calculated as “−4(1 pixel/0.25 pixel)”. However, in a picture having a field structure, as depicted in FIG. 1A, the pixels in the Bottom_field are shifted by 0.5 pixel across the field in the downward direction, compared with the pixels in the Top_field. Accordingly, under normal circumstances, the vertical component of the vector should be “1.5 pixel” (see FIG. 1B), and the value of the vertical component of the vector should be calculated as “−6(1.5 pixel/0.25 pixel)”.
In this manner, if a pixel with opposite parity is referred to, an error of 0.5 pixel corresponding to the difference in the parities occurs in the vector. Accordingly, the encoding apparatus and the decoding apparatus according to the first embodiment correct the error.
More specifically, the encoding apparatus and the decoding apparatus according to the first embodiment correct a reference vector and a direct vector, by using three formulae depicted in FIG. 1C. Formula (A) in FIG. 1C is a formula for correcting a reference vector. As depicted in Formula (A), the encoding apparatus and the decoding apparatus according to the first embodiment, if parity of a reference picture in the past direction and a parity of a reference picture in the future direction are different, perform correction by adding or subtracting “2” corresponding to 0.5 pixel, to or from a value of the reference vector.
Formula (B) in FIG. 1C is a formula for correcting a first direct vector. As depicted in Formula (B), the encoding apparatus and the decoding apparatus according to the first embodiment, if a parity of a reference picture in the past direction and a parity of a current picture are different, perform correction by adding or subtracting “2” corresponding to 0.5 pixel, to or from a value of the first direct vector.
Formula (C) in FIG. 1C is a formula for correcting a second direct vector. As depicted in Formula (C), the encoding apparatus and the decoding apparatus according to the first embodiment, if a parity of a reference picture in the future direction and a parity of the current picture are different, perform correction by adding or subtracting “2” corresponding to 0.5 pixel, to or from a value of the second direct vector.
In this manner, the encoding apparatus and the decoding apparatus according to the first embodiment perform correction by adding or subtracting an error, which is a half pixel corresponding to the difference in the parities. Accordingly, the vector is properly corrected, thereby preventing the encoding efficiency from being decreased.
[Configuration of Encoding Apparatus]
A configuration of the encoding apparatus according to the first embodiment will now be described with reference to FIG. 2. FIG. 2 is a block diagram of the configuration of the encoding apparatus according to the first embodiment.
As depicted in FIG. 2, an encoding apparatus 100 according to the first embodiment includes a frame memory 110, an orthogonal transformation/quantization unit 111, an inverse orthogonal transformation/inverse quantization unit 112, a frame memory 113, and a motion compensating unit (intra predicting unit) 114. The encoding apparatus 100 also includes a motion vector detecting unit 115, a direct vector determining unit 116, a prediction mode determining unit 117, and a variable length coding unit 118. These units are the same as those included in a conventional encoding apparatus. Additionally, the encoding apparatus 100 according to the first embodiment includes a reference vector correcting unit 120 and a direct vector correcting unit 121 as indicated by the thick frames in FIG. 2.
A normal processing performed by the encoding apparatus 100 according to the first embodiment will be described. The encoding apparatus 100 stores an input image into the frame memory 110 and divides the input image into macroblock units. The encoding apparatus 100 performs processing in a macroblock unit.
In the encoding apparatus 100, the motion vector detecting unit 115 detects a motion vector of the input image stored in the frame memory 110, from a reference image stored in the frame memory 113. The motion compensating unit 114 then generates a prediction image, by using the reference image stored in the frame memory 113 and the motion vector detected by the motion vector detecting unit 115.
The prediction mode determining unit 117 determines a prediction mode capable of efficient encoding, by comparing between an encoding of the prediction image generated by the motion compensating unit 114 and an encoding of the input image. If the prediction mode determining unit 117 determines that an interframe coding is performed, a prediction difference image that is a difference between the input image stored in the frame memory 110 and the prediction image generated by the motion compensating unit 114 is fed into the orthogonal transformation/quantization unit 111. If the prediction mode determining unit 117 determines that an intraframe coding is performed, the input image stored in the frame memory 110 is directly fed into the orthogonal transformation/quantization unit 111.
The orthogonal transformation/quantization unit 111 orthogonally transforms and quantizes the prediction difference image and feeds into the variable length coding unit 118. The variable length coding unit 118 encodes the prediction difference image and outputs the image as a bit stream. The prediction difference image orthogonally transformed and quantized by the orthogonal transformation/quantization unit 111 is inversely orthogonally transformed and is inversely quantized by the inverse orthogonal transformation/inverse quantization unit 112, for the motion compensation prediction performed in the next frame. The prediction difference image is decoded by using the prediction image generated by the motion compensating unit 114 and is stored in the frame memory 113.
The encoding apparatus 100 according to the first embodiment includes the reference vector correcting unit 120 and the direct vector correcting unit 121. The reference vector correcting unit 120 and the direct vector correcting unit 121 operate when the input image is a picture having a field structure and is in the temporal direct mode.
The reference vector correcting unit 120, when a reference vector is detected by the motion vector detecting unit 115, determines whether a pixel with opposite parity is referred to. If it is determined that a pixel with opposite parity is referred to, the reference vector correcting unit 120 performs correction by adding or subtracting a value corresponding to 0.5 pixel, to or from a value of the reference vector value being detected. The reference vector correcting unit 120 then transmits the corrected reference vector to the direct vector determining unit 116.
More specifically, the reference vector correcting unit 120 performs correction by using the following formulae. The calculation is performed based on 0.25 pixel accuracy. The variables will now be described.
isBottomFieldCurr: parity (Top: 0, Bottom: 1) of a current picture (CurrPic)
isBottomFieldCol: parity (Top: 0, Bottom: 1) of a reference picture in the future direction (colPic)
isBottomFieldrefPicCol: parity (Top: 0, Bottom: 1) of a reference picture in the past direction (refPicCol)
mvCol_correct: corrected value of a reference vector (mvCol)
mvL0_correct: corrected value of a first direct vector (mvL0)
mvL1_correct: corrected value of a second direct vector (mvL1)
The reference vector correcting unit 120 performs correction using Formula (3), by considering the parity of the reference picture in the future direction (colPic), which is a reference source of a reference vector (mvCol), and the parity of a reference picture in the past direction (refPicCol), which is a picture to be referenced.
mvCol_correct=mvCol+2×(isBottomFieldrefPicCol−isBottomFieldCol)  (3)
By using Formula (3), regardless of parity, it is possible to calculate the correct direction of the reference vector (mvCol) in space time.
If it is determined that a pixel with opposite parity is not referred to, the reference vector correcting unit 120 transmits the reference vector detected by the motion vector detecting unit 115 to the direct vector determining unit 116.
The direct vector determining unit 116 calculates a direct vector by using the corrected reference vector transmitted from the reference vector correcting unit 120, and transmits the calculated direct vector to the direct vector correcting unit 121.
More specifically, the direct vector determining unit 116 performs correction using Formulae (4) and (5). In other words, the direct vector determining unit 116 calculates a direct vector parallel to the corrected reference vector (mvCol_correct), by using the corrected reference vector (mvCol_correct) and a distance of each picture in the temporal direction.
mvL0_correct=mvCol_correct×tb/td  (4)
mvL1_correct=mvL0_correct−mvCol_correct  (5)
The direct vector correcting unit 121 determines whether the direct vector transmitted from the direct vector determining unit 116 is obtained by referring to a pixel with opposite parity. If it is determined that a pixel with opposite parity is referred to, the direct vector correcting unit 121 performs correction by adding or subtracting a value corresponding to 0.5 pixel, to or from a value of the transmitted direct vector. The direct vector correcting unit 121 then transmits the corrected direct vector to the prediction mode determining unit 117.
More specifically, the direct vector correcting unit 121 performs correction on the first direct vector (mvL0), using Formula (6), by considering the parity between the current picture (CurrPic) and the reference picture in the past direction (refPicCol). The direct vector correcting unit 121 performs correction on the second director vector (mvL1), using Formula (7), by considering the parity between the current picture (CurrPic) and the reference picture in the future direction (colPic).
mvL0_correct=mvL0+2×(isBottomFieldrefPicCol−isBottomFieldCurr)  (6)
mvL1_correct=mvL1+2×(isBottomFieldCol−isBottomFieldCurr)  (7)
If it is determined that a pixel with opposite parity is not referred to, the direct vector correcting unit 121 transmits the direct vector transmitted from the direct vector determining unit 116 to the prediction mode determining unit 117.
[Processing Procedure Performed by Encoding Apparatus]
A processing procedure performed by the encoding apparatus according to the first embodiment will now be described with reference to FIG. 3. FIG. 3 is a flowchart of the processing procedure performed by the encoding apparatus according to the first embodiment. The processing procedure depicted in FIG. 3 is the detailed processing procedure performed when an input image is a picture having a field structure and is in the temporal direct mode.
The encoding apparatus 100 determines whether an input image has a field structure and is in the temporal direct mode (Step S101). If the input image does not have a field structure nor is in the temporal direct mode (No at Step S101), the encoding apparatus 100 performs a normal process (Step S102), and proceeds to Step S106.
If the input image has a field structure and is in the temporal direct mode (Yes at Step S101), the reference vector correcting unit 120 corrects the reference vector by considering the field relationships (Step S103).
The direct vector determining unit 116 then calculates a direct vector (Step S104). Subsequently, the direct vector correcting unit 121 corrects the first direct vector and the second direct vector by considering the field relationships (Step S105).
The prediction mode determining unit 117 then determines a prediction mode (Step S106). Subsequently, the macroblock is encoded by the orthogonal transformation/quantization unit 111 and the variable length coding unit 118 (Step S107).
The encoding apparatus 100 determines whether a processing of one picture has finished (Step S108). If the processing of one picture has finished (Yes at Step S108), the encoding apparatus 100 finishes the process. If the processing of one picture is not yet finished (No at Step S108), the encoding apparatus 100 returns to the process in which the reference vector correcting unit 120 corrects a reference vector (Step S103).
[Configuration of Decoding Apparatus]
A configuration of a decoding apparatus according to the first embodiment will now be described with reference to FIG. 4. FIG. 4 is a block diagram of the configuration of the decoding apparatus according to the first embodiment.
As depicted in FIG. 4, a decoding apparatus 200 according to the first embodiment includes a variable length decoding unit 210, a prediction mode determining unit 211, and an inverse orthogonal transformation/inverse quantization unit 212. The decoding apparatus 200 also includes a vector determining unit 213, a motion compensating unit (intra predicting unit) 214, a frame memory 215, and a direct vector determining unit 216. These units are the same as those included in a conventional decoding apparatus. Additionally, the decoding apparatus 200 according to the first embodiment includes a reference vector correcting unit 220 and a direct vector correcting unit 221 as indicated by the thick frames in FIG. 4.
A normal processing performed by the decoding apparatus 200 according to the first embodiment will now be described. The decoding apparatus 200 feeds a bit stream transmitted from the encoding apparatus 100 into the variable length decoding unit 210. The variable length decoding unit 210 decodes the upper level header information of the input bit stream, and divides into macroblock units. The decoding apparatus 200 performs processing in a macroblock unit.
In the decoding apparatus 200, the prediction mode determining unit 211 determines a prediction mode of decoding data in a macroblock unit. If the prediction mode determining unit 211 determines that an interframe coding is performed, the vector determining unit 213 extracts a motion vector from the bit stream. The motion compensating unit 214 then generates a prediction image, by using the motion vector extracted by the vector determining unit 213 and a decoded image stored in the frame memory 215. The prediction image generated by the motion compensating unit 214 is added to the decoded result of the prediction difference image generated by the inverse orthogonal transformation/inverse quantization unit 212, and is output as a decoded image.
If the prediction mode determining unit 211 determines that an intraframe coding is performed, the signal extracted from the bit stream is directly decoded by the inverse orthogonal transformation/inverse quantization unit 212, and output as a decoded image. The decoded image is stored in the frame memory 215, for the motion compensation process performed in the next frame.
The decoding apparatus 200 according to the first embodiment includes the reference vector correcting unit 220 and the direct vector correcting unit 221. The reference vector correcting unit 220 and the direct vector correcting unit 221 operate when the input image is a picture having a field structure and is in the temporal direct mode.
The reference vector correcting unit 220 determines whether a pixel with opposite parity is referred to, when a reference vector is determined by the vector determining unit 213. If it is determined that a pixel with opposite parity is referred to, the reference vector correcting unit 220 performs correction by adding or subtracting a value corresponding to 0.5 pixel, to or from a value of the detected reference vector. The reference vector correcting unit 220 then transmits the corrected reference vector to the direct vector determining unit 216.
More specifically, the reference vector correcting unit 220 performs correction by using the following formula. The calculation is performed based on 0.25 pixel accuracy. The variables will now be described.
isBottomFieldCurr: parity (Top: 0, Bottom: 1) of a current picture (CurrPic)
isBottomFieldCol: parity (Top: 0, Bottom: 1) of a reference picture in the future direction (colPic)
isBottomFieldrefPicCol: parity (Top: 0, Bottom: 1) of a reference picture in the past direction (refPicCol)
mvCol_correct: corrected value of a reference vector (mvCol)
mvL0_correct: corrected value of a first direct vector (mvL0)
mvL1_correct: corrected value of a second direct vector (mvL1)
The reference vector correcting unit 220 performs correction using Formula (8), by considering the parity of the reference picture in the future direction (colPic), which is a reference source of the reference vector (mvCol), and the parity of the reference picture in the past direction (refPicCol), which is a picture to be referenced.
mvCol_correct=mvCol+2×(isBottomFieldrefPicCol−isBottomFieldCol)  (8)
By using Formula (8), regardless of parity, it is possible to calculate the correct direction of the reference vector (mvCol) in space time.
If it is determined that a pixel with opposite parity is not referred to, the reference vector correcting unit 220 transmits the reference vector determined by the vector determining unit 213 to the direct vector determining unit 216.
The direct vector determining unit 216 calculates a direct vector by using the corrected reference vector transmitted from the reference vector correcting unit 220, and transmits the calculated direct vector to the direct vector correcting unit 221.
More specifically, the direct vector determining unit 216 performs correction using Formulae (9) and (10). In other words, the direct vector determining unit 216 calculates a direct vector parallel to the corrected reference vector (mvCol_correct), by using the corrected reference vector (mvCol_correct) and a distance of each picture in the temporal direction.
mvL0_correct=mvCol_correct×tb/td  (9)
mvL1_correct=mvL0_correct−mvCol_correct  (10)
The direct vector correcting unit 221 determines whether the direct vector transmitted from the direct vector determining unit 216 is obtained by referring to a pixel with opposite parity. If it is determined that a pixel with opposite parity is referred to, the direct vector correcting unit 221 performs correction by adding or subtracting a value corresponding to 0.5 pixel to or from a value of the transmitted direct vector. The direct vector correcting unit 221 then transmits the corrected direct vector to the motion compensating unit 214.
More specifically, the direct vector correcting unit 221 performs correction on the first direct vector (mvL0) using Formula (11), by considering the parity between the current picture (CurrPic) and the reference picture in the past direction (refPicCol). The direct vector correcting unit 221 performs correction on the second direct vector (mvL1) using Formula (12), by considering the parity between the current picture (CurrPic) and the reference picture in the future direction (colPic).
mvL0_correct=mvL0+2×(isBottomFieldrefPicCol−isBottomFieldCurr)  (11)
mvL1_correct=mvL1+2×(isBottomFieldCol−isBottomFieldCurr)  (12)
If it is determined that a pixel with opposite parity is not referred to, the direct vector correcting unit 221 transmits the direct vector transmitted from the direct vector determining unit 216 to the motion compensating unit 214.
[Processing Procedure Performed by Decoding Apparatus]
With reference to FIG. 5, a processing procedure performed by the decoding apparatus according to the first embodiment will be described. FIG. 5 is a flowchart of the processing procedure performed by the decoding apparatus according to the first embodiment. The processing procedure depicted in FIG. 5 is the detailed processing procedure performed when an input image is a picture having a field structure and is in the temporal direct mode.
In the decoding apparatus 200, the prediction mode determining unit 211 determines whether an input stream has a field structure and is in the temporal direct mode (Step S201). If the input stream does not have a field structure nor is in the temporal direct mode (No at Step S201), the decoding apparatus 200 performs a normal process (Step S202), and proceeds to Step S206.
If the input stream has a field structure and is in the temporal direct mode (Yes at Step S201), the reference vector correcting unit 220 corrects the reference vector by considering the field relationships (Step S203).
The direct vector determining unit 216 then calculates a direct vector (Step S204). Subsequently, the direct vector correcting unit 221 corrects the first direct vector and the second direct vector by considering the field relationships (Step S205). The macroblock is then decoded by the motion compensating unit 214 (Step S206).
The decoding apparatus 200 determines whether a processing of one picture has finished (Step S207). If the processing of one picture has finished (Yes at Step S207), the decoding apparatus 200 finishes the process. If the processing of one picture is not yet finished (No at Step S207), the decoding apparatus 200 returns to the process (Step S203) in which the reference vector correcting unit 220 corrects the reference vector.
[Advantages]
As described above, according to the first embodiment, the encoding apparatus performs correction by adding or subtracting a value corresponding to a half pixel to or from a value of the obtained reference vector, if a pixel with opposite parity is referred to so as to obtain a reference vector in the direct mode. The encoding apparatus performs correction by adding or subtracting a value corresponding to a half pixel to or from a value of the obtained direct vector, if a pixel with opposite parity is referred to so as to obtain a direct vector from the corrected reference vector.
In this manner, with the encoding apparatus according to the first embodiment, the correction is performed by adding or subtracting an error, which is a half pixel corresponding to the difference in parities. Accordingly, the vector is properly corrected, thereby preventing the encoding efficiency from being decreased.
In other words, as depicted in FIG. 6A, in the picture having a field structure, the pixels in the Bottom_field are shifted in the downward direction for 0.5 pixel across the field, compared with the pixels in the Top_field. Accordingly, the direct vectors mvL0 and mvL1 do not become parallel to the reference vector (mvCol) in space time, with the vector obtained by referring to a pixel with opposite parity, while the picture having a field structure is encoded in the direct mode. In other words, for example, in FIG. 6A, although the reference vector (mvCol) and the direct vectors mvL0 and mvl1 are all 0 vectors, their directions are all different in relation to the parity. Because the conventional direct vector does not become maximum likelihood, the encoding efficiency is decreased.
Alternatively, with the encoding apparatus according to the first embodiment, the correction is performed on the reference vector and the direct vector. As a result, as depicted in FIG. 6B, the direct vectors mvL0 and mvL1 become parallel to the reference vector (mvCol) in space time. Because the direct vector becomes maximum likelihood, it is possible to prevent the encoding efficiency from being decreased. However, the reference vector (mvCol) and the direct vectors mvL0 and mvL1 need not necessarily be 0 vectors.
Similarly, according to the first embodiment, if a pixel with opposite parity is referred to so as to obtain the reference vector in the direct mode, the decoding apparatus performs correction by adding or subtracting a value corresponding to a half pixel, to or from a value of the obtained reference vector. If a pixel with opposite parity is referred to so as to obtain the direct vector from the corrected reference vector, the decoding apparatus performs correction by adding or subtracting a value corresponding to a half pixel, to or from a value of the obtained direct vector.
In this manner, with the decoding apparatus according to the first embodiment, the correction is performed by adding or subtracting an error, which is a half pixel corresponding to the difference in parities. Accordingly, the vector is properly corrected, thereby preventing the encoding efficiency from being decreased.
[b] Other Embodiments
While embodiments of the present invention have been described, it is to be understood that various other modifications may be made in addition to the embodiments of the present invention.
[System Configuration etc.]
In the first embodiment, a method of correcting the reference vector and the direct vector by using both of the encoding apparatus and the decoding apparatus has been described. However, the present invention is not limited thereto. The present invention may similarly be applied to a method in which only the encoding apparatus is used for correction, or a method in which only the decoding apparatus is used for correction.
Of the processes described in the present embodiments, all or a part of the processes described as being automatically performed may be manually performed, or all or a part of the processes described as being manually performed may be automatically performed with a known method. The information including the processing procedure (such as FIGS. 3 and 5), specific names, and various kinds of data and parameters depicted in the specification or in the drawings can be optionally changed, unless otherwise specified.
The respective constituents of each apparatus depicted in the drawings are functionally conceptual, and are not necessarily be physically configured as illustrated (such as FIGS. 2 and 4). In other words, the specific mode of dispersion and integration of each apparatus is not limited to the ones depicted in the drawings, and all or a part of the apparatus can be functionally or physically dispersed or integrated in an optional unit, depending on various kinds of load and the status of use. All or an optional part of the respective processing functions carried out in each apparatus are realized by a central processing unit (CPU) and a computer program analyzed and executed by the CPU, or may be realized as hardware by the wired logic.
[Encoding Program and Decoding Program]
Various types of processing described in the embodiments can be realized by executing prepared computer programs with a computer such as a personal computer and a work station. With reference to FIGS. 7 and 8, examples of a computer that executes an encoding program and a computer that executes a decoding program having the similar functions as those of the embodiments will now be described. FIG. 7 is a schematic of a computer that executes the encoding program, and FIG. 8 is a schematic of a computer that executes the decoding program.
As depicted in FIG. 7, an encoding program (computer) 10 is connected to a cache 12, a random access memory (RAM) 11, a hard disk drive (HDD) 13, a read only memory (ROM) 14, and a CPU 15 via a bus 16. The ROM 14 includes an encoding program that can exercise functions similar to those of the embodiments. In other words, as depicted in FIG. 7, the ROM 14 includes an orthogonal transformation/quantization program 14 a, an inverse orthogonal transformation/inverse quantization program 14 b, a motion compensation program 14 c, a motion vector detection program 14 d, a reference vector correction program 14 e, a direct vector determination program 14 f, a direct vector correction program 14 g, a prediction mode determination program 14 h, and a variable length coding program 14 i.
The CPU 15 reads and executes the programs 14 a to 14 i, and thus the programs 14 a to 14 i, as depicted in FIG. 7, function as an orthogonal transformation/quantization process 15 a, an inverse orthogonal transformation/inverse quantization process 15 b, a motion compensation process 15 c, a motion vector detection process 15 d, a reference vector correction process 15 e, a direct vector determination process 15 f, a direct vector correction process 15 g, a prediction mode determination process 15 h, and a variable length coding process 15 i. The processes 15 a to 15 i respectively correspond to the orthogonal transformation/quantization unit 111, the inverse orthogonal transformation/inverse quantization unit 112, the motion compensating unit 114, the motion vector detecting unit 115, the reference vector correcting unit 120, the direct vector determining unit 116, the direct vector correcting unit 121, the prediction mode determining unit 117, and the variable length coding unit 118, depicted in FIG. 2.
The RAM 11, as depicted in FIG. 7, includes a frame memory 11 a and a frame memory 11 b. The frame memory 11 a and the frame memory 11 b respectively correspond to the frame memory 110 and the frame memory 113 depicted in FIG. 2.
The computer programs 14 a to 14 i need not necessarily be stored in the ROM 14 in advance. For example, the computer programs 14 a to 14 i may be stored in a “portable physical medium” such as a flexible disk (FD), a compact disk read only memory (CD-ROM), a magneto optical (MO) disk, a digital versatile disk (DVD), an optical disk, and an integrated circuit (IC) card that can be inserted into the computer 10; in a “fixed physical medium” such as a hard disk drive (HDD) provided inside and outside of the computer 10; or in “another computer (or server)” connected to the computer 10 via a public line, the Internet, a local area network (LAN), or a wide area network (WAN). The computer 10 can read out each computer program, and execute the program.
As depicted in FIG. 8, a decoding program (computer) 20 having a configuration connected to a cache 22, a RAM 21, a HDD 23, a ROM 24, and a CPU 25, via a bus 26. The ROM 24 includes a decoding program that can exercise functions similar to those of the embodiments. In other words, as depicted in FIG. 8, the ROM 24 includes a variable length decoding program 24 a, a prediction mode determination program 24 b, an inverse orthogonal transformation/inverse quantization program 24 c, a motion compensation program 24 d, a vector determination program 24 e, a reference vector correction program 24 f, a direct vector determination program 24 g, and a direct vector correction program 24 h.
The CPU 25 reads and executes the computer programs 24 a to 24 h, and thus the computer programs 24 a to 24 h, as depicted in FIG. 8, function as a variable length decoding process 25 a, a prediction mode determination process 25 b, an inverse orthogonal transformation/inverse quantization process 25 c, a motion compensation process 25 d, a vector determination process 25 e, a reference vector correction process 25 f, a direct vector determination process 25 g, and a direct vector correction process 25 h. The processes 25 a to 25 h respectively correspond to the variable length decoding unit 210, the prediction mode determining unit 211, the inverse orthogonal transformation/inverse quantization unit 212, the motion compensating unit 214, the vector determining unit 213, the reference vector correcting unit 220, the direct vector determining unit 216, and the direct vector correcting unit 221 depicted in FIG. 4.
The RAM 21, as depicted in FIG. 8, includes a frame memory 21 a. The frame memory 21 a corresponds to the frame memory 215 depicted in FIG. 4.
The computer programs 24 a to 24 h need not necessarily be stored in the ROM 24 in advance. For example, the computer programs 24 a to 24 h may be stored in a “portable physical medium” such as a FD, a CD-ROM, an MO disk, a DVD, an optical disk, and an IC card that can be inserted into the computer 20; in a “fixed physical medium” such as a HDD provided inside and outside of the computer 20; or in “another computer (or server)” connected to the computer 20 via a public line, the Internet, a LAN, or a WAN. The computer 20 can read out each computer program, and execute the program.
As described above, the encoding apparatus and the decoding apparatus according to an embodiment can prevent the encoding efficiency from being decreased.
All examples and conditional language recited herein are intended for pedagogical purposes to aid the reader in understanding the invention and the concepts contributed by the inventor to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions, nor does the organization of such examples in the specification relate to a showing of the superiority and inferiority of the invention. Although the embodiments of the present inventions have been described in detail, it should be understood that the various changes, substitutions, and alterations could be made hereto without departing from the spirit and scope of the invention.

Claims (2)

What is claimed is:
1. An encoding apparatus having a direct mode as a prediction mode, the encoding apparatus comprising:
a reference vector correcting unit that, when a pixel with opposite parity is referred to for obtaining a reference vector in the direct mode, performs correction by adding or subtracting a value corresponding to a half pixel to or from a vertical component of a value of the obtained reference vector; and
a direct vector correcting unit that, when a pixel with opposite parity is referred to for obtaining a first and a second direct vector by temporally scaling the reference vector corrected by the reference vector correcting unit, performs correction by adding or subtracting a value corresponding to a half pixel to or from the vertical component of the value of the obtained first direct vector, wherein
the reference vector correcting unit performs correction on the reference vector by using
mvCol_correct=mvCol+2×(isBottomFieldrefPicCol−isBottomFieldCol), where mvCol is the reference vector, mvCol_correct is the corrected value of the reference vector, isBottomFieldrefPicCol is parity of a reference picture in first direct vector direction, and isBottomFieldCol is parity of a reference picture in second direct vector direction, and
the direct vector correcting unit performs correction on the first direct vector by using
mvL0_correct=mvL0+2×(isBottomFieldrefPicCol−isBottomFieldCurr), where mvL0 is the first direct vector, mvL0_correct is the corrected value of the first direct vector, and isBottomFieldCurr is parity of a current picture, and
the direct vector correcting unit performs correction on the second direct vector by using
mvL1_correct=mvL1+2×(isBottomFieldCol−isBottomFieldCurr), where mvL1 is the second direct vector, and mvL1_correct is the corrected value of the second direct vector.
2. A decoding apparatus having a direct mode as a prediction mode, the decoding apparatus comprising:
a reference vector correcting unit that, when a pixel with opposite parity is referred to for obtaining a reference vector in the direct mode, performs correction by adding or subtracting a value corresponding to a half pixel to or from a vertical component of a value of the obtained reference vector; and
a direct vector correcting unit that, when a pixel with opposite parity is referred to for obtaining a first and a second direct vector by temporally scaling the reference vector corrected by the reference vector correcting unit, performs correction by adding or subtracting a value corresponding to a half pixel to or from the vertical component of the value of the obtained first direct vector, wherein
the reference vector correcting unit performs correction on the reference vector by using
mvCol_correct=mvCol+2×(isBottomFieldrefPicCol−isBottomFieldCol), where mvCol is the reference vector, mvCol_correct is the corrected value of the reference vector, isBottomFieldrefPicCol is parity of a reference picture in first direct vector direction, and isBottomFieldCol is parity of a reference picture in second direct vector direction, and
the direct vector correcting unit performs correction on the first direct vector by using
mvL0_correct=mvL0+2×(isBottomFieldrefPicCol−isBottomFieldCurr), where mvL0 is the first direct vector, mvL0_correct is the corrected value of the first direct vector, and isBottomFieldCurr is parity of a current picture, and
the direct vector correcting unit performs correction on the second direct vector by using
mvL1_correct=mvL1+2×(isBottomFieldCol−isBottomFieldCurr), where mvL1 is the second direct vector, and mvL1_correct is the corrected value of the second direct vector.
US12/458,729 2008-09-02 2009-07-21 Encoding apparatus and decoding apparatus Expired - Fee Related US8576912B2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US13/672,056 US8761256B2 (en) 2008-09-02 2012-11-08 Encoding apparatus and decoding apparatus
US14/202,426 US8971409B2 (en) 2008-09-02 2014-03-10 Encoding apparatus and decoding apparatus

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2008225305A JP5578775B2 (en) 2008-09-02 2008-09-02 Encoding device and decoding device
JP2008-225305 2008-09-02

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US13/672,056 Continuation US8761256B2 (en) 2008-09-02 2012-11-08 Encoding apparatus and decoding apparatus

Publications (2)

Publication Number Publication Date
US20100054335A1 US20100054335A1 (en) 2010-03-04
US8576912B2 true US8576912B2 (en) 2013-11-05

Family

ID=40956688

Family Applications (3)

Application Number Title Priority Date Filing Date
US12/458,729 Expired - Fee Related US8576912B2 (en) 2008-09-02 2009-07-21 Encoding apparatus and decoding apparatus
US13/672,056 Expired - Fee Related US8761256B2 (en) 2008-09-02 2012-11-08 Encoding apparatus and decoding apparatus
US14/202,426 Expired - Fee Related US8971409B2 (en) 2008-09-02 2014-03-10 Encoding apparatus and decoding apparatus

Family Applications After (2)

Application Number Title Priority Date Filing Date
US13/672,056 Expired - Fee Related US8761256B2 (en) 2008-09-02 2012-11-08 Encoding apparatus and decoding apparatus
US14/202,426 Expired - Fee Related US8971409B2 (en) 2008-09-02 2014-03-10 Encoding apparatus and decoding apparatus

Country Status (5)

Country Link
US (3) US8576912B2 (en)
EP (1) EP2160035A1 (en)
JP (1) JP5578775B2 (en)
KR (1) KR101070930B1 (en)
CN (1) CN101668212B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5578775B2 (en) 2008-09-02 2014-08-27 富士通株式会社 Encoding device and decoding device
US20130094774A1 (en) * 2011-10-13 2013-04-18 Sharp Laboratories Of America, Inc. Tracking a reference picture based on a designated picture on an electronic device
US8768079B2 (en) 2011-10-13 2014-07-01 Sharp Laboratories Of America, Inc. Tracking a reference picture on an electronic device
JP5821542B2 (en) * 2011-11-07 2015-11-24 富士通株式会社 Video encoding device and video decoding device
JP5895469B2 (en) * 2011-11-18 2016-03-30 富士通株式会社 Video encoding device and video decoding device
US9621889B2 (en) 2012-03-02 2017-04-11 Sun Patent Trust Image encoding method, image decoding method, image encoding apparatus, image decoding apparatus, and image coding apparatus

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0955607A2 (en) 1998-05-07 1999-11-10 Sarnoff Corporation Method and apparatus for adaptively scaling motion vector information
WO2003047271A2 (en) 2001-11-21 2003-06-05 General Instrument Corporation Picture level adaptive frame/field coding for digital video signal
US6584155B2 (en) * 1999-12-27 2003-06-24 Kabushiki Kaisha Toshiba Method and system for estimating motion vector
JP2004048632A (en) 2002-05-16 2004-02-12 Matsushita Electric Ind Co Ltd Method for encoding and decoding motion picture
JP2004056400A (en) 2002-07-18 2004-02-19 Fujitsu Ltd Moving picture encoding method and moving picture decoding method
EP1406453A1 (en) 2002-10-04 2004-04-07 Lg Electronics Inc. Direct mode motion vector calculation method for B picture
EP1408694A1 (en) 2002-10-07 2004-04-14 Matsushita Electric Industrial Co., Ltd. Direct mode motion vector calculation
US20050053292A1 (en) 2003-09-07 2005-03-10 Microsoft Corporation Advanced bi-directional predictive coding of interlaced video
US20050147171A1 (en) 2002-09-06 2005-07-07 Fujitsu Limited Motion picture encoding device and motion picture decoding device
EP1562385A1 (en) 2002-11-01 2005-08-10 Matsushita Electric Industrial Co., Ltd. Motion picture encoding method and motion picture decoding method
US20060256866A1 (en) * 2005-05-13 2006-11-16 Streaming Networks (Pvt.) Ltd. Method and system for providing bi-directionally predicted video coding
CN101039433A (en) 2006-03-15 2007-09-19 富士通株式会社 Video coding method, video coding apparatus and video coding program
US20080063075A1 (en) * 2002-04-19 2008-03-13 Satoshi Kondo Motion vector calculation method

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100910769B1 (en) * 2002-06-11 2009-08-04 삼성테크윈 주식회사 IC card and manufacturing method thereof
JP5578775B2 (en) 2008-09-02 2014-08-27 富士通株式会社 Encoding device and decoding device

Patent Citations (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0955607A2 (en) 1998-05-07 1999-11-10 Sarnoff Corporation Method and apparatus for adaptively scaling motion vector information
US6584155B2 (en) * 1999-12-27 2003-06-24 Kabushiki Kaisha Toshiba Method and system for estimating motion vector
WO2003047271A2 (en) 2001-11-21 2003-06-05 General Instrument Corporation Picture level adaptive frame/field coding for digital video signal
JP2005510984A (en) 2001-11-21 2005-04-21 ジェネラル・インストルメント・コーポレーション Picture-level adaptive frame / field coding for digital video content
US20080063075A1 (en) * 2002-04-19 2008-03-13 Satoshi Kondo Motion vector calculation method
JP2004048632A (en) 2002-05-16 2004-02-12 Matsushita Electric Ind Co Ltd Method for encoding and decoding motion picture
JP2004056400A (en) 2002-07-18 2004-02-19 Fujitsu Ltd Moving picture encoding method and moving picture decoding method
US20050147171A1 (en) 2002-09-06 2005-07-07 Fujitsu Limited Motion picture encoding device and motion picture decoding device
EP1406453A1 (en) 2002-10-04 2004-04-07 Lg Electronics Inc. Direct mode motion vector calculation method for B picture
EP1408694A1 (en) 2002-10-07 2004-04-14 Matsushita Electric Industrial Co., Ltd. Direct mode motion vector calculation
EP1562385A1 (en) 2002-11-01 2005-08-10 Matsushita Electric Industrial Co., Ltd. Motion picture encoding method and motion picture decoding method
US20050053149A1 (en) * 2003-09-07 2005-03-10 Microsoft Corporation Direct mode motion vectors for Bi-directionally predicted interlaced pictures
US20050053292A1 (en) 2003-09-07 2005-03-10 Microsoft Corporation Advanced bi-directional predictive coding of interlaced video
US20060256866A1 (en) * 2005-05-13 2006-11-16 Streaming Networks (Pvt.) Ltd. Method and system for providing bi-directionally predicted video coding
CN101039433A (en) 2006-03-15 2007-09-19 富士通株式会社 Video coding method, video coding apparatus and video coding program
EP1835748A1 (en) 2006-03-15 2007-09-19 Fujitsu Ltd. Video coding method, apparatus and program
US8144776B2 (en) 2006-03-15 2012-03-27 Fujitsu Limited Direct mode video coding using variable selection criterion

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Chinese Office Action mailed May 3, 2012 issued in corresponding Chinese Patent Application No. 200910163336.9.
European Oral Proceedings dated Dec. 22, 2011 issued in corresponding European Patent Application No. 09166101.7.

Also Published As

Publication number Publication date
US8971409B2 (en) 2015-03-03
EP2160035A1 (en) 2010-03-03
US20100054335A1 (en) 2010-03-04
KR101070930B1 (en) 2011-10-06
JP2010062769A (en) 2010-03-18
JP5578775B2 (en) 2014-08-27
US20140185677A1 (en) 2014-07-03
CN101668212A (en) 2010-03-10
CN101668212B (en) 2014-05-07
US8761256B2 (en) 2014-06-24
KR20100027959A (en) 2010-03-11
US20130064297A1 (en) 2013-03-14

Similar Documents

Publication Publication Date Title
US11956462B2 (en) Video processing methods and apparatuses for sub-block motion compensation in video coding systems
US10715827B2 (en) Multi-hypotheses merge mode
US10205960B2 (en) Moving picture encoding apparatus, moving picture decoding apparatus, moving picture encoding method, moving picture decoding method, moving picture encoding program, and moving picture decoding program
US8971409B2 (en) Encoding apparatus and decoding apparatus
US8553779B2 (en) Method and apparatus for encoding/decoding motion vector information
US11924413B2 (en) Intra prediction for multi-hypothesis
KR101422422B1 (en) System and method for enhanced dmvd processing
US8542739B2 (en) Method of estimating disparity vector using camera parameters, apparatus for encoding and decoding multi-view picture using the disparity vector estimation method, and computer-readable recording medium storing a program for executing the method
US20110176611A1 (en) Methods for decoder-side motion vector derivation
JP5310614B2 (en) Moving picture coding apparatus, moving picture coding method, moving picture decoding apparatus, and moving picture decoding method
JP2004032355A (en) Motion picture encoding method, motion picture decoding method, and apparatus for the both method
JP5281597B2 (en) Motion vector prediction method, motion vector prediction apparatus, and motion vector prediction program
CN117915109A (en) Method for not performing correction according to slice similarity corrected by decoding end motion vector based on bilinear interpolation

Legal Events

Date Code Title Description
AS Assignment

Owner name: FUJITSU LIMITED,JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:YAMORI, AKIHIRO;SHIMADA, SATOSHI;NAKAGAWA, AKIRA;REEL/FRAME:023026/0708

Effective date: 20090602

Owner name: FUJITSU LIMITED, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:YAMORI, AKIHIRO;SHIMADA, SATOSHI;NAKAGAWA, AKIRA;REEL/FRAME:023026/0708

Effective date: 20090602

STCF Information on status: patent grant

Free format text: PATENTED CASE

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 4

FEPP Fee payment procedure

Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

LAPS Lapse for failure to pay maintenance fees

Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20211105