WO2006136544A2 - Procedure and device for detecting fades in a sequence of images - Google Patents

Procedure and device for detecting fades in a sequence of images Download PDF

Info

Publication number
WO2006136544A2
WO2006136544A2 PCT/EP2006/063328 EP2006063328W WO2006136544A2 WO 2006136544 A2 WO2006136544 A2 WO 2006136544A2 EP 2006063328 W EP2006063328 W EP 2006063328W WO 2006136544 A2 WO2006136544 A2 WO 2006136544A2
Authority
WO
WIPO (PCT)
Prior art keywords
image
images
fade
detection window
window
Prior art date
Application number
PCT/EP2006/063328
Other languages
French (fr)
Other versions
WO2006136544A3 (en
Inventor
Ludovic Noblet
Eric Mercier
Michel Jouan
Original Assignee
Thomson Licensing
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing filed Critical Thomson Licensing
Priority to JP2008517478A priority Critical patent/JP2008544666A/en
Priority to EP06763780A priority patent/EP1894417A2/en
Priority to US11/922,439 priority patent/US8165337B2/en
Priority to CN200680022280XA priority patent/CN101204093B/en
Publication of WO2006136544A2 publication Critical patent/WO2006136544A2/en
Publication of WO2006136544A3 publication Critical patent/WO2006136544A3/en
Priority to US13/425,645 priority patent/US20120176548A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/87Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving scene cut or scene change detection in combination with video compression
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/136Incoming video signal characteristics or properties

Definitions

  • the invention relates to a device and a procedure for detecting fades in a sequence of images.
  • a shot or sequence shot is an uninterrupted sequence of images filmed by a camera or a camcorder in a single picture capture.
  • This shot is the basic unit used to analyse or construct a video content.
  • a video sequence generally comprises a succession of shots interconnected with the aid of video editing procedures. These procedures in particular make it possible to create an abrupt transition between two shots or else a progressive transition according to which an image is progressively changed into another image by mixing of the two images (e.g. a fade).
  • the detection of these transitions makes it possible in the case of an image sequence coding application to improve the coding quality by taking account of these transitions in the choices (for example structure of a group of images or "GOP") made by the coder. This detection also makes it possible to retrieve the shot boundaries and narrative units so as in particular to allow non-linear navigation around the content.
  • the invention is aimed at detecting progressive transitions, more particularly fades, in a sequence of images.
  • the invention relates to a method for detecting a fade in a sequence of images comprising pixels or image points with each of which is associated at least one luminance value. It comprises the following steps:
  • the luminance level associated with an image is equal to the sum of the luminance values associated with each of the pixels of the image.
  • the method furthermore comprises an additional step for validating the presence of a fade in the detection window if a level of motion associated with the detection window is less than a predetermined threshold.
  • the level of motion associated with the current image is equal to the ratio between the temporal activity calculated between the current image and the image preceding the current image, termed the preceding image, and the spatial activity calculated for the current image.
  • the level of motion associated with the decision window is equal to the level of motion associated with the last image of the decision window.
  • the temporal activity between the image and the preceding image is equal to the sum of the absolute values of the pixel-wise differences between the luminance values associated with each pixel of the image and the luminance values associated with each pixel of the preceding image.
  • the spatial activity of the image is equal to the sum of the absolute values of the luminance values associated with each pixel of the image.
  • the invention also relates to a device for detecting fades in a sequence of images comprising pixels or image points with each of which is associated at least one luminance value. It comprises: - means (32, 33, 34) of calculation for calculating, for each of the images of a window N consecutive images in length, termed the detection window, and for the image preceding the detection window, a luminance level on the basis of the luminance values associated with each of the pixels of each of the images; - means (32, 33, 34) of detection for detecting a fade, for each of the
  • N images of the detection window, based on the difference between the luminance level associated with the current image and the luminance level associated with the image preceding the current image; and - means for deducing (32, 33, 34) the presence of a fade for the detection window if for at least k images of the detection window a fade has been detected by the detection means, with 2 ⁇ k ⁇ N.
  • the invention relates furthermore to a video coding device which comprises coding means and a device for detecting fades according to the invention.
  • the invention relates moreover to a device for structuring a sequence of images characterized in that it comprises indexation means and a device for detecting fades according to the invention.
  • FIG. 5 illustrates a device according to the invention
  • FIG. 6 illustrates an image sequence coding device according to the invention.
  • FIG. 7 illustrates an image sequence structuring device according to the invention.
  • the invention is aimed at detecting fades in a sequence of images, the sequence being progressive or interlaced, each image comprising pixels with each of which is associated in particular a luminance value.
  • each image comprises an even field and an odd field.
  • Each field itself comprises pixels or image points with each of which is associated in particular a luminance value.
  • a fade from f to s is called a fade-in and vice versa
  • a fade from s to f is called a fade-out.
  • - (x,y) are the coordinates of the pixels in the image.
  • the factor x(n) is homogeneous inside an image or frame (i.e. independent of x and y) and lies between 0 and 1.
  • L e (n) ( ⁇ -x(n))*L s (n) + x(n)L f (n).
  • x(n) is split up in the following manner:
  • N k ⁇ and k 2 are two coefficients making it possible to express the linear nature of the fading factor.
  • the variation in the luminance level between two images (or frames) is dependent on the variation in the luminance level between the start and the end of the fade [L 8 -Lf), and is inversely proportional to the duration (N images or frames) of the fade.
  • the method according to the invention is therefore based on the detection of a variation in the luminance level between two successive images. This variation is thereafter compared with an experimentally defined threshold.
  • the method is split up into 4 steps referenced 10 to 13.
  • the modules represented are functional units, which may or may not correspond to physically distinguishable units.
  • Step 10 consists in calculating, over a window N images in length, termed the detection window, a luminance level for each image of the detection window and for the image of the sequence which precedes this window.
  • the luminance level of an image is the sum over the whole of the image of the luminance values associated with each of the pixels of this image.
  • a window N images in length is a portion of the sequence comprising N successive images.
  • Step 11 consists in detecting a fade, for each image of the detection window, by comparing the variation in the luminance level between this image, termed the current image, and the image which precedes it with a threshold.
  • This threshold is a multiple of the size of the image as a number of pixels. For example, it is equal to 1.75 multiplied by the width and by the height of the image as a number of pixels. If the variation in the luminance level is greater than this threshold then a fade is detected for the current image. Depending on whether one is seeking to detect fade-in or fade-out, the variation in the luminance level is calculated differently.
  • the variation in the luminance level is equal to the difference between the luminance level of the image preceding the current image and the luminance level of the current image.
  • the variation in the luminance level is equal to the difference between the luminance level of the current image and the luminance level of the image preceding it.
  • this step consists in calculating the number of images of the detection window for which a fade has been detected.
  • the results arising from the previous step for fade-in and fade-out are combined thereby making it possible to detect a fade whether it be a fade-in or fade-out.
  • a fade-in or fade-out has been detected in step 11 for at least k (2 ⁇ k ⁇ N) images of the detection window, we deduce therefrom the presence of a fade for the detection window.
  • the length of the detection window is preferably equal to 8 and k is equal to 6. This step in particular makes it possible to avoid determining false fades due to noise, or else to an instability of the brightness in the course of the sequence.
  • the result of the previous step 12 is combined with a local evaluation of the level of motion of the decision window.
  • This level of motion can be provided for example by an image level decision module (for example decision of the frame/image mode of coding, decision of the type of the image I 1 P, or B%) of a coding device.
  • the level of motion calculated for the last image of the decision window is the level of motion associated with the decision window. If the presence of a fade has been deduced for the detection window at the previous step 12 and if the level of motion for this window is less than a certain threshold, then the presence of a fade for the detection window is validated.
  • the decision window can for example have a size of three frames.
  • This window shown in Figure 2 can in particular be used, in a coding device, by an image level decision module.
  • the window is positioned so as to take an image level decision for the image referenced Pk corresponding to the frames referenced FM and Fj-2- More precisely, the image P k is either an image consisting of the union of the two frames F ⁇ i and F ⁇ 2 ("frame picture"), or an image consisting of two independent frames F ⁇ i and F i-2 ("field picture").
  • the image level decisions (for example decision of the "frame pictureTfield picture” mode of coding, decision of the type of the image I, P, or B?) apply at least over two successive frames.
  • a frame is repeated. The size of the decision window is therefore three frames.
  • the variation in the level of luminance calculated in step 11 is calculated between the frame positioned in Fn in the sliding window and the frame at the same position in the sliding window such as positioned so as to take an image level decision for the preceding image referenced P ⁇ -i.
  • Figures 3 and 4 represent the position of the decision window for the following image Pk+i. Specifically, if Pk is composed of the union of F ⁇ i and Fj -2 and if Fj is not a repeated frame or if Pk is composed of two independent frames F ⁇ i and Fj -2 , then the following position of the decision window for the image Pk+i is illustrated by Figure 3.
  • the present invention also relates to a device for detecting fades, referenced 30 in Figure 5, implementing the method described previously. Only the essential elements of the device are represented in Figure 5.
  • the device 30 comprises: a random access memory 32 (RAM or similar component), a read only memory 33 (hard disk or similar component), a processing unit 34 such as a microprocessor or a similar component and an input /output interface 35. These elements are linked together by an address and data bus 31.
  • the read only memory 33 contains the algorithms implementing steps 10 to 12 and optionally step 13 of the method according to the invention.
  • the processing unit 34 loads and executes the instructions of these algorithms.
  • the random access memory 32 in particular comprises the programs for operating the processing unit 34 loaded on power-up of the appliance, as well as the images to be processed.
  • the input /output interface 35 has the function of receiving the input signal (i.e. the source image sequence) and outputs the result of the fade detection according to steps 10 to 12 (or 13) of the method of the invention.
  • the device for detecting fades 30 can be used in a coding device referenced 40 in Figure 6.
  • This device comprises in particular coding means 41 for coding the source image sequence 400.
  • the coding means comprise at least decision means making it possible to select the type of image (e.g. intra (I image), predicted (P images), bidirectional (B images)).
  • This module can for example implement the MPEG-2 coding standard defined in particular in the document ISO/IEC 13818-2 (entitled "Information technology -- Generic coding of moving pictures and associated audio information: Video"). More generally, it can implement any standard for coding image sequences. In particular it uses the information provided by the device 30 to dynamically adapt the type of image. This can make it possible to improve the cost of compression and the quality of the decoded images.
  • the device for detecting fades 30 can also be integrated with a device for structuring image sequences, referenced 50 in Figure 5.
  • This device 50 comprises in particular indexation means 51 making it possible to create a description 501 of the source image sequence 400 (e.g. temporal marker, otherwise known as a "time code", of start and end of fades).
  • This device 50 in particular makes it possible to retrieve the shot boundaries and narrative units so as to make it possible to navigate non-linearly through the sequence or to generate digests. More precisely, it makes it possible to retrieve the appropriate shot boundaries when a fade has been introduced during editing by virtue of the device for detecting fades 30 and therefore provides better bases for structuring into narrative units.
  • the knowledge of the fades gives high-level information on the structuring of the video.
  • the device 50 therefore makes it possible to structure the sequence for example into chapters and sub-chapters in the case of digital sequences of images as on a DVD.
  • a fade to black or fade-out followed by a fade from black or fade-in very often indicates a change of "chapter".
  • slow motions - which make it possible to detect the interesting phases of a game - are very often delimited by fades.

Abstract

The invention relates to a method for detecting a fade in a sequence of images which comprises the following steps: - calculating (10), for each of the images of a window N consecutive images in length, termed the detection window, and for the image preceding the detection window, a luminance level on the basis of the luminance values associated with each of the pixels of each of the images; - detecting (11) a fade, for each of the N images of the detection window, based on the difference between the luminance level associated with the current image and the luminance level associated with the image preceding the current image; and - deducing therefrom (12) the presence of a fade for the detection window if, for at least k images of the detection window, a fade has been detected, with 2≤k≤N.

Description

PROCEDURE AND DEVICE FOR DETECTING FADES IN A SEQUENCE OF IMAGES
1. Field of the invention The invention relates to a device and a procedure for detecting fades in a sequence of images.
2. State of the art
In the field of image processing (e.g. compression, content enhancement), the methods applied to image sequences, for example to estimate a motion between two images or else to temporally filter the images of the sequence, often require temporal tracking of the information. But a break in temporal continuity in a sequence of images disrupts the implementation of these methods. This more or less sharp break corresponds to a more or less significant modification (e.g. transition) of the content between two successive images. Its identification is necessary so as to take account of this discontinuity during the application of these methods and thus improve their robustness.
More particularly, a shot or sequence shot is an uninterrupted sequence of images filmed by a camera or a camcorder in a single picture capture. This shot is the basic unit used to analyse or construct a video content. A video sequence generally comprises a succession of shots interconnected with the aid of video editing procedures. These procedures in particular make it possible to create an abrupt transition between two shots or else a progressive transition according to which an image is progressively changed into another image by mixing of the two images (e.g. a fade). The detection of these transitions makes it possible in the case of an image sequence coding application to improve the coding quality by taking account of these transitions in the choices (for example structure of a group of images or "GOP") made by the coder. This detection also makes it possible to retrieve the shot boundaries and narrative units so as in particular to allow non-linear navigation around the content. 3. Summary of the invention
The invention is aimed at detecting progressive transitions, more particularly fades, in a sequence of images. The invention relates to a method for detecting a fade in a sequence of images comprising pixels or image points with each of which is associated at least one luminance value. It comprises the following steps:
- calculating (10), for each of the images of a window N consecutive images in length, termed the detection window, and for the image preceding the detection window, a luminance level on the basis of the luminance values associated with each of the pixels of each of the images;
- detecting (11) a fade, for each of the N images of the detection window, based on the difference between the luminance level associated with the current image and the luminance level associated with the image preceding the current image; and
- deducing therefrom (12) the presence of a fade for the detection window if, for at least k images of the detection window, a fade has been detected, with 2<k≤N. Preferably, the luminance level associated with an image is equal to the sum of the luminance values associated with each of the pixels of the image.
Advantageously, the method furthermore comprises an additional step for validating the presence of a fade in the detection window if a level of motion associated with the detection window is less than a predetermined threshold.
Preferably, the level of motion associated with the current image is equal to the ratio between the temporal activity calculated between the current image and the image preceding the current image, termed the preceding image, and the spatial activity calculated for the current image. Moreover, the level of motion associated with the decision window is equal to the level of motion associated with the last image of the decision window.
Preferably, the temporal activity between the image and the preceding image is equal to the sum of the absolute values of the pixel-wise differences between the luminance values associated with each pixel of the image and the luminance values associated with each pixel of the preceding image. Furthermore, the spatial activity of the image is equal to the sum of the absolute values of the luminance values associated with each pixel of the image.
The invention also relates to a device for detecting fades in a sequence of images comprising pixels or image points with each of which is associated at least one luminance value. It comprises: - means (32, 33, 34) of calculation for calculating, for each of the images of a window N consecutive images in length, termed the detection window, and for the image preceding the detection window, a luminance level on the basis of the luminance values associated with each of the pixels of each of the images; - means (32, 33, 34) of detection for detecting a fade, for each of the
N images of the detection window, based on the difference between the luminance level associated with the current image and the luminance level associated with the image preceding the current image; and - means for deducing (32, 33, 34) the presence of a fade for the detection window if for at least k images of the detection window a fade has been detected by the detection means, with 2<k≤N.
The invention relates furthermore to a video coding device which comprises coding means and a device for detecting fades according to the invention.
The invention relates moreover to a device for structuring a sequence of images characterized in that it comprises indexation means and a device for detecting fades according to the invention.
4. Lists of figures
The invention will be better understood and illustrated by means of wholly non-limiting advantageous exemplary embodiments and modes of implementation with reference to the appended figures in which: - Figure 1 illustrates a method for detecting fades according to the invention;
- Figure 2 illustrates a position of a decision window;
- Figure 3 illustrates a displacement of a decision window; - Figure 4 illustrates another displacement of a decision window;
- Figure 5 illustrates a device according to the invention;
- Figure 6 illustrates an image sequence coding device according to the invention; and
- Figure 7 illustrates an image sequence structuring device according to the invention.
5. Detailed description of the invention
The invention is aimed at detecting fades in a sequence of images, the sequence being progressive or interlaced, each image comprising pixels with each of which is associated in particular a luminance value. In the case of interlaced sequences, each image comprises an even field and an odd field.
Each field itself comprises pixels or image points with each of which is associated in particular a luminance value. Considering a sequence of images s and a sequence of images f (for example a sequence of black or nearly uniform images), a fade from f to s is called a fade-in and vice versa a fade from s to f is called a fade-out. The sequence of images comprising the fade from s to for from f to s is defined as follows: e(x,y,n) = (1-x(n)).s(x,y,n) + x(n).f(x,y,n) where: - x(n) is the fade factor, - n is the index of the image in the sequence, and
- (x,y) are the coordinates of the pixels in the image.
The factor x(n) is homogeneous inside an image or frame (i.e. independent of x and y) and lies between 0 and 1. We define the luminance level Le(n) for a given image or frame n in the following manner:
X Y X Y 400 = ∑ L e(x,y, n) = ∑ ∑ [(I - *(«)) * s(x, y, ή) + x(ή)f(x, y, n)] x=0 y=0 x=0 y=0
X Y X Y
It follows that L» = 0 -*(»))£ ∑s(χ,y,n) + x(n)∑ ∑f(χ,y,n) and therefore that
X=O y=0 x=0 y=0
Le(n) = (\ -x(n))*Ls(n) + x(n)Lf(n). By assuming that the image f(x,y,n) is homogeneous, has a luminance level equal to Lf and that Lf(n) = Lf(n + \), then the difference between the luminance level associated with image or frame n and the luminance level associated with image or frame n+1 is equal to: Le (n + 1) - Le (n) = (1 - x(n + 1)) * Ls (n + 1) - (1 - JC(Λ)) * Ls (n) + (x(n + 1) - jc(«))Z,y
Considering that: L s (n + 1) - Ls (n) = ΔLs (n + 1) _ the luminance difference is then equal to:
Le(n + \)-Le(n) = ALs(n + \)*(l-x(n))-(x(n + \)-x(n))*(Ls(n + \)-Lf)
If the fade factor x(n) is linear for a fade which lasts N images or frames from an image (or frame) of index 0 to an image (or frame) of index N then x(n) is split up in the following manner:
x(n) = k, + k7
N kή and k2 are two coefficients making it possible to express the linear nature of the fading factor. In the case of a fade-in: x(0) = 1 and X(TV)=O. In the case of a fade-out: x(0)=0 and x(7γ)=1. It follows that the variation in the luminance level between images n and n+1 is equal to:
Le(n + l)-Le(n) = ALs(n+l)*(l-kι-k2÷)-^*(Ls(n + \)-Lf)
N} N The variation in the luminance level between two successive images (or frames) of the sequence s is generally negligible with respect to the luminance variation between the sequence s and the sequence f, thereby making it possible to simplify the equation above as follows:
Le(n+\)-Le(n)~-^*(Ls(n + \)-Lf)
By making the assumption of the stationnarity of the luminance level along the sequence s, the equation simplifies in the following manner:
Le(n+\)-Le(n)~-^ Zl*(Ls-Lf)
N The latter assumption is valid most of the time insofar as most fades occur on the basis of a sequence of weak motion. Thus during a fade, the variation in the luminance level between two images (or frames) is dependent on the variation in the luminance level between the start and the end of the fade [L8-Lf), and is inversely proportional to the duration (N images or frames) of the fade. The method according to the invention is therefore based on the detection of a variation in the luminance level between two successive images. This variation is thereafter compared with an experimentally defined threshold. According to a preferred embodiment, illustrated by Figure 1 , the method is split up into 4 steps referenced 10 to 13. In this figure, the modules represented are functional units, which may or may not correspond to physically distinguishable units. For example, these modules or some of them may be grouped together in a single component, or constitute functionalities of one and the same software. A contrario, certain modules may possibly be composed of separate physical entities. Hereinafter in the document, to simplify the description the term image is employed to designate either an image composed of two frames or one frame alone. Step 10 consists in calculating, over a window N images in length, termed the detection window, a luminance level for each image of the detection window and for the image of the sequence which precedes this window. As defined previously, the luminance level of an image is the sum over the whole of the image of the luminance values associated with each of the pixels of this image. A window N images in length is a portion of the sequence comprising N successive images.
Step 11 consists in detecting a fade, for each image of the detection window, by comparing the variation in the luminance level between this image, termed the current image, and the image which precedes it with a threshold. This threshold is a multiple of the size of the image as a number of pixels. For example, it is equal to 1.75 multiplied by the width and by the height of the image as a number of pixels. If the variation in the luminance level is greater than this threshold then a fade is detected for the current image. Depending on whether one is seeking to detect fade-in or fade-out, the variation in the luminance level is calculated differently. Thus, in the case of a fade-in, the variation in the luminance level is equal to the difference between the luminance level of the image preceding the current image and the luminance level of the current image. In the case of a fade-out, the variation in the luminance level is equal to the difference between the luminance level of the current image and the luminance level of the image preceding it. This step makes it possible to detect, for each image of the detection window, fade-ins and fade-outs separately. Advantageously, different thresholds can be defined so as to detect the fade-ins and the fade-outs. Step 12 consists in deducing the presence of a fade for the detection window on the basis of the detections performed in step 11. More precisely, this step consists in calculating the number of images of the detection window for which a fade has been detected. During this step, the results arising from the previous step for fade-in and fade-out are combined thereby making it possible to detect a fade whether it be a fade-in or fade-out. For this purpose, if a fade-in or fade-out has been detected in step 11 for at least k (2<k≤N) images of the detection window, we deduce therefrom the presence of a fade for the detection window. The length of the detection window is preferably equal to 8 and k is equal to 6. This step in particular makes it possible to avoid determining false fades due to noise, or else to an instability of the brightness in the course of the sequence.
Advantageously, during an optional step 13, the result of the previous step 12 is combined with a local evaluation of the level of motion of the decision window. This makes it possible to avoid overdetections on portions of sequence of images with strong motion that might possibly be wrongly regarded as fade. This level of motion can be provided for example by an image level decision module (for example decision of the frame/image mode of coding, decision of the type of the image I1 P, or B...) of a coding device. It can be calculated for an image by computing the ratio between the temporal activity calculated between this image and the image preceding it (for example the sum of the absolute values of the differences, pixel-wise, between the luminance values associated with each pixel of this image and the luminance values associated with each pixel of the image preceding it) and the spatial activity calculated for this image (for example the sum over the image of the absolute values of the luminance values associated with each pixel of this image). More precisely, the level of motion calculated for the last image of the decision window is the level of motion associated with the decision window. If the presence of a fade has been deduced for the detection window at the previous step 12 and if the level of motion for this window is less than a certain threshold, then the presence of a fade for the detection window is validated.
In the particular case of an interlaced sequence, it is advantageous to define a sliding window, termed the decision window. This window can for example have a size of three frames. This window shown in Figure 2 can in particular be used, in a coding device, by an image level decision module. In Figure 2, the window is positioned so as to take an image level decision for the image referenced Pk corresponding to the frames referenced FM and Fj-2- More precisely, the image Pk is either an image consisting of the union of the two frames Fμi and Fμ2 ("frame picture"), or an image consisting of two independent frames Fμi and Fi-2 ("field picture"). In the case of the MPEG-2 standard defined in the document ISO/IEC 13818-2 (entitled "Information technology -- Generic coding of moving pictures and associated audio information: Video"), the image level decisions (for example decision of the "frame pictureTfield picture" mode of coding, decision of the type of the image I, P, or B...) apply at least over two successive frames. In the particular case of a sequence having undergone a transformation of 24-image adjustment type ("detelecine" or "3:2 pulldown"), a frame is repeated. The size of the decision window is therefore three frames. In order to tie the fade detection to the image level decisions, the variation in the level of luminance calculated in step 11 , is calculated between the frame positioned in Fn in the sliding window and the frame at the same position in the sliding window such as positioned so as to take an image level decision for the preceding image referenced Pκ-i. Figures 3 and 4 represent the position of the decision window for the following image Pk+i. Specifically, if Pk is composed of the union of Fμi and Fj-2 and if Fj is not a repeated frame or if Pk is composed of two independent frames Fμi and Fj-2, then the following position of the decision window for the image Pk+i is illustrated by Figure 3. On the other hand, if Pk is composed of the union of Fμi and Fi-2 and if Fi is a repeated frame ("detelecine" case), then the following position of the decision window for the image Pk+i is illustrated by Figure 4. The present invention also relates to a device for detecting fades, referenced 30 in Figure 5, implementing the method described previously. Only the essential elements of the device are represented in Figure 5. The device 30 comprises: a random access memory 32 (RAM or similar component), a read only memory 33 (hard disk or similar component), a processing unit 34 such as a microprocessor or a similar component and an input /output interface 35. These elements are linked together by an address and data bus 31. The read only memory 33 contains the algorithms implementing steps 10 to 12 and optionally step 13 of the method according to the invention. On power-up, the processing unit 34 loads and executes the instructions of these algorithms. The random access memory 32 in particular comprises the programs for operating the processing unit 34 loaded on power-up of the appliance, as well as the images to be processed. The input /output interface 35 has the function of receiving the input signal (i.e. the source image sequence) and outputs the result of the fade detection according to steps 10 to 12 (or 13) of the method of the invention.
The present invention applies to the field of image sequence compression. Specifically, the device for detecting fades 30 can be used in a coding device referenced 40 in Figure 6. This device comprises in particular coding means 41 for coding the source image sequence 400. The coding means comprise at least decision means making it possible to select the type of image (e.g. intra (I image), predicted (P images), bidirectional (B images)). This module can for example implement the MPEG-2 coding standard defined in particular in the document ISO/IEC 13818-2 (entitled "Information technology -- Generic coding of moving pictures and associated audio information: Video"). More generally, it can implement any standard for coding image sequences. In particular it uses the information provided by the device 30 to dynamically adapt the type of image. This can make it possible to improve the cost of compression and the quality of the decoded images.
The device for detecting fades 30 can also be integrated with a device for structuring image sequences, referenced 50 in Figure 5. This device 50 comprises in particular indexation means 51 making it possible to create a description 501 of the source image sequence 400 (e.g. temporal marker, otherwise known as a "time code", of start and end of fades). This device 50 in particular makes it possible to retrieve the shot boundaries and narrative units so as to make it possible to navigate non-linearly through the sequence or to generate digests. More precisely, it makes it possible to retrieve the appropriate shot boundaries when a fade has been introduced during editing by virtue of the device for detecting fades 30 and therefore provides better bases for structuring into narrative units. Moreover the knowledge of the fades gives high-level information on the structuring of the video. The device 50 therefore makes it possible to structure the sequence for example into chapters and sub-chapters in the case of digital sequences of images as on a DVD. Specifically, in a film, a fade to black or fade-out followed by a fade from black or fade-in very often indicates a change of "chapter". Moreover, in the sports retransmissions, slow motions - which make it possible to detect the interesting phases of a game - are very often delimited by fades.
Of course, the invention is not limited to the exemplary embodiments mentioned above. In particular, the person skilled in the art can introduce any variant to the embodiments set forth and combine them to benefit from their various advantages.

Claims

Claims
1. Method for detecting a fade in a sequence of images, said images comprising pixels or image points with each of which is associated at least one luminance value, characterized in that it comprises the following steps:
- calculating (10), for each of the images of a window N consecutive images in length, termed detection window, and for the image preceding said detection window, a luminance level on the basis of the luminance values associated with each of said pixels of each of said images;
- detecting (11 ) a fade, for each of said N images of said detection window, based on the difference between the luminance level associated with the current image and the luminance level associated with the image preceding said current image; and
- deducing therefrom (12) the presence of a fade for said detection window if, for at least k images of said detection window, a fade has been detected, with 2<k≤N.
2. Method according to Claim 1 , characterized in that the luminance level associated with an image is equal to the sum of the luminance values associated with each of said pixels of said image.
3. Method according to one of Claims 1 or 2, characterized in that it furthermore comprises an additional step (13) for validating the presence of a fade in the detection window if a level of motion associated with said detection window is less than a predetermined threshold.
4. Method according to Claim 3, characterized in that the level of motion associated with an image is equal to the ratio between the temporal activity calculated between said image and the image preceding said image, termed the preceding image, and the spatial activity calculated for said image and in that the level of motion associated with said decision window is equal to the level of motion associated with the last image of said decision window.
5. Method according to Claim 4, characterized in that said temporal activity between said image and said preceding image is equal to the sum of the absolute values of the pixel-wise differences between the luminance values associated with each pixel of said image and the luminance values associated with each pixel of said preceding image and in that the spatial activity of said image is equal to the sum of the absolute values of the luminance values associated with each pixel of said image.
6. Device for detecting fades in a sequence of images comprising pixels or image points with each of which is associated at least one luminance value, characterized in that it comprises:
- means (32, 33, 34) of calculation for calculating, for each of the images of a window N consecutive images in length, termed the detection window, and for the image preceding said detection window, a luminance level on the basis of the luminance values associated with each of said pixels of each of said images;
- means (32, 33, 34) of detection for detecting a fade, for each of said N images of said detection window, based on the difference between the luminance level associated with the current image and the luminance level associated with the image preceding said current image; and
- means for deducing (32, 33, 34) the presence of a fade for said detection window if for at least k images of said detection window a fade has been detected by the detection means, with 2<k≤N.
7. Video coding device (40), characterized in that it comprises coding means (41) and a device for detecting fades (30) according to Claim 6.
8. Device (50) for structuring a sequence of images, characterized in that it comprises indexation means (51 ) and a device for detecting fades (30) according to Claim 6.
PCT/EP2006/063328 2005-06-23 2006-06-20 Procedure and device for detecting fades in a sequence of images WO2006136544A2 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
JP2008517478A JP2008544666A (en) 2005-06-23 2006-06-20 Procedure and apparatus for detecting fades in an image sequence
EP06763780A EP1894417A2 (en) 2005-06-23 2006-06-20 Procedure and device for detecting fades in a sequence of images
US11/922,439 US8165337B2 (en) 2005-06-23 2006-06-20 Procedure and device for detecting fades in a sequence of images
CN200680022280XA CN101204093B (en) 2005-06-23 2006-06-20 Procedure and device for detecting fades in a sequence of images
US13/425,645 US20120176548A1 (en) 2005-06-23 2012-03-21 Procedure and device for detecting fades in a sequence of images

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0551728A FR2887731A1 (en) 2005-06-23 2005-06-23 METHOD AND DEVICE FOR DETECTING FOUNDED IN IMAGE SEQUENCE
FR0551728 2005-06-23

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US13/425,645 Continuation US20120176548A1 (en) 2005-06-23 2012-03-21 Procedure and device for detecting fades in a sequence of images

Publications (2)

Publication Number Publication Date
WO2006136544A2 true WO2006136544A2 (en) 2006-12-28
WO2006136544A3 WO2006136544A3 (en) 2007-09-07

Family

ID=35717654

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2006/063328 WO2006136544A2 (en) 2005-06-23 2006-06-20 Procedure and device for detecting fades in a sequence of images

Country Status (7)

Country Link
US (2) US8165337B2 (en)
EP (1) EP1894417A2 (en)
JP (1) JP2008544666A (en)
KR (1) KR20080027255A (en)
CN (1) CN101204093B (en)
FR (1) FR2887731A1 (en)
WO (1) WO2006136544A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013138489A (en) * 2007-09-28 2013-07-11 Dolby Lab Licensing Corp Treating video information

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5561477A (en) 1994-10-26 1996-10-01 Thomson Consumer Electronics, Inc. System for coding a video signal in the presence of an image intensity gradient
JPH09172596A (en) * 1995-10-17 1997-06-30 Canon Inc Image processing unit and image processing method
US5929902A (en) * 1996-02-28 1999-07-27 C-Cube Microsystems Method and apparatus for inverse telecine processing by fitting 3:2 pull-down patterns
CN1166751A (en) * 1996-04-27 1997-12-03 日本胜利株式会社 Picture switching apparatus
US5920360A (en) * 1996-06-07 1999-07-06 Electronic Data Systems Corporation Method and system for detecting fade transitions in a video signal
JP3604864B2 (en) 1997-04-25 2004-12-22 シャープ株式会社 Video encoding device
US6195458B1 (en) * 1997-07-29 2001-02-27 Eastman Kodak Company Method for content-based temporal segmentation of video
US6084641A (en) * 1997-08-06 2000-07-04 General Instrument Corporation Fade detector for digital video
US6459459B1 (en) * 1998-01-07 2002-10-01 Sharp Laboratories Of America, Inc. Method for detecting transitions in sampled digital video sequences
JP3166716B2 (en) * 1998-08-12 2001-05-14 日本電気株式会社 Fade image-adaptive moving image encoding apparatus and encoding method
JP3957915B2 (en) * 1999-03-08 2007-08-15 パイオニア株式会社 Fade detection device and information encoding device
KR20010087552A (en) * 2000-03-07 2001-09-21 구자홍 Dissolve/fade detection method for mpeg-compressed video using spatio-temporal distribution of the macro blocks
JP2001292372A (en) 2000-04-04 2001-10-19 Pioneer Electronic Corp Image change detector, image change detecting method, image encoder and information recording medium in which program for detecting image change is computer- readably recorded
CN100370818C (en) * 2000-05-23 2008-02-20 皇家菲利浦电子有限公司 Commercial-break detection device
JP4328000B2 (en) 2000-08-02 2009-09-09 富士通株式会社 Moving picture coding apparatus and moving picture special effect scene detecting apparatus
US7298962B2 (en) * 2003-05-12 2007-11-20 Macrovision Corporation Method and apparatus for reducing and restoring the effectiveness of a commercial skip system
US7949051B2 (en) * 2004-05-26 2011-05-24 Broadcom Corporation Mosquito noise detection and reduction

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
BESCOS J: "Real-Time Shot Change Detection Over Online MPEG-2 Video" IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, IEEE SERVICE CENTER, PISCATAWAY, NJ, US, vol. 14, no. 4, April 2004 (2004-04), pages 475-484, XP011110017 ISSN: 1051-8215 *
FORD R M ET AL: "Metrics for shot boundary detection in digital video sequences" MULTIMEDIA SYSTEMS, ACM, NEW YORK, NY, US, vol. 8, no. 1, January 2000 (2000-01), pages 37-46, XP002419455 ISSN: 0942-4962 *
FURHT B ET AL: "Technique for detection of editing effects" HANDBOOK OF VIDEO DATABASES. DESIGN AND APPLICATIONS, CRC PRESS,, US, 2004, pages 156-176, XP002370037 *
LEFEVRE S ET AL: "A review of real-time segmentation of uncompressed video sequences for content-based search and retrieval" February 2003 (2003-02), REAL-TIME IMAGING, ACADEMIC PRESS LIMITED, GB, PAGE(S) 73-98 , XP004413477 ISSN: 1077-2014 page 76, paragraph 2.1 *
YEO B-L ET AL: "RAPID SCENE ANALYSIS ON COMPRESSED VIDEO" IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, IEEE SERVICE CENTER, PISCATAWAY, NJ, US, vol. 5, no. 6, 1 December 1995 (1995-12-01), pages 533-544, XP000545960 ISSN: 1051-8215 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013138489A (en) * 2007-09-28 2013-07-11 Dolby Lab Licensing Corp Treating video information

Also Published As

Publication number Publication date
US8165337B2 (en) 2012-04-24
CN101204093A (en) 2008-06-18
US20090129687A1 (en) 2009-05-21
WO2006136544A3 (en) 2007-09-07
US20120176548A1 (en) 2012-07-12
EP1894417A2 (en) 2008-03-05
JP2008544666A (en) 2008-12-04
FR2887731A1 (en) 2006-12-29
CN101204093B (en) 2010-12-15
KR20080027255A (en) 2008-03-26

Similar Documents

Publication Publication Date Title
KR100468967B1 (en) Thumbnail image generating system
EP2193663B1 (en) Treating video information
US6940910B2 (en) Method of detecting dissolve/fade in MPEG-compressed video environment
US20100302453A1 (en) Detection of gradual transitions in video sequences
US20060193387A1 (en) Extracting key frames from a video sequence
US20070139552A1 (en) Unified approach to film mode detection
JPH10304374A (en) Moving image encoding device
JP2004529578A (en) Detection of subtitles in video signals
US8259172B2 (en) Picture searching apparatus
JP3714871B2 (en) Method for detecting transitions in a sampled digital video sequence
WO2007013238A1 (en) Video processing device and vide processing method
US20030058947A1 (en) Scene cut detection in a video bitstream
US8401070B2 (en) Method for robust inverse telecine
US8165337B2 (en) Procedure and device for detecting fades in a sequence of images
US20080247656A1 (en) Method and apparatus for compression of video signals containing fades and flashes
JP2004282318A (en) Scene change detection method and scene change detection apparatus
JPH1023421A (en) Moving picture processing method
KR20020040503A (en) Shot detecting method of video stream
Kuo et al. Detection of H. 264 shot change using intra predicted direction
JP3333956B2 (en) Cut point detection method and device
Park et al. Automatic dissolve detection scheme based on visual rhythm spectrum
KR20030082794A (en) Method and apparatus for shot conversion detection of video encoder

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 1020077029367

Country of ref document: KR

WWE Wipo information: entry into national phase

Ref document number: 2006763780

Country of ref document: EP

Ref document number: 9743/DELNP/2007

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 200680022280.X

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2008517478

Country of ref document: JP

WWW Wipo information: withdrawn in national office

Ref document number: DE

WWP Wipo information: published in national office

Ref document number: 2006763780

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 11922439

Country of ref document: US