We detect remake movies based on the similiarity of audio hightlighted segments. The steps involved in detecting remake movies are
  1. Audio Highlight Detection
  2. Tuple Matrix Formation
  3. Tuple Comparison for Detecting Remakes

Audio Highlight Detection

The intensity of a segment of an audio signal is summarized by the root-mean-square value. The audio track of a video is divided into windows of equal size and the rms value is computed for each audio window. From the resulting rms sequence, the rms ratio is computed for successive items in the sequence.

The rms ratio is marked as low when the value is below a user defined threshold. In our implementation, we use 5 as the threshold, and the video frames corresponding to such windows are considered as "important".

Tuple Matrix Formation

For the audio highlighted segments, wavelet transform is computed from the rms signal of the corresponding audio .


RMS waveform of audio signal detected as highlight and its corresponding wavelet transform

This wavelet transformed signal is treated as image and is splitted into windows of small interval both vertically and horizontally. For each window, dominant value is computed and stored which represents tuple image.

      
Segmenting wavelet signal to form tuples

From these tuples number values are extracted based on the dominant value and tuple matrix is formed.


Tuple formation

Tuple Comparison for Detecting Remakes

We compute two similarity metrics from this tuple matrix. Our first feature captures information about different types of music that occurs in the audio. The tuples in the tuple matrix along with its frequency of occurrences forms our first feature. The frequencies of the intersecting tuples is summed up to form similarity measure. This measure is normalized by dividing it by the total number of tuples in both audios.

Our second feature works with the sequence of tuple in tuple matrices. For each tuple matrix, the number of common tuples from the tuple matrixes are computed. This results in a similarity matrix of size N/c$ where each point corresponds to a tuple. The similarity measure is computed by summing up the exponential values of data from the similarity matrix. This measure is used for identifying remakes which have same music.

Tuple formation