Detecting Violent Content in Hollywood Movies and User-Generated Videos - Smart Information Systems: Computational Intelligence for Real-Life Applications - page 303

Information Technology Reference

In-Depth Information

Fig. 11.6

Ranked List view of our visualization tool

11.5.3 Annotations

The training set of the MediaEval 2013 VSD task provides annotations for 18 Hol-

lywood movies. The annotations mark the presence of audio, visual and audio-

visual concepts such as explosions, gunshots, screams, blood, fights, car chases, fire,

firearms, coldweapons and gore. The user can query any or all movies for any of these

concepts (e.g., show all segments with fire in Saving Private Ryan ). The annotations

are then displayed in a view (Fig. 11.7 ) similar to the one of the Ranked List .

11.5.4 Online Analysis

The Online Analysis (Fig. 11.8 ) executes our VSD pipeline to any video hosted by

YouTube (or any other site supported by the youtube-dl script). After the user entered

the URL, the video is downloaded, transcoded, and split into segments. The MFCC

feature vectors of the audio of each segment are subsequently computed and used

to build mid-level features with sparse coding and vector quantization as explained

in [ 1 ]. Both mid-level feature representations are used to classify the segment and

produce two violence scores. Even though our methods only use audio features, the

Online Analysis pipeline can be applied to any method using audio, visual, or audio-

visual features. In addition to the Ranked List view, the Online Analysis produces a

Next Page

Smart Information Systems: Computational Intelligence for Real-Life Applications

Search WWH ::

Custom Search

Home