Video segmentation into shots is the first step for video indexing and searching. Videos shots are mostly very small in duration\nand do not give meaningful insight of the visual contents. However, grouping of shots based on similar visual contents gives a\nbetter understanding of the video scene; grouping of similar shots is known as scene boundary detection or video segmentation\ninto scenes. In this paper, we propose a model for video segmentation into visual scenes using bag of visual word (BoVW) model.\nInitially, the video is divided into the shots which are later represented by a set of key frames. Key frames are further represented by\nBoVW feature vectors which are quite short and compact compared to classical BoVW model implementations. Two variations of\nBoVWmodel are used: (1) classical BoVWmodel and (2) Vector of LinearlyAggregated Descriptors (VLAD)which is an extension\nof classical BoVW model. The similarity of the shots is computed by the distances between their key frames feature vectors within\nthe sliding window of length.................
Loading....