A relevance feedback technique for multimodal retrieval of news videos
EUROCON 2005 - The International Conference on Computer as a Tool
139 - 142
Item Usage Stats
MetadataShow full item record
Content-based retrieval in news video databases has become an important task with the availability of large quantities of data in both public and proprietary archives. We describe a relevance feedback technique that captures the significance of different features at different spatial locations in an image. Spatial content is modeled by partitioning images into non-overlapping grid cells. Contributions of different features at different locations are modeled using weights defined for each feature in each grid cell. These weights are iteratively updated based on user's feedback in terms of positive and negative labeling of retrieval results. Given this labeling, the weight updating scheme uses the ratios of standard deviations of the distances between relevant and irrelevant images to the standard deviations of the distances between relevant images. The proposed technique is quantitatively and qualitatively evaluated using shots related to several sports from the news video collection of the TRECVID video retrieval evaluation where the weights could capture relative contributions of different features and spatial locations. © 2005 IEEE.
Video signal processing
Content based retrieval
Published Version (Please cite this version)https://doi.org/10.1109/EURCON.2005.1629878
Showing items related by title, author, creator and subject.
Gerek, Ömer. N.; Altunbaşak, Y. (SPIE, 1997-02)This paper describes a method for selecting key frames by using a number of parameters extracted from the MPEG video stream. The parameters are directly extracted from the compressed video stream without decompression. A ...
Baştan, Muhammet; Güdükbay, Uğur; Ulusoy, Özgür (IEEE, 2008-06)We describe a method to automatically extract important video objects for object-based indexing. Most of the existing salient object detection approaches detect visually conspicuous structures in images, while our method ...
Verstockt, S.; Hoecke, S. V.; Beji, T.; Merci, B.; Gouverneur, B.; Cetin, A. E.; Potter, P. D.; Walle, R. V. D. (Elsevier, 2013)In this paper a novel multi-modal flame and smoke detector is proposed for the detection of fire in large open spaces such as car parks. The flame detector is based on the visual and amplitude image of a time-of-flight ...