Show simple item record

dc.contributor.authorBaştan, Muhammeten_US
dc.contributor.authorDuygulu, Pınaren_US
dc.coverage.spatialTempe, AZ, USA
dc.date.accessioned2016-02-08T11:48:49Z
dc.date.available2016-02-08T11:48:49Z
dc.date.issued2006-07en_US
dc.identifier.urihttp://hdl.handle.net/11693/27256
dc.descriptionDate of Conference: 13-15 July, 2006
dc.descriptionConference name: 5th International Conference on Image and Video Retrieval. CIVR 2006: Image and Video Retrieval
dc.description.abstractWe propose a new approach to recognize objects and scenes in news videos motivated by the availability of large video collections. This approach considers the recognition problem as the translation of visual elements to words. The correspondences between visual elements and words are learned using the methods adapted from statistical machine translation and used to predict words for particular image regions (region naming), for entire images (auto-annotation), or to associate the automatically generated speech transcript text with the correct video frames (video alignment). Experimental results are presented on TRECVID 2004 data set, which consists of about 150 hours of news videos associated with manual annotations and speech transcript text. The results show that the retrieval performance can be improved by associating visual and textual elements. Also, extensive analysis of features are provided and a method to combine features are proposed. © Springer-Verlag Berlin Heidelberg 2006.en_US
dc.language.isoEnglishen_US
dc.source.title5th International Conference on Image and Video Retrieval. CIVR 2006: Image and Video Retrievalen_US
dc.relation.isversionofhttps://doi.org/10.1007/11788034_39
dc.subjectFeature extractionen_US
dc.subjectImage analysisen_US
dc.subjectMultimedia systemsen_US
dc.subjectSpeech recognitionen_US
dc.subjectStatistical methodsen_US
dc.subjectNews videosen_US
dc.subjectStatistical machine translationen_US
dc.subjectVideo collectionsen_US
dc.subjectVideo framesen_US
dc.subjectObject recognitionen_US
dc.titleRecognizing objects and scenes in news videosen_US
dc.typeConference Paperen_US
dc.departmentDepartment of Computer Engineeringen_US
dc.citation.spage380en_US
dc.citation.epage390en_US
dc.identifier.doi10.1007/11788034_39
dc.publisherSpringeren_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record