    • Automatic detection of salient objects and spatial relations in videos for a video database system 

      Sevilmiş, T.; Baştan M.; Güdükbay, Uğur; Ulusoy, Özgür (Elsevier BV, 2008-10)
      Multimedia databases have gained popularity due to rapidly growing quantities of multimedia data and the need to perform efficient indexing, retrieval and analysis of this data. One downside of multimedia databases is the ...
    • Bilvideo-7: an MPEG-7-compatible video indexing and retrieval system 

      Baştan M.; Çam, H.; Güdükbay, Uğur; Ulusoy, Özgür (Institute of Electrical and Electronics Engineers, 2010-07)
      BilVideo-7 is an MPEG-7-compatible, distributed, video indexing and retrieval system that supports complex multimodal queries in a unified framework.
    • HandVR: a hand-gesture-based interface to a video retrieval system 

      Genç, S.; Baştan M.; Güdükbay, Uğur; Atalay, V.; Ulusoy, Özgür (Springer U K, 2015)
      Using one’s hands in human–computer interaction increases both the effectiveness of computer usage and the speed of interaction. One way of accomplishing this goal is to utilize computer vision techniques to develop ...
    • Keyframe labeling technique for surveillance event classification 

      Şaykol, E.; Baştan M.; Güdükbay, Uğur; Ulusoy, Özgür (S P I E - International Society for Optical Engineering, 2010)
      The huge amount of video data generated by surveillance systems necessitates the use of automatic tools for their efficient analysis, indexing, and retrieval. Automated access to the semantic content of surveillance videos ...
    • Mobile multi-view object image search 

      Çalışır, F.; Baştan M.; Ulusoy, Özgür; Güdükbay, Uğur (Springer New York LLC, 2017)
      High user interaction capability of mobile devices can help improve the accuracy of mobile visual search systems. At query time, it is possible to capture multiple views of an object from different viewing angles and at ...
    • Multimedia translation for linking visual data to semantics in videos 

      Duygulu, P.; Baştan M. (Springer, 2011-01)
      The semantic gap problem, which can be referred to as the disconnection between low-level multimedia data and high-level semantics, is an important obstacle to build real-world multimedia systems. The recently developed ...
    • Translating images to words for recognizing objects in large image and video collections 

      Duygulu, P.; Baştan M.; Forsyth, D. (Springer, 2006)
      We present a new approach to the object recognition problem, motivated by the recent availability of large annotated image and video collections. This approach considers object recognition as the translation of visual ...
    • Video copy detection using multiple visual cues and MPEG-7 descriptors 

      Küçüktunç, O.; Baştan M.; Güdükbay, Uğur; Ulusoy, Özgür (Academic Press, 2010)
      We propose a video copy detection framework that detects copy segments by fusing the results of three different techniques: facial shot matching, activity subsequence matching, and non-facial shot matching using low-level ...