Now showing items 21-40 of 58

    • GCap: Graph-based automatic image captioning 

      Pan J.-Y.; Yang H.-J.; Faloutsos C.; Duygulu, Pınar (IEEE, 2004)
      Given an image, how do we automatically assign keywords to it? In this paper, we propose a novel, graph-based approach (GCap) which outperforms previously reported methods for automatic image captioning. Moreover, it is ...
    • Gibbs random field model based 3-D motion estimation from video sequences 

      Alatan, A. A.; Levent, O. (IEEE, 1994)
      In contrast to previous global 3D motion concept, a Gibbs random field based method, which models local interactions between motion parameters defined at each point on the object, is proposed. An energy function which gives ...
    • Ground-nesting insects could use visual tracking for monitoring nest position during learning flights 

      Samet, Nermin; Zeil, J.; Mair, E.; Boeddeker, N.; Stürzl, W. (Springer Verlag, 2014-07)
      Ants, bees and wasps are central place foragers. They leave their nests to forage and routinely return to their home-base. Most are guided by memories of the visual panorama and the visual appearance of the local nest ...
    • HandVR: a hand-gesture-based interface to a video retrieval system 

      Genç, S.; Baştan M.; Güdükbay, Uğur; Atalay, V.; Ulusoy, Özgür (Springer U K, 2015)
      Using one’s hands in human–computer interaction increases both the effectiveness of computer usage and the speed of interaction. One way of accomplishing this goal is to utilize computer vision techniques to develop ...
    • Identification of illustrators 

      Şener Fadime; Samet, Nermin; Duygulu-Şahin Pınar (2012-10)
      This paper is motivated by a book in which artists and illustrators from all over the world offer their personal interpretations of the declaration of human rights in pictures [1]. It was enthusiastic for a young reader ...
    • Joint estimation and optimum encoding of depth field for 3-D object-based video coding 

      Alatan, A. Aydın; Onural, Levent (IEEE, 1996-09)
      3-D motion models can be used to remove temporal redundancy between image frames. For efficient encoding using 3-D motion information, apart from the 3-D motion parameters, a dense depth field must also be encoded to achieve ...
    • Localization of diagnostically relevant regions of interest in whole slide images: a comparative study 

      Mercan, E.; Aksoy, S.; Shapiro, L. G.; Weaver, D. L.; Brunyé, T. T.; Elmore, J. G. (Springer New York LLC, 2016-08)
      Whole slide digital imaging technology enables researchers to study pathologists’ interpretive behavior as they view digital slides and gain new understanding of the diagnostic medical decision-making process. In this ...
    • Mağaza katalogları içerisinde resim arama 

      Baysal, Sermetcan; Kurt, Mehmet Can; Aydoğdu, Gonca; Damcı, Pelin; Telmen, İlay; Duygulu, Pınar (IEEE, 2009-04)
      In this paper, an overview of an application, which aims to make significant improvements on access methods to the online shopping catalogs, is presented. In current online shopping sites, only browsing and semantic based ...
    • Map building with multiple range measurements using morphological surface profile extraction 

      Barshan, B.; Başkent, D. (IEEE, Piscataway, NJ, United States, 1999)
      A novel method is described for surface profile extraction based on morphological processing of multiple range sensor data. The approach taken is extremely flexible and robust, in addition to being simple and straightforward. ...
    • Motion capture and human pose reconstruction from a single-view video sequence 

      Güdükbay, Uğur; Demir, I.; Dedeoǧlu, Y. (Academic Press, 2013)
      We propose a framework to reconstruct the 3D pose of a human for animation from a sequence of single-view video frames. The framework for pose construction starts with background estimation and the performer's silhouette ...
    • An MPEG-7 compatible video retrieval system with integrated support for complex multimodal queries 

      Baştan, Muhammet; Çam, Hayati; Güdükbay, Uğur; Ulusoy, Özgür (IEEE Computer Society, 2019)
      We present BilVideo-7, an MPEG-7 compatible, video indexing and retrieval system that supports complex multimodal queries in a unified framework. An MPEG-7 profile is developed to represent the videos by decomposing them ...
    • Multi-channel TDMA scheduling in wireless sensor networks 

      Uyanık, Özge (Bilkent University, 2013)
      The Multiple Instance Learning (MIL) paradigm arises to be useful in many application domains, whereas it is particularly suitable for computer vision problems due to the difficulty of obtaining manual labeling. Multiple ...
    • Multi-label multi-modal classification of movie scenes 

      Türköz, Irmak (Bilkent University, 2022-09)
      Promoting movies through their trailers provides valuable information that can help viewers and investors form expectations about the movie’s future success. Recent research confirmed that the audience prefers to watch ...
    • Nesne tanımada bağlam ve anlambilimsel sınıflandırmanın önemi: Bilgisayarla görme ve insanda görme alanlarındaki çalışmalar 

      Aksoy, Selim; Boyacı, Hüseyin; Gökçay, D. (IEEE, 2008-04)
      Sahne sınıflandırması ve nesne tanıma, bilgisayarla görme alanında fok uzun yıllardır üzerinde çalışılan temel problemlerdir. Bilgisayarlara kazandırılmaya çalışılan, sahnelerin ve içerdikleri nesnelerin otomatik olarak ...
    • Neural networks for improved target differentiation and localization with sonar 

      Ayrulu, B.; Barshan, B. (Pergamon Press, 2001)
      This study investigates the processing of sonar signals using neural networks for robust differentiation of commonly encountered features in indoor robot environments. Differentiation of such features is of interest for ...
    • Nitelik tabanlı sınıflandırıcılar ve koşullu rastgele alan ile dikkat çeken görsel bölge tespiti 

      Demirel, B.; Cinbiş, Ramazan Gökberk; İkizler-Cinbiş, N. (IEEE, 2016-05)
      Dikkat çeken görsel bölge tahmini, resimlerde ya da sahnelerde insan gözünün öncelikli olarak odaklandıgı bölgeleri bulmayı amaçlayan bir bilgisayarlı görü problemidir. Pekçok bilgisayarlı görü problemi bir sahnedeki ...
    • Object rigidity and reflectivity identification based on motion analysis 

      Zang, D.; Schrater P.R.; Doerschner, Katja (IEEE, 2010)
      Rigidity and reflectivity are important properties of objects, identifying these properties is a fundamental problem for many computer vision applications like motion and tracking. In this paper, we extend our previous ...
    • On recognizing actions in still images via multiple features 

      Şener, Fadime; Bas, C.; Ikizler-Cinbis, N. (Springer, Berlin, Heidelberg, 2012)
      We propose a multi-cue based approach for recognizing human actions in still images, where relevant object regions are discovered and utilized in a weakly supervised manner. Our approach does not require any explicitly ...
    • Perception of 3-D Surfaces from 2-D Contours 

      Ulupinar F.; Nevatia, R. (1993)
      Inference of 3-D shape from 2-D contours in a single image is an important problem in machine vision. We survey classes of techniques proposed in the past and provide a critical analysis. We propose that two kinds of ...
    • Physically-based simulation of hair strips in real-time 

      Taşkıran, Hasan Dogu; Güdükbay, Uğur (UNION Agency - Science Press, 2005)
      In this paper, we present our implementation of physically-based simulation of hair strips. We used a mass-spring model followed by a hybrid approach where particle systems and the method of clustering of hair strands are ...