Utilizing multiple instance learning for computer vision tasks

Şener, Fadime

Utilizing multiple instance learning for computer vision tasks

buir.advisor	Şahin, Pınar Duygulu
dc.contributor.author	Şener, Fadime
dc.date.accessioned	2016-01-08T18:26:16Z
dc.date.available	2016-01-08T18:26:16Z
dc.date.issued	2013
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references leaves 81-89.	en_US
dc.description.abstract	The Multiple Instance Learning (MIL) paradigm arises to be useful in many application domains, whereas it is particularly suitable for computer vision problems due to the difficulty of obtaining manual labeling. Multiple Instance Learning methods have large applicability to a variety of challenging learning problems in computer vision, including object recognition and detection, tracking, image classification, scene classification and more. As opposed to working with single instances as in standard supervised learning, Multiple Instance Learning operates over bags of instances. A bag is labeled as positive if it is known to contain at least one positive instance; otherwise it is labeled as negative. The overall learning task is to learn a model for some concept using a training set that is formed of bags. A vital component of using Multiple Instance Learning in computer vision is its design for abstracting the visual problem to multi-instance representation, which involves determining what the bag is and what are the instances in the bag. In this context, we consider three different computer vision problems and propose solutions for each of them via novel representations. The first problem is image retrieval and re-ranking; we propose a method that automatically constructs multiple candidate Multi-instance bags, which are likely to contain relevant images. The second problem we look into is recognizing actions from still images, where we extract several candidate object regions and approach the problem of identifying related objects from a weakly supervised point of view. Finally, we address the recognition of human interactions in videos within a MIL framework. In human interaction recognition, videos may be composed of frames of different activities, and the task is to identify the interaction in spite of irrelevant activities that are scattered through the video. To overcome this problem, we use the idea of Multiple Instance Learning to tackle irrelevant actions in the whole video sequence classification. Each of the outlined problems are tested on benchmark datasets of the problems and compared with the state-of-the-art. The experimental results verify the advantages of the proposed MIL approaches to these vision problems.	en_US
dc.description.statementofresponsibility	Şener, Fadime	en_US
dc.format.extent	xx, 89 leaves, illustrations, graphics	en_US
dc.identifier.itemid	B139319
dc.identifier.uri	http://hdl.handle.net/11693/15890
dc.language.iso	English	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Computer vision	en_US
dc.subject	Multiple instance learning	en_US
dc.subject	Image retrieval	en_US
dc.subject	Image re-ranking	en_US
dc.subject	Action recognition in images	en_US
dc.subject	Interaction recognition	en_US
dc.subject	Multiple features	en_US
dc.subject.lcc	TA1634 .S45 2013	en_US
dc.subject.lcsh	Computer vision.	en_US
dc.subject.lcsh	Image processing--Digital techniques--Mathematical models.	en_US
dc.subject.lcsh	Optical pattern recognition.	en_US
dc.subject.lcsh	Human locomotion--Computer simulation.	en_US
dc.title	Utilizing multiple instance learning for computer vision tasks	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Computer Engineering
thesis.degree.grantor	Bilkent University
thesis.degree.level	Master's
thesis.degree.name	MS (Master of Science)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 0006589.pdf
Size:: 50.41 MB
Format:: Adobe Portable Document Format

Download

Collections

Graduate School of Engineering and Science