Multiple view human activity recognition

Pehlivan, Selen

Multiple view human activity recognition

buir.advisor	Duygulu, Pınar
dc.contributor.author	Pehlivan, Selen
dc.date.accessioned	2016-01-08T18:23:14Z
dc.date.available	2016-01-08T18:23:14Z
dc.date.issued	2012
dc.description	Cataloged from PDF version of article.	en_US
dc.description	Includes bibliographical references leaves 94-100.	en_US
dc.description.abstract	This thesis explores the human activity recognition problem when multiple views are available. We follow two main directions: we first present a system that performs volume matching using constructed 3D volumes from calibrated cameras, then we present a flexible system based on frame matching directly using multiple views. We examine the multiple view systems compared to single view systems, and measure the performance improvements in recognition using more views by various experiments. Initial part of the thesis introduces compact representations for volumetric data gained through reconstruction. The video frames recorded by many cameras with significant overlap are fused by reconstruction, and the reconstructed volumes are used as substitutes of action poses. We propose new pose descriptors over these three dimensional volumes. Our first descriptor is based on the histogram of oriented cylinders in various sizes and orientations. We then propose another descriptor which is view-independent, and which does not require pose alignment. We show the importance of discriminative pose representations within simpler activity classification schemes. Activity recognition framework based on volume matching presents promising results compared to the state-of-the-art. Volume reconstruction is one natural approach for multi camera data fusion, but there can be few cameras with overlapping views. In the second part of the thesis, we introduce an architecture that is adaptable to various number of cameras and features. The system collects and fuses activity judgments from cameras using a voting scheme. The architecture requires no camera calibration. Performance generally improves when there are more cameras and more features; training and test cameras do not need to overlap; camera drop in or drop out is handled easily with little penalty. Experiments support the performance penalties, and advantages for using multiple views versus single view.	en_US
dc.description.statementofresponsibility	Pehlivan, Selen	en_US
dc.format.extent	xix, 100 leaves, illustrations	en_US
dc.identifier.uri	http://hdl.handle.net/11693/15693
dc.language.iso	English	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	Video analysis	en_US
dc.subject	Human activity recognition	en_US
dc.subject	Multiple views	en_US
dc.subject	Multiple cameras	en_US
dc.subject	Pose representation	en_US
dc.subject.lcc	QP301 .P44 2012	en_US
dc.subject.lcsh	Human locomotion--Computer simulation.	en_US
dc.subject.lcsh	Body, Human--Computer simulation.	en_US
dc.subject.lcsh	Image processing--Digital techniques.	en_US
dc.subject.lcsh	Computer simulation.	en_US
dc.subject.lcsh	Digital computer vision.	en_US
dc.subject.lcsh	Pattern recognition systems.	en_US
dc.title	Multiple view human activity recognition	en_US
dc.type	Thesis	en_US
thesis.degree.discipline	Computer Engineering
thesis.degree.grantor	Bilkent University
thesis.degree.level	Doctoral
thesis.degree.name	Ph.D. (Doctor of Philosophy)

Files

Original bundle

Now showing 1 - 1 of 1

Name:: 0006408.pdf
Size:: 5.76 MB
Format:: Adobe Portable Document Format

Download

Collections

Graduate School of Engineering and Science