Pose sentences : a new representation for understanding human actions
buir.advisor | Duygulu, Pınar | |
dc.contributor.author | Hatun, Kardelen | |
dc.date.accessioned | 2016-01-08T18:07:42Z | |
dc.date.available | 2016-01-08T18:07:42Z | |
dc.date.issued | 2008 | |
dc.description | Ankara : The Department of Computer Engineering and the Institute of Engineering and Science of Bilkent University, 2008. | en_US |
dc.description | Thesis (Master's) -- Bilkent University, 2008. | en_US |
dc.description | Includes bibliographical references leaves 55-58. | en_US |
dc.description.abstract | In this thesis we address the problem of human action recognition from video sequences. Our main contribution to the literature is the compact use of poses while representing videos and most importantly considering actions as pose-sentences and exploit string matching approaches for classification. We focus on single actions, where the actor performs one simple action through the video sequence. We represent actions as documents consisting of words, where a word refers to a pose in a frame. We think pose information is a powerful source for describing actions. In search of a robust pose descriptor, we make use of four well-known techniques to extract pose information, Histogram of Oriented Gradients, k-Adjacent Segments, Shape Context and Optical Flow Histograms. To represent actions, first we generate a codebook which will act as a dictionary for our action dataset. Action sequences are then represented using a sequence of pose-words, as posesentences. The similarity between two actions are obtained using string matching techniques. We also apply a bag-of-poses approach for comparison purposes and show the superiority of pose-sentences. We test the efficiency of our method with two widely used benchmark datasets, Weizmann and KTH. We show that pose is indeed very descriptive while representing actions, and without having to examine complex dynamic characteristics of actions, one can apply simple techniques with equally successful results. | en_US |
dc.description.provenance | Made available in DSpace on 2016-01-08T18:07:42Z (GMT). No. of bitstreams: 1 0003639.pdf: 3316251 bytes, checksum: c2631e601dd45888b286443c83f8247e (MD5) | en |
dc.description.statementofresponsibility | Hatun, Kardelen | en_US |
dc.format.extent | xi, 58 leaves, illustrations, graphs | en_US |
dc.identifier.itemid | BILKUTUPB109730 | |
dc.identifier.uri | http://hdl.handle.net/11693/14772 | |
dc.language.iso | English | en_US |
dc.rights | info:eu-repo/semantics/openAccess | en_US |
dc.subject | Human motion | en_US |
dc.subject | Action recognition | en_US |
dc.subject | String matching | en_US |
dc.subject | Bag-of-words | en_US |
dc.subject.lcc | TA1650 .H38 2008 | en_US |
dc.subject.lcsh | Optical pattern recognition. | en_US |
dc.subject.lcsh | Computer vision. | en_US |
dc.subject.lcsh | Image processing--Digital techniques. | en_US |
dc.subject.lcsh | Body, Human--Computer simulation. | en_US |
dc.title | Pose sentences : a new representation for understanding human actions | en_US |
dc.type | Thesis | en_US |
thesis.degree.discipline | Computer Engineering | |
thesis.degree.grantor | Bilkent University | |
thesis.degree.level | Master's | |
thesis.degree.name | MS (Master of Science) |
Files
Original bundle
1 - 1 of 1