Browsing by Subject "Video signal processing"
Now showing 1 - 18 of 18
- Results Per Page
- Sort Options
Item Open Access Automatic detection of salient objects and spatial relations in videos for a video database system(Elsevier BV, 2008-10) Sevilmiş, T.; Baştan M.; Güdükbay, Uğur; Ulusoy, ÖzgürMultimedia databases have gained popularity due to rapidly growing quantities of multimedia data and the need to perform efficient indexing, retrieval and analysis of this data. One downside of multimedia databases is the necessity to process the data for feature extraction and labeling prior to storage and querying. Huge amount of data makes it impossible to complete this task manually. We propose a tool for the automatic detection and tracking of salient objects, and derivation of spatio-temporal relations between them in video. Our system aims to reduce the work for manual selection and labeling of objects significantly by detecting and tracking the salient objects, and hence, requiring to enter the label for each object only once within each shot instead of specifying the labels for each object in every frame they appear. This is also required as a first step in a fully-automatic video database management system in which the labeling should also be done automatically. The proposed framework covers a scalable architecture for video processing and stages of shot boundary detection, salient object detection and tracking, and knowledge-base construction for effective spatio-temporal object querying. © 2008 Elsevier B.V. All rights reserved.Item Open Access Çarpıcıdan bağımsız ortak fark matrisi kullanarak video ve görüntü işleme(IEEE, 2009-04) Çetin, A. Enis; Duman, Kaan; Tuna, Hakan; Eryıldırım, AbdulkadirBu bildiride gerçel sayılar üzerinde yarı grup kuran yeni bir iletmen tanımlayarak elde edilen bir bölge betimleyicisi ile hareketli obje takibi, yüz sezimi, plaka bulma, bölge betimleme için kullanılabilecek hızlı bir algoritma sunuyoruz. Bu yeni iletmen hiçbir çarpma gerektirmez. Bu iletmeni kullanarak, imge bölgelerini nitelendiren ve ortak fark adı verilen bir matris tanımlıyoruz. Plaka bulma uygulamasında ortak fark matrislerinı plaka bölgelerinden kestirip, bunları bir veritabanında saklıyoruz. Plaka bölgelerini gerçek zamanlı videoda tanımlamak için ilk önce videodaki hareketli bölgeleri taşıyan imgeleri belirliyoruz, sonra hareketli bölgelerin içinde ya da bütün resim içinde plaka büyüklüğündeki bölgelerin ortak ayrık matrislerini veritabanındaki plaka ortak ayrık matrisleriyle karşılaştırarak bölge içinde plaka olup olmadığını belirliyoruz.Item Open Access A fast algorithm for subpixel accuracy image stabilization for digital film and video(SPIE, 1998) Eroğlu, Çiğdem; Erdem, A. T.This paper introduces a novel method for subpixel accuracy stabilization of unsteady digital films and video sequences. The proposed method offers a near-closed-form solution to the estimation of the global subpixel displacement between two frames, that causes the misregistration of them. The criterion function used is the mean-squared error over the displaced frames, in which image intensities at subpixel locations are evaluated using bilinear interpolation. The proposed algorithm is both faster and more accurate than the search-based solutions found in the literature. Experimental results demonstrate the superiority of the proposed method to the spatio-temporal differentiation and surface fitting algorithms, as well. Furthermore, the proposed algorithm is designed so that it is insensitive to frame-to-frame intensity variations. It is also possible to estimate any affine motion between two frames by applying the proposed algorithm on three non-collinear points in the unsteady frame.Item Open Access Gibbs random field model based 3-D motion estimation from video sequences(IEEE, 1994) Alatan, A. A.; Levent, O.In contrast to previous global 3D motion concept, a Gibbs random field based method, which models local interactions between motion parameters defined at each point on the object, is proposed. An energy function which gives the joint probability distribution of motion vectors, is constructed. The energy function is minimized in order to find the most likely motion vector set. Some convergence problems, due to ill-posedness of the problem, are overcome by using the concept of hierarchical rigidity. In hierarchical rigidity, the objects are assumed to be almost rigid in the coarsest level and this rigidness is weakened at each level until the finest level is reached. The propagation of motion information between levels, is encouraged. At the finest level, each point have a motion vector associated with it and the interaction between these vectors are described by the energy function. The minimization of the energy function is achieved by using hierarchical rigidity, without trapping into a local minimum. The results are promising.Item Open Access A histogram-based approach for object-based query-by-shape-and-color in image and video databases(Elsevier, 2005) Şaykol, E.; Güdükbay, Uğur; Ulusoy, ÖzgürConsidering the fact that querying by low-level object features is essential in image and video data, an efficient approach for querying and retrieval by shape and color is proposed. The approach employs three specialized histograms, (i.e. distance, angle, and color histograms) to store feature-based information that is extracted from objects. The objects can be extracted from images or video frames. The proposed histogram-based approach is used as a component in the query-by-feature subsystem of a video database management system. The color and shape information is handled together to enrich the querying capabilities for content-based retrieval. The evaluation of the retrieval effectiveness and the robustness of the proposed approach is presented via performance experiments. © 2005 Elsevier Ltd. All rights reserved.Item Open Access Impact of scalability in video transmission in promotion-capable differentiated services networks(IEEE, 2002-09) Gürses, E.; Akar, G. B.; Akar, NailTransmission of high quality video over the Internet faces many challenges including unpredictable packet loss characteristics of the current Internet and the heterogeneity of receivers in terms of their bandwidth and processing capabilities. To address these challanges, we propose an architecture in this paper that is based on the temporally scalable and error resilient video coding mode of the H.263+ codec. In this architecture, the video frames will be transported over a new generation IP network that supports differentiated services (Diffserv). We also propose a novel Two Rate Three Color Promotion-Capable Marker (trTCPCM) to be used at the edge of the diffserv network. Our simulation study demonstrates that an average of 30 dB can be achieved in case of highly congested links.Item Open Access Improvement of face detection algorithms for news videos(IEEE, 2005) Ikizler, Nazlı; Duygulu, PınarPeople are the most important subjects in news videos and for proper retrieval of person images, face detection is a very crucial step. However, face detection and recognition in news videos is a very challenging task due to the huge irregularities and high noise level in the data. This study presents a method that combines skin detection and Schneiderman-Kanade face detection, for improving the face detection performance in news videos for a better retrieval. This method has been tested on TRECVID 2003 dataset and the results are very promising. © 2005 IEEE.Item Open Access Introduction to the issue on emerging techniques in 3-D(IEEE, 2012) Alatan, A. A.; Ostermann, J.; Onural, L.; AlRegib, G.; Mattoccia, S.; Yuan, C.The fifteen papers in this special section that focus on three dimensional content (3D), with particular emphasis on the fusion of conventional camera outputs with those captured by other modalities, such as active sensors, multi-spectral data or dynamic range images as well as applications that support the measurement and improvement of 3-D content.Item Open Access Iterative technique for 3-D motion estimation in videophone applications(IEEE, 1994-04) Bozdağı, Gözde; Tekalp, A. M.; Onural, LeventIn object based coding of facial images, the accuracy of motion and depth parameter estimates strongly affects the coding efficiency. We propose an improved algorithm based on stochastic relaxation for 3-D motion and depth estimation that converges to true motion and depth parameters even in the presence of 50% error in the initial depth estimates. The proposed method is compared with an existing algorithm (MBASIC) in case of different number of point correspondences. The simulation results show that the proposed method provides significantly better results than the MBASIC algorithm.Item Open Access Moving object detection using adaptive subband decomposition and fractional lower-order statistics in video sequences(Elsevier, 2002) Bagci, A. M.; Yardimci, Y.; Çetin, A. EnisIn this paper, a moving object detection method in video sequences is described. In the first step, the camera motion is eliminated using motion compensation. An adaptive subband decomposition structure is then used to analyze the motion compensated image. In the "low-high" and "high-low" subimages moving objects appear as outliers and they are detected using a statistical detection test based on fractional lower-order statistics. It turns out that the distribution of the subimage pixels is almost Gaussian in general. On the other hand, at the object boundaries the distribution of the pixels in the subimages deviates from Gaussianity due to the existence of outliers. By detecting the regions containing outliers the boundaries of the moving objects are estimated. Simulation examples are presented. © 2002 Elsevier Science B.V. All rights reserved.Item Open Access Moving shadow detection in video using cepstrum(SAGE, 2013) Cogun, F.; Çetin, A. EnisMoving shadows constitute problems in various applications such as image segmentation and object tracking. The main cause of these problems is the misclassification of the shadow pixels as target pixels. Therefore, the use of an accurate and reliable shadow detection method is essential to realize intelligent video processing applications. In this paper, a cepstrum-based method for moving shadow detection is presented. The proposed method is tested on outdoor and indoor video sequences using well-known benchmark test sets. To show the improvements over previous approaches, quantitative metrics are introduced and comparisons based on these metrics are made. © 2013 Cogun and Cetin; licensee InTech.Item Open Access Real time hand gesture recognition for computer interaction(IEEE, 2014-04) Farooq, J.; Ali, Muhaddisa BaratHand gesture recognition is a natural and intuitive way to interact with the computer, since interactions with the computer can be increased through multidimensional use of hand gestures as compare to other input methods. The purpose of this paper is to explore three different techniques for HGR (hand gesture recognition) using finger tips detection. A new approach called 'Curvature of Perimeter' is presented with its application as a virtual mouse. The system presented, uses only a webcam and algorithms which are developed using computer vision, image and the video processing toolboxes of Matlab. © 2014 IEEE.Item Open Access Real-time fire and flame detection in video(IEEE, 2005) Dedeoğlu, Yigithan; Töreyin, B. Ugur; Güdükbay, Uğur; Çetin, A. EnisThis paper proposes a novel method to detect fire and/or flame by processing the video data generated by an ordinary camera monitoring a scene. In addition to ordinary motion and color clues, flame and fire flicker is detected by analyzing the video in wavelet domain. Periodic behavior in flame boundaries is detected by performing temporal wavelet transform. Color variations in fire is detected by computing the spatial wavelet transform of moving fire-colored regions. Other clues used in the fire detection algorithm include irregularity of the boundary of the fire colored region and the growth of such regions in time. All of the above clues are combined to reach a final decision.Item Open Access A relevance feedback technique for multimodal retrieval of news videos(IEEE, 2005-11) Aksoy, Selim; Çavuş ÖzgeContent-based retrieval in news video databases has become an important task with the availability of large quantities of data in both public and proprietary archives. We describe a relevance feedback technique that captures the significance of different features at different spatial locations in an image. Spatial content is modeled by partitioning images into non-overlapping grid cells. Contributions of different features at different locations are modeled using weights defined for each feature in each grid cell. These weights are iteratively updated based on user's feedback in terms of positive and negative labeling of retrieval results. Given this labeling, the weight updating scheme uses the ratios of standard deviations of the distances between relevant and irrelevant images to the standard deviations of the distances between relevant images. The proposed technique is quantitatively and qualitatively evaluated using shots related to several sports from the news video collection of the TRECVID video retrieval evaluation where the weights could capture relative contributions of different features and spatial locations. © 2005 IEEE.Item Open Access Robust transmission of multi-view video streams using flexible macroblock ordering and systematic LT codes(IEEE, 2007) Argyropoulos, S.; Tan, A. Serdar; Thomos, N.; Arıkan, Erdal; Strintzis, M. G.The transmission of fully compatible H.264/AVC multi-view video coded streams over packet erasure networks is examined. Macroblock classification into unequally important slice groups is considered using the Flexible Macroblock Ordering (FMO) tool of H.264/AVC Systematic LT codes are used for error protection due to their low complexity and advanced performance. The optimal slice grouping and channel rate allocation are jointly determined by an iterative optimization algorithm based on dynamic programming. The experimental evaluation clearly demonstrates the validity of the proposed method.Item Open Access A simple and effective mechanism for stored video streaming with TCP transport and server-side adaptive frame discard(Elsevier, 2005) Gürses, E.; Akar, G. B.; Akar, N.Transmission control protocol (TCP) with its well-established congestion control mechanism is the prevailing transport layer protocol for non-real time data in current Internet Protocol (IP) networks. It would be desirable to transmit any type of multimedia data using TCP in order to take advantage of the extensive operational experience behind TCP in the Internet. However, some features of TCP including retransmissions and variations in throughput and delay, although not catastrophic for non-real time data, may result in inefficiencies for video streaming applications. In this paper, we propose an architecture which consists of an input buffer at the server side, coupled with the congestion control mechanism of TCP at the transport layer, for efficiently streaming stored video in the best-effort Internet. The proposed buffer management scheme selectively discards low priority frames from its head-end, which otherwise would jeopardize the successful playout of high priority frames. Moreover, the proposed discarding policy is adaptive to changes in the bandwidth available to the video stream. © 2004 Elsevier B.V. All rights reserved.Item Open Access Tracking motion and intensity variations using hierarchical 2-D mesh modeling for synthetic object transfiguration(1996-11) Toklu, C.; Erdem, A. T.; Sezan, M. I.; Tekalp, A. M.We propose a method for tracking the motion and intensity variations of a 2-D mildly deformable image object using a hierarchical 2-D mesh model. The proposed method is applied to synthetic object transfiguration, namely, replacing an object in a real video clip with another synthetic or natural object via digital postprocessing. Successful transfiguration requires accurate tracking of both motion and intensity (contrast and brightness) variations of the object-to-be-replaced so that the replacement object can be rendered in exactly the same way from a single still picture. The proposed method is capable of tracking image regions corresponding to scene objects with nonplanar and/or mildly deforming surfaces, accounting for intensity variations, and is shown to be effective with real image sequences.Item Open Access Video fire detection-Review(Elsevier, 2013) Çetin, A. Enis; Dimitropoulos, K.; Gouverneur, B.; Grammalidis, N.; Günay, O.; Habiboğlu, Y. H.; Töreyin, B. U.; Verstockt, S.This is a review article describing the recent developments in Video based Fire Detection (VFD). Video surveillance cameras and computer vision methods are widely used in many security applications. It is also possible to use security cameras and special purpose infrared surveillance cameras for fire detection. This requires intelligent video processing techniques for detection and analysis of uncontrolled fire behavior. VFD may help reduce the detection time compared to the currently available sensors in both indoors and outdoors because cameras can monitor "volumes" and do not have transport delay that the traditional "point" sensors suffer from. It is possible to cover an area of 100 km2 using a single pan-tilt-zoom camera placed on a hilltop for wildfire detection. Another benefit of the VFD systems is that they can provide crucial information about the size and growth of the fire, direction of smoke propagation.