Now showing items 1-12 of 12

    • Ensemble pruning for text categorization based on data partitioning 

      Toraman, C.; Can F. (2011)
      Ensemble methods can improve the effectiveness in text categorization. Due to computation cost of ensemble approaches there is a need for pruning ensembles. In this work we study ensemble pruning based on data partitioning. ...
    • Graph aware caching policy for distributed graph stores 

      Aksu H.; Canim, M.; Chang, Y.-C.; Korpeoglu I.; Ulusoy Ö. (Institute of Electrical and Electronics Engineers Inc., 2015)
      Graph stores are becoming increasingly popular among NOSQL applications seeking flexibility and heterogeneity in managing linked data. Conceptually and in practice, applications ranging from social networks, knowledge ...
    • IBM streams processing language: analyzing big data in motion 

      Hirzel M.; Andrade, H.; Gedik, B.; Jacques-Silva, R.; Khandekar, R.; Kumar, V.; Mendell, M.; Nasgaard, H.; Schneider S.; Soule´, R.; Wu, K. L. (I B M Corp., 2013-05-17)
      The IBM Streams Processing Language (SPL) is the programming language for IBM InfoSphere® Streams, a platform for analyzing Big Data in motion. By “Big Data in motion,” we mean continuous data streams at high data-transfer ...
    • L1 norm based multiplication-free cosine similarity measures for big data analysis 

      Akbas, C.E.; Bozkurt, A.; Arslan, M.T.; Aslanoglu H.; Cetin, A.E. (Institute of Electrical and Electronics Engineers Inc., 2014)
      The cosine similarity measure is widely used in big data analysis to compare vectors. In this article a new set of vector similarity measures are proposed. New vector similarity measures are based on a multiplication-free ...
    • Land cover classification with multi-sensor fusion of partly missing data 

      Aksoy, S.; Koperski, K.; Tusk, C.; Marchisio, G. (American Society for Photogrammetry and Remote Sensing, 2009-05)
      We describe a system that uses decision tree-based tools for seamless acquisition of knowledge for classification of remotely sensed imagery. We concentrate on three important problems in this process: information fusion, ...
    • Object-oriented query language facilitating construction of new objects 

      Alhajj, R.; Arkun, M. E. (Elsevier, 1993)
      In object-oriented database systems, messages can be used to manipulate the database; however, a query language is still a required component of any kind of database system. In the paper, we describe a query language for ...
    • Pipelined fission for stream programs with dynamic selectivity and partitioned state 

      Gedik, B.; Özsema, H. G.; Öztürk, Ö. (Academic Press, 2016)
      There is an ever increasing rate of digital information available in the form of online data streams. In many application domains, high throughput processing of such data is a critical requirement for keeping up with the ...
    • A privacy-preserving solution for the bipartite ranking problem 

      Faramarzi, N. S.; Ayday, E.; Güvenir, H. A. (Institute of Electrical and Electronics Engineers Inc., 2017)
      In this paper, we propose an efficient solution for the privacy-preserving of a bipartite ranking algorithm. The bipartite ranking problem can be considered as finding a function that ranks positive instances (in a dataset) ...
    • Processing real-time transactions in a replicated database system 

      Ulusoy, Ö. (Springer/Kluwer Academic Publishers, 1994)
      A database system supporting a real-time application has to provide real-time information to the executing transactions. Each real-time transaction is associated with a timing constraint, typically in the form of a deadline. ...
    • Query model for object-oriented databases 

      Alhajj, R.; Arkun, M. E. (Publ by IEEE, Los Alamitos, CA, United States, 1993)
      A query language should be a part of any database system. While the relational model has a well defined underlying query model, the object-oriented database systems have been criticized for not having such a query model. ...
    • Safe data parallelism for general streaming 

      Schneider S.; Hirzel M.; Gedik, B.; Wu, Kun-Lung (Institute of Electrical and Electronics Engineers, 2015)
      Streaming applications process possibly infinite streams of data and often have both high throughput and low latency requirements. They are comprised of operator graphs that produce and consume data tuples. General streaming ...
    • SPL: an extensible language for distributed stream processing 

      Hirzel M.; Schneider S.; Gedik, B. (Association for Computing Machinery, 2017)
      Big data is revolutionizing how all sectors of our economy do business, including telecommunication, transportation, medical, and finance. Big data comes in two flavors: data at rest and data in motion. Processing data in ...