L1 norm based multiplication-free cosine similarity measures for big data analysis
2014 International Workshop on Computational Intelligence for Multimedia Understanding, IWCIM 2014
Institute of Electrical and Electronics Engineers Inc.
MetadataShow full item record
Please cite this item using this persistent URLhttp://hdl.handle.net/11693/28674
The cosine similarity measure is widely used in big data analysis to compare vectors. In this article a new set of vector similarity measures are proposed. New vector similarity measures are based on a multiplication-free operator which requires only additions and sign operations. A vector 'product' using the multiplication-free operator is also defined. The new vector product induces the ℓ1-norm. As a result, new cosine measure-like similarity measures are normalized by the ℓ1-norms of the vectors. They can be computed using the MapReduce framework. Simulation examples are presented. © 2014 IEEE.
Showing items related by title, author, creator and subject.
Şenel L.K.; Yücesoy V.; Koç A.; Çukur T. (Institute of Electrical and Electronics Engineers Inc., 2017)This paper studies cross-lingual semantic similarity (CLSS) between five European languages (i.e. English, French, German, Spanish and Italian) via unsupervised word embeddings from a cross-lingual lexicon. The vocabulary ...
Varol, E.; Can F.; Aykanat, C.; Kaya O. (2011)We study a generalized version of the near-duplicate detection problem which concerns whether a document is a subset of another document. In text-based applications, document containment can be observed in exact-duplicates, ...
Saygin, Y.; Ulusoy, Ö. (IEEE, 2001)Fuzzy sets and fuzzy logic research aims to bridge the gap between the crisp world of math and the real world. Fuzzy set theory was applied to many different areas, from control to databases. Sometimes the number of events ...