BUIR Repository :: Browsing by Subject "Artificial intelligence"

Browsing by Subject "Artificial intelligence"

Now showing 1 - 20 of 62

Open Access
Accurate simulation of reflector antennas by the complex source-dual series approach
(Institute of Electrical and Electronics Engineers, 1995-08) Oğuzer, T.; Altıntaş, A.; Nosich, A. I.
The radiation from circular cylindrical reflector antennas is treated in an accurate manner for both polarizations. The problem is first formulated in terms of the dual series equations and then is regularized by the Riemann-Hilbert problem technique. The resulting matrix equation is solved numeridy with a guaranteed accuracy, and remarkably Little CPU time is needed. The feed directivity is included in the analysis by the complex source point method. Various characteristic patterns are obtained for the front and offset-fed reflector antenna geometries with this analysis, and some comparisons are made with the high frequency techniques. The directivity and radiated power properties are also studied.
Open Access
Adaptive ensemble learning with confidence bounds for personalized diagnosis
(AAAI Press, 2016) Tekin, Cem; Yoon, J.; Van Der Schaar, M.
With the advances in the field of medical informatics, automated clinical decision support systems are becoming the de facto standard in personalized diagnosis. In order to establish high accuracy and confidence in personalized diagnosis, massive amounts of distributed, heterogeneous, correlated and high-dimensional patient data from different sources such as wearable sensors, mobile applications, Electronic Health Record (EHR) databases etc. need to be processed. This requires learning both locally and globally due to privacy constraints and/or distributed nature of the multimodal medical data. In the last decade, a large number of meta-learning techniques have been proposed in which local learners make online predictions based on their locally-collected data instances, and feed these predictions to an ensemble learner, which fuses them and issues a global prediction. However, most of these works do not provide performance guarantees or, when they do, these guarantees are asymptotic. None of these existing works provide confidence estimates about the issued predictions or rate of learning guarantees for the ensemble learner. In this paper, we provide a systematic ensemble learning method called Hedged Bandits, which comes with both long run (asymptotic) and short run (rate of learning) performance guarantees. Moreover, we show that our proposed method outperforms all existing ensemble learning techniques, even in the presence of concept drift.
Open Access
Adaptive hierarchical space partitioning for online classification
(IEEE, 2016) Kılıç, O. Fatih; Vanlı, N. D.; Özkan, H.; Delibalta, İ.; Kozat, Süleyman Serdar
We propose an online algorithm for supervised learning with strong performance guarantees under the empirical zero-one loss. The proposed method adaptively partitions the feature space in a hierarchical manner and generates a powerful finite combination of basic models. This provides algorithm to obtain a strong classification method which enables it to create a linear piecewise classifier model that can work well under highly non-linear complex data. The introduced algorithm also have scalable computational complexity that scales linearly with dimension of the feature space, depth of the partitioning and number of processed data. Through experiments we show that the introduced algorithm outperforms the state-of-the-art ensemble techniques over various well-known machine learning data sets.
Open Access
Alternate strategies for tutorial modules in intelligent tutoring systems
(ASME, 1992) Cankat, E.; Güvenir, Altay H.
Intelligent Tutoring Systems (ITS) have now reached a structure in which the major components of an instructional system are separated in a way that provides both the system and the student with a flexibility within the learning environment. This atmosphere is an interactive, realistic scene similar to actual face-to-face teacher and student instructional environment. A major problem in determining the tutorial strategy of an ITS is how to balance the executive control of the system. In other words to what extend will the teacher be allowed to control the session and where is the point the computer will not allow any external interference. The dividing line is hard to draw and very sensitive to changes in the instructional strategy. Another aspect is that even the student would like to take over the control of execution mostly by asking questions. Now we have three main elements of the system: computer, teacher and student. Who should possess what level of executive control? This is the question we would like to discuss from different points of view in the domain of teaching science courses at secondary school level.
Open Access
An analysis of manipulated information and respective alternative costs in information systems and in decision making structures
(International Institute of Informatics and Systemics, IIIS, 2006) Güvenen O.; Öztürk, M.H.
Today Information Technologies create base for the most important decision support systems for the practices in academia, business and politics. The effectiveness and success of operations that are supported by information systems are directly correlated with the quantity, accuracy, timing, credibility and the quality of the information that prevails in the system. Rapid development of these technologies in recent decades allows high level of information transaction and communication through the whole world. The quantity of information that flows through information systems has increased tremendously. New researches and technological applications in this area aim to improve the systems quantitatively. However, despite a huge and continuous increase in information flow, the quality and reliability of the information in the systems are doubtful from many perspectives. We believe that quality and reliability considerations in information technologies are not handled by researchers and users adequately. So in this research we decided to discuss about quality and reliability aspects of the information flow. To be able to evaluate the information from qualitative perspectives, we believe that it is crucial to handle the problem in science and especially in social sciences by endogenising socio-economic phenomena and science methodology approaches. We hope this work will create a stimulus for researchers of Information Technologies and Systems to give importance to the reliability and quality of information issues.
Open Access
Application of the RIMARC algorithm to a large data set of action potentials and clinical parameters for risk prediction of atrial fibrillation
(Springer, 2015) Ravens, U.; Katircioglu-Öztürk, D.; Wettwer, E.; Christ, T.; Dobrev, D.; Voigt, N.; Poulet, C.; Loose, S.; Simon, J.; Stein, A.; Matschke, K.; Knaut, M.; Oto, E.; Oto, A.; Güvenir, H. A.
Ex vivo recorded action potentials (APs) in human right atrial tissue from patients in sinus rhythm (SR) or atrial fibrillation (AF) display a characteristic spike-and-dome or triangular shape, respectively, but variability is huge within each rhythm group. The aim of our study was to apply the machine-learning algorithm ranking instances by maximizing the area under the ROC curve (RIMARC) to a large data set of 480 APs combined with retrospectively collected general clinical parameters and to test whether the rules learned by the RIMARC algorithm can be used for accurately classifying the preoperative rhythm status. APs were included from 221 SR and 158 AF patients. During a learning phase, the RIMARC algorithm established a ranking order of 62 features by predictive value for SR or AF. The model was then challenged with an additional test set of features from 28 patients in whom rhythm status was blinded. The accuracy of the risk prediction for AF by the model was very good (0.93) when all features were used. Without the seven AP features, accuracy still reached 0.71. In conclusion, we have shown that training the machine-learning algorithm RIMARC with an experimental and clinical data set allows predicting a classification in a test data set with high accuracy. In a clinical setting, this approach may prove useful for finding hypothesis-generating associations between different parameters.
Open Access
Artificial intelligence-based hybrid anomaly detection and clinical decision support techniques for automated detection of cardiovascular diseases and Covid-19
(Bilkent University, 2023-10) Terzi, Merve Begüm
Coronary artery diseases are the leading cause of death worldwide, and early diagnosis is crucial for timely treatment. To address this, we present a novel automated arti cial intelligence-based hybrid anomaly detection technique com posed of various signal processing, feature extraction, supervised, and unsuper vised machine learning methods. By jointly and simultaneously analyzing 12-lead electrocardiogram (ECG) and cardiac sympathetic nerve activity (CSNA) data, the automated arti cial intelligence-based hybrid anomaly detection technique performs fast, early, and accurate diagnosis of coronary artery diseases. To develop and evaluate the proposed automated arti cial intelligence-based hybrid anomaly detection technique, we utilized the fully labeled STAFF III and PTBD databases, which contain 12-lead wideband raw recordings non invasively acquired from 260 subjects. Using the wideband raw recordings in these databases, we developed a signal processing technique that simultaneously detects the 12-lead ECG and CSNA signals of all subjects. Subsequently, using the pre-processed 12-lead ECG and CSNA signals, we developed a time-domain feature extraction technique that extracts the statistical CSNA and ECG features critical for the reliable diagnosis of coronary artery diseases. Using the extracted discriminative features, we developed a supervised classi cation technique based on arti cial neural networks that simultaneously detects anomalies in the 12-lead ECG and CSNA data. Furthermore, we developed an unsupervised clustering technique based on the Gaussian mixture model and Neyman-Pearson criterion that performs robust detection of the outliers corresponding to coronary artery diseases. By using the automated arti cial intelligence-based hybrid anomaly detection technique, we have demonstrated a signi cant association between the increase in the amplitude of CSNA signal and anomalies in ECG signal during coronary artery diseases. The automated arti cial intelligence-based hybrid anomaly de tection technique performed highly reliable detection of coronary artery diseases with a sensitivity of 98.48%, speci city of 97.73%, accuracy of 98.11%, positive predictive value (PPV) of 97.74%, negative predictive value (NPV) of 98.47%, and F1-score of 98.11%. Hence, the arti cial intelligence-based hybrid anomaly detection technique has superior performance compared to the gold standard diagnostic test ECG in diagnosing coronary artery diseases. Additionally, it out performed other techniques developed in this study that separately utilize either only CSNA data or only ECG data. Therefore, it signi cantly increases the detec tion performance of coronary artery diseases by taking advantage of the diversity in di erent data types and leveraging their strengths. Furthermore, its perfor mance is comparatively better than that of most previously proposed machine and deep learning methods that exclusively used ECG data to diagnose or clas sify coronary artery diseases. It also has a very short implementation time, which is highly desirable for real-time detection of coronary artery diseases in clinical practice. The proposed automated arti cial intelligence-based hybrid anomaly detection technique may serve as an e cient decision-support system to increase physicians' success in achieving fast, early, and accurate diagnosis of coronary artery diseases. It may be highly bene cial and valuable, particularly for asymptomatic coronary artery disease patients, for whom the diagnostic information provided by ECG alone is not su cient to reliably diagnose the disease. Hence, it may signi cantly improve patient outcomes, enable timely treatments, and reduce the mortality associated with cardiovascular diseases. Secondly, we propose a new automated arti cial intelligence-based hybrid clinical decision support technique that jointly analyzes reverse transcriptase polymerase chain reaction (RT-PCR) curves, thorax computed tomography im ages, and laboratory data to perform fast and accurate diagnosis of Coronavirus disease 2019 (COVID-19). For this purpose, we retrospectively created the fully labeled Ankara University Faculty of Medicine COVID-19 (AUFM-CoV) database, which contains a wide variety of medical data, including RT-PCR curves, thorax computed tomogra phy images, and laboratory data. The AUFM-CoV is the most comprehensive database that includes thorax computed tomography images of COVID-19 pneu monia (CVP), other viral and bacterial pneumonias (VBP), and parenchymal lung diseases (PLD), all of which present signi cant challenges for di erential diagnosis. We developed a new automated arti cial intelligence-based hybrid clinical de cision support technique, which is an ensemble learning technique consisting of two preprocessing methods, long short-term memory network-based deep learning method, convolutional neural network-based deep learning method, and arti cial neural network-based machine learning method. By jointly analyzing RT-PCR curves, thorax computed tomography images, and laboratory data, the proposed automated arti cial intelligence-based hybrid clinical decision support technique bene ts from the diversity in di erent data types that are critical for the reliable detection of COVID-19 and leverages their strengths. The multi-class classi cation performance results of the proposed convolu tional neural network-based deep learning method on the AUFM-CoV database showed that it achieved highly reliable detection of COVID-19 with a sensitivity of 91.9%, speci city of 92.5%, precision of 80.4%, and F1-score of 86%. There fore, it outperformed thorax computed tomography in terms of the speci city of COVID-19 diagnosis. Moreover, the convolutional neural network-based deep learning method has been shown to very successfully distinguish COVID-19 pneumonia (CVP) from other viral and bacterial pneumonias (VBP) and parenchymal lung diseases (PLD), which exhibit very similar radiological ndings. Therefore, it has great potential to be successfully used in the di erential diagnosis of pulmonary dis eases containing ground-glass opacities. The binary classi cation performance results of the proposed convolutional neural network-based deep learning method showed that it achieved a sensitivity of 91.5%, speci city of 94.8%, precision of 85.6%, and F1-score of 88.4% in diagnosing COVID-19. Hence, it has compara ble sensitivity to thorax computed tomography in diagnosing COVID-19. Additionally, the binary classi cation performance results of the proposed long short-term memory network-based deep learning method on the AUFM-CoV database showed that it performed highly reliable detection of COVID-19 with a sensitivity of 96.6%, speci city of 99.2%, precision of 98.1%, and F1-score of 97.3%. Thus, it outperformed the gold standard RT-PCR test in terms of the sensitivity of COVID-19 diagnosis Furthermore, the multi-class classi cation performance results of the proposed automated arti cial intelligence-based hybrid clinical decision support technique on the AUFM-CoV database showed that it diagnosed COVID-19 with a sen sitivity of 66.3%, speci city of 94.9%, precision of 80%, and F1-score of 73%. Hence, it has been shown to very successfully perform the di erential diagnosis of COVID-19 pneumonia (CVP) and other pneumonias. The binary classi cation performance results of the automated arti cial intelligence-based hybrid clinical decision support technique revealed that it diagnosed COVID-19 with a sensi tivity of 90%, speci city of 92.8%, precision of 91.8%, and F1-score of 90.9%. Therefore, it exhibits superior sensitivity and speci city compared to laboratory data in COVID-19 diagnosis. The performance results of the proposed automated arti cial intelligence-based hybrid clinical decision support technique on the AUFM-CoV database demon strate its ability to provide highly reliable diagnosis of COVID-19 by jointly ana lyzing RT-PCR data, thorax computed tomography images, and laboratory data. Consequently, it may signi cantly increase the success of physicians in diagnosing COVID-19, assist them in rapidly isolating and treating COVID-19 patients, and reduce their workload in daily clinical practice.
Embargo
Artificial neural network and decision tree–based models for prediction and validation of in vitro organogenesis of two hydrophytes—Hemianthus callitrichoides and Riccia fluitans
(Springer, 0202-08-02) Özcan, Esra; Atar, Hasan Hüseyin; Ali, Seyid Amjad; Aasim, Muhammad
The application of plant tissue culture protocols for aquatic plants has been widely adopted in recent years to produce cost-effective plants for aquarium industry. In vitro regeneration protocol for the two different hydrophytes Hemianthus callitrichoides (Cuba) and Riccia fluitans were optimized for appropriate basal medium, sucrose, agar, and plant growth regulator concentration. The MS No:3B and SH + MSVit basal medium yielded a maximum clump diameter of 5.53 cm for H. callitrichoides and 3.65 cm for R. fluitans. The application of 20 g/L sucrose was found appropriate for yielding larger clumps in both species. Solidification of the medium with 1 g/L agar was optimized for inducing larger clumps with rooting for both species. Provision of basal medium with any concentration of 6-benzylaminopurine (BAP) and α-naphthaleneacetic acid (NAA) was found detrimental for inducing larger clumps for both species. The largest clumps of H. callitrichoides (5.51 cm) and R. fluitans (4.59 cm) were obtained on basal medium without any plant growth regulators. The attained data was also predicted and validated by employing multilayer perceptron (MLP), random forest (RF), and extreme gradient boosting (XGBoost) algorithms. The performance of the models was tested with three different performance metrics, namely, coefficient of regression (R2), means square error (MSE), and mean absolute error (MAE). Results revealed that MLP and RF models performed better than the XGBoost model. The protocols developed in this study have shown promising outcomes and the findings can irrefutably assist to produce H. callitrichoides and R. fluitans on a large scale for the local aquarium industry.
Open Access
Assembly line balancing using genetic algorithms
(Kluwer Academic Publishers, 2000) Sabuncuoğlu İ.; Erel, E.; Tanyer, M.
Assembly Line Balancing (ALB) is one of the important problems of production/operations management area. As small improvements in the performance of the system can lead to significant monetary consequences, it is of utmost importance to develop practical solution procedures that yield high-quality design decisions with minimal computational requirements. Due to the NP-hard nature of the ALB problem, heuristics are generally used to solve real life problems. In this paper, we propose an efficient heuristic to solve the deterministic and single-model ALB problem. The proposed heuristic is a Genetic Algorithm (GA) with a special chromosome structure that is partitioned dynamically through the evolution process. Elitism is also implemented in the model by using some concepts of Simulated Annealing (SA). In this context, the proposed approach can be viewed as a unified framework which combines several new concepts of AI in the algorithmic design. Our computational experiments with the proposed algorithm indicate that it outperforms the existing heuristics on several test problems.
Open Access
Big-data streaming applications scheduling based on staged multi-armed bandits
(Institute of Electrical and Electronics Engineers, 2016) Kanoun, K.; Tekin, C.; Atienza, D.; Van Der Schaar, M.
Several techniques have been recently proposed to adapt Big-Data streaming applications to existing many core platforms. Among these techniques, online reinforcement learning methods have been proposed that learn how to adapt at run-time the throughput and resources allocated to the various streaming tasks depending on dynamically changing data stream characteristics and the desired applications performance (e.g., accuracy). However, most of state-of-the-art techniques consider only one single stream input in its application model input and assume that the system knows the amount of resources to allocate to each task to achieve a desired performance. To address these limitations, in this paper we propose a new systematic and efficient methodology and associated algorithms for online learning and energy-efficient scheduling of Big-Data streaming applications with multiple streams on many core systems with resource constraints. We formalize the problem of multi-stream scheduling as a staged decision problem in which the performance obtained for various resource allocations is unknown. The proposed scheduling methodology uses a novel class of online adaptive learning techniques which we refer to as staged multi-armed bandits (S-MAB). Our scheduler is able to learn online which processing method to assign to each stream and how to allocate its resources over time in order to maximize the performance on the fly, at run-time, without having access to any offline information. The proposed scheduler, applied on a face detection streaming application and without using any offline information, is able to achieve similar performance compared to an optimal semi-online solution that has full knowledge of the input stream where the differences in throughput, observed quality, resource usage and energy efficiency are less than 1, 0.3, 0.2 and 4 percent respectively.
Open Access
Boosted LMS-based piecewise linear adaptive filters
(IEEE, 2016) Kari, Dariush; Marivani, Iman; Delibalta, İ.; Kozat, Süleyman Serdar
We introduce the boosting notion extensively used in different machine learning applications to adaptive signal processing literature and implement several different adaptive filtering algorithms. In this framework, we have several adaptive constituent filters that run in parallel. For each newly received input vector and observation pair, each filter adapts itself based on the performance of the other adaptive filters in the mixture on this current data pair. These relative updates provide the boosting effect such that the filters in the mixture learn a different attribute of the data providing diversity. The outputs of these constituent filters are then combined using adaptive mixture approaches. We provide the computational complexity bounds for the boosted adaptive filters. The introduced methods demonstrate improvement in the performances of conventional adaptive filtering algorithms due to the boosting effect.
Open Access
Çağrı merkezi metin madenciliği yaklaşımı
(IEEE, 2017-05) Yiğit, İ. O.; Ateş, A. F.; Güvercin, Mehmet; Ferhatosmanoğlu, Hakan; Gedik, Buğra
Günümüzde çağrı merkezlerindeki görüşme kayıtlarının sesten metne dönüştürülebilmesi görüşme kaydı metinleri üzerinde metin madenciliği yöntemlerinin uygulanmasını mümkün kılmaktadır. Bu çalışma kapsamında görüşme kaydı metinleri kullanarak görüşmenin içeriğinin duygu yönünden (olumlu/olumsuz) değerlendirilmesi, müşteri memnuniyetinin ve müşteri temsilcisi performansının ölçülmesi amaçlanmaktadır. Yapılan çalışmada görüşme kaydı metinlerinden metin madenciliği yöntemleri ile yeni özellikler çıkarılmıştır. Metinlerden elde edilen özelliklerden yararlanılarak sınıflandırma ve regresyon yöntemleriyle görüşme kayıtlarının içeriklerinin değerlendirilmesini sağlayacak tahmin modelleri oluşturulmuştur. Bu çalışma sonucunda ortaya çıkarılan tahmin modellerinin Türk Telekom bünyesindeki çağrı merkezlerinde kullanılması hedeflenmektedir.
Open Access
Chatbots and mental health: Insights into the safety of generative AI
(John Wiley & Sons Ltd., 2023-10-26) De Freitas, Julian; Uğuralp, Ahmet Kaan; Oğuz-Uğuralp, Zeliha; Puntoni, Stefano
Chatbots are now able to engage in sophisticated conversations with consumers. Due to the “black box” nature of the algorithms, it is impossible to predict in advance how these conversations will unfold. Behavioral research provides little insight into potential safety issues emerging from the current rapid deployment of this technology at scale. We begin to address this urgent question by focusing on the context of mental health and “companion AI”: Applications designed to provide consumers with synthetic interaction partners. Studies 1a and 1b present field evidence: Actual consumer interactions with two different companion AIs. Study 2 reports an extensive performance test of several commercially available companion AIs. Study 3 is an experiment testing consumer reaction to risky and unhelpful chatbot responses. The findings show that (1) mental health crises are apparent in a nonnegligible minority of conversations with users; (2) companion AIs are often unable to recognize, and respond appropriately to, signs of distress; and (3) consumers display negative reactions to unhelpful and risky chatbot responses, highlighting emerging reputational risks for generative AI companies.
Open Access
Classification by voting feature intervals
(Springer, 1997-04) Demiröz, Gülşen; Güvenir, H. Altay
A new classification algorithm called VFI (for Voting Feature Intervals) is proposed. A concept is represented by a set of feature intervals on each feature dimension separately. Each feature participates in the classification by distributing real-valued votes among classes. The class receiving the highest vote is declared to be the predicted class. VFI is compared with the Naive Bayesian Classifier, which also considers each feature separately. Experiments on real-world datasets show that VFI achieves comparably and even better than NBC in terms of classification accuracy. Moreover, VFI is faster than NBC on all datasets. © Springer-Verlag Berlin Heidelberg 1997.
Open Access
Classification of regional ionospheric disturbance based on machine learning techniques
(European Space Agency, 2016) Terzi, Merve Begüm; Arıkan, Orhan; Karatay, S.; Arıkan, F.; Gulyaeva, T.
In this study, Total Electron Content (TEC) estimated from GPS receivers is used to model the regional and local variability that differs from global activity along with solar and geomagnetic indices. For the automated classification of regional disturbances, a classification technique based on a robust machine learning technique that have found wide spread use, Support Vector Machine (SVM) is proposed. Performance of developed classification technique is demonstrated for midlatitude ionosphere over Anatolia using TEC estimates generated from GPS data provided by Turkish National Permanent GPS Network (TNPGN-Active) for solar maximum year of 2011. As a result of implementing developed classification technique to Global Ionospheric Map (GIM) TEC data, which is provided by the NASA Jet Propulsion Laboratory (JPL), it is shown that SVM can be a suitable learning method to detect anomalies in TEC variations.
Open Access
A comparison of state-of-the-art machine learning algorithms on fault indication and remaining useful life determination by telemetry data
(IEEE, 2021-11-15) Ünal, Aras Fırat; Kaleli, Ali Yücel; Ummak, Emre; Albayrak, Özlem; Younas, M.; Awan, I.; Unal, P.
Contemporary trends in the diffusion of artificial intelligence technologies has increased the number of studies on predictive maintenance, a recent focus of interest in many industrial domains. Despite the increased interest in the use of machine learning for predictive maintenance, few studies involve thorough comparisons of machine learning algorithms' performance on predictive maintenance applications. This work aims to predict the remaining useful life and machine failures and compares five different algorithms: Random Forest, Gradient Boosted Tree, K-Nearest Neighbors, Multilayer Perceptron and LightGBM. Our results suggest better performances for binary classification using Random Forest, and for regression using LightGRM comnared to other selected algorithms.
Open Access
Concave measures and the fuzzy core of exchange economies with heterogeneous divisible commodities
(Elsevier BV, 2012) Hüsseinov, F.; Sagara, N.
The main purpose of this paper is to prove the existence of the fuzzy core of an exchange economy with a heterogeneous divisible commodity in which preferences of individuals are given by nonadditive utility functions defined on a σ-algebra of admissible pieces of the total endowment of the commodity. The problem is formulated as the partitioning of a measurable space among finitely many individuals. Applying the Yosida-Hewitt decomposition theorem, we also demonstrate that partitions in the fuzzy core are supportable by prices in L 1. © 2012 Elsevier B.V.
Open Access
ConceptMap: mining noisy web data for concept learning
(Springer, 2014-09) Gölge, Eren; Duygulu, Pınar
We attack the problem of learning concepts automatically from noisy Web image search results. The idea is based on discovering common characteristics shared among subsets of images by posing a method that is able to organise the data while eliminating irrelevant instances. We propose a novel clustering and outlier detection method, namely Concept Map (CMAP). Given an image collection returned for a concept query, CMAP provides clusters pruned from outliers. Each cluster is used to train a model representing a different characteristics of the concept. The proposed method outperforms the state-of-the-art studies on the task of learning from noisy web data for low-level attributes, as well as high level object categories. It is also competitive with the supervised methods in learning scene concepts. Moreover, results on naming faces support the generalisation capability of the CMAP framework to different domains. CMAP is capable to work at large scale with no supervision through exploiting the available sources. © 2014 Springer International Publishing.
Open Access
A contextualist analysis of insults
(Springer, 2017) Berkovski, Y. Sandy
For a predicate expression F contained in a sentence S (‘x is F’) to count as an insult, it should be used in a situation having a number of contextual elements. There should be an audience to whom the utterance of S is addressed. There should be a target of the insult, an individual who the speaker wishes to be shunned, excluded from certain, more or less salient, forms of social cooperation. The purpose of the utterance of S is to persuade the audience, by appeal to their emotions, to shun the target. Slurs have the canonical occasions of use structurally identical to the occasions of insults.
Open Access
Data imputation through the identification of local anomalies
(Institute of Electrical and Electronics Engineers Inc., 2015) Ozkan, H.; Pelvan, O. S.; Kozat, S. S.
We introduce a comprehensive and statistical framework in a model free setting for a complete treatment of localized data corruptions due to severe noise sources, e.g., an occluder in the case of a visual recording. Within this framework, we propose: 1) a novel algorithm to efficiently separate, i.e., detect and localize, possible corruptions from a given suspicious data instance and 2) a maximum a posteriori estimator to impute the corrupted data. As a generalization to Euclidean distance, we also propose a novel distance measure, which is based on the ranked deviations among the data attributes and empirically shown to be superior in separating the corruptions. Our algorithm first splits the suspicious instance into parts through a binary partitioning tree in the space of data attributes and iteratively tests those parts to detect local anomalies using the nominal statistics extracted from an uncorrupted (clean) reference data set. Once each part is labeled as anomalous versus normal, the corresponding binary patterns over this tree that characterize corruptions are identified and the affected attributes are imputed. Under a certain conditional independency structure assumed for the binary patterns, we analytically show that the false alarm rate of the introduced algorithm in detecting the corruptions is independent of the data and can be directly set without any parameter tuning. The proposed framework is tested over several well-known machine learning data sets with synthetically generated corruptions and experimentally shown to produce remarkable improvements in terms of classification purposes with strong corruption separation capabilities. Our experiments also indicate that the proposed algorithms outperform the typical approaches and are robust to varying training phase conditions. © 2015 IEEE.