• About
  • Policies
  • What is open access
  • Library
  • Contact
Advanced search
      View Item 
      •   BUIR Home
      • Scholarly Publications
      • Faculty of Engineering
      • Department of Electrical and Electronics Engineering
      • View Item
      •   BUIR Home
      • Scholarly Publications
      • Faculty of Engineering
      • Department of Electrical and Electronics Engineering
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      An efficient and effective second-order training algorithm for LSTM-based adaptive learning

      Thumbnail
      View / Download
      2.5 Mb
      Author(s)
      Vural, N. Mert
      Ergüt, S.
      Kozat, Süleyman S.
      Date
      2021-04-07
      Source Title
      IEEE Transactions on Signal Processing
      Print ISSN
      1053-587X
      Electronic ISSN
      1941-0476
      Publisher
      IEEE
      Volume
      69
      Pages
      2541 - 2554
      Language
      English
      Type
      Article
      Item Usage Stats
      32
      views
      71
      downloads
      Abstract
      We study adaptive (or online) nonlinear regression with Long-Short-Term-Memory (LSTM) based networks, i.e., LSTM-based adaptive learning. In this context, we introduce an efficient Extended Kalman filter (EKF) based second-order training algorithm. Our algorithm is truly online, i.e., it does not assume any underlying data generating process and future information, except that the target sequence is bounded. Through an extensive set of experiments, we demonstrate significant performance gains achieved by our algorithm with respect to the state-of-the-art methods. Here, we mainly show that our algorithm consistently provides 10 to 45% improvement in the accuracy compared to the widely-used adaptive methods Adam, RMSprop, and DEKF, and comparable performance to EKF with a 10 to 15 times reduction in the run-time.
      Keywords
      Adaptive learning
      Online learning
      Truly online
      Long short term memory (LSTM)
      Kalman filtering
      Regression
      Stochastic gradient descent (SGD) EDICS Category: MLR-SLER
      MLR-DEEP
      Permalink
      http://hdl.handle.net/11693/76922
      Published Version (Please cite this version)
      https://doi.org/10.1109/TSP.2021.3071566
      Collections
      • Department of Electrical and Electronics Engineering 4011
      Show full item record

      Browse

      All of BUIRCommunities & CollectionsTitlesAuthorsAdvisorsBy Issue DateKeywordsTypeDepartmentsCoursesThis CollectionTitlesAuthorsAdvisorsBy Issue DateKeywordsTypeDepartmentsCourses

      My Account

      Login

      Statistics

      View Usage StatisticsView Google Analytics Statistics

      Bilkent University

      If you have trouble accessing this page and need to request an alternate format, contact the site administrator. Phone: (312) 290 2976
      © Bilkent University - Library IT

      Contact Us | Send Feedback | Off-Campus Access | Admin | Privacy