• About
  • Policies
  • What is open access
  • Library
  • Contact
Advanced search
      View Item 
      •   BUIR Home
      • Scholarly Publications
      • Faculty of Engineering
      • Department of Computer Engineering
      • View Item
      •   BUIR Home
      • Scholarly Publications
      • Faculty of Engineering
      • Department of Computer Engineering
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Exploiting interclass rules for focused crawling

      Thumbnail
      View / Download
      153.6 Kb
      Author(s)
      Altingövde, I. S.
      Ulusoy, Özgür
      Date
      2004
      Source Title
      IEEE Intelligent Systems
      Print ISSN
      1541-1672
       
      1941-1294
       
      Publisher
      IEEE
      Volume
      19
      Issue
      6
      Pages
      66 - 73
      Language
      English
      Type
      Review
      Item Usage Stats
      235
      views
      239
      downloads
      Abstract
      A baseline crawler was developed at the Bilkent University based on a focused-crawling approach. The focused crawler is an agent that targets a particular topic and visits and gathers only a relevant, narrow Web segment while trying not to waste resources on irrelevant materials. The rule-based Web-crawling approach uses linkage statistics among topics to improve a baseline focused crawler's harvest rate and coverage. The crawler also employs a canonical topic taxonomy to train a naïve-Bayesian classifier, which then helps determine the relevancy of crawled pages.
      Keywords
      Best First Search
      Breadth First Search
      Domain Name Systems (DNS)
      Web Crawling Approaches
      Classification (of information)
      Data Acquisition
      Indexing (of information)
      Knowledge Based Systems
      Network Protocols
      Online Searching
      Queueing Theory
      Websites
      Permalink
      http://hdl.handle.net/11693/38244
      Published Version (Please cite this version)
      http://dx.doi.org/10.1109/MIS.2004.62
      Collections
      • Department of Computer Engineering 1561
      Show full item record

      Browse

      All of BUIRCommunities & CollectionsTitlesAuthorsAdvisorsBy Issue DateKeywordsTypeDepartmentsCoursesThis CollectionTitlesAuthorsAdvisorsBy Issue DateKeywordsTypeDepartmentsCourses

      My Account

      Login

      Statistics

      View Usage StatisticsView Google Analytics Statistics

      Bilkent University

      If you have trouble accessing this page and need to request an alternate format, contact the site administrator. Phone: (312) 290 2976
      © Bilkent University - Library IT

      Contact Us | Send Feedback | Off-Campus Access | Admin | Privacy