Altingövde, I. S.Ulusoy, Özgür2018-04-122018-04-1220041541-16721941-1294http://hdl.handle.net/11693/38244A baseline crawler was developed at the Bilkent University based on a focused-crawling approach. The focused crawler is an agent that targets a particular topic and visits and gathers only a relevant, narrow Web segment while trying not to waste resources on irrelevant materials. The rule-based Web-crawling approach uses linkage statistics among topics to improve a baseline focused crawler's harvest rate and coverage. The crawler also employs a canonical topic taxonomy to train a naïve-Bayesian classifier, which then helps determine the relevancy of crawled pages.EnglishBest First SearchBreadth First SearchDomain Name Systems (DNS)Web Crawling ApproachesClassification (of information)Data AcquisitionIndexing (of information)Knowledge Based SystemsNetwork ProtocolsOnline SearchingQueueing TheoryWebsitesExploiting interclass rules for focused crawlingReview10.1109/MIS.2004.62