Browsing by Subject "Web Crawling Approaches"
Now showing 1 - 1 of 1
- Results Per Page
- Sort Options
Item Open Access Exploiting interclass rules for focused crawling(IEEE, 2004) Altingövde, I. S.; Ulusoy, ÖzgürA baseline crawler was developed at the Bilkent University based on a focused-crawling approach. The focused crawler is an agent that targets a particular topic and visits and gathers only a relevant, narrow Web segment while trying not to waste resources on irrelevant materials. The rule-based Web-crawling approach uses linkage statistics among topics to improve a baseline focused crawler's harvest rate and coverage. The crawler also employs a canonical topic taxonomy to train a naïve-Bayesian classifier, which then helps determine the relevancy of crawled pages.