Characterizing, predicting, and handling web search queries that match very few or no results

buir.contributor.authorUlusoy, Özgür
buir.contributor.authorSarıgil, Erdem
dc.citation.epage270en_US
dc.citation.issueNumber2en_US
dc.citation.spage256en_US
dc.citation.volumeNumber69en_US
dc.contributor.authorSarıgil, Erdemen_US
dc.contributor.authorAltıngövde, I. S.en_US
dc.contributor.authorBlanco, R.en_US
dc.contributor.authorCambazoğlu, B. B.en_US
dc.contributor.authorÖzcan, R.en_US
dc.contributor.authorUlusoy, Özgüren_US
dc.date.accessioned2019-02-12T06:31:37Z
dc.date.available2019-02-12T06:31:37Z
dc.date.issued2018en_US
dc.departmentDepartment of Computer Engineeringen_US
dc.description.abstractA non‐negligible fraction of user queries end up with very few or even no matching results in leading commercial web search engines. In this work, we provide a detailed characterization of such queries and show that search engines try to improve such queries by showing the results of related queries. Through a user study, we show that these query suggestions are usually perceived as relevant. Also, through a query log analysis, we show that the users are dissatisfied after submitting a query that match no results at least 88.5% of the time. As a first step towards solving these no‐answer queries, we devised a large number of features that can be used to identify such queries and built machine‐learning models. These models can be useful for scenarios such as the mobile‐ or meta‐search, where identifying a query that will retrieve no results at the client device (i.e., even before submitting it to the search engine) may yield gains in terms of the bandwidth usage, power consumption, and/or monetary costs. Experiments over query logs indicate that, despite the heavy skew in class sizes, our models achieve good prediction quality, with accuracy (in terms of area under the curve) up to 0.95.en_US
dc.embargo.release2019-01-08en_US
dc.identifier.doi10.1002/asi.23955en_US
dc.identifier.eissn2330-1643en_US
dc.identifier.issn2330-1635en_US
dc.identifier.urihttp://hdl.handle.net/11693/49282en_US
dc.language.isoEnglishen_US
dc.publisherJohn Wiley & Sonsen_US
dc.relation.isversionofhttps://doi.org/10.1002/asi.23955en_US
dc.source.titleJournal of the Association for Information Science and Technologyen_US
dc.titleCharacterizing, predicting, and handling web search queries that match very few or no resultsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Sarigil_et_al-2018-Journal_of_the_Association_for_Information_Science_and_Technology.pdf
Size:
773.31 KB
Format:
Adobe Portable Document Format
Description:
Full printable version

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: