DBTagger: multi-task learning for keyword mapping in NLIDBs using Bi-directional recurrent neural networks

buir.contributor.authorUsta, Arif
buir.contributor.authorKarakayali, Akifhan
buir.contributor.authorUlusoy, Özgür
buir.contributor.orcidUsta, Arif|0000-0001-6713-6621
buir.contributor.orcidKarakayali, Akifhan|0000-0003-0773-902X
dc.citation.epage821en_US
dc.citation.issueNumber5en_US
dc.citation.spage813en_US
dc.citation.volumeNumber14en_US
dc.contributor.authorUsta, Arif
dc.contributor.authorKarakayali, Akifhan
dc.contributor.authorUlusoy, Özgür
dc.date.accessioned2022-02-22T13:34:40Z
dc.date.available2022-02-22T13:34:40Z
dc.date.issued2021-01
dc.departmentDepartment of Computer Engineeringen_US
dc.description.abstractTranslating Natural Language Queries (NLQs) to Structured Query Language (SQL) in interfaces deployed in relational databases is a challenging task, which has been widely studied in database community recently. Conventional rule based systems utilize series of solutions as a pipeline to deal with each step of this task, namely stop word filtering, tokenization, stemming/lemmatization, parsing, tagging, and translation. Recent works have mostly focused on the translation step overlooking the earlier steps by using adhoc solutions. In the pipeline, one of the most critical and challenging problems is keyword mapping; constructing a mapping between tokens in the query and relational database elements (tables, attributes, values, etc.). We define the keyword mapping problem as a sequence tagging problem, and propose a novel deep learning based supervised approach that utilizes POS tags of NLQs. Our proposed approach, called DBTagger (DataBase Tagger), is an end-to-end and schema independent solution, which makes it practical for various relational databases. We evaluate our approach on eight different datasets, and report new state-of-the-art accuracy results, 92.4% on the average. Our results also indicate that DBTagger is faster than its counterparts up to 10000 times and scalable for bigger databases.en_US
dc.identifier.doi10.14778/3446095.3446103en_US
dc.identifier.eissn2150-8097en_US
dc.identifier.urihttp://hdl.handle.net/11693/77553en_US
dc.language.isoEnglishen_US
dc.publisherAssociation for Computing Machineryen_US
dc.relation.isversionofhttps://doi.org/10.14778/3446095.3446103en_US
dc.source.titleProceedings of the VLDB Endowmenten_US
dc.titleDBTagger: multi-task learning for keyword mapping in NLIDBs using Bi-directional recurrent neural networksen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DBTagger_multi-task_learning_for_keyword_mapping_in_NLIDBs_using_Bi-directional_recurrent_neural_networks.pdf
Size:
764.68 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.69 KB
Format:
Item-specific license agreed upon to submission
Description: