Improving scalability of inductive logic programming via pruning and best-effort optimisation

Kazmi, Mishal; Schuller, Peter; SAYGIN, YÜCEL

doi:10.1016/j.eswa.2017.06.013

Improving scalability of inductive logic programming via pruning and best-effort optimisation

Atıf İçin Kopyala

Kazmi M., Schuller P., SAYGIN Y.

EXPERT SYSTEMS WITH APPLICATIONS, cilt.87, ss.291-303, 2017 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 87
Basım Tarihi: 2017
Doi Numarası: 10.1016/j.eswa.2017.06.013
Dergi Adı: EXPERT SYSTEMS WITH APPLICATIONS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.291-303
Anahtar Kelimeler: Answer Set Programming, Inductive logic programming, Natural Language Processing, Chunking, ANSWER, DEFINITIONS
Marmara Üniversitesi Adresli: Evet

Özet

Inductive Logic Programming (ILP) combines rule-based and statistical artificial intelligence methods, by learning a hypothesis comprising.a set of rules given background knowledge and constraints for the search space. We focus on extending the XHAIL algorithm for ILP which is based on Answer Set Programming and we evaluate our extensions using the Natural Language Processing application of sentence chunking. With respect to processing natural language, ILP can cater for the constant change in how we use language on a daily basis. At the same time, ILP does not require huge amounts of training examples such as other statistical methods and produces interpretable results, that means a set of rules, which can be analysed and tweaked if necessary. As contributions we extend XHAIL with (i) a pruning mechanism within the hypothesis generalisation algorithm which enables learning from larger datasets, (ii) a better usage of modern solver technology using recently developed optimisation methods, and (iii) a time budget that permits the usage of suboptimal results. We evaluate these improvements on the task of sentence chunking using three datasets from a recent SemEval competition. Results show that our improvements allow for learning on bigger datasets with results that are of similar quality to state-of-the-art Systems on the same task. Moreover, we compare the hypotheses obtained on datasets to gain insights on the structure of each dataset. (c) 2017 Elsevier Ltd. All rights reserved.