Query Expansion

Showing results in 'Publications'. Show all posts
Al-Chalabi, H., S. Ray, and K. Shaalan, "Semantic Based Query Expansion for Arabic Question Answering Systems", The International Conference on Arabic Computational Linguistics (ACLing), Cairo, Egypt, 17 April, 2015. Abstractqeacling2015.pdf

Question Answering Systems have emerged as a good alternative to search engines where they produce the desired information in a very precise way in the real time. However, one serious concern with the Question Answering system is that despite having answers of the questions in the knowledge base, they are not able to retrieve the answer due to mismatch between the words used by users and content creators. There has been a
lot of research in the field of English and some European language Question Answering Systems to handle this issue. However, Arabic Question Answering Systems could not match the pace due to some inherent difficulties with the language itself as well as due to lack of tools available to assist the researchers. In this paper, we are
presenting a method to add semantically equivalent keywords in the questions by using semantic resources. The experiments suggest that the proposed research can deliver highly accurate
answers for Arabic questions.

Shaalan, K., S. Al-Sheikh, and F. Oroumchian, "Query Expansion Based-on Similarity of Terms for Improving Arabic Information Retrieval", Intelligent Information Processing VI, vol. 385, Berlin Heidelberg, Springer, pp. 167-176, 2012. Abstractquery_arabic_iip_ifip.pdf

This research suggests a method for query expansion on Arabic Information Retrieval using Expectation Maximization (EM). We employ the EM algorithm in the process of selecting relevant terms for expanding the query and weeding out the non-related terms. We tested our algorithm on INFILE test collection of CLLEF2009, and the experiments show that query expansion that considers similarity of terms both improves precision and retrieves more relevant documents. The main finding of this research is that we can increase the recall while keeping the precision at the same level by this method.

Tourism