Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.12323/7791
Title: Fine-Tuning QurSim on Monolingual and Multilingual Models for Semantic Search
Authors: Afzal, Tania
Rauf, Sadaf Abdul
Abbas Malik, Muhammad Ghulam
Imran, Muhammad
Keywords: semantic similarity
Quranic verse classification
verse retrieval
semantic search
Issue Date: 23-Jan-2025
Publisher: MDPI
Series/Report no.: Vol. 16;Information, № 2
Abstract: Transformers have made a significant breakthrough in natural language processing. These models are trained on large datasets and can handle multiple tasks. We compare monolingual and multilingual transformer models for semantic relatedness and verse retrieval. We leveraged data from the original QurSim dataset (Arabic) and used authentic multi-author translations in 22 languages to create a multilingual QurSim dataset, which we released for the research community. We evaluated the performance of monolingual and multilingual LLMs for Arabic and our results show that monolingual LLMs give better results for verse classification and matching verse retrieval. We incrementally built monolingual models with Arabic, English, and Urdu and multilingual models with all 22 languages supported by the multilingual paraphrase-MiniLM-L12-v2 model. Our results show improvement in classification accuracy with the incorporation of multilingual QurSim.
URI: http://hdl.handle.net/20.500.12323/7791
ISSN: 2078-2489
Appears in Collections:Publications

Files in This Item:
File Description SizeFormat 
Fine-Tuning QurSim on Monolingual and Multilingual Models or Semantic Search.pdf292.89 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.