Javascript must be enabled to continue!
AR-Sanad 280K: A Novel 280K Artificial Sanads Dataset for Hadith Narrator Disambiguation
View through CrossRef
Determining hadith authenticity is vitally important in the Islamic religion because hadiths record the sayings and actions of Prophet Muhammad (PBUH), and they are the second source of Islamic teachings following the Quran. When authenticating a hadith, the reliability of the hadith narrators is a big factor that hadith scholars consider. However, many narrators share similar names, and the narrators’ full names are not usually included in the narration chains of hadiths. Thus, first, ambiguous narrators need to be identified. Then, their reliability level can be determined. There are no available datasets that could help address this problem of identifying narrators. Here, we present a new dataset that contains narration chains (sanads) with identified narrators. The AR-Sanad 280K dataset has around 280K artificial sanads and could be used to identify 18,298 narrators. After creating the AR-Sanad 280K dataset, we address the narrator disambiguation in several experimental setups. The hadith narrator disambiguation is modeled as a multiclass classification problem with 18,298 class labels. We test different representations and models in our experiments. The best results were achieved by finetuning BERT-Based deep learning model (AraBERT). We obtained a 92.9 Micro F1 score and 30.2 sanad error rate (SER) on the validation set of our artificial sanads AR-Sanad 280K dataset. Furthermore, we extracted a real test set from the sanads of the famous six books in Islamic hadith. We evaluated the best model on the real test data, and we achieved 83.5 Micro F1 score and 60.6 sanad error rate.
Title: AR-Sanad 280K: A Novel 280K Artificial Sanads Dataset for Hadith Narrator Disambiguation
Description:
Determining hadith authenticity is vitally important in the Islamic religion because hadiths record the sayings and actions of Prophet Muhammad (PBUH), and they are the second source of Islamic teachings following the Quran.
When authenticating a hadith, the reliability of the hadith narrators is a big factor that hadith scholars consider.
However, many narrators share similar names, and the narrators’ full names are not usually included in the narration chains of hadiths.
Thus, first, ambiguous narrators need to be identified.
Then, their reliability level can be determined.
There are no available datasets that could help address this problem of identifying narrators.
Here, we present a new dataset that contains narration chains (sanads) with identified narrators.
The AR-Sanad 280K dataset has around 280K artificial sanads and could be used to identify 18,298 narrators.
After creating the AR-Sanad 280K dataset, we address the narrator disambiguation in several experimental setups.
The hadith narrator disambiguation is modeled as a multiclass classification problem with 18,298 class labels.
We test different representations and models in our experiments.
The best results were achieved by finetuning BERT-Based deep learning model (AraBERT).
We obtained a 92.
9 Micro F1 score and 30.
2 sanad error rate (SER) on the validation set of our artificial sanads AR-Sanad 280K dataset.
Furthermore, we extracted a real test set from the sanads of the famous six books in Islamic hadith.
We evaluated the best model on the real test data, and we achieved 83.
5 Micro F1 score and 60.
6 sanad error rate.
Related Results
AL-THULATHIYYAT DALAM KITAB INDUK HADITH
AL-THULATHIYYAT DALAM KITAB INDUK HADITH
One of privilege of Islam brought by the Prophet Muhammad SAW is the existence of news linker to the followers. The news linker (riwayah) is sanad. The sanad condition in Islam get...
Konsep Hadith Mawdu‘i Menurut Perspektif Pengkaji Hadith Kontemporari: Antara Dirasah Al-Mawdu‘Iyyah min Al-Hadith dan Syarh Al-Mawdu‘I li Al-Hadith
Konsep Hadith Mawdu‘i Menurut Perspektif Pengkaji Hadith Kontemporari: Antara Dirasah Al-Mawdu‘Iyyah min Al-Hadith dan Syarh Al-Mawdu‘I li Al-Hadith
The hadith mawdu’i (thematic hadith) terminology is widely used in contemporary hadith discourse. However, there are confusions in understanding its true concept since several othe...
Authority And Hadith Research Methodology
Authority And Hadith Research Methodology
The purpose of examining a hadith and sanad is to determine the authenticity of the hadith. In the rules, a sanad or Mata is distinguished or must be able to include two rules, nam...
Kontribusi Ali Mustafa Yaqub (1952-2016) dalam Dinamika Kajian Hadis di Indonesia
Kontribusi Ali Mustafa Yaqub (1952-2016) dalam Dinamika Kajian Hadis di Indonesia
<p>Artikel ini akan membahas tentang kontribusi Ali Mustafa Yaqub dalam dinamika kajian hadis di Indonesia. Ia adalah salah seorang pakar di bidang hadis. Hadis-hadis yang di...
PEMIKIRAN KRITIK SANAD HADIS
PEMIKIRAN KRITIK SANAD HADIS
<p>Hadith sourced from the Prophet Muhammad in the form of words, deeds and <em>takrir</em>. Hadith can be accepted as <em>dalil </em>in Islam, if the...
Al-Hadith Al-Gharib in the Discourse of Hadith Studies; The Authenticity and The Authority
Al-Hadith Al-Gharib in the Discourse of Hadith Studies; The Authenticity and The Authority
This article aims to examine the concept of al-hadith al-Gharib from the perspective of hadith scholars. The primary research questions address: (1) What defines al-hadith al-Ghari...
Tradisi Periwayatan Umat Islam
Tradisi Periwayatan Umat Islam
Penelitian ini membahas tentang tradisi periwayatan umat Islam, khususnya tentang bidang-bidang pengaplikasian sanad. Setidaknya ada tiga bidang yang sering diiringi oleh sanad, ya...
KUALITAS HADIS SHAHIH, HASAN, DHAIF SEBAGAI HUJJAH DALAM HUKUM ISLAM
KUALITAS HADIS SHAHIH, HASAN, DHAIF SEBAGAI HUJJAH DALAM HUKUM ISLAM
As the second source of Islamic law after the Qur'an, Hadith is divided into three, namely Sahih, Hasan and Dhaif Hadith. Sahih hadith is a hadith that fulfills the conditions: San...

