Javascript must be enabled to continue!
MD2PR: A Multi-level Distillation based Dense Passage Retrieval Model
View through CrossRef
Abstract
Reranker and retriever are two important components in information retrieval. The retriever typically adopts a dual-encoder model, where queries and documents are separately input into two pre-trained models, and the vectors generated by the models are used for similarity calculation. The reranker often uses a cross-encoder model, where the concatenated query-document pairs are input into a pre-trained model to obtain word similarities. However, the dual-encoder model lacks interaction between queries and documents due to its independent encoding, while the cross-encoder model requires substantial computational cost for attention calculation, making it difficult to obtain real-time retrieval results. In this paper, we propose a dense retrieval model called MD2PR based on multi-level knowledge distillation, that is, the knowledge learned from the cross-encoder is distilled to the dual-encoder at both the sentence level and word level. Sentence-level distillation enhances the dual-encoder on capturing the themes and emotions of sentences. Word-level distillation improves the dual-encoder in analysis of word semantics and relationships. As a result, the dual-encoder can be used independently for subsequent encoding and retrieval, avoiding the significant computational cost associated with the participation of the cross-encoder. Furthermore, we propose a dynamic false negative filtering method, which updates the threshold during multiple training iterations to ensure the effective identification of false negatives and thus obtains a more comprehensive semantic representation space. The experimental results over two standard datasets show our MD2PR outperforms 14 baseline models in terms of MRR and Recall metrics.
Title: MD2PR: A Multi-level Distillation based Dense Passage Retrieval Model
Description:
Abstract
Reranker and retriever are two important components in information retrieval.
The retriever typically adopts a dual-encoder model, where queries and documents are separately input into two pre-trained models, and the vectors generated by the models are used for similarity calculation.
The reranker often uses a cross-encoder model, where the concatenated query-document pairs are input into a pre-trained model to obtain word similarities.
However, the dual-encoder model lacks interaction between queries and documents due to its independent encoding, while the cross-encoder model requires substantial computational cost for attention calculation, making it difficult to obtain real-time retrieval results.
In this paper, we propose a dense retrieval model called MD2PR based on multi-level knowledge distillation, that is, the knowledge learned from the cross-encoder is distilled to the dual-encoder at both the sentence level and word level.
Sentence-level distillation enhances the dual-encoder on capturing the themes and emotions of sentences.
Word-level distillation improves the dual-encoder in analysis of word semantics and relationships.
As a result, the dual-encoder can be used independently for subsequent encoding and retrieval, avoiding the significant computational cost associated with the participation of the cross-encoder.
Furthermore, we propose a dynamic false negative filtering method, which updates the threshold during multiple training iterations to ensure the effective identification of false negatives and thus obtains a more comprehensive semantic representation space.
The experimental results over two standard datasets show our MD2PR outperforms 14 baseline models in terms of MRR and Recall metrics.
Related Results
A Comprehensive Review of Distillation in the Pharmaceutical Industry
A Comprehensive Review of Distillation in the Pharmaceutical Industry
Distillation processes play a pivotal role in the pharmaceutical industry for the purification of active pharmaceutical ingredients (APIs), intermediates, and solvent recovery. Thi...
STUDY ON DOUBLE-EFFECT DISTILLATION PROCESS FOR SEPARATING METHANOL-WATER USING ASPEN PLUS V10
STUDY ON DOUBLE-EFFECT DISTILLATION PROCESS FOR SEPARATING METHANOL-WATER USING ASPEN PLUS V10
Methanol (also known as CH3OH, methyl alcohol, hydroxymethane, wood alcohol, or carbinol) is a widely used primary raw material. It is one of the first organic chemicals to find e...
Principles and Modes of Distillation in Desalination Process
Principles and Modes of Distillation in Desalination Process
Distillation has been a very important separation technique used over many centuries. This technique is diverse and applicable in different fields and for different substances. Dis...
Passage, Sentence, or Proposition? An Empirical Comparison of Retrieval Granularity Effects on LLM Answer Accuracy in Retrieval-Augmented Generation
Passage, Sentence, or Proposition? An Empirical Comparison of Retrieval Granularity Effects on LLM Answer Accuracy in Retrieval-Augmented Generation
Retrieval-Augmented Generation (RAG) has become a dominant paradigm for grounding large language model (LLM) outputs in external knowledge. While extensive research has focused on ...
Unconventional Method of Subsea Umbilical Retrieval Using Anchor Handling Vessel
Unconventional Method of Subsea Umbilical Retrieval Using Anchor Handling Vessel
Abstract
A deepwater field in West Africa was decommissioned and subsea facilities retrieval operation was carried out as part of the Abandonment and Decommissioning...
The influence of timing of oocytes retrieval and embryo transfer on the IVF-ET outcomes in patients having bilateral salpingectomy due to bilateral hydrosalpinx
The influence of timing of oocytes retrieval and embryo transfer on the IVF-ET outcomes in patients having bilateral salpingectomy due to bilateral hydrosalpinx
ObjectiveThe objective of the study was to investigate whether the sequence of oocyte retrieval and salpingectomy for hydrosalpinx affects pregnancy outcomes of in vitro fertilizat...
Energy Efficient Distillation Columns Design for Retrofit NGLs Fractionation Process
Energy Efficient Distillation Columns Design for Retrofit NGLs Fractionation Process
Distillation is the primary separation process widely used in the industrial chemical process. Although it has many advantages, the main drawback is its large energy requirement, w...
Steam Distillation Studies For The Kern River Field
Steam Distillation Studies For The Kern River Field
Abstract
The interactions of heavy oil and injected steam in the mature steamflood at the Kern River Field have been extensively studied to gain insight into the ...

