Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

Classification of open source software bug report based on transfer learning

View through CrossRef
AbstractCurrently, the feature richness of text encoding vectors in the bug report classification model based on deep learning is limited by the size of the domain dataset and the quality of the text. However, it is difficult to further enrich the features of text encoding vectors. At the same time, most existing bug report classification methods ignore the submitter's personal information. To solve these problems, we construct nine personal information characteristics of bug report submitters in GitHub by survey. Then, we propose a GitHub bug report classification method named personal information fine‐tuning network (PIFTNet) based on transfer learning and the submitter's personal information. PIFTNet transfers the general text feature vectors in bidirectional encoder representation from transformers (BERT) to the domain of bug report classification by fine‐tuning the pre‐training parameters in BERT. It also combines the text characteristics and the characteristics of the submitter's personal information to construct the classification model. In addition, we propose a two‐stage training method to alleviate the catastrophic changes in the pre‐training parameters and loss of the initially learned knowledge caused by direct training of PIFTNet. We verify the proposed PIFTNet on the dataset extracted from GitHub and empirical results prove the effectiveness of PIFTNet.
Title: Classification of open source software bug report based on transfer learning
Description:
AbstractCurrently, the feature richness of text encoding vectors in the bug report classification model based on deep learning is limited by the size of the domain dataset and the quality of the text.
However, it is difficult to further enrich the features of text encoding vectors.
At the same time, most existing bug report classification methods ignore the submitter's personal information.
To solve these problems, we construct nine personal information characteristics of bug report submitters in GitHub by survey.
Then, we propose a GitHub bug report classification method named personal information fine‐tuning network (PIFTNet) based on transfer learning and the submitter's personal information.
PIFTNet transfers the general text feature vectors in bidirectional encoder representation from transformers (BERT) to the domain of bug report classification by fine‐tuning the pre‐training parameters in BERT.
It also combines the text characteristics and the characteristics of the submitter's personal information to construct the classification model.
In addition, we propose a two‐stage training method to alleviate the catastrophic changes in the pre‐training parameters and loss of the initially learned knowledge caused by direct training of PIFTNet.
We verify the proposed PIFTNet on the dataset extracted from GitHub and empirical results prove the effectiveness of PIFTNet.

Related Results

Hydatid Disease of The Brain Parenchyma: A Systematic Review
Hydatid Disease of The Brain Parenchyma: A Systematic Review
Abstarct Introduction Isolated brain hydatid disease (BHD) is an extremely rare form of echinococcosis. A prompt and timely diagnosis is a crucial step in disease management. This ...
Bug Report Summarization by Using Swarm Intelligence Approaches
Bug Report Summarization by Using Swarm Intelligence Approaches
Background: Bug reports are considered as a reference document, during the maintenance phase of the software development process. The developer's counsel them at whatever point the...
Effective Bug Triage With Software Reliability
Effective Bug Triage With Software Reliability
Programming associations spend in excess of 45 percent of cost in overseeing programming bugs. An inevitable progress of settling bugs is bug triage, which wants to precisely dole ...
Breast Carcinoma within Fibroadenoma: A Systematic Review
Breast Carcinoma within Fibroadenoma: A Systematic Review
Abstract Introduction Fibroadenoma is the most common benign breast lesion; however, it carries a potential risk of malignant transformation. This systematic review provides an ove...
A Comparative Study of Multilabel Classification Techniques for Analyzing Bug Report Dependencies
A Comparative Study of Multilabel Classification Techniques for Analyzing Bug Report Dependencies
Bug report dependency analysis entails identifying and examining the interrelations among software bug reports. Dependencies may indicate that bugs are interconnected, with one bug...
Software Bug Ontology Supporting Bug Search on Peer-to-Peer Networks
Software Bug Ontology Supporting Bug Search on Peer-to-Peer Networks
This paper presents a semantics-based bug search system that allows users to solve bugs by searching similar bug reports on peer-to-peer networks. This system uses a bug schema to ...
Hybrid Multiple Deep Learning Models to Boost Blocking Bug Prediction
Hybrid Multiple Deep Learning Models to Boost Blocking Bug Prediction
<p>A blocking bug (BB) is a severe bug that could prevent other bugs from being fixed in time and cost more effort to repair itself in software maintenance. Hence, early dete...
Using CNN to Predict the Resolution Status of Bug Reports
Using CNN to Predict the Resolution Status of Bug Reports
Abstract Bug tracking systems (BTS) are a resource for receiving bug reports that help to improve software applications. They usually contain reports reported by the...

Back to Top