Javascript must be enabled to continue!
TB-Free RTL Anomaly Detection for Early Chip Verification: A Reproducible Binary Benchmark and Baseline Study on CVDP
View through CrossRef
Early digital verification is dominated by manual RTL review because simulation testbenches and assertions are typically incomplete or unavailable during pre-verification. This work studies testbench-free (TB-free) anomaly detection for SystemVerilog RTL as a practical surrogate for early bug-risk spotting: given an RTL snippet, a model assigns a probability that the snippet contains a defect-like anomaly without executing the design. We construct a fully reproducible binary benchmark from the Comprehensive Verilog Design Problems (CVDP) corpus. From the non-agentic code comprehension tasks (v1.0.2) we extract 114 RTL files across 113 distinct problems, window them into 546 snippets (30 lines, stride 15), and create paired clean/buggy samples by injecting exactly one mutation per snippet from five mutation operators (constant flips, edge-sensitivity flips, equality/inequality flips, boolean operator flips, and nonblocking-to-blocking assignment changes). The resulting benchmark contains 1092 labeled samples with a 50/50 class balance. We evaluate seven TB-free baselines under a strict group split by problem id to prevent design leakage: a heuristic risk score, a character 5-gram language model, a structural-feature logistic regressor, word-level TF-IDF linear models, and character-level TF-IDF models with and without structural features. The best-performing baseline, a character TF-IDF linear SVM, achieves AUROC 0.816, AUPRC 0.838, accuracy 0.791, and F1 0.772 on a held-out test split, with bootstrap 95% confidence interval [0.743, 0.880] for AUROC. These results quantify how much risk-ranking signal is available from RTL text alone and establish a transparent evaluation scaffold for LLM-assisted RTL review that does not depend on a testbench.
Scientific Publication Center
Title: TB-Free RTL Anomaly Detection for Early Chip Verification: A Reproducible Binary Benchmark and Baseline Study on CVDP
Description:
Early digital verification is dominated by manual RTL review because simulation testbenches and assertions are typically incomplete or unavailable during pre-verification.
This work studies testbench-free (TB-free) anomaly detection for SystemVerilog RTL as a practical surrogate for early bug-risk spotting: given an RTL snippet, a model assigns a probability that the snippet contains a defect-like anomaly without executing the design.
We construct a fully reproducible binary benchmark from the Comprehensive Verilog Design Problems (CVDP) corpus.
From the non-agentic code comprehension tasks (v1.
2) we extract 114 RTL files across 113 distinct problems, window them into 546 snippets (30 lines, stride 15), and create paired clean/buggy samples by injecting exactly one mutation per snippet from five mutation operators (constant flips, edge-sensitivity flips, equality/inequality flips, boolean operator flips, and nonblocking-to-blocking assignment changes).
The resulting benchmark contains 1092 labeled samples with a 50/50 class balance.
We evaluate seven TB-free baselines under a strict group split by problem id to prevent design leakage: a heuristic risk score, a character 5-gram language model, a structural-feature logistic regressor, word-level TF-IDF linear models, and character-level TF-IDF models with and without structural features.
The best-performing baseline, a character TF-IDF linear SVM, achieves AUROC 0.
816, AUPRC 0.
838, accuracy 0.
791, and F1 0.
772 on a held-out test split, with bootstrap 95% confidence interval [0.
743, 0.
880] for AUROC.
These results quantify how much risk-ranking signal is available from RTL text alone and establish a transparent evaluation scaffold for LLM-assisted RTL review that does not depend on a testbench.
Related Results
کشمير مخالف نسل پرستی کی تعريف
کشمير مخالف نسل پرستی کی تعريف
<p dir="rtl">کشمير مخالف نسل پرستی ک ی تعريف</p><p dir="rtl">منجانب: ب نيش احمد I ترجمہ: ڈاکٹر فہد احمد</p><p dir="rtl">کشمير مخا لف نسل پرس تی در اصل...
کشمير مخالف نسل پرستی کی تعريف
کشمير مخالف نسل پرستی کی تعريف
<p dir="rtl">کشمير مخالف نسل پرستی ک ی تعريف</p><p dir="rtl">منجانب: ب نيش احمد I ترجمہ: ڈاکٹر فہد احمد</p><p dir="rtl">کشمير مخا لف نسل پرس تی در اصل...
Association between leukocyte telomere length and angiogenic cytokines in knee osteoarthritis
Association between leukocyte telomere length and angiogenic cytokines in knee osteoarthritis
AbstractAimThe aims of this study were to compare leukocyte relative telomere length (RTL) in knee osteoarthritis (OA) patients and healthy controls and to investigate associations...
Comparing genome-wide chromatin profiles using ChIP-chip or ChIP-seq
Comparing genome-wide chromatin profiles using ChIP-chip or ChIP-seq
AbstractMotivation: ChIP-chip and ChIP-seq technologies provide genome-wide measurements of various types of chromatin marks at an unprecedented resolution. With ChIP samples colle...
Abstract 4146122: Potential Protective Roles of Clonal Hematopoiesis of Indeterminate Potential in Angina Pectoris
Abstract 4146122: Potential Protective Roles of Clonal Hematopoiesis of Indeterminate Potential in Angina Pectoris
Introduction:
Clonal hematopoiesis of indeterminate potential (CHIP) poses strong relationship to the occurrence of cardiovascular diseases with the process of aging. I...
Shenzi 16-Inch Oil Export SCR CVA Verification
Shenzi 16-Inch Oil Export SCR CVA Verification
Abstract
In 2006 Enterprise developed a 16-inch oil export system from Shenzi field located in Green Canyon Block 653 in the Gulf of Mexico, approximately 120 nau...
A systematic survey: role of deep learning-based image anomaly detection in industrial inspection contexts
A systematic survey: role of deep learning-based image anomaly detection in industrial inspection contexts
Industrial automation is rapidly evolving, encompassing tasks from initial assembly to final product quality inspection. Accurate anomaly detection is crucial for ensuring the reli...
Platform Verification - Aview From Amember Of Industry
Platform Verification - Aview From Amember Of Industry
ABSTRACT
Concerns have been raised in many sectors regarding the safety and reliability of offshore platforms. In this paper, the history of offshore operations a...

