Search engine for discovering works of Art, research articles, and books related to Art and Culture
ShareThis
Javascript must be enabled to continue!

TB-Free RTL Anomaly Detection for Early Chip Verification: A Reproducible Binary Benchmark and Baseline Study on CVDP

View through CrossRef
Early digital verification is dominated by manual RTL review because simulation testbenches and assertions are typically incomplete or unavailable during pre-verification. This work studies testbench-free (TB-free) anomaly detection for SystemVerilog RTL as a practical surrogate for early bug-risk spotting: given an RTL snippet, a model assigns a probability that the snippet contains a defect-like anomaly without executing the design. We construct a fully reproducible binary benchmark from the Comprehensive Verilog Design Problems (CVDP) corpus. From the non-agentic code comprehension tasks (v1.0.2) we extract 114 RTL files across 113 distinct problems, window them into 546 snippets (30 lines, stride 15), and create paired clean/buggy samples by injecting exactly one mutation per snippet from five mutation operators (constant flips, edge-sensitivity flips, equality/inequality flips, boolean operator flips, and nonblocking-to-blocking assignment changes). The resulting benchmark contains 1092 labeled samples with a 50/50 class balance. We evaluate seven TB-free baselines under a strict group split by problem id to prevent design leakage: a heuristic risk score, a character 5-gram language model, a structural-feature logistic regressor, word-level TF-IDF linear models, and character-level TF-IDF models with and without structural features. The best-performing baseline, a character TF-IDF linear SVM, achieves AUROC 0.816, AUPRC 0.838, accuracy 0.791, and F1 0.772 on a held-out test split, with bootstrap 95% confidence interval [0.743, 0.880] for AUROC. These results quantify how much risk-ranking signal is available from RTL text alone and establish a transparent evaluation scaffold for LLM-assisted RTL review that does not depend on a testbench.
Title: TB-Free RTL Anomaly Detection for Early Chip Verification: A Reproducible Binary Benchmark and Baseline Study on CVDP
Description:
Early digital verification is dominated by manual RTL review because simulation testbenches and assertions are typically incomplete or unavailable during pre-verification.
This work studies testbench-free (TB-free) anomaly detection for SystemVerilog RTL as a practical surrogate for early bug-risk spotting: given an RTL snippet, a model assigns a probability that the snippet contains a defect-like anomaly without executing the design.
We construct a fully reproducible binary benchmark from the Comprehensive Verilog Design Problems (CVDP) corpus.
From the non-agentic code comprehension tasks (v1.
2) we extract 114 RTL files across 113 distinct problems, window them into 546 snippets (30 lines, stride 15), and create paired clean/buggy samples by injecting exactly one mutation per snippet from five mutation operators (constant flips, edge-sensitivity flips, equality/inequality flips, boolean operator flips, and nonblocking-to-blocking assignment changes).
The resulting benchmark contains 1092 labeled samples with a 50/50 class balance.
We evaluate seven TB-free baselines under a strict group split by problem id to prevent design leakage: a heuristic risk score, a character 5-gram language model, a structural-feature logistic regressor, word-level TF-IDF linear models, and character-level TF-IDF models with and without structural features.
The best-performing baseline, a character TF-IDF linear SVM, achieves AUROC 0.
816, AUPRC 0.
838, accuracy 0.
791, and F1 0.
772 on a held-out test split, with bootstrap 95% confidence interval [0.
743, 0.
880] for AUROC.
These results quantify how much risk-ranking signal is available from RTL text alone and establish a transparent evaluation scaffold for LLM-assisted RTL review that does not depend on a testbench.

Related Results

Verification of High Speed on Chip with VIP using System Verilog
Verification of High Speed on Chip with VIP using System Verilog
Abstract - The exploration work is addressing verification of High speed on chips protocol; we've used the system Verilog grounded test bench structure. I developed a system Verilo...
کشمير مخالف نسل پرستی کی تعريف
کشمير مخالف نسل پرستی کی تعريف
<p dir="rtl">کشمير مخالف نسل پرستی ک ی تعريف</p><p dir="rtl">منجانب: ب نيش احمد I ترجمہ: ڈاکٹر فہد احمد</p><p dir="rtl">کشمير مخا لف نسل پرس تی در اصل...
کشمير مخالف نسل پرستی کی تعريف
کشمير مخالف نسل پرستی کی تعريف
<p dir="rtl">کشمير مخالف نسل پرستی ک ی تعريف</p><p dir="rtl">منجانب: ب نيش احمد I ترجمہ: ڈاکٹر فہد احمد</p><p dir="rtl">کشمير مخا لف نسل پرس تی در اصل...
Fructose intake and its association with relative telomere length: an exploratory study among healthy Lebanese adults
Fructose intake and its association with relative telomere length: an exploratory study among healthy Lebanese adults
IntroductionShorter relative telomere length (RTL) has been associated with increased incidence of morbidity. Although still disputed, available evidence suggests that dietary fact...
Trace2PatchLR: Reproducible RTL Bug Localization and Template-Guided Repair for Verilog Debug
Trace2PatchLR: Reproducible RTL Bug Localization and Template-Guided Repair for Verilog Debug
Register-transfer level (RTL) debug dominates the turnaround time of modern chip development: after simulation or formal verification reports a failure, engineers must localize the...
Association between leukocyte telomere length and angiogenic cytokines in knee osteoarthritis
Association between leukocyte telomere length and angiogenic cytokines in knee osteoarthritis
AbstractAimThe aims of this study were to compare leukocyte relative telomere length (RTL) in knee osteoarthritis (OA) patients and healthy controls and to investigate associations...
Comparing genome-wide chromatin profiles using ChIP-chip or ChIP-seq
Comparing genome-wide chromatin profiles using ChIP-chip or ChIP-seq
AbstractMotivation: ChIP-chip and ChIP-seq technologies provide genome-wide measurements of various types of chromatin marks at an unprecedented resolution. With ChIP samples colle...

Back to Top