Javascript must be enabled to continue!
Bond-DQN: Deep Q-Learning of Lumped-Element Systems Design via Bond Graphs
View through CrossRef
Abstract
Lumped-element schematics (e.g. free-body diagrams, electrical circuit schematics) have been utilized for design synthesis in a wide variety of domains. They are invaluable for generalizing design synthesis in a process which consists of taking a set of available components and exploring the space of possible connectivity. The connectivity of a given lumped-element model (e.g. the netlist of an electrical schematic) represents a parameterized model which may be optimized using generalizable state space techniques and has the added benefit of being highly interpretable. Some recent machine learning models, particularly in the analog electrical circuit synthesis community, have attempted to leverage such lumped-element models in design synthesis or forward modeling tasks. They either learn faster neural network proxies for the forward dynamics of human-crafted lumped-element simulators or use hand crafted lumped-element representations in the networks themselves. An issue with these approaches is the models are difficult to generalize to different domains due to their reliance on a domain-specific dataset or simulator for design performance evaluation. In this work, we propose Bond-DQN, a Deep Q-Learning method to sequentially generate lumped-parameter designs for state-space based objectives. Specifically, our method uses a grammar based on bond graphs, which are an energy-based graph representation which have been historically used to simulate lumped-parameter models in a variety of domains. The combination of bond graphs with DQL allows us to accurately simulate and sample the large design space of multi-domain systems without retraining on external simulators or new datasets. We demonstrate Bond-DQN’s capability for generalizeable lumped-parameter design by benchmarking against a suspension optimization problem in which the previous SOTA utilizes the Harris Hawks Optimization algorithm, demonstrating a reduction in the peak vehicle translational and pitch acceleration of 20.5% and 4.1% respectively. Additionally, we showcase its ability for multi-domain design by applying to an electrical circuit voltage regulation problem.
American Society of Mechanical Engineers
Title: Bond-DQN: Deep Q-Learning of Lumped-Element Systems Design via Bond Graphs
Description:
Abstract
Lumped-element schematics (e.
g.
free-body diagrams, electrical circuit schematics) have been utilized for design synthesis in a wide variety of domains.
They are invaluable for generalizing design synthesis in a process which consists of taking a set of available components and exploring the space of possible connectivity.
The connectivity of a given lumped-element model (e.
g.
the netlist of an electrical schematic) represents a parameterized model which may be optimized using generalizable state space techniques and has the added benefit of being highly interpretable.
Some recent machine learning models, particularly in the analog electrical circuit synthesis community, have attempted to leverage such lumped-element models in design synthesis or forward modeling tasks.
They either learn faster neural network proxies for the forward dynamics of human-crafted lumped-element simulators or use hand crafted lumped-element representations in the networks themselves.
An issue with these approaches is the models are difficult to generalize to different domains due to their reliance on a domain-specific dataset or simulator for design performance evaluation.
In this work, we propose Bond-DQN, a Deep Q-Learning method to sequentially generate lumped-parameter designs for state-space based objectives.
Specifically, our method uses a grammar based on bond graphs, which are an energy-based graph representation which have been historically used to simulate lumped-parameter models in a variety of domains.
The combination of bond graphs with DQL allows us to accurately simulate and sample the large design space of multi-domain systems without retraining on external simulators or new datasets.
We demonstrate Bond-DQN’s capability for generalizeable lumped-parameter design by benchmarking against a suspension optimization problem in which the previous SOTA utilizes the Harris Hawks Optimization algorithm, demonstrating a reduction in the peak vehicle translational and pitch acceleration of 20.
5% and 4.
1% respectively.
Additionally, we showcase its ability for multi-domain design by applying to an electrical circuit voltage regulation problem.
Related Results
Control via Reinforcement Learning : Controle via Aprendizado por Reforço
Control via Reinforcement Learning : Controle via Aprendizado por Reforço
This work presents a comprehensive review and practical application of Reinforcement Learning (RL) algorithms in control engineering. The theoretical groundwork of RL is laid out, ...
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
CREATING LEARNING MEDIA IN TEACHING ENGLISH AT SMP MUHAMMADIYAH 2 PAGELARAN ACADEMIC YEAR 2020/2021
The pandemic Covid-19 currently demands teachers to be able to use technology in teaching and learning process. But in reality there are still many teachers who have not been able ...
The physics behind groundwater recession and hydrologically passive mixing volumes
The physics behind groundwater recession and hydrologically passive mixing volumes
Abstract. Transit time and water age characteristics are fundamental descriptors of catchment response, and their determination is vital for the implementation of sustainable strat...
Independent Set in Neutrosophic Graphs
Independent Set in Neutrosophic Graphs
New setting is introduced to study neutrosophic independent number and independent neutrosophic-number arising neighborhood of different vertices. Neighbor is a key term to have th...
Failed Independent Number in Neutrosophic Graphs
Failed Independent Number in Neutrosophic Graphs
New setting is introduced to study neutrosophic failed-independent number and failed independent neutrosophic-number arising neighborhood of different vertices. Neighbor is a key t...
Computing a Minimum Subset Feedback Vertex Set on Chordal Graphs Parameterized by Leafage
Computing a Minimum Subset Feedback Vertex Set on Chordal Graphs Parameterized by Leafage
Abstract
Chordal graphs are characterized as the intersection graphs of subtrees in a tree and such a representation is known as the tree model. Restricting the characteriz...
Multi-UAVs task allocation method based on MPSO-SA-DQN
Multi-UAVs task allocation method based on MPSO-SA-DQN
Multi-UAVs play an important role in the battlefield. Although many methods are proposed to solve the Multi-UAV task allocation, there still existing the problems of complex time c...
Reinforcement Learning for Climate Change Simulations: A DQN-Based Approach
Reinforcement Learning for Climate Change Simulations: A DQN-Based Approach
This research presents a novel reinforcement learning paradigm through application of a Deep Q-Network (DQN) to discover the optimal policies for adaptation to and mitigation of cl...

