Javascript must be enabled to continue!
Special parallel processor for lu decomposition of a large‐scale sparse matrix
View through CrossRef
AbstractThe analysis of a nonlinear network is reduced to the solution of a system of equations at each step of Newton's method. In the solution by the direct method, the highspeed realization of the LU decomposition of an large‐scale sparse matrix has become important with the development of the integrated technology. In the past, the applications of the array processor and the supercomputer have been considered. However, the array processor cannot utilize the sparsity of the matrix, resulting in an impractical system with too many processor cells. The efficiency of the vectorization is a problem in the super‐computer, when it is used for the Gauss elimination only for the nonzero elements in the matrix. This paper proposes a dedicated processor for LU decomposition of a large‐scale sparse matrix, which can utilize the sparsity of the matrix and can be realized with a practical number of processors. The processor is composed of q local units, corresponding to the nonzero elements in each row of the coefficient matrix, and those units operate in parallel. In the processing of a sparse matrix, the matching of the label is required in the Gauss elimination. It is realized with a high speed by the data‐shift operation by the inter‐register transfer. Furthermore, a processor with a hierarchical memory structure is proposed which can cope with the case where the number p of nonzero elements in a row exceeds q.
Title: Special parallel processor for lu decomposition of a large‐scale sparse matrix
Description:
AbstractThe analysis of a nonlinear network is reduced to the solution of a system of equations at each step of Newton's method.
In the solution by the direct method, the highspeed realization of the LU decomposition of an large‐scale sparse matrix has become important with the development of the integrated technology.
In the past, the applications of the array processor and the supercomputer have been considered.
However, the array processor cannot utilize the sparsity of the matrix, resulting in an impractical system with too many processor cells.
The efficiency of the vectorization is a problem in the super‐computer, when it is used for the Gauss elimination only for the nonzero elements in the matrix.
This paper proposes a dedicated processor for LU decomposition of a large‐scale sparse matrix, which can utilize the sparsity of the matrix and can be realized with a practical number of processors.
The processor is composed of q local units, corresponding to the nonzero elements in each row of the coefficient matrix, and those units operate in parallel.
In the processing of a sparse matrix, the matching of the label is required in the Gauss elimination.
It is realized with a high speed by the data‐shift operation by the inter‐register transfer.
Furthermore, a processor with a hierarchical memory structure is proposed which can cope with the case where the number p of nonzero elements in a row exceeds q.
Related Results
The Application of S‐transform Spectrum Decomposition Technique in Extraction of Weak Seismic Signals
The Application of S‐transform Spectrum Decomposition Technique in Extraction of Weak Seismic Signals
AbstractIn processing of deep seismic reflection data, when the frequency band difference between the weak useful signal and noise both from the deep subsurface is very small and h...
Matrix Subgridding and Its Effects in Dual Porosity Simulators
Matrix Subgridding and Its Effects in Dual Porosity Simulators
Abstract
Naturally fractured reservoirs are found throughout the world and contain significant amounts of oil reserves. The so-called dual porosity model is one o...
Efficiency of Steamflooding in Naturally Fractured Reservoirs
Efficiency of Steamflooding in Naturally Fractured Reservoirs
Abstract
This study aims to identify the effective parameters on matrix heating and recovery, and the efficiencies of these processes while there is a continuous ...
Unified framework for multi‐scale decomposition and applications
Unified framework for multi‐scale decomposition and applications
Since real‐world digital images differ in thousands ways, an adaptive multi‐scale decomposition scheme adapting to images is increasingly urgently required for image analysis and a...
Design of RISCV processor using verilog
Design of RISCV processor using verilog
The main goal of this paper is to develop a 32-bit pipelined processor with several clock domains based on the RISCV (open source RV32I Version 2.0) ISA. To minimize the complexity...
Substrate type and discovery govern decomposition along a savanna rainfall gradient
Substrate type and discovery govern decomposition along a savanna rainfall gradient
Abstract
Decomposition is the process by which dead plant biomass is recycled and made available again for uptake by other plants. It is largely mediated by microbes and so...
Leaf litter diversity and structure of microbial decomposer communities modulate litter decomposition in aquatic systems
Leaf litter diversity and structure of microbial decomposer communities modulate litter decomposition in aquatic systems
AbstractLeaf litter decomposition is a major ecosystem process that can link aquatic to terrestrial ecosystems by flows of nutrients. Biodiversity and ecosystem functioning researc...
Generation of correlated pseudorandom varibales
Generation of correlated pseudorandom varibales
When Monte Carlo method is used to study many problems, it is sometimes necessary to sample correlated pseudorandom variables. Previous studies have shown that the Cholesky decompo...

