High-Speed VLSI Architecture Based on Massively Parallel Processor Arrays for Real-Time Remote Sensing Applications - Applications of Digital Signal Processing

Image Processing Reference

In-Depth Information

At this stage we obtain the vector q = S + u  . The operator S + operating on u  is mapped.

Thus, the result, q , can be interpreted as a rough estimate of b  = ( b - m b ) referred to as a

degraded image.

c. Third Step: Signal Reconstruction

At this stage we obtain the estimate



-1





of the unknown signal

referred to as the reconstructed image frame. The matrix A  -1 = ( S + S +  RSF I ) -1 operating on

q produces some form of inversion of the degradations embedded in the operator S + S . It is

important to note that in the case  = 0, we have

bAqSS



(



α )

RSF







qSu



, where matrix

(α = 0)



SSSS is recognized to be the pseudoinverse (i.e., the well known Moore-Penrouse

pseudoinverse) of the SFO matrix S .

d. Fourth Step: Restoration of the Trend

Having obtained the estimate b  and known the mean value m b , we can obtain the

optimum RSF estimate (20) simply by adding the prescribed mean value m b (referred to as

the non-zero trend) to the reconstructed image frame as b = m b +

(

b  .

3.1.2 (ii) Partitioning process of the computational tasks

One of the challenging problems of the HW/SW co-design is to perform an efficient

HW/SW partitioning of the computational tasks. The aim of the partitioning problem is to

find which computational tasks can be implemented in an efficient hardware architecture

looking for the best trade-offs among the different solutions. The solution to the problem

requires, first, the definition of a partitioning model that meets all the specification

requirements (i.e., functionality, goals and constraints).

Note that from the formal SW-level co-design point of view, such DEDR techniques (20), (21),

(22) can be considered as a properly ordered sequence of the vector-matrix multiplication

procedure that one can next perform in an efficient high performance computational fashion

following the proposed bit-level high-speed VLSI co-processor architecture. In particular, for

implementing the fixed-point DEDR RSF and RASF algorithms, we consider in this

partitioning stage to develop a high-speed VLSI co-processor for the computationally complex

matrix-vector SP operation in aggregation with a powerful FPGA reconfigurable architecture

via the HW/SW co-design technique. The rest of the reconstructive SP operations are

employed in SW with a 32 bits embedded processor (MicroBlaze).

This novel VLSI-FPGA platform represents a new paradigm for real time processing of

newer RS applications. Fig. 1 illustrates the proposed VLSI-FPGA architecture for the

implementation of the RSF/RASF algorithms.

Once the partitioning stage has been defined, the selected reconstructive SP sub-task is to be

mapped into the corresponding high-speed VLSI co-processor. In the HW design, the

precision of 32 bits for performing all fixed-point operations is used, in particular, 9-bit

integer and 23-bits decimal for the implementation of the co-processor. Such precision

guarantees numerical computational errors less than 10 -5 referring to the MATLAB Fixed

Point Toolbox (Matlab, 2011).

3.1.3 Aggregation of parallel computing techniques

This sub-section is focused in how to improve the performance of the complex RS

algorithms with the aggregation of parallel computing and mapping techniques onto HW-

level massively parallel processor arrays (MPPAs).

Applications of Digital Signal Processing

Search WWH ::

Custom Search

Home