Hardware Reference
In-Depth Information
■ Coherence Protocol Performance
■ Coherence Protocol Optimizations
Consider the distributed shared-memory system illustrated in Figure 5.37 . It consists of two
four-core chips. The processor in each chip share an L2 cache (L2$), and the two chips are
connected via a point-to-point interconnect. The system memory is distributed across the two
chips. Figure 5.38 zooms in on part of this system. Pi,j denotes processor i in chip j . Each pro-
cessor has a single direct-mapped L1 cache that holds two blocks, each holding two words.
Each chip has a single direct-mapped L2 cache that holds two blocks, each holding two words.
To simplify the illustration, the cache address tags contain the full address and each word
shows only two hex characters, with the least significant word on the right. The L1 cache states
are denoted M, S, and I for Modified, Shared, and Invalid. Both the L2 caches and memor-
ies have directories. The directory states are denoted DM, DS, and DI for Directory Modified,
Directory Shared, and Directory Invalid. The simple directory protocol is described in Figures
5.22 and 5.23 . The L2 directory lists the local sharers/owners and additionally records if a line
is shared externally in another chip; for example, P1,0;E denotes that a line is shared by local
processor P1,0 and is externally shared in some other chip. The memory directory has a list of
the chip sharers/owners of a line; for example, C0,C1 denotes that a line is shared in chips 0 and
1.
FIGURE 5.37 Multichip, multicore multiprocessor with DSM .
 
Search WWH ::




Custom Search