Fetch Directed Prefetching a Study CS 752 Project















- Slides: 15
Fetch Directed Prefetching - a Study CS 752 Project Gokul Nadathur Nitin Bahadur Sambavi Muthukrishnan Gokul, Nitin, Sambavi 1
Motivation n Execution engine limited by fetch bandwidth n n effect of memory latency on fetch correlation between i-cache stalls and branch predictor rate at which branch predictor and BTB can be cycled With increase in ILP, there is a need to increase fetch performance Gokul, Nitin, Sambavi 2
Fetch Directed Architecture Prefetch Instruction Queue Branch Predictor Prefetch Filtration Mechanism L 2 Cache Prefetch Buffer Instruction Fetch Target Buffer Fetch Target Queue Gokul, Nitin, Sambavi 3
Decoupled Branch Predictor n n n has its own PC runs independent of fetch pipeline stage makes a prediction each cycle unaffected by i-cache stalls Problem!!! n May not have updated branch history Gokul, Nitin, Sambavi 4
Fetch Target Buffer and Fetch Target Queue Fetch Target Buffer n n n Stores fall through and target address for taken branches Accessed with a prediction from branch predictor each cycle Fills in single/multiple cache line blocks into FTQ Fetch Target Queue n n Contains blocks of instruction addresses to be next executed FTQ entries are dequeued by fetch engine Gokul, Nitin, Sambavi 5
Prefetch Filter and Prefetch Instruction Queue n n Contains queue of cache blocks to be prefetched Prefetch mechanism dequeues PIQ and performs the prefetching Prefetch Filter n n Takes entries from FTQ, filters them and inserts them into PIQ Enables intelligent prefetching ! Gokul, Nitin, Sambavi 6
Stream Buffers L 1 I-cache L 2 I-cache Tag and comparator Stream buffer Cache block Tag Cache block Head Tail FIFO Gokul, Nitin, Sambavi 7
Prefetching in the Fetch Directed Architecture n n Similar to stream buffers Addresses given by PIQ Gokul, Nitin, Sambavi 8
Simulation Results Gokul, Nitin, Sambavi 9
Simulation Results Gokul, Nitin, Sambavi 10
Simulation Results Gokul, Nitin, Sambavi 11
Simulation Results Gokul, Nitin, Sambavi 12
Simulation Results Gokul, Nitin, Sambavi 13
Simulation Results Gokul, Nitin, Sambavi 14
Conclusions n n n Prefetching definitely helps Fetch directed architecture aids prefetching Optimal results require sophisticated memory hierarchy Gokul, Nitin, Sambavi 15