(Solved) : 4 40 Points Assume Arithmetic Load Store Branch Instructions Processor Cpis 1 12 5 Respect Q44135524 . . .

4. (40 points) Assume for arithmetic, load/store, and branch instructions, a processor has CPIs of 1, 12, and 5, respectively. Also assume that on a single processor a program requires the execution of 2.56E9 arithmetic instructions, 1.28E9 load/store instructions, and 256 million branch instructions. Assume each processor has a 2 GHz clock frequency. Assume that, as the program is parallelized to run over multiple cores, the number of arithmetic and load/store instructions per processor is divided by 0.7 xp (where p is the number of processors) but the number of branch instructions per processor remains the same. a. Find the total execution time for this program on 1, 4, and 8 processors, and show the relative speedup of the 4, and 8 processor results relative to the single processor result. b. If the CPI of the arithmetic instructions was doubled, what would the impact be on the execution time of the program on 1, 2, 4, and 8 processors? C. To what should the CPI of load/store instructions be reduced in order for a single processor to match the performance of four processors using the original CPI values? Show transcribed image text 4. (40 points) Assume for arithmetic, load/store, and branch instructions, a processor has CPIs of 1, 12, and 5, respectively. Also assume that on a single processor a program requires the execution of 2.56E9 arithmetic instructions, 1.28E9 load/store instructions, and 256 million branch instructions. Assume each processor has a 2 GHz clock frequency. Assume that, as the program is parallelized to run over multiple cores, the number of arithmetic and load/store instructions per processor is divided by 0.7 xp (where p is the number of processors) but the number of branch instructions per processor remains the same. a. Find the total execution time for this program on 1, 4, and 8 processors, and show the relative speedup of the 4, and 8 processor results relative to the single processor result. b. If the CPI of the arithmetic instructions was doubled, what would the impact be on the execution time of the program on 1, 2, 4, and 8 processors? C. To what should the CPI of load/store instructions be reduced in order for a single processor to match the performance of four processors using the original CPI values?
Expert Answer
Answer to 4. (40 points) Assume for arithmetic, load/store, and branch instructions, a processor has CPIs of 1, 12, and 5, respect…
Related