Computers and Technology

Assume a gpu architecture that contains 10 simd processors. each simd instruction is 32 bits, and each simd processor contains 8 lanes for single precision arithmetic, and load/store instructions, meaning that each non-diverged simd instruction can produce 32 results every 4 cycles. assume a kernel that has divergent branches that causes, on average 80% of threads to be active. also assume that 70% of all instructions are sp arithmetic and 20% load/store. because not all memory latencies are covered, assume an average simd instruction issue rate of 0.85. assume the gpu has a clock speed of 1.5 ghz. compute the throughput, in gflop/sec, for this code on this gpu.

answer
Answers: 2

Other questions on the subject: Computers and Technology

image
Computers and Technology, 22.06.2019 18:30, dpinzoner5952
Which of the following commands is more recommended while creating a bot?
Answers: 1
image
Computers and Technology, 23.06.2019 00:20, mmmmaaarriiieeee
The open systems interconnection (osi) reference model: defines standards for many aspects of computing and communications within a network. is a generic description for how computers use multiple layers of protocol rules to communicate across a network. defines standards for wireless local area network (wlan) communication protocols. details the advantages and disadvantages of various basic network cabling options.
Answers: 1
image
Computers and Technology, 23.06.2019 09:30, rscvsdfsrysas3712
Why is an outfitting a workspace with video games in a technology development company considered a strategic use of money
Answers: 1
image
Computers and Technology, 23.06.2019 14:30, soapai
Select the correct answer. sean is a computer programmer. he has programmed an application for toddlers that plays nursery rhymes. however, a logic error has occurred in the program. which problem is a likely consequence of the error? a. the program crashes every time the user wants to play the nursery rhymes. b. the program crosses its buffer boundaries and overwrites an adjacent program. c. the program plays a different nursery rhyme than the one the user intended to play. d. the program shows different structures in its programming language code. e. the program introduces new viruses every time the user plays a nursery rhyme.
Answers: 1
Do you know the correct answer?
Assume a gpu architecture that contains 10 simd processors. each simd instruction is 32 bits, and ea...

Questions in other subjects: