Uploaded by zjzqdpvfxyplxytbdo

Electrical and electronics homework 1 Gujarat Technological University

advertisement
See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/344311102
Computer Architecture A Quantitative Approach by John L. Hennessy
Experiment Findings · September 2020
CITATIONS
READS
0
7,714
1 author:
Mohsan Ali
University of the Aegean
12 PUBLICATIONS 24 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Location Based Rumor stance classification in Social Media View project
ODECO- Towards a sustainable Open Data ECOsystem View project
All content following this page was uploaded by Mohsan Ali on 19 September 2020.
The user has requested enhancement of the downloaded file.
Assignment_01
Date: OCT 18, 2019
Course: ACA
Submitted by:
Student Name
Students Reg. ID
Email
Mohsan Ali
181484
181484@students.au.edu.pk
Ayesha Ejaz
181482
181482@students.au,edu.pk
Submitted To:
Dr. Shaharyar Kamal (Assistant Professor Au.edu.pk)
Advanced computer architecture-Assignment_01
2|Page
Contents
Case study 1:
1.
Chip Fabrication Cost ......................................................................................................................3
Question 1.1 .....................................................................................................................................................3
Statement: ...................................................................................................................................................3
Solution: ......................................................................................................................................................3
2.
Question 1.2 .....................................................................................................................................................4
Statement: ....................................................................................................................................................4
Solution: .......................................................................................................................................................4
Case Study 2: Power Consumption in Computer Systems .............................................................................................7
3.
Question 1.4 .....................................................................................................................................................7
Statement: .....................................................................................................................................................7
Solution:........................................................................................................................................................7
4.
Question 1.6 .....................................................................................................................................................8
Statement: ....................................................................................................................................................8
Solution: ........................................................................................................................................................9
Exercise ..........................................................................................................................................................................9
5.
Question 1.9 .....................................................................................................................................................9
Statement: .....................................................................................................................................................9
Solution:........................................................................................................................................................9
6.
Question 1.10 .................................................................................................................................................10
Statement: ..................................................................................................................................................10
Solution: ......................................................................................................................................................10
7.
Question 1.11 .................................................................................................................................................12
Statement: ...................................................................................................................................................12
Solution:......................................................................................................................................................12
8.
Question 1.13 .................................................................................................................................................14
Statement: ..................................................................................................................................................14
Solution: .....................................................................................................................................................14
9.
Question 1.15 .................................................................................................................................................15
Statement: ...................................................................................................................................................15
Solution:......................................................................................................................................................15
10.
Question 1.16 .............................................................................................................................................16
Statement: ....................................................................................................................................................16
Solution:.......................................................................................................................................................16
Advanced computer architecture-Assignment_01
3|Page
Case study 1:
Chip Fabrication Cost
1. Question 1.1
Statement:
<Table 1> gives the relevant chip statistics that influence the cost of several
current chips. In the next few exercises, you will be exploring the effect of different
possible design decisions for the IBM Power5.
Table 1: Manufacturing cost factors for several modern processors
Chips
Die
size(mm2)
IBM Power5
Sun Niagara
AMD Opteron
389
380
199
Estimated
defect rate
(per cm2)
.30
.75
.75
Manufacturing
size (nm)
Transistors
(million)
130
90
90
276
279
233
Solution:
A. What is the yield for the IBM Power5?
Answer:
Given data:
Defects per area= 0.30cm2
Die size or die area= 389 mm2
N=4
Yield =0.36approximately
B. Why does the IBM Power5 have a lower defect rate than the Niagara and Opteron?
Answer:
There are many reasons can be considered to do this question but the company
itself matter a lot in production of IBM Power5. It is possible that the age of IBM
Power5 firm more than the age of Niagara and Opteron. The die age also matter a lot. As
the die age increase the chance of defects in the die decrease.
Advanced computer architecture-Assignment_01
4|Page
2. Question 1.2
Statement:
It costs $1 billion to build a new fabrication facility. You will be selling a range of
chips from that factory, and you need to decide how much capacity to dedicate to each chip. Your
Woods chip will be 150 mm2 and will make a profit of $20 per defect-free chip. Your Markon chip will be
250 mm2 and will make a profit of $25 per defect-free chip. Your fabrication facility will be identical to
that for the Power5. Each wafer has a 300 mm diameter.
Solution:
A. How much profit do you make on each wafer of Woods chip?
Answer:
Given information:
Markon-profit per defect-free chip= $25
Woods-Profit per defect-free chip=$20
Each Wafer diameter=300mm
Woods Die Area= 1.5cm2
Markon Die Area=2.5cm2
The given below formula can be used to get required answer
Advanced computer architecture-Assignment_01
5|Page
BY computing LCM
By re-arranging the equation
Profit of Phonix chip= Yield * dies per wafer * price of each wafer
Profit of each wafer= 0.65 * 416* 20$
Profit of each Woods Wafer= 5408$
B. How much profit do you make on each wafer of Markon chip?
Answer:
perform same task as above for the Markon chip
Advanced computer architecture-Assignment_01
6|Page
BY computing LCM
By re-arranging the equation
Profit of Markon chip= Yield * dies per wafer * price of each wafer
Profit of each wafer of Markon= 0.50 * 240* 25$
Profit of each wafer of Markon = $3000
C. Which chip should you produce in this facility?
Answer: there are different parameters to produce the best chips. According to the defect
rate, yield and profit Woods chip is most suitable for more benefits.
D. What is the profit on each new Power5 chip? If your demand is 50,000 Woods chips
per month and 25,000 Markon chips per month, and your facility can fabricate 150
wafers a month, how many wafers should you make of each chip?
Answer:
We have two kind of chips one is Markon and one is Woods chips.
As earlier we know that the number of dies per wafer in Markon and Woods
Number of dies per wafer in Woods= 416
Number of dies per wafer in Markon= 240
Wafers needed for Woods:
= total chips per month/chips per wafer
= 50000/416 =120 wafers
Wafers needed for Markon:
= total chips per month/ dies per wafer
=25000/ 240= 104 wafers
Advanced computer architecture-Assignment_01
7|Page
The above statistics shows that if Woods Company wants to make 50000
chips/dies a month then 416 dies/chips needed to make one wafer so total number
of wafers that can be made with 50k dies are 120. The same manner we can
explain the Markon Company.
Case Study 2: Power Consumption in Computer Systems
3. Question 1.4
Statement:
Figure 1.23 presents the power consumption of several computer system
components. In this exercise, we will explore how the hard drive affects power
consumption for the system.
Solution:
A. Assuming the maximum load for each component, and a power supply efficiency of
80%, what wattage must the server’s power supply deliver to a system with an Intel
Pentium 4 chip, 2 GB 240-pin Kingston DRAM, and one 7200 rpm hard drive?
Answer:
Given information:
Peak power of intel P4=66
Power Supply efficiency=80%
1gb Kingston D2N3 requires 240-pin and 2.3W
For 2GB Kingston D2N3 (2.3*2)
7200RPM hard drive needs 7.9W to run smoothly
Power efficiency of load= sum of all the loads
0.8x=(2.3*2)+66+7.9
x=4.6+66+7.9
x=98.125
This is the required power supply for all the three devices.
B. How much power will the 7200 rpm disk drive consume if it is idle roughly 60% of the
time?
Answer:
Advanced computer architecture-Assignment_01
8|Page
The given information first:
7200rpm hard drive consumes 4.0W in Idle situation
Now find the power consumed by 7200rpm in idle and running condition:
60% when idle+ 40 % when busy= (0.6* 4W+ 0.4*7.9W)
=3.16+2.4
Total power Consumed=5.56
C. Given that the time to read data off a 7200 rpm disk drive will be roughly 75% of a
5400 rpm disk, at what idle time of the 7200 rpm disk will the power consumption be
equal, on average, for the two disks?
Answer:
Given Information:
The time to read data off a 7200 rpm disk drive will be roughly 75% of a 5400 rpm disk
seek7200 = .75 × seek5400
The total seek and idle 7200rpm
seek7200 + idle7200 = 100
The total seeks and idle 5400rpm
seek5400 + idle5400 = 100
At what idle time of the 7200rpm
seek7200 × 7.9W + idle7200 × 4W = seek5400 × 7W + idle5400 × 2.9W
idle7200 = 29.8%
4. Question 1.6
Statement:
Figure 1.24 gives a comparison of power and performance for several
benchmarks comparing two servers: Sun Fire T2000 (which uses Niagara) and IBM x346
(using Intel Xeon processors). This information was reported on a Sun Web site. There
are two pieces of information reported: power and speed on two benchmarks. For the
results shown, the Sun Fire T2000 is clearly superior. What other factors might be
Advanced computer architecture-Assignment_01
9|Page
important and thus cause someone to choose the IBM x346 if it were superior in those
areas?
Solution:
There are number of options available to evaluate the servers. Few of them is listed below





Reliability of server
Benchmark for comparison
Space for deployment
Physical structure
Price of server
If we were running applications that didn’t have any benchmark for characteristics then
IBMx346 might be faster. At the same time we don’t know those characteristics which
are unknown to us to decide about the IBMx346 is either good or bad.
Exercise
5. Question 1.9
Statement:
You are designing a system for a real-time application in which specific deadlines
must be met. Finishing the computation faster gains nothing. You find that your system can
execute the necessary code, in the worst case, twice as fast as necessary.
Solution:
A. How much energy do you save if you execute at the current speed and turn off
the system when the computation is complete?
Answer:
This is the natural phenomena that if after completing the task system
should be turned off to save the resources provide the benefits directly or indirectly.
According the statement in question:
Let’s suppose our system complete the whole task in half time because it is
twice faster. So you can save 50% of the energy in worst time. So, in best and
average time you also can save the 50% energy.
Energy Saving=50%
B. How much energy do you save if you set the voltage and frequency to be half as
much?
Answer: The original energy equation is given below
Advanced computer architecture-Assignment_01
10 | P a g e
We didn’t use the frequency term because changing the frequency does not
affect energy but only power.
So the new energy is
Now we evaluate the effect of power term
(
)
According to above equation Energy saved is ¼ of the old energy.
6. Question 1.10
Statement:
Server farms such as Google and Yahoo! provide enough compute capacity for the
highest request rate of the day. Imagine that most of the time these servers operate at only 60%
capacity. Assume further that the power does not scale linearly with the load; that is, when the
servers are operating at 60% capacity, they consume 90% of maximum power. The servers could
be turned off, but they would take too long to restart in response to more load. A new system has
been proposed that allows for a quick restart but requires 20% of the maximum power while in
this “barely alive” state.
Solution:
A. How much power savings would be achieved by turning off 60%
of the servers?
Answer:
The old power consumption is:
To find the old power let’s suppose power is represented by P and number
of available servers are N.
Then 90% of the servers were using the maximum power is
Old power=0.9NP
If we reduce the power consumption from 0.9NP to 0.4NP because 60% of servers
are at rest.
Advanced computer architecture-Assignment_01
11 | P a g e
Then 55% power approximately can be saved.
B. How much power savings would be achieved by placing 60% of the servers in
the “barely alive” state?
Answer:
The power consumption is given below:
The description of the variable used in this part is explained in a part of this
question.
C. How much power savings would be achieved by reducing the voltage by 20%
and frequency by 40%?
Answer:
The voltage is going to be decrease by 20% and remaining voltage is 80%
and the frequency is going to be decrease by 40% and remaining frequency will be
60% or 0.6.
Advanced computer architecture-Assignment_01
12 | P a g e
D. How much power savings would be achieved by placing 30% of the servers in
the “barely alive” state and 30% off?
Answer:
Let’s assume that other 40% the system have to operate with full capacity
and consume the 90% of maximum power
The power saving in new system can be calculated that 30%(0.3N) of servers are off
and 30% are in barely state. And second term shows that 40% of the systems are running with 90% of
maximum power.
0.3N*0.3P + 0.4N*0.9P= 0.45NP
Let’s power consumption suppose is equal to 0.9NP.
Power saving = (0.9NP – 0.45NP) / 0.9NP  0.309= 30.9%
7. Question 1.11
Statement:
Availability is the most important consideration for designing servers,
followed closely by scalability and throughput.
Solution:
A. We have a single processor with a failure in time (FIT) of 100. What is the
mean time to failure (MTTF) for this system?
Answer:
Given information:
Failure time= 100
MTTF=?
As a metric, MTTF represents how long a product can reasonably be expected
to perform in the field based on specific testing.
109 represent billion. Generally reported as failures per billion hours of
operation or Failure in Time (FIT)
MTTF=107
B. If it takes one day to get the system running again, what is the availability of the
system?
Advanced computer architecture-Assignment_01
13 | P a g e
Answer: the calculated mean time to failure is 107 then we can calculate the
availability of the system.
107/107 + 24 = 1 failure
Mean Time to Failure is a reliability measure; the reciprocal of MTTF is a rate of
failures. Generally reported as failures per billion hours of operation or Failure In
Time (FIT).
C. Imagine that the government, to cut costs, is going to build a supercomputer out
of inexpensive computers rather than expensive, reliable computers. What is the
MTTF for a system with 1000 processors? Assume that if one fails, they all fail.
Answer: MTTF have a complete focus on the hardware side rather than software and
applications of the system. It is compulsory that if you are going to develop the large
system on which a city or country is relying. You must focus on the mean time
failure. The availabity is the main thing in the development of any system.if a system
contains 1k processors then the system failure time will be less than the system
which has more nodes installed.the below graph shows the relationship between
them.
Advanced computer architecture-Assignment_01
14 | P a g e
8. Question 1.13
Statement:
Your company is trying to choose between purchasing the Opteron or Itanium 2. You
have analyzed your company’s applications, and 60% of the time it will be running
applications similar to wupwise, 20% of the time applications similar to ammp, and 20%
of the time applications similar to apsi. Fig. 1.17
Solution:
A. If you were choosing just based on overall SPEC performance, which would you
choose and why?
Answer: the wupwise application consumes less energy than the apsi and ammp.
Similarly there are other attributes on the basis which we can decide about Opteron
or Itanium 2. The SPEC table shown in this question tells us that the execution time
of Itanium2 is less than the Opteron. One more point is that the SpecRatio of
Opteron is (20.87) is less than the Itanium 2(27.8). So, Itanium 2 is the best option.
B. What is the weighted average of execution time ratios for this mix of
applications for the Opteron and Itanium 2?
Answer: keeping in view the main statement of question which is telling about 60%,
10% and 30% of different applications.
The formula to compute the weighted average of execution of time ratios for
this mix of applications is:
0.6*(
) + 0.1*(
0.6*(
)+0.1(
) + 0.3*
)+0.3(
)
The weighted average execution time ratio is = 0.649
Weighted time=64.9%
C. What is the speedup of the Opteron over the Itanium 2?
Answer: to calculate the speedup of Opteron over the Itanium we need the
weighted execution time of both applications.
The weighted execution time for the Opteron is:
Keep in view the main statement of question which is telling
about 60%, 10% and 30% of different applications.
The formula to compute the weighted average of execution of time ratios for
this mix of applications is:
Advanced computer architecture-Assignment_01
15 | P a g e
0.6*(
) + 0.1*(
) + 0.3*
0.6*(51.5)+0.1(136)+0.3(150)
The weighted average execution time for Opteron = 89.5 seconds
Similarly calculate weighted average time for the Itanium 2
0.6*(
) + 0.1*(
) + 0.3*
0.6*(56.1)+0.1(132)+0.3(230)
The weighted average execution time for Itanium = 116.16 seconds
Now,
Speed Up of Opteron Over the Itanium 2=
=
Speed Up of Opteron Over the Itanium 2= 1.30 Approxi.
9. Question 1.15
Statement:
Assume that we make an enhancement to a computer that improves some mode of
execution by a factor of 10. Enhanced mode is used 50% of the time, measured as a percentage of
the execution time when the enhanced mode is in use. Recall that Amdahl’s law depends on the
fraction of the original, unenhanced execution time that could make use of enhanced mode. Thus,
we cannot directly use this 50% measurement to compute speedup with Amdahl’s law.
Solution:
A. What is the speedup we have obtained from fast mode?
Answer: if we enhance the execution by 10 factors. This means
50% is enhanced mode will be 500%
Without enhancement, the accelerated phase would have taken just as long(50%) ,
but the accelerated phase would take 10 times as long i.e 500%. So the relative
execution time without the enhancement would be
50%+500%=550%
Therefore,
Overall speed up is:
=
Advanced computer architecture-Assignment_01
16 | P a g e
Overall speed up=5.5
B. What percentage of the original execution time has been converted to fast
mode?
Answer: the fast mode is also called accelerated.
Amdahl’s law can be used to construct equation for this problem
The 90.9% speedup is achieved by turning the accelerated on.
10.Question 1.16
Statement:
When making changes to optimize part of a processor, it is often the case that
speeding up one type of instruction comes at the cost of slowing down something else. For
example, if we put in a complicated fast floating point unit, that takes space, and something might
have to be moved farther away from the middle to accommodate it, adding an extra cycle in delay
to reach that unit. The basic Amdahl’s law equation does not take into account this trade-off.
Solution:
A. What is the speedup If the new fast floating-point unit speeds up floating-point
operations by, on average, 2×, and floating-point operations take 20% of the
original program’s execution time, what is the overall speedup (ignoring the
penalty to any other instructions)?
Answer:
20% is original program’s execution time
Then remaining program execution is 80%
2 x speeds up for 20% is divide up
The overall speedup = 1 / (0.8 + 0.2 / 2) = 1/0.9
= 1.1 will be the speed up
B. What is the speedup Now assume that speeding up the floating-point unit slowed
down data cache accesses, resulting in a 1.5× slowdown (or 2/3 speedup). Data
Cache accesses consume 10% of the execution time. What is the overall speedup
now?
Answer:
Advanced computer architecture-Assignment_01
17 | P a g e
20% is original program’s execution time
Data cache access consumes 10% of the execution time
Then remaining program execution is 60%
2 x speeds up for 20% is divide up
The overall speedup = 1 / (0.6 + 0.2 / 2 + 0.1 × 1.5) = 1/0.9
= 1/(0.6+0.1+0.15)
=1/0.85
=1.17
C. After implementing the new floating-point operations, what percentage of
execution time is spent on floating-point operations? What percentage is spent
on data cache accesses?
Answer:
There are two questions in the above statement one is for floating-point operations
and one is for cache access operation.
First we compute:
The percentage of execution time spent on floating-point operations is
(0.3 / 2) / (0.6 + 0.3 / 2 + 0.1 × 1.5) = 0.167 = 16.7%
The percentage spent on data cache accesses is
(0.1 × 1.5) / (0.6 + 0.3 / 2 + 0.1 × 1.5) = 0.167 = 16.7%
Advanced computer architecture-Assignment_01
View publication stats
Download