HWSW codesign > 3 Once upon a time(r) > 3.5 - Exercises

3.5 - Exercises

Practice makes perfect

Below are a number of programming exercises.

Exercise 301

Repeat the example of the 2x2 matrix multiplication on you own implementation.

Exercise 302

ASSIGNMENT

Extend the implementation to (exclusively) support 3x3 matrices: 1) verify correct working, 2) find the latency, and 3) determine the cost (#LUT, #FF, imem-size)

Example

	3x3 matrix multiplication SW-only	3x3 matrix multiplication with M extension	Relative
# CC¹	1887	285	15%
T (ns)	25	36	144%
f_clk (MHz)	40	27.78	70%
Latency (µs)	47.175	10.260	22%
imem size (bytes)	2441	1908	78%
# LUT	1553	3501	225%
# FF	1156	1156	100%

Througput (matrix mult/s)	1/47175e-9 = 21k mm/s	1/10260e-9 = 97k mm /s	462%

¹ This counts real clock cycles. The 33% duty cycle induced by the chip enable is taken into account. The non-optimal usage of the processor is hence reflected in the outcome.