Genesi Home Home Projects Forums
Login
Username:

Password:

Remember Me
 
[Register an Account]

Developer Programs
i.MX515
Power2People MPC8610

Efika 5200B

Search Projects

Google

Sponsored By



Efika 5200B Project
Review and Support, Application optimization

in category Other
proposed by Matthias Muench on 4th June 2006 (accepted on 7th October 2006)

Blog Entry

  Stream Benchmarks
posted by Matthias Muench on 10th April 2007

My next step was to do some benchmarks under Linux to check the RAM performance. I used the Stream test against Pegasos and Sam440ep board.

EFIKA Linux 2.6.19

-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 2 microseconds.
Each test below will take on the order of 183141 microseconds.
(= 91570 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function----Rate (MB/s)---Avg time----Min time----Max time
Copy:--------192.4569------0.1978------0.1663------0.3896
Scale:--------187.5988------0.1855------0.1706------0.2254
Add:----------195.1688------0.2569------0.2459------0.2920
Triad:---------188.3092------0.3100------0.2549------0.4959
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------

Pegasos II G4/1GHz Linux 2.6.17

-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 90864 microseconds.
(= 90864 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function----Rate (MB/s)---Avg time----Min time----Max time
Copy:--------376.0548------0.0869------0.0851------0.0907
Scale:--------361.1246------0.0907------0.0886------0.0929
Add:----------359.3918------0.1359------0.1336------0.1376
Triad:---------361.1680------0.1357------0.1329------0.1394
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------

Pegasos II G3/600MHz

-------------------------------------------------------------
STREAM version $Revision: 5.6 $
-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 2 microseconds.
Each test below will take on the order of 251755 microseconds.
(= 125877 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function----Rate (MB/s)---Avg time---Min time---Max time
Copy:--------132.6981------0.2416-----0.2411-----0.2421
Scale:-------112.7054------0.2849-----0.2839-----0.2859
Add:---------118.6606------0.4054-----0.4045-----0.4063
Triad:-------118.7202------0.4056-----0.4043-----0.4065
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------


AMCC 440EP 533 Mhz Evaluation Board Linux 2.6.18

-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 2000000, Offset = 0
Total memory required = 45.8 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Printing one line per active thread....
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 112887 microseconds.
(= 112887 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function----Rate (MB/s)---Avg time----Min time----Max time
Copy:--------262.6503------0.1221------0.1218------0.1223
Scale:--------245.8928------0.1302------0.1301------0.1303
Add:----------253.5979------0.1893------0.1893------0.1893
Triad:---------251.2734------0.1911------0.1910------0.1912
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------

For further tests, Application optimizations and guides I am waiting now for a MorphOS Release.

PowerDeveloper.org: Copyright © 2004-2010, Genesi USA, Inc. The Power Architecture and Power.org wordmarks and the Power and Power.org logos and related marks are trademarks and service marks licensed by Power.org. All other names and trademarks used are property of their respective owners. Privacy Policy