LINPACK benchmark code

The LINPACK floating-point benchmark in Forth

Benchmark results

The URL http://hpwww.epfl.ch/bench/bench.FAQ.html contains interesting benchmark source code pointers. I ported the "C" LINPACK mark by Al Aburto (aburto@nosc.mil) to Forth. Some of the other benchmarks mentioned there are on related pages of this site.
| The program (clinpack.c) and latest Double Precision (DP) table of
| results (clinpack.dpr for 'ROLL' option and clinpack.dpu for 'UNROLL'
| option) are available via anonymous ftp from 'ftp.nosc.mil' in
| directory 'pub/aburto'.  The ftp,nosc.mil IP address is: 128.49.192.51
| CLINPACK tests a few BLAS and LAPACK routines in the context of solving
| large sets of linear equations.
The Forth LINPACK benchmark shows the quality of your compiler's FPU code as compared to what an optimizing "C" compiler would give you. Results for a large number of C compilers on a variety of systems are available in the .dpu file.

Intel Pentium P54C 166 MHz, 48 MB, iForth 1.11, NT 4.0

	  norm. resid        resid          machep          x[0]-1        x[n-1]-1
	       1        8.39914076E-14  2.22044605E-16 -6.22835117E-14 -4.16333634E-14
	times are reported for matrices of order 100
	      dgefa      dgesl      total       kflops      unit      ratio
	times for array with leading dimension of 201
	      0.031      0.001      0.032       21458      0.093      0.571
	      0.031      0.001      0.032       21458      0.093      0.571
	      0.031      0.001      0.032       21458      0.093      0.571
	      0.030   8.000E-4      0.031       22008      0.091      0.557
	times for array with leading dimension of 200
	      0.031      0.001      0.032       21458      0.093      0.571
	      0.031      0.001      0.032       21458      0.093      0.571
	      0.030      0.001      0.031       22150      0.090      0.554
	      0.030   9.000E-4      0.031       21938      0.091      0.559
	Unrolled, DP Precision, 21938 Kflops; 100 Reps

AMD Athlon 900 MHz, 128 MB, iForth 1.11, Windows 2000

	  norm. resid        resid          machep          x[0]-1        x[n-1]-1
	       3        2.58238634E-13  2.22044605E-16  7.77156117E-15 -2.17603713E-14
	times are reported for matrices of order 180
	      dgefa      dgesl      total       kflops      unit      ratio
	times for array with leading dimension of 301
	      0.013      0.001      0.014      282342      0.007      0.250
	      0.013      0.000      0.013      304061      0.007      0.232
	      0.013      0.000      0.013      304061      0.007      0.232
	      0.013   2.200E-4      0.013      300821      0.007      0.235
	times for array with leading dimension of 300
	      0.013      0.000      0.013      304061      0.007      0.232
	      0.013      0.000      0.013      304061      0.007      0.232
	      0.012      0.001      0.013      304061      0.007      0.232
	      0.013   2.200E-4      0.013      300821      0.007      0.230
	Unrolled, DP Precision, 300822 Kflops; 100 Reps
The LINPACK result puts iForth in the following company (clinpack.dpu):
Results as of 01 Apr 1997:

    System               OS                   CPU/FPU   CPU   MFLOPS   REF
						       (MHz) DP,UNROLL
### -------------------- ------------------- --------- ----- --------- ---
--------------------------------------------------------------------------
    AMD Athlon           Windows 2000        Athlon    900.0   300.822	xx
--------------------------------------------------------------------------
001 SGI Indigo2          Irix 6.2            R10000    195.0    76.606  26
002 SGI Indigo2          Irix 6.2            R10000    195.0    72.421  26
003 SGI O2               Irix 6.3            R10000    175.0    68.729  30
004 SGI Origin 200       Irix 6.4            R10000    180.0    66.562  32
005 SGI O2               Irix 6.3            R10000    175.0    65.716  30
006 Dell XPS Pro 200n    NT 3.51             PentiumP6 200.0    47.77   28
007 Brett Station ATX    Linux 2.0.0         PentiumP6 180.0    46.396  34
008 Enterprise 4000/10   Solaris 2.5.1       UltrSPARC 250.0    44.880  38
009 HP 9000/J210XC       HP-UX 10.20         7200_2CPU 120.0    34.680  36
010 Enterprise 4000/10   Solaris 2.5.1       UltrSPARC 250.0    34.666  38
011 Sun Ultra 4000       Solaris 2.5.1       UltrSPARC 167.0    33.911  27
012 HP 9000/J210         HP-UX 10.01         PA-RISC   120.0    32.855  29
013 Aspen Alpine 275XS   OpenVMS 6.2         DEC21064A 275.0    29.598  24
014 HP 9000/712          HP-UX 9.03          PA-7100LC 100.0    26.309  25
015 SPARCstation 20/HS21 Solaris 2.4         HyprSPARC 125.0    24.524  20
--------------------------------------------------------------------------
    Pentium P54C         NT 4.0              Pentium   166.0    21.619
--------------------------------------------------------------------------
016 SGI Onyx             Irix 6.2            R8000      75.0    20.549  33
017 HP 9000/712          HP-UX 10.20         PA7100LC  100.0    20.078  36
018 SGI Indy             Irix 6.2            R5000     150.0    18.761  32
019 Mac PowerPC 604      MacOS 7.5.2         PwrPC 604 120.0    18.559  19
020 SGI Challenge S      Irix 6.2            R4400     200.0    18.294  31
021 Mac PowerPC 604      MacOS 7.5.2         PwrPC 604 120.0    18.150  19
022 DEC 3000/500 AXP     OSF/1 T1.3-3        DEC 21064 150.0    17.994   5
023 Mac PowerPC 604      MacOS 7.5.2         PwrPC 604 120.0    17.095  21
024 Dell XPS Pro 200n    NT 3.51   No opt    PentiumP6 200.0    16.91   28
free counter Valid HTML 3.0