SimpleScalar Tool Set

Tutorials Hits: 3893

Command lines for SPEC CPU2000

Integer benchmarks

Benchmark: 164.gzip
Command Line: gzip00.peak.ev6 input.source 60

Benchmark: 175.vpr
Command Line: vpr00.peak.ev6 net.in arch.in place.in route.out -nodisp -route_only -route_chan_width 15 -pres_fac_mult 2 -acc_fac 1 -first_iter_pres_fac 4 -initial_pres_fac 8

Benchmark: 176.gcc
Command Line: gcc00.peak.ev6 166.i -o 166_2.s

Benchmark: 181.mcf
Command Line: mcf00.peak.ev6 inp.in

Benchmark: 186.crafty
Command Line: crafty00.peak.ev6

Benchmark: 197.parser
Command Line: parser00.peak.ev6 2.1.dict -batch

Benchmark: 252.eon
Command Line: eon00.peak.ev6 chair.control.cook chair.camera chair.surfaces chair.cook.ppm ppm pixels_out.cook

Benchmark: 253.perlbmk
Command Line: perlbmk00.peak.ev6 diffmail.pl 2 550 15 24 23 100

Benchmark: 254.gap
Command Line: gap00.peak.ev6 -l . -q -m 64M

Benchmark: 255.vortex
Command Line: vortex00.peak.ev6 lendian1.raw

Benchmark: 256.bzip2
Command Line: bzip200.peak.ev6 input.source 58

Benchmark: 300.twolf
Command Line: twolf00.peak.ev6 ref

Floating point benchmarks

Benchmark: 168.wupwise
Command Line: wupwise00.peak.ev6

Benchmark: 171.swim
Command Line: swim00.peak.ev6

Benchmark: 172.mgrid
Command Line: mgrid00.peak.ev6

Benchmark: 173.applu
Command Line: applu00.peak.ev6

Benchmark: 177.mesa
Command Line: mesa00.peak.ev6 -frames 1000 -meshfile mesa.in -ppmfile mesa.ppm

Benchmark: 178.galgel
Command Line: galgel00.peak.ev6

Benchmark: 179.art
Command Line: art00.peak.ev6 -scanfile c756hel.in -trainfile1 a10.img -trainfile2 hc.img -stride 2 -startx 110 -starty 200 -endx 160 -endy 240 -objects 10

Benchmark: 183.equake
Command Line: equake00.peak.ev6

Benchmark: 187.facerec
Command Line: facerec00.peak.ev6

Benchmark: 188.ammp
Command Line: ammp00.peak.ev6

Benchmark: 189.lucas
Command Line: lucas00.peak.ev6

Benchmark: 191.fma3d
Command Line: fma3d00.peak.ev6

Benchmark: 200.sixtrack
Command Line: sixtrack00.peak.ev6

Benchmark: 301.apsi
Command Line: apsi00.peak.ev6

Notes on using SimpleScalar

(1) Read SimpleScalar Tutorial.

(2) Download 631ssAlpha.tgz (for Linux, 20,033,842 bytes) or 631ssAlpha-Cygwin.tgz (for Cygwin, 19,844,137 bytes). Unzip it (tar xvzf cpe631Alpha.tgz).
This archive includes all necessary simulators from SimpleScalar tool suite and Alpha binaries of SPEC CPU2000 benchmarks. Some of the simulators have been modified by our research group members, e.g., sim-cache in order to allow you to skip the specified number of instructions.
If you have a PC running Linux you might want to install full SimpleScalar suit which includes program development environment for PISA instruction set architecture (MIPS like) and ARM instruction set architecture. Links are on the course Web site.

(3) Be sure that you have SPEC CPU2000 (SED contact person is Mr. David Austin). You can install or just copy it. Let’s say that home directory of SPEC CPU2000 is $SPEC_HOME.

(4) Steps to do:

# create a working directory

mkdir work

cd work

mkdir 172.mgrid # e.g., you want to simulate 172.mgrid application

cd 172.mgrid

# now you can copy inputs for this application
# into your working directory;

# with Cygwin you can use Explorer to move necessary input file mgrid.in

cp $SPEC_HOME/spec_cpu2000/benchspec/CFP2000/172.mgrid/data/ref/input/mgrid.in .

# let’s say $SS_HOME is where you unzipped 631ssAlpha

# to run simulation type in (one line command)

$SS_HOME/631ssAlpha/mysimplesim_pff_log/sim-cache -fastfwd 500000000 -max:inst 500000000 -redir:sim u2_32KB.txt -cache:il1 il1:512:64:1:f -cache:dl1 dl1:512:64:1:f -cache:il2 none -cache:dl2 ul2:2048:64:4:l $SS_HOME/631ssAlpha/spec2000binaries/mgrid00.peak.ev6 < mgrid.in

# this will run sim-cache functional cache simulator for mgrid00 SPEC CPU application;
# input for this application is given in the file mgrid.in

# tested cache configuration is 8KB L1I, 8KB L1D, and 32KB L2U;
# first 500M instructions will be skipped, and then 500M simulated.

# you can prepare a command file, e.g., 172mgrid.sh to include command lines for
# all runs for your homework (u2 is 32KB, 64KB, ...).

Example 1:

SS_HOME/631ssAlpha/arAlpha/mysimplesim_pff_log/sim-cache -max:inst 2000000000 -redir:sim crafty_cache_f2b_l.txt -cache:il1 il1:256:64:1:f -cache:dl1 dl1:128:32:8:r -cache:il2 dl2 -cache:dl2 ul2:256:64:16:l $SS_HOME/631ssAlpha/spec2000binaries/crafty00.peak.ev6 < crafty.in

This command line runs the sim-cache simulator for 2 billion instructions. It stores the output in crafty_cache_f2b_l.txt file. There are two levels of caches: L1 contains IL1 with 256 sets, 64 B block size, direct mapped, and fifo replacement policy with a total size of 16 KB; and DL1 with 128 sets, 32 B block size, 8-way set associative, and random replacement policy with a total size of 32 KB.

Example 2:

SS_HOME/631ssAlpha/arAlpha/mysimplesim_pff_log/sim-outorder -redir:sim Current-outorder.txt -cache:il1 il1:64:8:32:l -cache:dl1 dl1:64:8:32:l -fetch:ifqsize 2 -bpred nottaken -decode:width 1 -issue:width 1 -issue:inorder true -res:ialu 1 -res:fpalu 1 -res:fpmult 1 -cache:dl2 none -cache:il2 none -mem:width 4 -mem:lat 12 1 $SS_HOME/631ssAlpha/spec2000binaries/gcc00.peak.ev6 scilab.i -o scilab.s

This command line runs the sim-outorder simulator. The output goes to Current-outorder.txt file. IL1 has 64 sets, 8 B block size, 32-way set associative, and least recently used replacement policy with a total size of 16 KB. DL1 is the same as IL1. Instruction fetch queue size: 2 instructions. Branch prediction scheme: not-taken. Instruction decode bandwidth: 1 instruction per cycle. Instruction issue bandwidth: 1 instruction per cycle. In-order issue. There is one INT ALU unit, one FP ALU unit, 1 FP multiplier. there is no L2 instruction or data caches. Memory access bus width: 4 B. Memory latency has 12 cycles for the first_chunk, and 1 cycle for inter_chunk.

Example 3:

SS_HOME/631ssAlpha/arAlpha/mysimplesim_pff_log/sim-cheetah -redir:sim sim-cheetah.txt -R opt -C sa -a 5 -b 14 -l 4 -n 2$SS_HOME/631ssAlpha/spec2000binaries/parser00.peak.ev6 ./2.1.dict - batch < ref.in

This command line runs the sim-cheetah simulator. with optimal replacement policy. Set associative cache. The number of sets ranges from 5 to 14 . Block size of 4 B. And associativity ranges from direct-mapped to 2-way set associative.

SimpleScalar resources

Web page: http://www.simplescalar.com
Mailing list: http://ord.eecs.umich.edu/ss_archives
SimpleScalar Version 4.0 Test Releases: http://www.simplescalar.com/v4test.html
SimpleScalar Documentation: Documentation
SimpleScalar users guide: users_guide_v2.pdf

Benchmarks:

MiBench Embedded Benchmark Suite: http://www.eecs.umich.edu/mibench/
Standard Performance Evaluation Corporation (SPEC): http://www.spec.org/
Inputs for SPEC CPU applications

(http://www.cag.lcs.mit.edu/~kbarr/cag/spec2000-commandlines.html)

SimpleScalar Tool Set

Table of contents:

sim-bpred

sim-cache

sim-cheetah

sim-outorder

sim-profile

sim-safe