https://github.com/spcl/riscv-fp-tracer

RISC-V FP instruction tracer + Python emulator for RISC-V FP32/FP64 traces
https://github.com/spcl/riscv-fp-tracer

Last synced: 11 months ago
JSON representation

RISC-V FP instruction tracer + Python emulator for RISC-V FP32/FP64 traces

Host: GitHub
URL: https://github.com/spcl/riscv-fp-tracer
Owner: spcl
Created: 2024-01-31T12:01:11.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2024-01-31T12:11:36.000Z (over 2 years ago)
Last Synced: 2025-04-13T14:13:44.573Z (about 1 year ago)
Language: C
Homepage:
Size: 83 KB
Stars: 1
Watchers: 6
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

## Floating Point Instruction Trace Analysis
------
This repository contains
- An QEMU TCG plugin that traces all FP instructions of a RISC-V executable
- A Python emulator that converts collected RISC-V FP instruction traces in FP32 and FP64 to FP16. At the same time, it can also produce metrics such as the number of arithmetic instructions that cause an overflow.

### Quick Start
------
#### Dependencies
- Clone QEMU from https://github.com/qemu/qemu/
- Enter the following commands to build QEMU
```console
> cd fp-trace-src
> cp api.c /plugins/
> cp qemu-plugins.symbols /plugins
> cp qemu-plugin.h /include/qemu/
> cp riscv.c /disas/
> cd
> ./configure --enable-plugins --target-list=riscv64-linux-user
> make -j8
```
- Make sure to have the [RISC-V GNU Toolchain](https://github.com/riscv-collab/riscv-gnu-toolchain) cloned and installed. The specific version of the Toolchain which we used corresponds with the commit __89268de2af0957d571cf5ad0e0b894a15b147b25__.

To collect the floating point traces of applications simply
use the `trace.sh` script followed by the binary executable of
the application that you want to trace. For instance, to trace LULESH, the following command can be entered, and the collected
trace can be found in `trace.out`.
```console
> ./trace.sh "lulesh2.0 -i 100 -s 8"
```
To perform overflow analysis on the collected floating point instruction traces, `cd` into the `trace-converter-src` directory and run
```console
> python3 main.py -i ../trace.out -s
```
to obtain the percentage of all FP instructions that trigger an overflow. This command will also produce another trace file `fp16_trace.out` that contains FP instructions that have been converted to FP16.

Add the `--ignore-ex-prop` flag to count only the FP arithmetic instructions that are the root causes of overflows. For instance, if `--ignore-ex-prop` is enabled, assuming that a FP instruction `a` leads to an overflow, and instructions `b` and `c` both depend on the data produced by `a`, then only `a` will be counted as an overflow instruction while the propagation of exception is ignored.

Note that the trace converter also incorporates other experimental functionalities, such as the analysis of catastrophic cancellation, that are currently disabled. They should be used with re-enabled in the source with caution.

### Application Analysis
------
To produce FP instruction traces for the following list of applications, make sure to compile them with the RISC-V GCC compiler, and execute the `trace.sh` script followed by their corresponding `command`.

- [LULESH2.0](https://github.com/LLNL/LULESH/tree/master)
- Command: `lulesh2.0 -i 100 -s 4`
- [OpenCV](https://github.com/opencv/opencv) (4.x)
- Application: Square detection in the `sample` directory
- Command: `square`
- Images tested:
1. `home.jpg`
2. `apple.jpg`
3. `pic1.png`
4. `pic2.png`
5. `pic3.png`
6. `pic4.png`
7. `pic5.png`
8. `pic6.png`
9. `orange.jpg`
10. `lena.jpg`
11. `mask.png`
12. `stuff.jpg`
13. `HappyFish.jpg`
14. `Blender_Suzanne1.jpg`
15. `notes.png`
- [HPCG](https://github.com/hpcg-benchmark/hpcg) (V3.1)
- Command: `xhpcg 16 16 16`
- [Kripke](https://github.com/LLNL/Kripke) (V1.2.7)
- Command: `kripke.exe --zones 8,8,8 --niter 3 --groups 2`
- [Nyx](https://github.com/AMReX-Astro/Nyx?tab=readme-ov-file) (V 21.02.1-184-gfb5216c7cb87)
- Change the following fields in `Exec/LyA/inputs.rt` in the build directory:
- `max_step = 3`
- `amr.n_cell = 8 8 8`
- `amr.max_grid_size = 8`
- Make sure to place `trace.sh` in the `Exec/LyA` repository
- Command: `nyx_LyA inputs.rt`
- [LAMMPS](https://www.lammps.org/download.html) (V 2Aug2023 Stable)
- After compilation move `lmp_serial` to the `bench` directory. Before tracing, make sure to have `in.eam` and `Cu_u3.eam` in the same directory as `trace.sh`.
- Command: `lmp_serial -in in.eam`
- The configuration file `in.eam` is as follows:
```
# bulk Cu lattice

variable x index 1
variable y index 1
variable z index 1

variable xx equal 10*$x
variable yy equal 10*$y
variable zz equal 10*$z

units metal
atom_style atomic

lattice fcc 3.615
region box block 0 ${xx} 0 ${yy} 0 ${zz}
create_box 1 box
create_atoms 1 box

pair_style eam
pair_coeff 1 1 Cu_u3.eam

velocity all create 1600.0 376847 loop geom

neighbor 1.0 bin
neigh_modify every 1 delay 5 check yes

fix 1 all nve

timestep 0.005
thermo 50

run 5
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/spcl/riscv-fp-tracer

Awesome Lists containing this project

README