mirror of https://github.com/QMCPACK/qmcpack.git
improved readme
This commit is contained in:
parent
9ae0d9222e
commit
001afba76a
|
@ -7,7 +7,7 @@ With CUDA:
|
||||||
* Running: With the performance tests set up.
|
* Running: With the performance tests set up.
|
||||||
``` shell
|
``` shell
|
||||||
[epd@oxygen ]$ export QMCPACK_APP=/path/to/your/instrumented/bin/qmcpack
|
[epd@oxygen ]$ export QMCPACK_APP=/path/to/your/instrumented/bin/qmcpack
|
||||||
[epd@oxygen ]$ cd /performace/NiO/sample/dmc-a4-e48-gpu
|
[epd@oxygen ]$ cd /your_scratch/performace/NiO/sample/dmc-a4-e48-gpu
|
||||||
[epd@oxygen ]$ OMP_NUM_THREADS=12 XRAY_OPTIONS="patch_premain=true xray_mode=xray-basic" nvprof -o simul.nvprof $QMCPACK_APP NiO-fcc-S1-dmc.xml
|
[epd@oxygen ]$ OMP_NUM_THREADS=12 XRAY_OPTIONS="patch_premain=true xray_mode=xray-basic" nvprof -o simul.nvprof $QMCPACK_APP NiO-fcc-S1-dmc.xml
|
||||||
[epd@oxygen ]$ llvm-xray account xray-log.qmcpack.yrxHG1 -instr_map=$QMCPACK_APP -sort=med -top=10 -sortorder=dsc
|
[epd@oxygen ]$ llvm-xray account xray-log.qmcpack.yrxHG1 -instr_map=$QMCPACK_APP -sort=med -top=10 -sortorder=dsc
|
||||||
|
|
||||||
|
@ -33,8 +33,17 @@ If you also have a new version of llvm installed you can use the newer xray tool
|
||||||
|
|
||||||
Then convert into chrome viewable html doc using the [catapult](https://github.com/catapult-project/catapult) tool.
|
Then convert into chrome viewable html doc using the [catapult](https://github.com/catapult-project/catapult) tool.
|
||||||
|
|
||||||
```
|
``` shell
|
||||||
~/codes/catapult/tracing/bin/trace2html dmc-a4-e48-gpu.trace -o dmc-a4-e48-gpu.html
|
~/codes/catapult/tracing/bin/trace2html dmc-a4-e48-gpu.trace -o dmc-a4-e48-gpu.html
|
||||||
```
|
```
|
||||||
|
|
||||||
[example -- only works with chrome](http://cdash-minimal.ornl.gov/profiling/dmc-a4-e48-gpu.html)
|
[example -- only works with chrome](http://cdash-minimal.ornl.gov/profiling/dmc-a4-e48-gpu.html)
|
||||||
|
|
||||||
|
You can then download the nvprof trace:
|
||||||
|
``` shell
|
||||||
|
$ scp /your_scratch/performace/NiO/sample/dmc-a4-e48-gpu/simul.nvprof ./
|
||||||
|
$ nvvp
|
||||||
|
```
|
||||||
|
You'll need to import simul.nvprof to a new session in nvvp.
|
||||||
|
Then you'll be able look at the traces from the CPU and GPU point of view at the same time.
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue