Coyote/sw/examples/perf_fpga
zhe 91a2241052 add support for multiple card and strm axi 2023-06-02 09:12:13 +02:00
..
README.md renaming examples. 2022-11-03 15:30:12 +01:00
main.cpp add support for multiple card and strm axi 2023-06-02 09:12:13 +02:00

README.md

Example benchmarking base read and write operations initiated by the FPGA

This performance benchmark measures the throughput and latency of the FPGA initiated read and write transfers.

The results should (hopefully) point to the advantage the FPGA initiated transfers have over their CPU initiated counterparts. As all user logic in Coyote is virtualized, these transfers can use the same virtual address ranges available in the user space.

Note

On Alveo platforms completion events fired by XDMA signal only the completion within the DMA engine (weird design choice within an XDMA core...). This doesn't measure the time it takes to actually write the data. To measure this, writeback memory should be polled. For this reason, the results will show a much smaller latency for write operations.

This is not the case in Enzian where measurements are actually measuring the correct completion.

TODO

  • Poll on writeback
  • Printout results