Add: benchmark scene tests for dispatch and graph topologies#414
Open
chenshengxin2026 wants to merge 1 commit intohw-native-sys:mainfrom
Open
Add: benchmark scene tests for dispatch and graph topologies#414chenshengxin2026 wants to merge 1 commit intohw-native-sys:mainfrom
chenshengxin2026 wants to merge 1 commit intohw-native-sys:mainfrom
Conversation
There was a problem hiding this comment.
Code Review
This pull request introduces a comprehensive suite of performance and topology tests for the tensormap_and_ringbuffer runtime, including benchmarks for independent and serial dispatch, as well as various graph structures such as linear chains, diamonds, fan-ins, and fan-outs. Additionally, it expands the benchmark_bgemm test cases to include sweeps for tile size, batch groups, and dimensions, and integrates these new tests into the benchmark tools. The review feedback correctly identifies several discrepancies between the docstrings and the actual implemented test cases in the golden scripts, providing specific suggestions to align the documentation with the code.
tests/st/a2a3/tensormap_and_ringbuffer/Graph-fanout_N/golden.py
Outdated
Show resolved
Hide resolved
ad594c7 to
2cf31d2
Compare
2cf31d2 to
9a9cf64
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
tensormap_and_ringbufferruntime covering dispatch overhead and graph topology benchmarks:benchmark_bgemmtest cases with descriptive parameter sweep naming (Tile, Batch, K, Loop sweeps) replacing generic Case0-4benchmark_rounds.shTest plan
Dispatch-independentandDispatch-serialon device to verify dispatch overhead measurementGraph-chain_N,Graph-fanin_N,Graph-fanout_N,Graph-diamondon device to verify graph topology testsbenchmark_bgemmwith new case names to verify restructured parameter sweeps