Skip to content

[ERROR] client reporting Error CQE, when running gpunetio_verbs_put_bw example on two node. #2

@foraxe

Description

@foraxe

[ERROR] client reporting Error CQE, when running gpunetio_verbs_put_bw example on two node.
Hi, everyone.

[Description]
Run commands:
Server:

cd /workspace/gpunetio/examples/gpunetio_verbs_put_bw
LD_LIBRARY_PATH=${LD_LIBRARY_PATH}:/workspace/gpunetio/lib DOCA_GPUNETIO_LOG=6 ./gpunetio_verbs_put_bw -g 08:00.0 -d mlx5_bond_0

Client:

cd /workspace/gpunetio/examples/gpunetio_verbs_put_bw
LD_LIBRARY_PATH=${LD_LIBRARY_PATH}:/workspace/gpunetio/lib DOCA_GPUNETIO_LOG=6 ./gpunetio_verbs_put_bw -g 08:00.0 -d mlx5_bond_0 -c x.x.x.x

When run on single node in tmux, the example returns Sample finished successfully and give the BW results.

But when run on two node, the client return Error CQE. And i also tried -p 1 mode, same errors.

ps: I am trying this example for solving the issue in NIXL gpunetio backend ai-dynamo/nixl#952

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions