Skip to content

windows 11 intel npu - nexa sdk BUG #937

@paoIpao

Description

@paoIpao

Pre-checks

  • I searched existing issues
  • I’m using the latest NexaSDK release

What happened?

I install nexa sdk.
when I execute (or any other models): nexa infer NexaAI/llama3.2-3B-intel-npu
I write something and works and response.
Then closed and when reopen the windows and execute: nexa infer NexaAI/llama3.2-3B-intel-npu
then I write something and the response is seems blank (or some character that I not see).

Then if close, wait some time and re-open window then it's start work againt.

If I execute: nexa infer NexaAI/deepSeek-r1-distill-qwen-7B-intel-npu
the problem on first time that I write something start report some random characters and then infinite "cryptocryptocrypto"...
Hen if close and reopen and write something the response is seems blanks (or some characters that I not see).

Steps to reproduce

1.Install nexa sdk on windows 11 - intel npu
2.execute: nexa infer NexaAI/llama3.2-3B-intel-npu (or nexa infer NexaAI/deepSeek-r1-distill-qwen-7B-intel-npu)
3. write something and see response
4. bug/problem on response with invalid characters/problem decode or other similar.

Logs & stack traces


Model(s) and quantization

deepSeek-r1-distill-qwen-7B-intel-npu

NexaSDK version

NexaSDK Bridge Version: v1.0.38-rc5 NexaSDK CLI Version: v0.2.65

Install method

from source

OS and version

Windows 11

Hardware / accelerator

NPU - Yoga Slim 7 14ILL10 - Type 83JX

Additional context

Metadata

Metadata

Labels

🐞 bugSomething isn't working🔚 wontfixThis will not be worked on

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions