Bug: Incorrect CORRECTNESS_PROMPT — missing format placeholders (user_question, report, answer)

Hi team! Thank you for the great project.

I found that CORRECTNESS_PROMPT does not use `user_question`, `report`, or `answer` in tests/prompts.py:

https://github.com/langchain-ai/open_deep_research/blob/b419df8d33b4f39ff5b2a34527bb6b85d0ede5d0/tests/prompts.py#L95-L104

However, the eval_correctness function in tests/evaluators.py formats the prompt with these fields:

https://github.com/langchain-ai/open_deep_research/blob/b419df8d33b4f39ff5b2a34527bb6b85d0ede5d0/tests/evaluators.py#L112

As a result, format() has no effect and the evaluator receives a prompt that lacks the actual question, the generated report, and the authoritative answer.

Could you please confirm whether this is a bug?

	CORRECTNESS_PROMPT = """You are evaluating the correctness of a research report that was generated by a research agent.

	You will be provided with the question, the report, and the answer from an independent authority.

	Score the report from 1-5 on how well it mirrors the answer from the authority.
	We expect the report to contain more information that is not in the answer, that's perfectly okay.
	They likely won't be perfectly the same, but they should have the same themes and ideas to get a high score.

	Use your best judgement when comparing the answer to the report!
	"""

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bug: Incorrect CORRECTNESS_PROMPT — missing format placeholders (user_question, report, answer) #217

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Bug: Incorrect CORRECTNESS_PROMPT — missing format placeholders (user_question, report, answer) #217

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions