Add article quality validation gate to generate-news-enhanced.ts pipeline by Copilot · Pull Request #469 · Hack23/riksdagsmonitor

Copilot · 2026-02-23T07:25:56Z

Articles were being written to news/ with no quality gate, resulting in committed files with 100% Unknown (Unknown) authors, 21+ untranslated data-translate spans, and sub-300-word document lists with zero analytical structure.

Changes

`scripts/generate-news-enhanced.ts`

validateArticleQuality(html, lang, articleType): ArticleQualityReport — 100-point scoring across 4 dimensions (25pts each): word count, unknown-author ratio, untranslated spans (non-sv only), analytical <h2> sections
writeSingleArticle calls validation before writing and logs a per-article quality report to console
--quality-threshold=N CLI flag (default: 40); exit code 2 when all articles in a run score below threshold (soft monitoring failure)
ArticleQualityReport interface with readonly properties exported for external use

📊 Article Quality Report: motions-en
   - Word count: 450 ✅
   - Unknown authors: 10/10 ⚠️
   - Untranslated spans: 21 > 10 ⚠️
   - Analytical sections: 1/3 ⚠️
   - Quality Score: 15/100 — BELOW THRESHOLD

`tests/article-quality.test.ts`

17 unit tests covering all scoring dimensions, partial-score boundaries, Swedish span exemption, pass/fail threshold, and articleId format.

`tests/generate-news-enhanced-mcp-abort.test.ts`

Fixed missing beforeEach import causing TS2304 TypeScript compile error.

Original prompt

This section details on the original issue you should resolve

<issue_title>Add article quality validation gate in generate-news-enhanced.ts pipeline</issue_title>
<issue_description>## 📋 Issue Type
Feature / Content Quality

🎯 Objective

Add a post-generation validation and enhancement step to scripts/generate-news-enhanced.ts that checks article quality before writing files, and rejects or enhances articles that don't meet minimum quality standards.

📊 Current State

The generate-news-enhanced.ts pipeline:

Fetches data from MCP

Generates HTML via data-transformers.ts functions

Writes files directly to news/ directory

No quality validation between generation and writing

Result: Low-quality list-style articles are written without any checks:

Articles with 100% "Filed by: Unknown (Unknown)" entries are committed

Articles with 21+ untranslated data-translate spans are committed

Articles under 500 words of actual analysis are committed

Articles that are pure document lists without analysis are committed

🚀 Desired State

Add a validateArticleQuality(html, lang, articleType) function that:
Checks for disqualifying patterns:

Count of "Unknown (Unknown)" > 50% of entries → WARN

Count of data-translate="true" in non-Swedish > 10 → WARN

Word count < 300 → REJECT

No analytical sections (h2 headers for analysis) → WARN
Reports quality metrics in console:
📊 Article Quality Report: motions-en
- Word count: 450 ✅
- Unknown authors: 10/10 ⚠️
- Untranslated spans: 21 ⚠️
- Analytical sections: 1/3 ⚠️
- Quality Score: 35/100 — BELOW THRESHOLD
Quality threshold: If score < 40, log a warning but still write the file (don't block generation, but make quality visible)

Exit code: If ALL articles in a run score < 40, exit with code 2 (soft failure for monitoring)
🔧 Implementation Approach

Target file: scripts/generate-news-enhanced.ts (add validation step)

Create validateArticleQuality(html, lang, articleType) function

Call it after HTML generation but before file write

Log quality metrics for each article

Track aggregate quality score across all articles in the run

Add --quality-threshold CLI flag (default: 40) for adjustable minimum

✅ Acceptance Criteria

Quality validation runs after each article generation

Console shows quality metrics for each article

Known quality issues (Unknown authors, untranslated spans) are detected

Quality scores are logged for monitoring

Soft failure exit code (2) when ALL articles below threshold

--quality-threshold flag for CI/CD integration

Unit tests verify quality validation logic

📚 References

scripts/generate-news-enhanced.ts (article generation pipeline)

Known quality issues in news/2026-02-22-* articles

.github/skills/editorial-standards/SKILL.md

🤖 Recommended Agent

Copilot coding agent — add quality validation to generate-news-enhanced.ts</issue_description>

<agent_instructions>Focus ONLY on scripts/generate-news-enhanced.ts. Add a new function validateArticleQuality(html, lang, articleType) that checks for: (1) 'Unknown (Unknown)' count, (2) data-translate span count in non-Swedish, (3) word count, (4) analytical section count (h2 headers). Call this function after HTML generation but before file writing. Log quality metrics to console. Add a --quality-threshold CLI flag (default 40). Exit with code 2 if ALL articles score below threshold. Add tests in a new test file tests/article-quality.test.ts. Do NOT modify data-transformers.ts, mcp-client.ts, or any workflow files to avoid merge conflicts.</agent_instructions>

Comments on the Issue (you are @copilot in this section)

Fixes Add article quality validation gate in generate-news-enhanced.ts pipeline #468

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

…line Co-authored-by: pethers <1726836+pethers@users.noreply.github.com>

github-actions · 2026-02-23T10:49:41Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

Copilot

Pull request overview

This pull request adds a comprehensive article quality validation gate to the generate-news-enhanced.ts pipeline to prevent low-quality articles from being committed. The implementation introduces a scoring system that evaluates four key dimensions (word count, unknown authors, untranslated spans, and analytical structure) and provides visibility into article quality through console reports and CI/CD-compatible exit codes.

Changes:

Added validateArticleQuality() function with a 100-point scoring system across 4 dimensions (25 points each)
Integrated quality validation into the writeSingleArticle() function with detailed console reporting
Added --quality-threshold CLI flag (default: 40) for CI/CD configuration
Implemented exit code 2 for soft failures when all articles score below threshold
Added comprehensive unit tests (17 test cases) covering all scoring dimensions and edge cases

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
`scripts/generate-news-enhanced.ts`	Added validateArticleQuality function, integrated quality reporting into writeSingleArticle, added CLI flag parsing for quality threshold, implemented exit code 2 logic for quality gate failures
`tests/article-quality.test.ts`	Added comprehensive unit tests covering word count scoring, unknown author detection, untranslated span detection, analytical section counting, and pass/fail thresholds

Copilot · 2026-02-23T10:53:06Z

scripts/generate-news-enhanced.ts

+  articleId: string;
+  wordCount: number;
+  unknownAuthorCount: number;
+  totalEntryCount: number;
+  untranslatedSpanCount: number;
+  analyticalSectionCount: number;
+  score: number;
+  passed: boolean;
+  issues: string[];


The ArticleQualityReport interface properties should be marked as readonly for consistency with the test interface and immutability best practices. The test file defines these properties as readonly (lines 19-27), but the implementation does not (lines 421-429).

Suggested change

articleId: string;

wordCount: number;

unknownAuthorCount: number;

totalEntryCount: number;

untranslatedSpanCount: number;

analyticalSectionCount: number;

score: number;

passed: boolean;

issues: string[];

readonly articleId: string;

readonly wordCount: number;

readonly unknownAuthorCount: number;

readonly totalEntryCount: number;

readonly untranslatedSpanCount: number;

readonly analyticalSectionCount: number;

readonly score: number;

readonly passed: boolean;

readonly issues: string[];

Fixed in c36cb05 — all 9 ArticleQualityReport properties are now readonly.

Copilot · 2026-02-23T10:53:06Z

scripts/generate-news-enhanced.ts

+ *
+ * @param html - Full HTML content of the article
+ * @param lang - Language code (e.g. 'en', 'sv')
+ * @param articleType - Article type label for reporting (e.g. 'motions-en')


The JSDoc example for the articleType parameter is misleading. It shows 'motions-en' but the actual usage (lines 717, 804, 888, 972) passes just the article type without language code (e.g., 'motions', 'propositions'). The language code is appended internally at line 447 to form the articleId. The example should be updated to 'motions' instead of 'motions-en'.

Suggested change

* @param articleType - Article type label for reporting (e.g. 'motions-en')

* @param articleType - Article type label for reporting (e.g. 'motions')

Fixed in c36cb05 — JSDoc example updated to 'motions'.

pethers · 2026-02-23T11:50:45Z

@copilot apply changes based on the comments in this thread

github-actions · 2026-02-23T11:52:05Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

…ample Co-authored-by: pethers <1726836+pethers@users.noreply.github.com>

Copilot · 2026-02-23T12:31:07Z

@copilot apply changes based on the comments in this thread

Done in c36cb05 — both review suggestions applied: readonly modifiers on all ArticleQualityReport properties, and the JSDoc @param articleType example corrected from 'motions-en' to 'motions'.

github-actions · 2026-02-23T12:37:07Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated no new comments.

…t.ts Co-authored-by: pethers <1726836+pethers@users.noreply.github.com>

github-actions · 2026-02-23T14:46:34Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 6 comments.

Copilot · 2026-02-23T14:51:13Z

scripts/generate-news-enhanced.ts

+  // Use list items as a proxy for total document entries
+  const listItemCount = (html.match(/<li[^>]*>/g) ?? []).length;
+  const totalEntryCount = Math.max(listItemCount, unknownAuthorCount);
+


totalEntryCount is derived from a raw <li> count, but the generated articles use <div class="*-entry"> blocks and class="document-link" anchors for documents (and <li> also appears in the event calendar/watch lists). This will skew the unknown-author ratio (e.g., known entries may not be counted at all, or unrelated <li>s may dilute the ratio), leading to incorrect scores/exit-code behavior. Consider counting document entries via a more stable marker like class="document-link" (or per-articleType entry class) instead of <li>.

Suggested change

// Use list items as a proxy for total document entries

const listItemCount = (html.match(/<li[^>]*>/g) ?? []).length;

const totalEntryCount = Math.max(listItemCount, unknownAuthorCount);

// Prefer explicit document links as a proxy for total document entries

const documentLinkCount = (html.match(/class=["']document-link["']/g) ?? []).length;

// Fallback: use list items when no explicit document links are present

const listItemCount = (html.match(/<li[^>]*>/g) ?? []).length;

const baseEntryCount = documentLinkCount > 0 ? documentLinkCount : listItemCount;

// Ensure total entries are never less than the number of unknown-author entries

const totalEntryCount = Math.max(baseEntryCount, unknownAuthorCount);

Copilot · 2026-02-23T14:51:13Z

scripts/generate-news-enhanced.ts

+  }
+
+  const score = wordScore + unknownScore + untranslatedScore + analyticalScore;
+  const passed = score >= qualityThreshold;


Articles flagged with Word count: < 300 — REJECT can still end up with passed=true if the other dimensions score high, because passed only checks score >= qualityThreshold. This contradicts the “REJECT” semantics and can prevent the run-level soft-failure from triggering. Consider forcing passed=false (and optionally treating the score as below threshold) whenever the reject condition is hit (e.g., wordCount < 300).

Suggested change

const passed = score >= qualityThreshold;

const hardRejected = wordCount < 300;

const passed = !hardRejected && score >= qualityThreshold;

Copilot · 2026-02-23T14:51:14Z

scripts/generate-news-enhanced.ts

+const qualityThreshold: number = qualityThresholdArg ? parseInt(qualityThresholdArg.split('=')[1] ?? '40', 10) : 40;



--quality-threshold= parsing can yield NaN (e.g., empty value --quality-threshold= or non-numeric input). A NaN threshold makes all score >= qualityThreshold comparisons false and also prevents the every(s < qualityThreshold) soft-failure check from ever triggering. Consider validating the parsed value (e.g., Number.isFinite) and falling back to the default or exiting with a clear error when invalid.

Suggested change

const qualityThreshold: number = qualityThresholdArg ? parseInt(qualityThresholdArg.split('=')[1] ?? '40', 10) : 40;

const DEFAULT_QUALITY_THRESHOLD = 40;

const qualityThresholdRaw: string | undefined = qualityThresholdArg ? qualityThresholdArg.split('=')[1] : undefined;

let qualityThreshold: number = DEFAULT_QUALITY_THRESHOLD;

if (qualityThresholdRaw !== undefined && qualityThresholdRaw !== '') {

const parsedQuality = Number(qualityThresholdRaw);

if (Number.isFinite(parsedQuality)) {

qualityThreshold = parsedQuality;

} else {

console.error(

`Invalid --quality-threshold value "${qualityThresholdRaw}". Using default ${DEFAULT_QUALITY_THRESHOLD}.`,

);

}

}

Copilot · 2026-02-23T14:51:14Z

scripts/generate-news-enhanced.ts

+export function validateArticleQuality(html: string, lang: string, articleType: string): ArticleQualityReport {
+  const articleId = `${articleType}-${lang}`;
+
+  // Word count: strip tags and count whitespace-separated tokens
+  const textContent = html.replace(/<[^>]+>/g, ' ').replace(/\s+/g, ' ').trim();
+  const wordCount = textContent.length === 0 ? 0 : textContent.split(' ').filter(w => w.length > 0).length;
+
+  // Count "Unknown (Unknown)" sentinel entries (used when author/party is missing)
+  const unknownAuthorCount = (html.match(/Unknown \(Unknown\)/g) ?? []).length;
+
+  // Use list items as a proxy for total document entries
+  const listItemCount = (html.match(/<li[^>]*>/g) ?? []).length;
+  const totalEntryCount = Math.max(listItemCount, unknownAuthorCount);
+
+  // Count untranslated spans — only relevant for non-Swedish content
+  const untranslatedSpanCount = lang !== 'sv'
+    ? (html.match(/data-translate="true"/g) ?? []).length
+    : 0;
+
+  // Count analytical h2 sections (structural quality indicator)
+  const analyticalSectionCount = (html.match(/<h2[^>]*>/g) ?? []).length;
+
+  const issues: string[] = [];
+
+  // Word count score: 25 pts
+  let wordScore = 0;
+  if (wordCount >= 500) {
+    wordScore = 25;
+  } else if (wordCount >= 300) {
+    wordScore = 15;
+  } else {
+    issues.push(`Word count: ${wordCount} < 300 — REJECT`);
+  }
+
+  // Unknown authors score: 25 pts
+  let unknownScore = 0;
+  const unknownRatio = totalEntryCount > 0 ? unknownAuthorCount / totalEntryCount : 0;
+  if (unknownRatio === 0) {
+    unknownScore = 25;
+  } else if (unknownRatio <= 0.5) {
+    unknownScore = Math.round(25 * (1 - unknownRatio));
+    issues.push(`Unknown authors: ${unknownAuthorCount}/${totalEntryCount} ⚠️`);
+  } else {
+    issues.push(`Unknown authors: ${unknownAuthorCount}/${totalEntryCount} ⚠️`);
+  }
+
+  // Untranslated spans score: 25 pts
+  let untranslatedScore = 0;
+  if (lang === 'sv' || untranslatedSpanCount === 0) {
+    untranslatedScore = 25;
+  } else if (untranslatedSpanCount <= 10) {
+    untranslatedScore = Math.round(25 * (1 - untranslatedSpanCount / 10));
+    issues.push(`Untranslated spans: ${untranslatedSpanCount} ⚠️`);
+  } else {
+    issues.push(`Untranslated spans: ${untranslatedSpanCount} > 10 ⚠️`);
+  }
+
+  // Analytical sections score: 25 pts
+  let analyticalScore = 0;
+  if (analyticalSectionCount >= 3) {
+    analyticalScore = 25;
+  } else if (analyticalSectionCount >= 1) {
+    analyticalScore = Math.round(25 * analyticalSectionCount / 3);
+    issues.push(`Analytical sections: ${analyticalSectionCount}/3 ⚠️`);
+  } else {
+    issues.push(`Analytical sections: 0/3 ⚠️`);
+  }
+
+  const score = wordScore + unknownScore + untranslatedScore + analyticalScore;
+  const passed = score >= qualityThreshold;
+


validateArticleQuality is exported but its passed value depends on the module-level qualityThreshold parsed from CLI args at import time. This makes the function’s behavior context-dependent for external callers (and for tests/tools importing the module), which is surprising for a reusable API. Consider making the threshold an explicit parameter (or returning score/metrics only and letting callers decide pass/fail).

Copilot · 2026-02-23T14:51:14Z

scripts/generate-news-enhanced.ts

+async function writeSingleArticle(html: string, slug: string, lang: Language, articleType?: string): Promise<string> {
+  // Validate article quality before writing
+  const report = validateArticleQuality(html, lang, articleType ?? slug);
+  const unknownRatioStr = report.totalEntryCount > 0
+    ? `${report.unknownAuthorCount}/${report.totalEntryCount}`
+    : `${report.unknownAuthorCount}/0`;
+  const unknownIcon = report.unknownAuthorCount > 0 ? '⚠️' : '✅';
+  const untranslatedIcon = report.untranslatedSpanCount > 0 ? '⚠️' : '✅';
+  const analyticalIcon = report.analyticalSectionCount >= 3 ? '✅' : '⚠️';
+  console.log(`\n  📊 Article Quality Report: ${report.articleId}`);
+  console.log(`     - Word count: ${report.wordCount} ${report.wordCount >= 300 ? '✅' : '❌'}`);
+  console.log(`     - Unknown authors: ${unknownRatioStr} ${unknownIcon}`);
+  console.log(`     - Untranslated spans: ${report.untranslatedSpanCount} ${untranslatedIcon}`);
+  console.log(`     - Analytical sections: ${report.analyticalSectionCount}/3 ${analyticalIcon}`);
+  console.log(`     - Quality Score: ${report.score}/100 — ${report.passed ? 'ABOVE THRESHOLD' : 'BELOW THRESHOLD'}`);
+  if (report.issues.length > 0) {
+    report.issues.forEach(issue => console.warn(`     ⚠️  ${issue}`));
+  }
+  qualityScores.push(report.score);
+
  const filename: string = `${slug}-${lang}.html`;
  await writeArticle(html, filename);


Quality validation currently runs only via writeSingleArticle, but several article types in this pipeline are generated by news-types/* modules that call the provided writeArticle(html, filename) callback directly (e.g., month-ahead/weekly-review/monthly-review/breaking). Those outputs won’t be validated and won’t contribute to qualityScores, so the new gate/exit-code behavior is incomplete relative to the PR’s stated goal. Consider moving validation into writeArticle (deriving lang/articleType from the filename/slug) or providing a validating wrapper callback to those generators.

Copilot · 2026-02-23T14:51:15Z

tests/article-quality.test.ts

+/** Shape of the quality report returned by validateArticleQuality */
+interface ArticleQualityReport {
+  readonly articleId: string;
+  readonly wordCount: number;
+  readonly unknownAuthorCount: number;
+  readonly totalEntryCount: number;
+  readonly untranslatedSpanCount: number;
+  readonly analyticalSectionCount: number;
+  readonly score: number;
+  readonly passed: boolean;
+  readonly issues: string[];
+}
+
+/** Partial shape of the generate-news-enhanced module we need for these tests */
+interface GenerateNewsEnhancedModule {
+  readonly validateArticleQuality: (html: string, lang: string, articleType: string) => ArticleQualityReport;
+}


This test re-declares the ArticleQualityReport shape locally instead of importing the exported type from generate-news-enhanced. That can drift over time (tests still compiling while the real interface changes). Prefer import type { ArticleQualityReport } from the module (or ReturnType<typeof validateArticleQuality>), and use it in GenerateNewsEnhancedModule.

pethers · 2026-02-23T16:01:02Z

@copilot apply changes based on the comments in this thread

github-actions · 2026-02-23T16:02:31Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

github-actions · 2026-02-23T17:49:55Z

🔍 Lighthouse Performance Audit

Category	Score	Status
Performance	85/100	🟡
Accessibility	95/100	🟢
Best Practices	90/100	🟢
SEO	95/100	🟢

📥 Download full Lighthouse report

Budget Compliance: Performance budgets enforced via budget.json

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated no new comments.

Initial plan

f962c83

Copilot AI assigned Copilot and pethers Feb 23, 2026

Copilot started work on behalf of pethers February 23, 2026 07:36 View session

Add article quality validation gate in generate-news-enhanced.ts pipe…

4f89246

…line Co-authored-by: pethers <1726836+pethers@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Add article quality validation gate in generate-news-enhanced pipeline~~ Add article quality validation gate to generate-news-enhanced.ts pipeline Feb 23, 2026

Copilot AI requested a review from pethers February 23, 2026 07:51

Copilot finished work on behalf of pethers February 23, 2026 07:51

Copilot started work on behalf of pethers February 23, 2026 09:20 View session

Copilot finished work on behalf of pethers February 23, 2026 09:27

Merge branch 'main' into copilot/add-article-quality-validation

d170c95

pethers requested a review from Copilot February 23, 2026 10:48

github-actions bot added testing Test coverage refactor Code refactoring size-l Large change (250-1000 lines) labels Feb 23, 2026

Copilot started reviewing on behalf of pethers February 23, 2026 10:48 View session

Copilot AI reviewed Feb 23, 2026

View reviewed changes

Merge branch 'main' into copilot/add-article-quality-validation

c47743b

Copilot started work on behalf of pethers February 23, 2026 12:03 View session

Apply review feedback: readonly interface properties and fix JSDoc ex…

c36cb05

…ample Co-authored-by: pethers <1726836+pethers@users.noreply.github.com>

Copilot finished work on behalf of pethers February 23, 2026 12:32

pethers requested a review from Copilot February 23, 2026 12:35

Copilot started reviewing on behalf of pethers February 23, 2026 12:36 View session

Copilot AI reviewed Feb 23, 2026

View reviewed changes

Copilot started work on behalf of pethers February 23, 2026 13:03 View session

Fix missing beforeEach import in generate-news-enhanced-mcp-abort.tes…

6e69b92

…t.ts Co-authored-by: pethers <1726836+pethers@users.noreply.github.com>

Copilot finished work on behalf of pethers February 23, 2026 13:12

Merge branch 'main' into copilot/add-article-quality-validation

78a6649

pethers requested a review from Copilot February 23, 2026 14:45

Copilot started reviewing on behalf of pethers February 23, 2026 14:46 View session

Copilot AI reviewed Feb 23, 2026

View reviewed changes

Merge branch 'main' into copilot/add-article-quality-validation

1fb96c3

Copilot started work on behalf of pethers February 23, 2026 16:18 View session

Copilot finished work on behalf of pethers February 23, 2026 17:17

Merge branch 'main' into copilot/add-article-quality-validation

07e4738

pethers requested a review from Copilot February 23, 2026 17:48

Copilot started reviewing on behalf of pethers February 23, 2026 17:49 View session

Copilot AI reviewed Feb 23, 2026

View reviewed changes

pethers marked this pull request as ready for review February 23, 2026 18:12

pethers merged commit eef669e into main Feb 23, 2026
19 checks passed

pethers deleted the copilot/add-article-quality-validation branch February 23, 2026 18:12

Copilot AI mentioned this pull request Feb 23, 2026

Add article quality validation gate to news generation pipeline #467

Draft

	* @param articleType - Article type label for reporting (e.g. 'motions-en')
	* @param articleType - Article type label for reporting (e.g. 'motions')

-  // Use list items as a proxy for total document entries
-  const listItemCount = (html.match(/<li[^>]*>/g) ?? []).length;
-  const totalEntryCount = Math.max(listItemCount, unknownAuthorCount);
+  // Prefer explicit document links as a proxy for total document entries
+  const documentLinkCount = (html.match(/class=["']document-link["']/g) ?? []).length;
+  // Fallback: use list items when no explicit document links are present
+  const listItemCount = (html.match(/<li[^>]*>/g) ?? []).length;
+  const baseEntryCount = documentLinkCount > 0 ? documentLinkCount : listItemCount;
+  // Ensure total entries are never less than the number of unknown-author entries
+  const totalEntryCount = Math.max(baseEntryCount, unknownAuthorCount);

	const passed = score >= qualityThreshold;
	const hardRejected = wordCount < 300;
	const passed = !hardRejected && score >= qualityThreshold;

		const qualityThreshold: number = qualityThresholdArg ? parseInt(qualityThresholdArg.split('=')[1] ?? '40', 10) : 40;

-const qualityThreshold: number = qualityThresholdArg ? parseInt(qualityThresholdArg.split('=')[1] ?? '40', 10) : 40;
+const DEFAULT_QUALITY_THRESHOLD = 40;
+const qualityThresholdRaw: string | undefined = qualityThresholdArg ? qualityThresholdArg.split('=')[1] : undefined;
+let qualityThreshold: number = DEFAULT_QUALITY_THRESHOLD;
+if (qualityThresholdRaw !== undefined && qualityThresholdRaw !== '') {
+  const parsedQuality = Number(qualityThresholdRaw);
+  if (Number.isFinite(parsedQuality)) {
+    qualityThreshold = parsedQuality;
+  } else {
+    console.error(
+      `Invalid --quality-threshold value "${qualityThresholdRaw}". Using default ${DEFAULT_QUALITY_THRESHOLD}.`,
+    );
+  }
+}

Comments

Conversation

Copilot AI commented Feb 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

scripts/generate-news-enhanced.ts

tests/article-quality.test.ts

tests/generate-news-enhanced-mcp-abort.test.ts

🎯 Objective

📊 Current State

🚀 Desired State

🔧 Implementation Approach

✅ Acceptance Criteria

📚 References

🤖 Recommended Agent

Comments on the Issue (you are @copilot in this section)

Uh oh!

github-actions bot commented Feb 23, 2026

🔍 Lighthouse Performance Audit

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

pethers commented Feb 23, 2026

Uh oh!

github-actions bot commented Feb 23, 2026

🔍 Lighthouse Performance Audit

Uh oh!

Copilot AI commented Feb 23, 2026

Uh oh!

github-actions bot commented Feb 23, 2026

🔍 Lighthouse Performance Audit

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

github-actions bot commented Feb 23, 2026

🔍 Lighthouse Performance Audit

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

pethers commented Feb 23, 2026

Uh oh!

github-actions bot commented Feb 23, 2026

🔍 Lighthouse Performance Audit

Copilot AI commented Feb 23, 2026 •

edited

Loading

`scripts/generate-news-enhanced.ts`

`tests/article-quality.test.ts`

`tests/generate-news-enhanced-mcp-abort.test.ts`