Skip to content

fix: replace delisted x-ai/grok-4-fast default with the live openai/gpt-4.1-mini#246

Closed
Fearvox wants to merge 668 commits into
EverMind-AI:mainfrom
Fearvox:fix/llm-model-default-delisted-grok
Closed

fix: replace delisted x-ai/grok-4-fast default with the live openai/gpt-4.1-mini#246
Fearvox wants to merge 668 commits into
EverMind-AI:mainfrom
Fearvox:fix/llm-model-default-delisted-grok

Conversation

@Fearvox
Copy link
Copy Markdown
Collaborator

@Fearvox Fearvox commented Jun 3, 2026

What

The out-of-box LLM model default is a delisted OpenRouter model, so a new
user who copies env.template, fills in a real OPENROUTER_API_KEY, and runs
EverCore fails on the first LLM call.

x-ai/grok-4-fast is no longer served on OpenRouter:

  • It is absent from the OpenRouter model catalog (GET https://openrouter.ai/api/v1/models — 343 models; the only grok ids are
    x-ai/grok-build-0.1, x-ai/grok-4.3, x-ai/grok-4.20,
    x-ai/grok-4.20-multi-agent).
  • GET …/models/x-ai/grok-4-fast/endpoints returns 0 serving endpoints, so
    a chat-completions request with it gets no providers and fails.

src/memory_layer/llm/llm_provider.py:45 forwards LLM_MODEL verbatim, so the
template value reaches OpenRouter unchanged.

This PR replaces the dead id with openai/gpt-4.1-mini — which is the code's
own hardcoded DEFAULT_LLM_MODEL (llm_provider.py:9) and has live OpenRouter
endpoints — in the three places the dead id is shipped:

  • methods/EverCore/env.template (the runtime default)
  • methods/EverCore/docs/dev_docs/getting_started.md (setup example)
  • methods/EverCore/docs/usage/CONFIGURATION_GUIDE.md (the cost-effective-model example)

After this change the template default agrees with the code default, so a
fresh key-only setup works on the first call.

Why this id

openai/gpt-4.1-mini is already DEFAULT_LLM_MODEL in the code, is live on
OpenRouter, and is cost-effective — so template, docs, and code all agree.

Scope

Three files, one dead-id → live-id swap each. No code change. Intentionally
not touched: tests/test_llm_switching_e2e.py uses x-ai/grok-4-fast
deliberately as a disallowed model to assert white-list rejection — a delisted
id is still a valid "disallowed" example there, so its meaning is preserved.

Verification

OpenRouter catalog checked live (2026-06-02): x-ai/grok-4-fast not present;
openai/gpt-4.1-mini present. A running EverCore configured with
openai/gpt-4.1-mini serves normally.

Co-Authored-By: Claude Opus 4.8 (1M context) noreply@anthropic.com

🤖 Generated with Claude Code

libin.zhang and others added 30 commits December 31, 2025 18:23
Use self-deployed embedding and rerank APIs by default

See merge request npc-work/aic/ai/evermemos-opensource!64
vLLM Rerank API adopts an instruction-tuned approach

See merge request npc-work/aic/ai/evermemos-opensource!65
feat: metrics client and rerank/vectorize/retrieve metrics

See merge request npc-work/aic/ai/evermemos-opensource!66
fix:update episode prompt

See merge request npc-work/aic/ai/evermemos-opensource!68
feat: add rerank metrics

See merge request npc-work/aic/ai/evermemos-opensource!69
cyfyifanchen and others added 25 commits April 27, 2026 03:42
* chore: rename project from evermemos to EverCore

This commit renames the project directory and updates all internal references from "evermemos" to "EverCore". The changes include:
- Renaming the main directory from `methods/evermemos` to `methods/EverCore`
- Updating all import paths and module references
- Maintaining the same code structure and functionality
- Adding new configuration files (.vscode/settings.json, .pylintrc, pyrightconfig.json)
- Updating Dockerfile and project metadata

* docs: update references from evermemos to EverCore

Update documentation files to reflect the renaming of the 'evermemos' directory to 'EverCore'. This includes fixing clone commands, directory paths, and documentation links across multiple files to ensure consistency and correct navigation for users.

* chore: rename EverMemOS to EverCore across codebase

This is a project-wide rebranding from EverMemOS to EverCore. The changes include:
- Update project name in source files, documentation, and configuration
- Rename API references, environment variables, and service names
- Modify demo descriptions and benchmark configurations
- Update URLs and citations to reflect new project identity

All functionality remains identical; only naming has changed to align with the new project branding.

* docs: update README with EverCore focus and restructured TOC

- Add line break before Table of Contents for better visual separation
- Rewrite project description to highlight EverCore as the central component
- Reorder directory tree to prioritize benchmarks and methods over use-cases
- Update use-cases list with more examples and clarify they are templates
- Improve flow from Quick Start to use-cases to benchmarks

* docs: update README with clearer methods description and benchmarks

Add benchmark numbers directly in the method summaries for better visibility.
Clarify introductory text to emphasize choice and composition of methods.

* docs: fix markdown formatting in README table of contents

Adjust whitespace and line breaks to ensure proper rendering of the collapsible table of contents section.
…d-AI#204)

- Replace specific EverMemBench-Dynamic badge with general EverMind-AI HuggingFace badge
- Remove redundant License badge
- Change "Methods" section heading to "Architecture Methods"
- Update sub-section headings from h4 (####) to h3 (###) for better hierarchy
…rMind-AI#208)

* docs: restructure README and add AGENTS.md for better navigation

- Reorder sections to emphasize architecture methods and use cases
- Move use cases section before quick start for better flow
- Rename "Methods" to "Architecture Methods" for clarity
- Add AGENTS.md with quick commands and key entry points
- Update section headers to improve document hierarchy
- Maintain all existing content while improving organization

* docs: add community and contribution files

* docs: reorder README directory tree for logical grouping

* docs: move community files to .github/ and update references

* ci: change deploy workflow trigger from feature branch to main
* docs: restructure README and add subdirectory guides

Move the directory tree from the main README to new dedicated README files for each top-level folder (use-cases, methods, benchmarks). Add detailed introductions and tables to guide users to the appropriate subprojects. This improves navigation and provides clear entry points for different use cases.

* docs: expand showcase section with new projects and links

Add six new project entries to the README showcase, each with a banner image, description, and code/plugin link. Also update an existing benchmark entry to include a dataset link. This enhances the repository's demonstration of real-world applications and available resources.
* docs(readme): update project links and formatting

* docs(use-cases): enhance README with visual catalogue of demos

Expand the use cases section from a simple table to a detailed visual catalogue with project banners, descriptions, and links. This improves user engagement and provides a better showcase of community integrations and demos.

* docs: update READMEs and add validation for use-case links
* docs: update plugin repository link in README

* docs(readme): update banner gif link
)

* docs(readme): update code example link to pinned commit

pin the reference to the voice assistant example code to a specific commit hash and fix folder name capitalization

* docs: update voice assistant demo link in README
* docs(readme): add four new use case entries

* docs(readme): update outdated banner links to correct github repos
…e-demo-content-payload

Fix EverCore demo memory payload
…actions-hygiene

Harden GitHub Actions workflows
…adme-quickstart

docs: verify EverCore quickstart path
…I#236)

Delete deprecated EvoAgentBench, EverMemBench benchmark suites and
HyperMem memory system implementation, including all associated
configurations, scripts, and supporting assets.
…pt-4.1-mini

The shipped default LLM model id is delisted on OpenRouter, so a fresh setup
(copy env.template, add a real OPENROUTER_API_KEY, run) fails on the first LLM
call. x-ai/grok-4-fast is absent from the OpenRouter catalog and its endpoints
list is empty (0 serving providers); llm_provider.py forwards LLM_MODEL verbatim,
so the dead default reaches OpenRouter unchanged.

Replace it with openai/gpt-4.1-mini, which is the code's own DEFAULT_LLM_MODEL
(llm_provider.py:9) and is live on OpenRouter, in the three places the dead id is
shipped: env.template (runtime default), docs/dev_docs/getting_started.md (setup
example), and docs/usage/CONFIGURATION_GUIDE.md (cost-effective-model example).
Template default now agrees with the code default.

Leaves tests/test_llm_switching_e2e.py untouched: it uses x-ai/grok-4-fast on
purpose as a disallowed model to assert white-list rejection, and a delisted id
is still a valid "disallowed" example there.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Copilot AI review requested due to automatic review settings June 3, 2026 06:58
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot was unable to review this pull request because the user who requested the review has reached their quota limit.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.