refactor: consolidate e2e tests and add unit tests by k82cn · Pull Request #435 · xflops/flame

k82cn · 2026-05-06T13:21:59Z

Summary

Consolidate e2e Python tests from 14 to 7 files for better maintainability
Add unit tests for session_manager storage layer, executor_manager, object_cache, and SDK

E2E Test Consolidation

New File	Merged From
`test_session.py`	`test_session_management.py` + `test_open_session.py` + `test_batch.py`
`test_cache.py`	`test_cache.py` + `test_cache_lru.py`
`test_runner.py`	`test_runner.py` + `test_flmrun.py` + `test_get_data.py` + `test_drf.py`
`test_failure_recovery.py`	`test_failure_recovery.py` + `test_shim_selection.py`
`test_core.py`	Kept as-is
`test_agent.py`	Kept as-is
`test_application.py`	Kept as-is

New Unit Tests

session_manager/src/storage/*_tests.rs - Storage layer unit tests
executor_manager/src/executor.rs - Executor unit tests
executor_manager/src/manager.rs - Manager unit tests
object_cache/src/cache.rs - Cache unit tests
sdk/python/tests/test_*.py - Python SDK tests
sdk/rust/src/client/mod.rs - Rust SDK tests

- Consolidate e2e Python tests from 14 to 7 files: - test_session.py: merged session_management, open_session, batch tests - test_cache.py: merged cache and cache_lru tests - test_runner.py: merged runner, flmrun, get_data, drf tests - test_failure_recovery.py: merged failure_recovery and shim_selection tests - Add unit tests for session_manager storage layer - Add unit tests for executor_manager - Add unit tests for object_cache - Add unit tests for SDK (Python and Rust)

Fix clippy field_reassign_with_default error in application_tests.rs

gemini-code-assist

Code Review

This pull request introduces several new end-to-end tests for the Flame system, covering LRU cache eviction policies, task failure recovery, session management, and shim selection logic. The review feedback highlights the need for more robust test configurations, such as using environment variable overrides for cache limits instead of relying on defaults, replacing hardcoded sleep intervals with polling mechanisms to reduce flakiness, and replacing magic numbers with constants for better maintainability.

gemini-code-assist · 2026-05-06T13:32:21Z

+class TestLRUEviction:
+    """Test suite for LRU eviction policy behavior."""
+
+    def test_basic_eviction_on_memory_limit(self):


The test test_basic_eviction_on_memory_limit does not explicitly configure the cache memory limit, making it dependent on the environment's default configuration. Please use monkeypatch.setenv("FLAME_CACHE_MAX_MEMORY", "1M") to ensure the test consistently triggers eviction.

def test_basic_eviction_on_memory_limit(self, monkeypatch): monkeypatch.setenv("FLAME_CACHE_MAX_MEMORY", "1M")

gemini-code-assist · 2026-05-06T13:32:21Z

+            refreshed_task = session.get_task(task.id)
+
+            # Check events for error information
+            error_events = [e for e in refreshed_task.events if e.code == TaskState.FAILED or e.code == 3]


The magic number 3 is used to check for task failure. Please define a constant for this error code to improve maintainability.

gemini-code-assist · 2026-05-06T13:32:21Z

+        task1 = session.create_task(serialize_request(request1))
+        task1_id = task1.id
+
+        time.sleep(3)


Hardcoded time.sleep(3) can lead to flaky tests. Please replace this with a polling mechanism that checks for the expected state change.

Suggested change

time.sleep(3)

# Wait for task to be pending

for _ in range(10):

if session.get_task(task1_id).state == TaskState.PENDING:

break

time.sleep(0.5)

codecov · 2026-05-06T13:47:46Z

Codecov Report

❌ Patch coverage is 39.58333% with 29 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
sdk/rust/src/client/mod.rs	0.00%	8 Missing ⚠️
session_manager/src/apiserver/frontend.rs	0.00%	4 Missing ⚠️
flmctl/src/create.rs	0.00%	3 Missing ⚠️
flmctl/src/list.rs	0.00%	3 Missing ⚠️
session_manager/src/model/mod.rs	40.00%	3 Missing ⚠️
session_manager/src/scheduler/plugins/priority.rs	70.00%	3 Missing ⚠️
common/src/apis/types.rs	0.00%	1 Missing ⚠️
flmexec/src/client.rs	0.00%	1 Missing ⚠️
flmping/src/client.rs	0.00%	1 Missing ⚠️
session_manager/src/storage/engine/filesystem.rs	75.00%	1 Missing ⚠️
... and 1 more

📢 Thoughts on this report? Let us know!

The TestFlmrunApplication tests require /opt/e2e which only exists in Docker E2E environment.

- Fix Rust SDK serde_message::deserialize to handle null values for common_data field by deserializing to Option<String> - Export ResourceRequirement in flamepy package for Python E2E tests

…ility The fairshare plugin's is_available check requires ssn.slots == exec.slots, but the SDK sets slots=0 when resreq is provided. This causes sessions with explicit resreq to never match any executor. Skip this test until the fairshare plugin is updated to handle resreq-based allocation.

- Remove fairshare scheduler plugin (slots-based allocation) - Replace with DRF (Dominant Resource Fairness) for resreq-based allocation - Update create_executor to use session's resreq directly when specified - Update all config files to use drf instead of fairshare - Update DEFAULT_POLICIES to [priority, drf, gang] - Update tests to reflect new behavior (all executors available by default) - Re-enable test_session_with_resreq test This simplifies the scheduling model by using resource requirements directly instead of the abstract slots concept. DRF provides fair multi-resource allocation based on actual CPU/memory/GPU requirements.

- Regenerate Python protobuf files for types.proto changes (removed slots, resreq now required) - Fix ruff lint: import sorting in agent/client.py - Fix ruff lint: add noqa comments for mock method names matching protobuf interface - Fix ruff format: 3 test files reformatted - Remove backend.proto from Python SDK (internal use only)

k82cn added 2 commits May 6, 2026 21:21

fix: use struct initialization instead of field reassignment

fb33761

Fix clippy field_reassign_with_default error in application_tests.rs

gemini-code-assist Bot reviewed May 6, 2026

View reviewed changes

k82cn added 6 commits May 6, 2026 21:49

fix: skip TestFlmrunApplication in BareMetal environment

8b20c3b

The TestFlmrunApplication tests require /opt/e2e which only exists in Docker E2E environment.

fix: handle null in serde_message and export ResourceRequirement

8594c62

- Fix Rust SDK serde_message::deserialize to handle null values for common_data field by deserializing to Option<String> - Export ResourceRequirement in flamepy package for Python E2E tests

fix: use int for cpu in ResourceRequirement test

969996a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor: consolidate e2e tests and add unit tests#435

refactor: consolidate e2e tests and add unit tests#435
k82cn wants to merge 8 commits intoxflops:mainfrom
k82cn:flm_test_0506

k82cn commented May 6, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 6, 2026

Uh oh!

gemini-code-assist Bot May 6, 2026

Uh oh!

gemini-code-assist Bot May 6, 2026

Uh oh!

codecov Bot commented May 6, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

-        time.sleep(3)
+        # Wait for task to be pending
+        for _ in range(10):
+            if session.get_task(task1_id).state == TaskState.PENDING:
+                break
+            time.sleep(0.5)

Conversation

k82cn commented May 6, 2026

Summary

E2E Test Consolidation

New Unit Tests

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 6, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 6, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 6, 2026

Choose a reason for hiding this comment

Uh oh!

codecov Bot commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codecov Bot commented May 6, 2026 •

edited

Loading