feat: Enable prediction-map generation for final linear classifier test runs by vojtech-cifka · Pull Request #14 · RationAI/tissue-classification

vojtech-cifka · 2026-05-18T10:35:36Z

This PR enables TIFF prediction-map generation for the final linear classifier test configs and cleans up the related ML config structure.

Changes include:

Add TiffPredictionMapWriter to final LBFGS/AdamW test runs.
Store final checkpoint run IDs directly in test/predict configs instead of passing them manually from submit scripts.
Add the Virchow2 whole-tissue embedding run ID to the tissue-tile prediction config.
Remove error-mask generation from prediction maps.
Add flushed progress logs while writing prediction maps.
Reorganize configs/ml into clearer task, data, model, and trainer groups.
Move optimizer defaults out of the shared model config and into experiment configs.
Keep prediction-map artifacts under:
prediction_maps_tiff/pred/
prediction_maps_tiff/prob//

Summary by CodeRabbit

New Features
- Predict mode for unlabeled tissue tiles with per-class probability outputs and TIFF/Parquet prediction map export.
- TIFF prediction map writer producing slide-aligned class maps and per-class probability maps.
Enhancements
- Tile coordinate tracking included in prediction outputs for precise spatial mapping.
- Improved checkpoint resolution for artifact-backed checkpoints.
- Data module supports separate train and evaluation batch sizes; new experiment presets for final/k-fold training and tests.

Use the `label` and `fold` columns produced by the upstream k-fold split instead of deriving labels from coverage columns and randomly splitting val. Memory-mapped via HuggingFace datasets so the full embedding parquet no longer has to fit in numpy. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…ddings Datamodule downloads embeddings + kfold artifacts from MLflow, joins on (slide_id, x, y) via pyarrow, applies class mapping, tissue/class coverage filters, and exposes per-fold splits via set_val_fold(). Training script loops folds in a single run and logs per-fold + aggregate metrics. Probe adds per-class F1, confusion matrix figures, optional input L2-norm and class weights. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

The experiment file was declaring /class_mapping as a fresh default while configs/ml/linear_probe.yaml already had one, which Hydra rejects as a duplicate. Mark it as an override so the experiment replaces the base default. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

ml/train.py uses @with_cli_args(["+ml=linear_probe"]), so the decorator already injects that arg. Passing it again on the command line caused Hydra to load configs/ml/linear_probe.yaml twice and reject duplicate defaults. Rely on the decorator and pass only +experiment=... Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

@Package

…ng refs Two interpolation problems prevented Hydra from resolving the linear-probe config: 1. configs/ml.yaml uses ${random_seed:} and configs/ml/linear_probe.yaml uses ${len:...}, but neither resolver is registered anywhere. Register both at module import time in ml/train.py. 2. The class_mapping yamls use # @Package _global_, so class_mapping, class_indices, and class_names land at the config root. The references in linear_probe.yaml were doubly nested (e.g. class_mapping.class_mapping). Drop the prefix. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

The filtered tiles parquet collapses ROI columns at tiling time, so kfold writes canonical names ("Epithelium", etc.) directly into `label`. The raw→canonical lookup built from the BB-suffixed YAML lists matched none of these and dropped the entire 1.1M-tile dataset under drop_unmapped=True. Extend _raw_to_canonical with identity entries for every canonical class so modern parquets pass through while legacy un-collapsed labels still collapse correctly. "background" stays unmapped → dropped, as intended. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

- Add EmbeddingsDataModule.compute_class_weights("balanced"|"inverse") using sklearn-style weights from the current train fold. - train.py resolves class_weights="balanced"/"inverse" via the datamodule and passes the resulting list to LinearProbe at instantiate time (per-fold, since splits change). - Bump class_coverage_min from 0.0 to 0.5 to drop mosaic tiles. - Drop the redundant /class_mapping default from configs/ml/linear_probe.yaml; experiment files now own the choice. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Extract derive_labels logic to shared preprocessing/_labels.py, then use it in both split/kfold_split.py and the new embedding_dataset pipeline. The new pipeline joins k-fold (train) / filter_tiles (test) tile metadata with precomputed embeddings after applying tissue + per-dominant-class ROI thresholds, and emits a SlidesTilesLoader-compatible Parquet dataset as an MLflow artifact. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ataset

Joining 1M+ rows of list<double> embeddings was either OOMing on to_pandas() or hitting int32 list-offset overflow inside take(). The fix: - read embeddings into Arrow only and cast each chunk to large_list so take() concatenation uses int64 offsets; - run the join on keys plus a synthetic row index because Acero refuses list columns in non-key fields, then pull embeddings via take(); - combine_chunks() before take() for an O(N) single-pass copy; - write the parquet straight from Arrow, never materialising the embedding column in pandas. Also bumps the kube job memory to 64Gi to give the combined-chunks + take() peak some headroom, and trims the verbose [timing] prints down to one progress line per split. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Without this guard a malformed train artifact would crash deep inside apply_thresholds with a confusing KeyError. Surface a clear error that points at the expected upstream artifact instead. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

setup(stage="fit") replaces criterion with class-weighted CrossEntropyLoss, adding a criterion.weight buffer that gets saved to checkpoints. At test, Lightning restores the checkpoint before setup() runs, so the model still has the unweighted criterion from __init__ and strict load fails with "Unexpected key(s) in state_dict: criterion.weight". Affected both adamw and lbfgs test runs. Initialize criterion with a placeholder ones-weight sized num_classes so the criterion.weight key always exists; setup(fit) still overrides it with the real class-balanced weights. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (4)

configs/ml/data/final_embedding_tiles.yaml (1)
3-8: 💤 Low value

Consider adding eval_batch_size for evaluation-phase batching.

According to the review stack context, the DataModule now accepts an optional eval_batch_size parameter for separate evaluation-phase batching. While the code will default to using train_batch_size for evaluation, specifying a larger eval_batch_size could improve memory efficiency during validation and testing since gradients are not computed.
💡 Suggested addition
 data:
   train_batch_size: 1024
+  eval_batch_size: 2048
   num_workers: 4
   train_shuffle: true
   train_drop_last: false
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@configs/ml/data/final_embedding_tiles.yaml` around lines 3 - 8, Add an
eval_batch_size field under the top-level data mapping to allow the DataModule
to use a separate, larger batch size during evaluation (instead of defaulting to
train_batch_size); update the YAML key "eval_batch_size" (alongside
"train_batch_size", "num_workers", etc.) and set a value appropriate for
validation/testing (e.g., a larger number to improve memory efficiency when
gradients are off).
configs/experiment/ml/linear_classifier_adamw_stratified_kfold.yaml (1)
10-13: ⚡ Quick win

Verify that weight_decay: 0.0 is intentional for AdamW.

AdamW's primary feature is decoupled weight decay for better regularization. Setting weight_decay to 0.0 disables this, which may be suboptimal for generalization. Other AdamW configs in this PR use weight_decay: 1.0e-3.

If this is intentional for testing an unregularized baseline, consider documenting it. Otherwise, consider using a non-zero weight decay value.
💡 Suggested fix if weight decay is desired
 model:
   optimizer: adamw
   learning_rate: 1.0e-4
-  weight_decay: 0.0
+  weight_decay: 1.0e-3
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@configs/experiment/ml/linear_classifier_adamw_stratified_kfold.yaml` around
lines 10 - 13, The YAML sets model.optimizer to "adamw" but model.weight_decay
is 0.0 which disables AdamW's decoupled regularization; either change
model.weight_decay to a non-zero value (e.g., 1.0e-3) to match other AdamW
configs or add a comment/field documenting that 0.0 is intentional for an
unregularized baseline; update the config entry for model.weight_decay and/or
add a nearby comment explaining the rationale so reviewers know this is
deliberate.
ml/data/datasets/embedding_tiles.py (2)
200-201: 💤 Low value

Guard against empty first chunk when computing embedding dimension.

If the first chunk in emb_col happens to be empty (possible with certain partitioned parquet files), this division will raise ZeroDivisionError. Consider finding the first non-empty chunk or using a defensive check.
🛡️ Suggested defensive handling
-    first_chunk = emb_col.chunks[0]
-    embedding_dim = len(first_chunk.values) // len(first_chunk)
+    for chunk in emb_col.chunks:
+        if len(chunk) > 0:
+            embedding_dim = len(chunk.values) // len(chunk)
+            break
+    else:
+        raise RuntimeError("embedding column has no data")
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@ml/data/datasets/embedding_tiles.py` around lines 200 - 201, The code assumes
emb_col.chunks[0] is non-empty when computing embedding_dim, which can raise
ZeroDivisionError; modify the logic in embedding_tiles.py to locate the first
non-empty chunk (iterate emb_col.chunks until len(chunk) > 0) or check
len(first_chunk) before dividing, and if no non-empty chunk exists raise a clear
ValueError or return a sensible default; update the computation of embedding_dim
(using the found non-empty chunk) so embedding_dim = len(chunk.values) //
len(chunk) is only executed on a non-empty chunk.
252-258: ⚡ Quick win

Unreachable column validation due to read_parquet behavior.

The check at lines 255-258 is dead code. When tissue_column doesn't exist in the parquet file, pd.read_parquet(..., columns=[..., tissue_column]) raises an error before reaching your custom validation. Users will see a less descriptive pyarrow error instead of your helpful message.
♻️ Suggested fix: read all columns first, then validate
     def _filter_metadata(
         metadata_uri: str | Path,
         tissue_column: str,
         tissue_min: float,
     ) -> pd.DataFrame:
         local = _resolve_uri(metadata_uri)
-        columns = ["slide_id", "x", "y", tissue_column]
-        df = pd.read_parquet(local, columns=columns)
+        required_cols = ["slide_id", "x", "y"]
+        df = pd.read_parquet(local)
         if tissue_column not in df.columns:
             raise ValueError(
                 f"metadata parquet has no {tissue_column!r} column; cannot filter"
             )
-        df = df.loc[df[tissue_column] > tissue_min, ["slide_id", "x", "y"]]
+        df = df.loc[df[tissue_column] > tissue_min, required_cols]
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@ml/data/datasets/embedding_tiles.py` around lines 252 - 258, The explicit
check for tissue_column is unreachable because pd.read_parquet(...,
columns=[..., tissue_column]) will raise beforehand; update embedding_tiles.py
to first load the parquet without a columns projection (or catch the pyarrow
error), validate that tissue_column exists on the resulting df (or inspect
df.columns), then select/assign columns = ["slide_id","x","y", tissue_column]
and subset df accordingly; refer to _resolve_uri(metadata_uri), tissue_column,
and the df variable when making the change so the ValueError with your custom
message is raised from your own validation rather than from pyarrow.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@ml/data/data_module.py`:
- Line 29: Replace the truthy fallback with an explicit None check for the eval
batch-size: in the data module where self.eval_batch_size is set (use the
constructor parameter names eval_batch_size and train_batch_size), change the
expression self.eval_batch_size = eval_batch_size or train_batch_size to use a
conditional that only falls back when eval_batch_size is None (e.g.,
self.eval_batch_size = train_batch_size if eval_batch_size is None else
eval_batch_size) so falsey but invalid values like 0 are not silently masked.

---

Nitpick comments:
In `@configs/experiment/ml/linear_classifier_adamw_stratified_kfold.yaml`:
- Around line 10-13: The YAML sets model.optimizer to "adamw" but
model.weight_decay is 0.0 which disables AdamW's decoupled regularization;
either change model.weight_decay to a non-zero value (e.g., 1.0e-3) to match
other AdamW configs or add a comment/field documenting that 0.0 is intentional
for an unregularized baseline; update the config entry for model.weight_decay
and/or add a nearby comment explaining the rationale so reviewers know this is
deliberate.

In `@configs/ml/data/final_embedding_tiles.yaml`:
- Around line 3-8: Add an eval_batch_size field under the top-level data mapping
to allow the DataModule to use a separate, larger batch size during evaluation
(instead of defaulting to train_batch_size); update the YAML key
"eval_batch_size" (alongside "train_batch_size", "num_workers", etc.) and set a
value appropriate for validation/testing (e.g., a larger number to improve
memory efficiency when gradients are off).

In `@ml/data/datasets/embedding_tiles.py`:
- Around line 200-201: The code assumes emb_col.chunks[0] is non-empty when
computing embedding_dim, which can raise ZeroDivisionError; modify the logic in
embedding_tiles.py to locate the first non-empty chunk (iterate emb_col.chunks
until len(chunk) > 0) or check len(first_chunk) before dividing, and if no
non-empty chunk exists raise a clear ValueError or return a sensible default;
update the computation of embedding_dim (using the found non-empty chunk) so
embedding_dim = len(chunk.values) // len(chunk) is only executed on a non-empty
chunk.
- Around line 252-258: The explicit check for tissue_column is unreachable
because pd.read_parquet(..., columns=[..., tissue_column]) will raise
beforehand; update embedding_tiles.py to first load the parquet without a
columns projection (or catch the pyarrow error), validate that tissue_column
exists on the resulting df (or inspect df.columns), then select/assign columns =
["slide_id","x","y", tissue_column] and subset df accordingly; refer to
_resolve_uri(metadata_uri), tissue_column, and the df variable when making the
change so the ValueError with your custom message is raised from your own
validation rather than from pyarrow.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 22daf217-348c-4bd3-aa28-fa9c2b3c3a8b

📥 Commits

Reviewing files that changed from the base of the PR and between ee9d2da and 2ba0562.

📒 Files selected for processing (19)

configs/experiment/ml/linear_classifier_adamw_stratified_group_kfold.yaml
configs/experiment/ml/linear_classifier_adamw_stratified_kfold.yaml
configs/experiment/ml/linear_classifier_final_lbfgs.yaml
configs/experiment/ml/linear_classifier_lbfgs_stratified_group_kfold.yaml
configs/experiment/ml/linear_classifier_lbfgs_stratified_kfold.yaml
configs/experiment/ml/linear_classifier_predict_tissue_tiles.yaml
configs/experiment/ml/linear_classifier_test_adamw.yaml
configs/experiment/ml/linear_classifier_test_lbfgs.yaml
configs/ml.yaml
configs/ml/data/final_embedding_tiles.yaml
configs/ml/data/kfold_embedding_tiles.yaml
configs/ml/task/final_linear_classifier.yaml
configs/ml/task/kfold_linear_classifier.yaml
ml/__main__.py
ml/callbacks/tiff_prediction_map_writer.py
ml/data/data_module.py
ml/data/datasets/embedding_tiles.py
ml/meta_arch.py
scripts/submit_train_linear_probe.py

✅ Files skipped from review due to trivial changes (4)

configs/ml/data/kfold_embedding_tiles.yaml
configs/ml.yaml
configs/experiment/ml/linear_classifier_lbfgs_stratified_group_kfold.yaml
configs/experiment/ml/linear_classifier_adamw_stratified_group_kfold.yaml

🚧 Files skipped from review as they are similar to previous changes (7)

configs/experiment/ml/linear_classifier_final_lbfgs.yaml
configs/experiment/ml/linear_classifier_test_lbfgs.yaml
configs/experiment/ml/linear_classifier_predict_tissue_tiles.yaml
configs/ml/task/final_linear_classifier.yaml
configs/experiment/ml/linear_classifier_test_adamw.yaml
ml/main.py
ml/callbacks/tiff_prediction_map_writer.py

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (2)

scripts/submit_test_linear.py (1)

12-12: ⚡ Quick win

Avoid hardcoding a feature branch in submit script.

Line 12 ties runtime to feature/ml-test-mode; if that branch is deleted/renamed, submissions fail. Prefer a stable ref (e.g., master) or parameterize the ref.

Suggested change

-        "git clone --branch feature/ml-test-mode https://github.com/RationAI/tissue-classification.git workdir",
+        "git clone --branch master https://github.com/RationAI/tissue-classification.git workdir",

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@scripts/submit_test_linear.py` at line 12, The git clone command in
scripts/submit_test_linear.py hardcodes the feature branch
"feature/ml-test-mode", causing failures if that branch is removed; change it to
use a stable ref or a parameterized variable (e.g., use a default
BRANCH="master" or accept BRANCH from env/args) and interpolate that variable
into the clone string instead of the literal "feature/ml-test-mode" so the clone
command becomes dynamic and resilient.

submit_report.py (1)

17-17: ⚡ Quick win

Pin the report dependency to an immutable revision.

Installing from a mutable feature branch makes the job non-reproducible and fragile. Please pin this to a tag or commit SHA so reruns keep using the same reporter code.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@submit_report.py` at line 17, The dependency string currently installs the
report package from a mutable feature branch
("git+ssh://git@gitlab.ics.muni.cz/.../report.git@feature/force-wsi-service-protocol");
change that to pin to an immutable revision by replacing the branch ref with a
tag or commit SHA (e.g., use ".../report.git@vX.Y.Z" or
".../report.git@<commit-sha>") in the list where the string appears in
submit_report.py so reruns use the exact same reporter code.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@ml/meta_arch.py`:
- Around line 368-370: The artifact_file passed to mlflow.log_table is using a
.parquet extension which violates mlflow.log_table's JSON contract; update the
call to mlflow.log_table (the mlflow.log_table invocation that writes
"per_slide/test_tile_accuracy.parquet") to use
"per_slide/test_tile_accuracy.json" instead, and if you actually need Parquet
output, write the DataFrame to a .parquet via DataFrame.to_parquet and use
mlflow.log_artifact(s) to upload it rather than mlflow.log_table.

In `@submit_report.py`:
- Around line 6-21: The module currently calls submit_job(...) at import time
(submit_job with job_name, username, cpu, memory, gpu, public, script, storage),
causing side effects; wrap that logic into a function (e.g., def main() or
submit_report()) and move the submit_job(...) invocation inside it, then call
that function only under a main guard (if __name__ == "__main__": main()),
keeping the same submit_job parameters and behavior so imports no longer trigger
job submission.

---

Nitpick comments:
In `@scripts/submit_test_linear.py`:
- Line 12: The git clone command in scripts/submit_test_linear.py hardcodes the
feature branch "feature/ml-test-mode", causing failures if that branch is
removed; change it to use a stable ref or a parameterized variable (e.g., use a
default BRANCH="master" or accept BRANCH from env/args) and interpolate that
variable into the clone string instead of the literal "feature/ml-test-mode" so
the clone command becomes dynamic and resilient.

In `@submit_report.py`:
- Line 17: The dependency string currently installs the report package from a
mutable feature branch
("git+ssh://git@gitlab.ics.muni.cz/.../report.git@feature/force-wsi-service-protocol");
change that to pin to an immutable revision by replacing the branch ref with a
tag or commit SHA (e.g., use ".../report.git@vX.Y.Z" or
".../report.git@<commit-sha>") in the list where the string appears in
submit_report.py so reruns use the exact same reporter code.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 9a00defe-f089-426a-b475-4e2610e87941

📥 Commits

Reviewing files that changed from the base of the PR and between 2ba0562 and e370417.

📒 Files selected for processing (3)

ml/meta_arch.py
scripts/submit_test_linear.py
submit_report.py

coderabbitai

Actionable comments posted: 1

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@scripts/submit_test_linear.py`:
- Line 6: The submission script uses a hardcoded username ("vcifka") instead of
the repository convention of using the Ellipsis placeholder; update the
parameter assignment for username in scripts/submit_test_linear.py (the
username= entry) to use the placeholder username=... so it matches the existing
pattern (e.g., +experiment=...) used elsewhere.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 5c16cc67-14d5-42a5-b99b-e46cbfe76b84

📥 Commits

Reviewing files that changed from the base of the PR and between e370417 and 597e348.

📒 Files selected for processing (3)

ml/callbacks/tiff_prediction_map_writer.py
ml/meta_arch.py
scripts/submit_test_linear.py

🚧 Files skipped from review as they are similar to previous changes (2)

ml/meta_arch.py
ml/callbacks/tiff_prediction_map_writer.py

Clear the batch buffer only on rank!=0 or after a successful write so the on_test_end fallback no longer hits an always-empty buffer. Add diagnostic prints to the silent early-return guards and an idempotency flag so the two write hooks cooperate. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

vojtech-cifka and others added 30 commits May 7, 2026 21:43

feat: create ml pipeline for linear probe

24668c3

fix: sort only tiles parquet

894c27b

fix: log join types of tile keys

fc824ad

fix: remove embeddings from the join

11931d1

fix: remove label column

fb6b320

fix: prevent overflow

7434ae9

Merge remote-tracking branch 'origin/master' into feature/linear-probe

1b18daa

feat: add class tresholds and run ids

911bec2

fix: wrong run id

1a02395

Merge remote-tracking branch 'origin/master' into feature/embedding-d…

08d7ba5

…ataset

feat: add timing

b38465e

refactor: use pyarrow to avoid to pandas conversion

bfc9578

fix: join on keys only

eb213c6

fix: typing

c92d9a1

fix: add prints

01cc394

refactor: use combine chunks

cad0d37

chore: remove time

3b0137f

feat: add timing

8df47aa

chore: revert to the previous state

926753d

feat: add prints

b0e9ba4

vejtek previously approved these changes May 18, 2026

View reviewed changes

chore: deduplicate, apply safety nets

8b3a82d

vojtech-cifka dismissed vejtek’s stale review via 8b3a82d May 18, 2026 10:55

vojtech-cifka requested a review from vejtek May 18, 2026 10:56

Adames4 requested changes May 18, 2026

View reviewed changes

Comment thread ml/data/data_module.py Outdated

Comment thread ml/meta_arch.py Outdated

Comment thread ml/data/datasets/embedding_tiles.py Outdated

Comment thread ml/data/datasets/embedding_tiles.py Outdated

vojtech-cifka added 4 commits May 18, 2026 18:06

fix: pytorch checkpoint loading

e16426e

chore: remove redundancy, rename variables

fd3fdd6

chore: remove username and branch

c401015

refactor: rename configs

847c3cc

vojtech-cifka requested a review from Adames4 May 18, 2026 16:49

coderabbitai Bot reviewed May 18, 2026

View reviewed changes

Comment thread ml/data/data_module.py

Adames4 previously approved these changes May 18, 2026

View reviewed changes

fix: criterion weight

e370417

vojtech-cifka dismissed Adames4’s stale review via e370417 May 18, 2026 19:15

coderabbitai Bot reviewed May 18, 2026

View reviewed changes

Comment thread ml/meta_arch.py Outdated

Comment thread submit_report.py Outdated

vojtech-cifka added 3 commits May 18, 2026 21:56

fix: keep space in MUG prediction masks names

632a8f6

fix: log test accuracy as jsons

3cd0243

chore: remove username from the submission script

76e4194

vojtech-cifka requested a review from Adames4 May 18, 2026 20:05

fix: force the entering of the write phase of the prediction maps

597e348

coderabbitai Bot reviewed May 18, 2026

View reviewed changes

Comment thread scripts/submit_test_linear.py Outdated

vojtech-cifka and others added 3 commits May 18, 2026 22:47

fix: remove username

3829ebd

fix: preserve original wsi name

79b47a2

vejtek approved these changes May 19, 2026

View reviewed changes

Adames4 approved these changes May 19, 2026

View reviewed changes

vojtech-cifka merged commit c655ad2 into master May 19, 2026
3 checks passed

vojtech-cifka deleted the feature/ml-test-mode branch May 19, 2026 08:16

coderabbitai Bot mentioned this pull request May 19, 2026

feat: Add Prov-GigaPath linear probe test workflows and prediction-map utilities #15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Enable prediction-map generation for final linear classifier test runs#14

feat: Enable prediction-map generation for final linear classifier test runs#14
vojtech-cifka merged 113 commits into
masterfrom
feature/ml-test-mode

vojtech-cifka commented May 18, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vojtech-cifka commented May 18, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vojtech-cifka commented May 18, 2026 •

edited by coderabbitai Bot

Loading