Add end-to-end tutorial for feature extraction and evaluation#8
Open
prachitbhike wants to merge 1 commit intoYalaLab:mainfrom
Open
Add end-to-end tutorial for feature extraction and evaluation#8prachitbhike wants to merge 1 commit intoYalaLab:mainfrom
prachitbhike wants to merge 1 commit intoYalaLab:mainfrom
Conversation
Adds a tutorial that walks through the full pipeline: generating synthetic data, extracting embeddings (CLI and Python API), running evaluation, and adding a custom dataset. - tutorial/setup_tutorial.py: generates a synthetic chest X-ray image and dummy labels JSON so the DummyDataset works out of the box - TUTORIAL.md: four-part guide covering extraction, Python API, evaluation, and custom datasets - README.md: link to tutorial This also provides a workaround for the missing rve package (YalaLab#5) and explains how the model converts images to embeddings (YalaLab#7). Closes YalaLab#2 Addresses YalaLab#5, YalaLab#7
305852b to
ffe685e
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
TUTORIAL.md) walking through the full pipeline: synthetic data setup, feature extraction (CLI + Python API), evaluation, and adding a custom datasettutorial/setup_tutorial.pyto generate the synthetic image and labels needed byDummyDatasetREADME.mdIssues addressed
rve-free path viaDummyDatasetand documents the workaround in the troubleshooting sectionWhat's included
tutorial/setup_tutorial.pyassets/CXR145_IM-0290-1001.png(synthetic 1024x1024 grayscale image) andtutorial/dummy_labels.json(100 studies, 3 binary findings). Uses only PIL + stdlib.TUTORIAL.mdREADME.md## Tutorialsection with link toTUTORIAL.mdTest plan
python tutorial/setup_tutorial.pyand verify both files are created--max-samples 10to verify extraction worksdiscover_questions_from_jsonexpectations