Skip to content

Latest commit

 

History

History
87 lines (78 loc) · 3.16 KB

File metadata and controls

87 lines (78 loc) · 3.16 KB

Experiment 1:

  • 3D autoencoder baseline
  • GH200 GPU
  • 100 patches
  • 5 epochs
  • MSE loss
  • Upsample + Conv3D decoder
  • Loss decreased successfully
  • Reconstruction captures global morphology but remains blurry
  • Reconstruction captures large-scale morphology and intensity distribution
  • Fine-grained microstructural details remain blurred
  • Low MSE loss does not necessarily correspond to high perceptual fidelity
  • Possible causes: shallow architecture, limited epochs, MSE smoothing effects
  • Final training loss: 0.000804

Experiment 2:

  • Increased training duration from 5 epochs to 25 epochs
  • Same 3D autoencoder baseline architecture
  • GH200 GPU
  • 100 patches
  • MSE reconstruction loss
  • Upsample + Conv3D decoder
  • Training loss decreased significantly over epochs
  • Final MSE loss reached very low values (~0.0008)
  • Reconstruction remained blurry despite lower numerical loss
  • Global morphology captured successfully
  • Fine-grained microstructural details remain blurred
  • Low MSE loss does not necessarily correspond to high perceptual fidelity
  • Possible causes: shallow architecture, limited dataset size, MSE smoothing effects
  • Demonstrated limitation of pixel-wise reconstruction objective

Experiment 3:

  • Increased raw dataset from 100 TIFF slices to 500 TIFF slices
  • Generated 700 3D patches of size 64×64×64
  • GH200 GPU
  • 25 epochs
  • MSE reconstruction loss
  • Upsample + Conv3D decoder

Results:

  • Training completed on larger patch dataset
  • Loss curve and reconstruction figure saved
  • Checkpoint saved

Observation:

  • Compare reconstruction sharpness and loss behaviour against Experiment 2

Experiment 4:

  • Added LPIPS perceptual loss to baseline 3D autoencoder

  • Combined loss function: Total Loss = MSE + 0.1 × LPIPS

  • LPIPS applied slice-wise on middle 2D slices extracted from 3D volumes

  • Grayscale slices converted to 3-channel format for LPIPS compatibility

  • Dataset size: 700 patches generated from 500 TIFF slices

  • Training epochs: 25

  • GPU: NVIDIA GH200 120GB

  • Decoder: Upsample + Conv3D architecture

  • MSE loss continued decreasing steadily

  • LPIPS loss decreased from ~0.21 to ~0.04

  • Reconstruction quality improved noticeably compared to pure MSE training

  • Fine structural textures and local morphology became more visible

  • Reconstruction still slightly smooth but preserves significantly more microstructural detail

  • Demonstrates perceptual-loss-based improvement over standard reconstruction objective

  • Final losses:

    • Total Loss ≈ 0.00449
    • MSE Loss ≈ 0.00053
    • LPIPS Loss ≈ 0.03968

    Experiment 5:

  • Increased dataset from 700 patches to 1200 patches

  • Generated from 800 TIFF slices

  • 3D autoencoder with deeper encoder-decoder architecture

  • Combined MSE + LPIPS perceptual loss

  • GH200 GPU training

  • 25 epochs

  • Reconstruction quality improved noticeably compared to earlier experiments

  • Fine-grained textures and structural patterns are better preserved

  • LPIPS loss decreased from ~0.20 to ~0.03 during training

  • Reconstruction remains slightly blurry but captures morphology more accurately

  • Larger dataset improved generalization and perceptual fidelity

  • Final losses:

    • Total Loss ≈ 0.006
    • MSE ≈ 0.0007
    • LPIPS ≈ 0.056