ft42 commited on
Commit
6c62721
·
verified ·
1 Parent(s): 599a397

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -7
README.md CHANGED
@@ -48,13 +48,13 @@ Medical imaging datasets are increasingly available, yet abnormal and annotation
48
  The overall pipeline for organ, body, and nodule segmentation with alignment is shown below:
49
 
50
  <p align="center">
51
- <img src="https://github.com/fitushar/NoMAISI/blob/main/doc/images/workflow.png" alt="Segmentation Pipeline"/>
52
  </p>
53
 
54
  **Workflow** for constructing the **NoMAISI** development dataset. The pipeline includes **(1)** organ segmentation using AI models, **(2)** body segmentation with algorithmic methods, **(3)** nodule segmentation through AI-assisted and ML-based refinement, and **(4)** segmentation alignment to integrate organs, body, and nodules segmentations into anatomically consistent volumes.
55
 
56
  <p align="center">
57
- <img src="https://github.com/fitushar/NoMAISI/blob/main/doc/images/NoMAISI_train_and_infer.png" alt="NoMAISI_train_and_infer"/>
58
  </p>
59
 
60
  **Overview** of our flow-based latent diffusion model with ControlNet conditioning for AI-based CT generation. The pipeline consists of three stages: **(top) Pretrained VAE** for image compression, where CT images are encoded into latent features using a frozen VAE; **(middle)** Model fine-tuning, where a **Rectified Flow ODE sampler**, conditioned on segmentation masks and voxel spacing through a **fine-tuned ControlNet**, predicts velocity fields in latent space and is optimized with a region-specific contrastive loss emphasizing ROI sensitivity and background consistency; and **(bottom) Inference**, where segmentation masks and voxel spacing guide latent sampling along the ODE trajectory to obtain a clean latent representation, which is then decoded by the VAE into full-resolution AI-generated CT images conditioned by body and lesion masks.
@@ -110,7 +110,7 @@ Fréchet Inception Distance (FID) of the **MAISI-v2** baseline and **NoMAISI** m
110
  ### 📉 FID Parity Plot
111
 
112
  <p align="left">
113
- <img src="https://github.com/fitushar/NoMAISI/blob/main/doc/images/GanAI_fid_scatter_marker_legend.png" alt="Parity comparison of FID for real↔real vs AI-generated CT across datasets" width="500">
114
  </p>
115
 
116
  **Comparison of Fréchet Inception Distance (FID) between real↔real and AI-generated CT datasets.** Each point represents a clinical dataset (**LNDbv4, NSCLC-R, LIDC-IDRI, DLCS24, Intgmultiomics, LUNA25**) under different generative models (**MAISI-V2, NoMAISI**).The x-axis shows the **median FID** computed between real datasets, while the y-axis shows the **FID of AI-generated data** compared to real.
@@ -124,13 +124,13 @@ The dashed diagonal line denotes **parity (y = x)**, where AI-generated fidelity
124
  - **Yellow boxes** highlight lung nodule regions for comparison.
125
 
126
  <p align="center">
127
- <img src="https://github.com/fitushar/NoMAISI/blob/main/doc/images/DLCS_1419_ann0_slice134_triple.png" alt="Comparison of MAISI-V2 vs NoMAISI on lung CT with input masks" width="1000">
128
  </p>
129
  <p align="center">
130
- <img src="https://github.com/fitushar/NoMAISI/blob/main/doc/images/DLCS_1508_ann0_slice46_triple.png" alt="Comparison of MAISI-V2 vs NoMAISI on lung CT with input masks" width="1000">
131
  </p>
132
  <p align="center">
133
- <img src="https://github.com/fitushar/NoMAISI/blob/main/doc/images/DLCS_1453_ann0_slice204_triple.png" alt="Comparison of MAISI-V2 vs NoMAISI on lung CT with input masks" width="1000">
134
  </p>
135
 
136
 
@@ -204,7 +204,7 @@ python -m scripts.infer_testV2_controlnet \
204
 
205
  ## 🔬 Downstream Task: Cancer vs. No-Cancer Classification
206
 
207
- ![Cancer/No-Cancer Classification Results](https://github.com/fitushar/NoMAISI/blob/main/doc/images/TaskCls.png)
208
 
209
  **Shown.** AUC vs. the **% of clinical data retained** (x-axis: **100%**, **50%**, **20%**, **10%**).
210
  **Curves (additive augmentation — we **add** AI-generated nodules; we never replace clinical samples):**
 
48
  The overall pipeline for organ, body, and nodule segmentation with alignment is shown below:
49
 
50
  <p align="center">
51
+ <img src="doc/images/workflow.png" alt="Segmentation Pipeline"/>
52
  </p>
53
 
54
  **Workflow** for constructing the **NoMAISI** development dataset. The pipeline includes **(1)** organ segmentation using AI models, **(2)** body segmentation with algorithmic methods, **(3)** nodule segmentation through AI-assisted and ML-based refinement, and **(4)** segmentation alignment to integrate organs, body, and nodules segmentations into anatomically consistent volumes.
55
 
56
  <p align="center">
57
+ <img src="doc/images/NoMAISI_train_and_infer.png" alt="NoMAISI_train_and_infer"/>
58
  </p>
59
 
60
  **Overview** of our flow-based latent diffusion model with ControlNet conditioning for AI-based CT generation. The pipeline consists of three stages: **(top) Pretrained VAE** for image compression, where CT images are encoded into latent features using a frozen VAE; **(middle)** Model fine-tuning, where a **Rectified Flow ODE sampler**, conditioned on segmentation masks and voxel spacing through a **fine-tuned ControlNet**, predicts velocity fields in latent space and is optimized with a region-specific contrastive loss emphasizing ROI sensitivity and background consistency; and **(bottom) Inference**, where segmentation masks and voxel spacing guide latent sampling along the ODE trajectory to obtain a clean latent representation, which is then decoded by the VAE into full-resolution AI-generated CT images conditioned by body and lesion masks.
 
110
  ### 📉 FID Parity Plot
111
 
112
  <p align="left">
113
+ <img src="doc/images/GanAI_fid_scatter_marker_legend.png" alt="Parity comparison of FID for real↔real vs AI-generated CT across datasets" width="500">
114
  </p>
115
 
116
  **Comparison of Fréchet Inception Distance (FID) between real↔real and AI-generated CT datasets.** Each point represents a clinical dataset (**LNDbv4, NSCLC-R, LIDC-IDRI, DLCS24, Intgmultiomics, LUNA25**) under different generative models (**MAISI-V2, NoMAISI**).The x-axis shows the **median FID** computed between real datasets, while the y-axis shows the **FID of AI-generated data** compared to real.
 
124
  - **Yellow boxes** highlight lung nodule regions for comparison.
125
 
126
  <p align="center">
127
+ <img src="doc/images/DLCS_1419_ann0_slice134_triple.png" alt="Comparison of MAISI-V2 vs NoMAISI on lung CT with input masks" width="1000">
128
  </p>
129
  <p align="center">
130
+ <img src="doc/images/DLCS_1508_ann0_slice46_triple.png" alt="Comparison of MAISI-V2 vs NoMAISI on lung CT with input masks" width="1000">
131
  </p>
132
  <p align="center">
133
+ <img src="doc/images/DLCS_1453_ann0_slice204_triple.png" alt="Comparison of MAISI-V2 vs NoMAISI on lung CT with input masks" width="1000">
134
  </p>
135
 
136
 
 
204
 
205
  ## 🔬 Downstream Task: Cancer vs. No-Cancer Classification
206
 
207
+ ![Cancer/No-Cancer Classification Results](doc/images/TaskCls.png)
208
 
209
  **Shown.** AUC vs. the **% of clinical data retained** (x-axis: **100%**, **50%**, **20%**, **10%**).
210
  **Curves (additive augmentation — we **add** AI-generated nodules; we never replace clinical samples):**