Skip to content

Commit d17fcac

Browse files
added theoretical introduction to WSI DICOM format
1 parent 4e02ad4 commit d17fcac

File tree

2 files changed

+15
-20
lines changed

2 files changed

+15
-20
lines changed
116 KB
Loading

notebooks/pathomics/getting_started_with_digital_pathology.ipynb

Lines changed: 15 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@
2323
"Besides radiology, slide microscopy is the second major imaging modality in the IDC. Slide microscopy images show **thin sections of tissue samples** (e.g., from a resected tumor) at microscopic resolution. They provide a unique glimpse into cellular architecture and function that is essential for diagnosing complex diseases like cancer. Computerized analysis makes the assessment of slide microscopy images more reproducible and less time consuming and it enables the extraction of novel digital biomarkers from tissue images.\n",
2424
"\n",
2525
"This tutorial provides a **15-minute** introduction to the way slide microscopy data is organized within the IDC and demonstrates how to **examine available data and build a data cohort** for further analysis.\n",
26-
"It also provides an example of the **application of deep learning (DL)** to computational pathology analysis. In particular, a pre-trained model made available in the context of the publication [The NCI Imaging Data Commons as a platform for reproducible research in computational pathology](https://doi.org/10.1016/j.cmpb.2023.107839) is used to **classify lung tissue regions** into either non-neoplastic (normal), adenocarcinoma (LUAD), or squamous cell carcinoma (LSCC) tissue. A quite similar notebook was part of a [tutorial given at ISBI 2024](https://github.com/ImagingDataCommons/IDC-Tutorials/wiki/ISBI2024) and can be found [here](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/labs/idc_isbi2024.ipynb).\n",
26+
"It also provides an example of the **application of deep learning (DL)** to computational pathology analysis. In particular, a pre-trained model made available in the context of the publication [The NCI Imaging Data Commons as a platform for reproducible research in computational pathology](https://doi.org/10.1016/j.cmpb.2023.107839) is used to **classify lung tissue regions** into either non-neoplastic (normal), adenocarcinoma (LUAD), or squamous cell carcinoma (LSCC) tissue. A quite similar notebook with an acompanying slide deck was part of a tutorial given at ISBI 2024 and can be found [here](https://github.com/ImagingDataCommons/IDC-Tutorials/wiki/ISBI2024). \n",
2727
"\n",
2828
"For a more comprehensive tutorial including training of a tissue classification model on IDC-hosted slide microscopy data, see the Github repository [idc-comppath-reproducibility](https://github.com/ImagingDataCommons/idc-comppath-reproducibility). \n",
2929
" \n",
@@ -183,28 +183,23 @@
183183
},
184184
{
185185
"cell_type": "markdown",
186-
"metadata": {
187-
"id": "Px14ELwJXj9q"
188-
},
186+
"metadata": {},
189187
"source": [
190-
"# Determinism\n",
191-
"Even in smaller experiments it is useful to ensure reproducibility of operations. An important part to this is setting random seeds and enabling TensorFlow operations to be deterministic.\n",
192-
"\n",
193-
"To learn more about reproducible computational pathology workflows using the IDC, you can read our publication [The NCI Imaging Data Commons as a platform for reproducible research in computational pathology](https://doi.org/10.1016/j.cmpb.2023.107839)."
188+
"# Part 1: Introduction to whole slide imaging with DICOM"
194189
]
195190
},
196191
{
197-
"cell_type": "code",
198-
"execution_count": 5,
199-
"metadata": {
200-
"id": "wlRZ3mjuXkIB"
201-
},
202-
"outputs": [],
192+
"cell_type": "markdown",
193+
"metadata": {},
194+
"source": [
195+
"The IDC uses the [DICOM standard](https://www.dicomstandard.org/) for data representation. As shown in the figure below, a WSI pyramid corresponds to a `Series` of DICOM (SOP) instances, that each represent the slide at a different resolution. Each DICOM instance is stored as a separate DICOM file. Multiple DICOM `Series` can be organized in a DICOM `Study`, and similarly multiple DICOM `Studies` can belong to one `Patient`. "
196+
]
197+
},
198+
{
199+
"cell_type": "markdown",
200+
"metadata": {},
203201
"source": [
204-
"random.seed(0)\n",
205-
"np.random.seed(0)\n",
206-
"tf.random.set_seed(0)\n",
207-
"tf.config.experimental.enable_op_determinism()"
202+
"<img src=\"DICOM_WSI_overview.png\" alt=\"Description\" width=\"1000\"/>"
208203
]
209204
},
210205
{
@@ -213,7 +208,7 @@
213208
"id": "OaLcUqjY39FU"
214209
},
215210
"source": [
216-
"# Part 1: How to search whole-slide data on IDC"
211+
"# Part 2: How to search whole slide data on IDC"
217212
]
218213
},
219214
{
@@ -224,7 +219,7 @@
224219
"source": [
225220
"For most computational pathology experiments, the first step is to select a cohort of WSIs by filtering for the desired metadata attributes.\n",
226221
"\n",
227-
"The IDC uses the [DICOM standard](https://www.dicomstandard.org/) for data representation. Here, a WSI corresponds to a `Series` of DICOM image objects, each representing the slide at a different resolution. Each DICOM object is stored as a separate DICOM file. Cohort selection is done easiest by executing SQL-like statements using the Python package `idc-index` against different index tables, which list all available DICOM `Series` or files (rows) with the corresponding available metadata atrributes (columns). For getting started with the idc-index, we refer to this [introductory notebook](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/part2_searching_basics.ipynb).\n",
222+
"Cohort selection is done easiest by executing SQL-like statements using the Python package `idc-index` against different index tables, which list all available DICOM `Series` or files (rows) with the corresponding available metadata atrributes (columns). For getting started with the idc-index, we refer to this [introductory notebook](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/part2_searching_basics.ipynb).\n",
228223
"\n",
229224
"If you can't find your attribute of interest in the idc-index, check out [this notebook](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/part3_exploring_cohorts.ipynb) to learn how to search **all** of the metadata accompanying IDC using Google BigQuery.\n",
230225
"\n",

0 commit comments

Comments
 (0)