You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+32-9Lines changed: 32 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -4,7 +4,7 @@ This repository contains tutorial materials (for the most part, as Python notebo
4
4
5
5
If this is the first time you hear about IDC, you may want to check out our [Getting Started documentation page](https://learn.canceridc.dev/getting-started-with-idc). Here are some highlights about what IDC has to offer:
6
6
7
-
***>93 TB of data**: IDC contains radiology, brightfield (H&E) and fluorescence slide microscopy images, along with image-derived data (annotations, segmentations, quantitative measurements) and accompanying clinical data
7
+
***~100 TB of data**: IDC contains radiology, brightfield (H&E) and fluorescence slide microscopy images, along with image-derived data (annotations, segmentations, quantitative measurements) and accompanying clinical data
8
8
9
9
***free**: all of the data in IDC is publicly available: no registration, no access requests
10
10
@@ -16,22 +16,40 @@ If this is the first time you hear about IDC, you may want to check out our [Get
16
16
17
17
The tutorial notebooks are located in the [notebooks](https://github.com/ImagingDataCommons/IDC-Tutorials/tree/master/notebooks), and are organized in the following folders.
18
18
19
+
## Recommended Learning Paths
20
+
21
+
**Beginner (no GCP account needed):**
22
+
1.[Basics of using IDC data programmatically](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/part2_searching_basics.ipynb) - start here!
3.[BigQuery prerequisites and setup](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/part1_prerequisites.ipynb) - set up Google Cloud for BigQuery access
27
+
4.[Advanced searching using BigQuery](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/part3_exploring_cohorts.ipynb)
28
+
5.[Working with IDC clinical data](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/advanced_topics/clinical_data_intro.ipynb)
29
+
30
+
**Domain-specific paths:**
31
+
***Radiology**: [NLST exploration](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/collections_demos/nlst_exploration.ipynb) → [TotalSegmentator features](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/collections_demos/TotalSegmentator_CT_Segmentations_features_extraction.ipynb) → [MedSAM on IDC](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/analysis/MedSAM_with_IDC.ipynb)
32
+
***Digital Pathology**: [Getting started with digital pathology](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/pathomics/getting_started_with_digital_pathology.ipynb) → [Slide microscopy metadata](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/pathomics/slide_microscopy_metadata_search.ipynb) → [DICOM annotations](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/pathomics/microscopy_dicom_ann_intro.ipynb)
"Getting Started" python notebooks are intended to introduce the users to IDC.
37
+
"Getting Started" python notebooks are intended to introduce the users to IDC.
22
38
23
-
*[Basics of using IDC data programmatically](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/part2_searching_basics.ipynb): learn how to use `idc-index` python package to programmatically search and download IDC data, visualize images and annotations, build cohorts and checking acknowledgments and liceses for the data included in your cohort.
39
+
*[Basics of using IDC data programmatically](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/part2_searching_basics.ipynb): learn how to use `idc-index` python package to programmatically search and download IDC data, visualize images and annotations, build cohorts and checking acknowledgments and licenses for the data included in your cohort.
24
40
*[Searching clinical data](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/exploring_clinical_data.ipynb): identify clinical and other non-imaging data accompanying imaging collections in IDC using `idc-index` python package and `duckdb`.
41
+
*[BigQuery prerequisites and setup](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/part1_prerequisites.ipynb): set up the prerequisites for working with IDC data via Google BigQuery. This notebook is only needed if you plan to use BigQuery for advanced metadata queries (Part 3 and beyond).
25
42
*[Advanced searching using BigQuery](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/getting_started/part3_exploring_cohorts.ipynb): access all of the metadata to build comprehensive queries and detailed cohort selection criteria.
Notebooks in this folder focus on topics that will require understanding of the basics, and aim to address more narrow use cases of IDC usage.
46
+
Notebooks in this folder focus on topics that will require understanding of the basics, and aim to address more narrow use cases of IDC usage.
30
47
31
48
*[Searching DICOM private tags](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/advanced_topics/dicom_private_tags_intro.ipynb): all of DICOM attributes for the imaging data in IDC are searchable using BigQuery. DICOM private tags often contain critical information, such as diffusion b-values, but are a bit more tricky to access from BigQuery. In this tutorial you will learn how to accomplish this.
32
49
*[Working with IDC clinical data](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/advanced_topics/clinical_data_intro.ipynb): this tutorial will help you become familiar with how clinical and other tabular data accompanying images in IDC is organized, and how you can use it to build cohorts.
33
50
*[Direct access to tags/frames from GCS/AWS buckets](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/advanced_topics/gcs_aws_direct_access.ipynb): learn how to access individual frames or tags of large DICOM files from the bucket without having to download the entire file (this notebook accompanies documentation article here: https://learn.canceridc.dev/data/downloading-data/direct-loading)
34
51
*[Using DICOMweb to access IDC data](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/advanced_topics/idc_dicomweb_access.ipynb): both metadata and pixel data can be access using DICOMweb, which is particularly important while working with digital pathology, as it enables granular access to the individual pyramid tiles (frames)
52
+
*[IDC on AWS](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/advanced_topics/idc_on_aws/idc-on-aws-tutorial.ipynb): learn how to work with IDC data using AWS services, including AWS HealthImaging.
*[Setting up Google Healthcare DICOM store](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/viewers_deployment/Creating_Google_Healthcare_DICOM_store.ipynb): once you have your viewers deployed, you can use this tutorial to create a DICOM store with your data, which you can then access from the viewers deployed
This folders contains notebooks that demonstrate the usage of the data in the specific IDC collections. The notebooks in this folder will always have the prefix of the `collection_id` they correspond to, for easier navigation.
64
+
This folder contains notebooks that demonstrate the usage of the data in the specific IDC collections. The notebooks in this folder will always have the prefix of the `collection_id` they correspond to, for easier navigation.
47
65
48
66
*[Using hiplot for exploring prostate MRI in IDC](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/collections_demos/prostate-MRI_hiplot_experiments.ipynb): this notebook demonstrates how [`hiplot`](https://facebookresearch.github.io/hiplot/), an open source package for high-dimensional parameter visualization, for examining various MRI acquisition parameters for the prostate MRI images available in IDC.
49
67
*[Visible Human Project exploration](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/collections_demos/nlm_visible_human_project.ipynb): demonstration of searching and visualizing images from the National Library of Medicine Visible Human Project available on IDC.
@@ -56,25 +74,30 @@ This folders contains notebooks that demonstrate the usage of the data in the sp
56
74
This folder is dedicated to the notebooks focused on the digital pathology (pathomics) applications. The use of DICOM standard is relatively new in digital pathology, and this field is being actively developed, thus a dedicated folder for this.
57
75
*[Getting started with pathology images in IDC](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/pathomics/getting_started_with_digital_pathology.ipynb): all of the pathology images in IDC are in DICOM Slide Microscopy format; this notebook will help you get started with using this representation and also searching IDC pathology images.
58
76
*[Exploring IDC slide microscopy images metadata](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/pathomics/slide_microscopy_metadata_search.ipynb): introduction to the key metadata accompanying IDC slide microscopy images that can be used for subsetting data and building cohorts.
59
-
*[Working with DICOM Microscopy Bulk Simple annotations](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/pathomics/Tutorial_MicroscopyBulkSimpleAnnotations.ipynb): introduction to working with bulk annotations encoded in DICOM using the example of nuclei annotations.
77
+
*[Working with DICOM Microscopy Bulk Simple annotations](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/pathomics/microscopy_dicom_ann_intro.ipynb): introduction to working with bulk annotations encoded in DICOM using the example of nuclei annotations.
Demonstrations/examples of analyses of images from IDC.
64
82
*[MedSAM on IDC](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/analysis/MedSAM_with_IDC.ipynb): learn how to experiment with MedSAM on the images available from IDC.
65
83
*[MHub.ai with IDC data](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/analysis/mhubai_tutorial.ipynb): [MHub.ai](https://mhub.ai) is a platform for Deep Learning models in medical imaging, which are interoperable with IDC and can be applied directly to the IDC DICOM images. Learn how to get started from this notebook!
*[How to use the IDC V2 API](https://github.com/ImagingDataCommons/IDC-Tutorials/blob/master/notebooks/idc_api/How_to_use_the_IDC_V2_API.ipynb): introduction to the IDC REST API v2 for programmatic access to IDC data, including cohort creation, manifest generation, and data download.
Here you will find an archive of the notebooks that were used in tutorials, which at times may demonstrate experimental features. By design, the notebooks presented at specific events may not be updated after the event, and are stored in this folder for archival purposes.
IDC is an actively evolving resource. As we develop new and improved capabilities, we improve our recommended usage practices, and may deprecate notebooks that are no longer maintained and may no longer work. You will find thse in the `deprecated` folder.
96
+
IDC is an actively evolving resource. As we develop new and improved capabilities, we improve our recommended usage practices, and may deprecate notebooks that are no longer maintained and may no longer work. You will find these in the `deprecated` folder.
74
97
75
98
## `testing`
76
99
77
-
This directory is used for the maintenance of the repository to support testing of the actively supported notebooks.
100
+
This directory is used for the maintenance of the repository to support testing of the actively supported notebooks.
0 commit comments