AI System Documentation Introduction
This site provides general information on the AI research servers for CT image analysis.
The main documents are accessed by clicking on the links in the sidebar to the left.
Sidebar Documents
Name |
Description |
|
Basic
conventions for labeling and image indexing |
The three image formats for 3D CT images |
|
Curated image datasets are best
accessed by the vdimage command which translates
between image-codes. DICOM series-ids, and file-system image locations. |
|
|
The system standard image
analysis algorithms that can be applied to curated image datasets |
|
Information on how to run image
analysis algorithms on the system that involve dockers. |
|
Relevant Server details |
|
Image datasets are curated to make all dataset images conveniently available |
|
A tutorial on how to use the on-line image viewer. |
Location
The CT image datasets and v4 software is installed on all research servers.
The remainder of this introduction provides general background information on the system
Accessing Research dataset images
Image Referencing
A unique naming system is used for every image (CT scan) in the research datasets. While the DICOM unique series identifier (DSID) is often used for this, the system used here is much shorter, directly provides dataset and case identification and deals with image datasets in which the DSID is not available.
Image Reference Format the image-code
The syntax for an image-code that identifies an image within a curated image dataset is:
<dataset-id><case>-<image>
Where:
<dataset-id> is a one- or two-character identifier for for the research dataset
<case> is the dataset case identifier
<image> is a unique identifier for each image within a case
For example, the code “NL00023-s1” refers to the dataset NL (NLST_Test) dataset case 000023
and image labeled s1 within that case.
Accessing Dataset Image data
A system command vdimage is used for image access. The basic function of vdimage is to determine the full image path name when provided with an image-code. For data set images that have DICOM unique series identifiers, vdimage will do a reverse look-up and return the image code associated with that series id.
Curated Research Datasets
The following image datasets are currently available on the research servers
Dataset Code |
Cases |
Images |
Dataset Contents |
W |
50 |
50 |
The VIA-ELCAP public nodule dataset. cases have all lung nodules documented |
NT |
2493 |
6715 |
The NLST_Test dataset. This contains images selected from the NLST dataset to test the Sybil cancer prediction model |
UN |
3405 |
40698 |
Pamplona image dataset up to 2015. |