AI System Documentation                          Introduction

This site provides general information on the AI research servers for CT image analysis.

The main documents are accessed by clicking on the links in the sidebar to the left.

Sidebar Documents

Name

Description

Conventions

 

Basic conventions for labeling and image indexing

Medical Images

The three image formats for 3D CT images

Image Access

Curated image datasets are best accessed by the vdimage command which translates between image-codes. DICOM series-ids, and file-system image locations.

Image Analysis

 

The system standard image analysis algorithms that can be applied to curated image datasets

Docker Access

 

Information on how to run image analysis algorithms on the system that involve dockers.

Servers

 

Relevant Server details

Dataset Curation

 

Image datasets are curated to make all dataset images conveniently available

Vsimba

 

A tutorial on how to use the on-line image viewer.

 

Location

The CT image datasets and v4 software is installed on all research servers.

The remainder of this introduction provides general background information on the system

 

Accessing Research dataset images

Image Referencing

A unique naming system is used for every image (CT scan) in the research datasets.  While the DICOM unique series identifier (DSID) is often used for this, the system used here is much shorter, directly provides dataset and case identification and deals with image datasets in which the DSID is not available.

Image Reference Format the image-code

The syntax for an image-code that identifies an image within a curated image dataset  is:

 <dataset-id><case>-<image>

Where:

<dataset-id> is a one- or two-character identifier for for the research dataset

<case> is the dataset case identifier

<image> is a unique identifier for each image within a case

For example, the code “NL00023-s1” refers to the dataset NL (NLST_Test) dataset case 000023

 and image labeled s1 within that case. 

Accessing Dataset Image data

A system command vdimage is used for image access. The basic function of vdimage is to determine the full image path name when provided with an image-code. For data set images that have DICOM unique series identifiers, vdimage will do a reverse look-up and return the image code associated with that series id.

Curated Research Datasets

The following image datasets are currently available on the research servers

Dataset Code

Cases

Images

Dataset Contents

W

50

50

The VIA-ELCAP public nodule dataset. cases have all lung nodules documented

NT

2493

6715

The NLST_Test dataset. This contains images selected from the NLST dataset to test the Sybil cancer prediction model

UN

3405

40698

Pamplona image dataset up to 2015.