Are 16 bit images supported by Caffe? If not, how to implement support?

Question

Background information: I need to load some 16 bit grayscale PNGs.

Does Caffe support loading 16 bit images through the ImageDataLayer?

After some googling, the answer seems it doesn't. The ImageDataLayer relies on this io routine

cv::Mat ReadImageToCVMat(const string& filename,
    const int height, const int width, const bool is_color) {
  cv::Mat cv_img;
  int cv_read_flag = (is_color ? CV_LOAD_IMAGE_COLOR :
    CV_LOAD_IMAGE_GRAYSCALE);
  cv::Mat cv_img_origin = cv::imread(filename, cv_read_flag);
  if (!cv_img_origin.data) {
    LOG(ERROR) << "Could not open or find file " << filename;
    return cv_img_origin;
  }
  if (height > 0 && width > 0) {
    cv::resize(cv_img_origin, cv_img, cv::Size(width, height));
  } else {
    cv_img = cv_img_origin;
  }
  return cv_img;
}

Which uses opencv's cv::imread function. This function will read the input as 8bits unless the appropiate flag is set

CV_LOAD_IMAGE_ANYDEPTH - If set, return 16-bit/32-bit image when the input has the corresponding depth, otherwise convert it to 8-bit.

Simply adding the appropriate flag will not work because later in the code [io.cpp] they check for 8bit depth:

void CVMatToDatum(const cv::Mat& cv_img, Datum* datum) {
  CHECK(cv_img.depth() == CV_8U) << "Image data type must be unsigned byte";
... }

I could just remove the check but I'm afraid it's there for a reason and unpredictable results might happen. Can anybody shine light on this issue?

Dzugaru · Accepted Answer

You can patch ImageDataLayer to read 16bit images like this:

Add appropriate flag as you mentioned (io.cpp):

after

int cv_read_flag = (is_color ? CV_LOAD_IMAGE_COLOR :
    CV_LOAD_IMAGE_GRAYSCALE);

add

cv_read_flag |= CV_LOAD_IMAGE_ANYDEPTH;

Modify the check you mentioned (data_transformer.cpp):

this

CHECK(cv_img.depth() == CV_8U) << "Image data type must be unsigned byte";

becomes

CHECK(cv_img.depth() == CV_8U || cv_img.depth() == CV_16U) << "Image data type must be uint8 or uint16";
bool is16bit = cv_img.depth() == CV_16U;

Modify the way DataTransformer reads cv::Mat like this (same function below):

add pointer of uint16_t type to:

const uchar* ptr = cv_cropped_img.ptr(h);

like this

const uint16_t* ptr_16 = cv_cropped_img.ptr(h);

Then read using appropriate pointer:

Dtype pixel = static_cast(ptr[img_index++]);

becomes

Dtype pixel;
if(is16bit)
    pixel = static_cast(ptr_16[img_index++]);
else
    pixel = static_cast(ptr[img_index++]);

Are 16 bit images supported by Caffe? If not, how to implement support?

Answers (2)

Related Questions