Count of bytes vs Count of Chars

Question

Some apis requires count of chars.

// Why did they choose cch in these functions.
HRESULT StringCchCopyW(
  __out  LPWSTR pszDest,
  __in   size_t cchDest,
  __in   LPCWSTR pszSrc
);

errno_t wcscpy_s(
   wchar_t *strDestination,
   size_t numberOfElements,
   const wchar_t *strSource 
);

DWORD WINAPI GetCurrentDirectoryW(
  __in   DWORD nBufferLength, // Count of Chars
  __out  LPWSTR lpBuffer
);

And Some apis requires count of bytes.

// What do you prefer cch vs cb function.
// Do cch functions almost useful?
HRESULT StringCbCopyW(
  __out  LPWSTR pszDest,
  __in   size_t cbDest,
  __in   LPCWSTR pszSrc
);

BOOL WINAPI ReadFile(
  __in         HANDLE hFile,
  __out        LPVOID lpBuffer,
  __in         DWORD nNumberOfBytesToRead,
  __out_opt    LPDWORD lpNumberOfBytesRead,
  __inout_opt  LPOVERLAPPED lpOverlapped
);

// Why did they choose cb in these structures.
// Because there are some apis uses cb, I always should see MSDN.
typedef struct _LSA_UNICODE_STRING {
  USHORT Length; // Count of bytes.
  USHORT MaximumLength; // Count of bytes.
  PWSTR  Buffer;
} UNICODE_STRING, *PUNICODE_STRING;

typedef struct _FILE_RENAME_INFO {
  BOOL   ReplaceIfExists;
  HANDLE RootDirectory;
  DWORD  FileNameLength; // Count of bytes.
  WCHAR  FileName[1];
} FILE_RENAME_INFO, *PFILE_RENAME_INFO;

When you design a function or a data structure, how do you determine cb or cch? And why?
To design better api for caller, what should I know about this?

user541686 · Accepted Answer

If the data returned is a string, you should return the count of chars, since the number of bytes is often useless. But if it's generic binary data (and not specifically a string), then obviously the number of chars doesn't make any sense, so use the number of bytes.

As to why:

I believe the reason for LSA_UNICODE_STRING holding the number of bytes is that it's meant to be compatible with UNICODE_STRING, which in turn is used in NtCreateFile. But NtCreateFile takes in a FILE_OPEN_BY_FILE_ID parameter that actually treats the UNICODE_STRING to be pointing to a LONGLONG value, and not a string... so the number of bytes made more sense there, although I'd say it was overall a poor design:

FILE_OPEN_BY_FILE_ID: The file name that is specified by the ObjectAttributes parameter includes the 8-byte file reference number for the file.

Count of bytes vs Count of Chars

Answers (2)

Related Questions