Should I use nvarchar(max) in place of a nvarchar(64) column or as an additional column?

I am building a table for tracking the history of particular objects in my database. Currently I have following columns:

HistoryId int IDENTITY(1,1) NOT NULL
HistoryDate datetimeoffset(7) NOT NULL 
HistoryTypeId int NOT NULL
HistoryDetails nvarchar(max) NULL

For the most part each history item will be self-explanatory through the HistoryTypeId so HistoryDetails will either be Null or very small. But for a couple of History Types, the details data will be large. Is it ok to go with nvarchar(max) for all of the records or should I break it apart and have an extra column for the History Types that will require more than 64 characters (see below)? A rough estimate is that 80%-90% of records will not require more than 64 characters of detail information and there will be millions of records in the table.

HistoryId int IDENTITY(1,1) NOT NULL
HistoryDate datetimeoffset(7) NOT NULL 
HistoryTypeId int NOT NULL
HistoryDetails nvarchar(64) NULL
HistoryDetailsMore nvarchar(max) NULL

Upvotes: 4

Answers (3)

Quassnoi

Reputation: 425411

You cannot make NVARCHAR(MAX) a part of a key in a plain B-Tree index (you can still use it as an included column in an index).

Otherwise, the storage will be the same as long as the data in the column does not exceed the row size threshold.

Since you're probably not going to index this field anyway, it's a good idea to create it as NVARCHAR(MAX).

Even if you still want to index it (say, to do prefix searches using LIKE), you can create a computed NVARCHAR(450) column, create an index on that column, and add it to your queries for coarse filtering.

See this entry in my blog for more details:

Indexing VARCHAR(MAX)

If you are going to do exact searches for the small columns only, create a computed column, index it and query like this:

ALTER TABLE History ADD HistoryDetailsIndex AS SUBSTRING(HistoryDetails, 1, 50)

CREATE INDEX ix_mytable_typeid_details ON History (HistoryTypeId, HistoryDetailsIndex) INCLUDE (HistoryDetails)

SELECT  COUNT(*)
FROM    History
WHERE   HistoryTypeId = 123
        AND HistoryDetailsIndex LIKE 'string_prefix_up_to_50_characters%'
        AND HistoryDetails = 'string_prefix_up_to_50_characters_plus_everything_after_it'

This will include only only the first 50 characters from your HistoryDetails into the index key (which will be searched in a LIKE condition), and everything into the included column.

If you are absolutely sure you will never search for a string that is more than 50 characters long, you can omit the included column and just use this:

SELECT  COUNT(*)
FROM    History
WHERE   HistoryTypeId = 123
        AND HistoryDetailsIndex = 'string_prefix_up_to_50_characters'

This will make the index shorter.

However, this will fail if you provide a string more than 50 characters long, so use it if you are absolutely sure you will never search for long strings.

Upvotes: 5

Paul Talbot

Reputation: 1603

First be aware that the varchar(MAX) can store upto 2gb of space, behind the scenes it actually uses a TEXT value, and subsequently it uses more processing than varchar(8000) or less.

If you're storing a lot of smaller data in a varchar(max) it will be treated as a normal varchar column, unless you exceed the 8000, after which it will be treated like a varchar(max).

Is the column indexed, or do you want to index it? If so steer clear of the varchar(max).

I would just pick a higher value, say varchar(255) and force users to fit to your database design and not the other way around.

Upvotes: 0

Chris J

Reputation: 2206

Since you are using nvarchar, you are already paying the variable length record overhead more than likely unless SQLServer overrides the variable length for small cases. However, the space on disk shouldn't change for the short records between nvarchar(64) and nvarchar(max). They should only take as much space as is needed to fit their data. Normally that number would only be used to constrain the data. If you don't want to constrain it, then you shouldn't pay a penalty between using those two that you aren't already paying.

Upvotes: 0

Should I use nvarchar(max) in place of a nvarchar(64) column or as an additional column?

Answers (3)

Related Questions