overhead of varchar(max) columns with small data

前端未结

关注

 5  1138

As part of a bulk load of data from an external source the stageing table is defined with varchar(max) columns. The idea being that each column will be able to hold whateve

相关标签:

5条回答

离开以前

2021-01-17 10:52

The storage overhead is the same between varchar(n) and varchar(max) The storage size is the actual length of data entered + 2 bytes

MSDN Reference

Check out these similar SO questions:

https://stackoverflow.com/questions/166371/varcharmax-versus-varcharn-in-ms-sql-server Are there any disadvantages to always using nvarchar(MAX)?

0 讨论(0)
发布评论:

提交评论
- 加载中...
耶瑟儿～

2021-01-17 10:57

As far as I know, the overhead that you are probably thinking about (storing the data out-of-row in the same way a TEXT or BINARY value is stored in sql server) only applies if the data size exceeds 8000 bytes. So there shouldn't be a problem using this with smaller columns for ETL processes.

0 讨论(0)
发布评论:

提交评论
- 加载中...
野的像风

2021-01-17 11:03

VARCHAR(MAX) column values will be stored IN the table row, space permitting. So if you have a single VARCHAR(MAX) field and it's 200, 300 byte, chances are it'll be stored inline with the rest of your data. No problem or additional overhead here.

Only when the entire data of a single row cannot fit on a single SQL Server page (8K) anymore, only then will SQL Server move VARCHAR(MAX) data into overflow pages.

So all in all, I think you get the best of both worlds - inline storage when possible, overflow storage when necessary.

Marc

PS: As Mitch points out, this default behaviour can be turned off - I don't see any compelling reasons to do so, however....

0 讨论(0)
发布评论:

提交评论
- 加载中...
滥情空心

2021-01-17 11:03

Well I want to say that there shouldn't be that big an overhead because i don't think that sql automatically assigned an alloted amount of data for nvarchar, and instead only allots what is needed for what is inserted, but i don't have naything to prove or back up that idea.

0 讨论(0)
发布评论:

提交评论
- 加载中...
面向向阳花

2021-01-17 11:13

If you use an varchar(max) or varbinary(max) in MSSQL2005 SSIS is creating a temporary file for each column in your record this can drop you performance and become a big problem. MS claims that they solved this issue in MSSQL2008.

0 讨论(0)
发布评论:

提交评论
- 加载中...