c# - Is data “lost” when using binary data in a string?

后端 未结 9 1754
佛祖请我去吃肉
佛祖请我去吃肉 2021-01-16 13:58

I\'ve tried reading a JPG file using the StreamReader class\' ReadToEnd() method which returns a string.

For some reason though, when I wri

相关标签:
9条回答
  • 2021-01-16 14:28

    I notice that nobody has answered the actual questions.

    Is something lost when reading data into a string?

    A JPEG file contains a picture rather than words. This bicture has a binary representation as a sequence of bytes. Some of those bytes have the value 0x00 also represented as NUL. In a string, a byte containing with this value is interpreted as marking the end of string. Data past the end of string is treated as unused buffer and is ignored.

    When you write the string out to a file, nothing past the first NUL is included. As a result, the file is not a complete binary image and is rejected by the validation logic of software trying to interpret it as a JPEG.

    So data generally is lost when you load a string with non-textual data. The problem here is that you have effectively made an invalid typecast, but neither compiler nor the runtime has stopped you, and the result is data corruption.

    What is it really good for?

    Several thing. As others have said, strings are designed to contain text. In .NET, strings support encodings other than plain old ASCII. There is also extensive support for text manipulation. Look up format specifiers in help for a spectacular example of string manipulation.

    Why do C# strings use NUL for end of string?

    This is a legacy thing. NUL isn't much good for anything else and doing so simplifies marshalling strings in and out of managed code. BSTR does the same thing for the same reasons.

    0 讨论(0)
  • 2021-01-16 14:30

    Strings use Unicode encoding by default, unicode uses the NUL character as a control character, double NUL is used for termination, a single NUL is used to represent an ASCII character before it.

    Its for this reason that binary data cannot be loaded into a string.

    0 讨论(0)
  • 2021-01-16 14:36

    String is designed for holding unicode characters; not binary. For binary, use a byte[] or Stream. Or an Image etc for more specialized image handling.

    Despite the name, StreamReader is actually a specialized TextReader - i.e. it is a TextReader that reads from a Stream. Images are not text, so this isn't the right option.

    0 讨论(0)
  • 2021-01-16 14:36

    You just can't do it this way.... Use FileStream instead.

    You cant use string to read binary files, some characters won't make its way as far as I know.

    0 讨论(0)
  • 2021-01-16 14:39

    As all Real Programmers know, the only useful data structure is the Array. Strings, Lists, Structures, Sets-- these are all special cases of arrays and can be treated that way just as easily without messing up your programming language with all sorts of complications. The worst thing about fancy data types is that you have to declare them, and Real Programming Languages, as we all know, have implicit typing based on the first letter of the (six character) variable name.

    Besides, the determined Real Programmer can write Fortran programs in any language.


    Whoever modded this down clearly has either no sense of humour or no knowledge of folklore. The above is excerpted from a very famous 1983 letter to the editor of Datamation, by Ed Post of Tektronix. The letter is titled Real Programmers Don't Use Pascal.

    0 讨论(0)
  • 2021-01-16 14:42

    Unfortunately there is a serious problem with class names in System.IO namespace. StreamReader is designed to read\write from\to text files. You should use FileStream for binary files as @goodwill suggested

    0 讨论(0)
提交回复
热议问题