Could somebody point me in the right direction of how I could read a binary file that is defined by a C struct? It has a few #define inside of the struct, which makes me thing t
There are some bad ideas and good ideas:
int
, long
, short
, etc, type in C/C++. You can use stuff like int16_t
, but these are available only on modern compilers.read
syscall and parsing a buffer then — or vice versa, reading whole thing at once instead of more granular, lazy reads where it's applicable).There are several tools available to do that:
You have to find out the endiannes of the machine where the file was written so you can interpret integers properly. Look out for ILP32 vs LP64 mismatch. The original structure packing/alignment might also be important.
Using C++ I/O library:
#include <fstream>
using namespace std;
ifstream ifs("file.dat", ios::binary);
Format f;
ifs.get(&f, sizeof f);
Using C I/O library:
#include <cstdio>
using namespace std;
FILE *fin = fopen("file.dat", "rb");
Format f;
fread(&f, sizeof f, 1, fin);
You can also use unions to do this parsing if you have the data you want to parse already in memory.
union A {
char* buffer;
Format format;
};
A a;
a.buffer = stuff_you_want_to_parse;
// You can now access the members of the struct through the union.
if (a.format.str_name == "...")
// do stuff
Also remember that long could be different sizes on different platforms. If you are depending on long being a certain size, consider using the types defined int stdint.h such as uint32_t.
Reading a binary defined by a struct is easy.
Format myFormat;
fread(&myFormat, sizeof(Format), 1, fp);
the #defines don't affect the structure at all. (Inside is an odd place to put them, though).
However, this is not cross-platform safe. It is the simplest thing that will possibly work, in situations where you are assured the reader and writer are using the same platform.
The better way would be to re-define your structure as such:
struct Format {
Uint32 str_totalstrings; //assuming unsigned long was 32 bits on the writer.
Uint32 str_name;
unsigned char stuff[4];
};
and then have a 'platform_types.h" which typedefs Uint32 correctly for your compiler. Now you can read directly into the structure, but for endianness issues you still need to do something like this:
myFormat.str_totalstrings = FileToNative32(myFormat.str_totalstrings);
myFormat.str_name = FileToNative32(str_name);
where FileToNative is either a no-op or a byte reverser depending on platform.