I just start learning C and found some confusion about the string pointer and string(array of char). Can anyone help me to clear my head a bit?
// source code
ch
You can initialize an array when you declare it, like this:
int n[5] = { 0, 1, 2, 3, 4 };
char c[5] = { 'd', 'a', 't', 'a', '\0' };
But since we typically treat char arrays as strings, C allows a special case:
char c[5] = "data"; // Terminating null character is added.
However, once you've declared an array, you can't reassign it. Why? Consider an assignment like
char *my_str = "foo"; // Declare and initialize a char pointer.
my_str = "bar"; // Change its value.
The first line declares a char pointer and "aims" it at the first letter in foo
. Since foo
is a string constant, it resides somewhere in memory with all the other constants. When you reassign the pointer, you're assigning a new value to it: the address of bar
. But the original string, foo
remains unchanged. You've moved the pointer, but haven't altered the data.
When you declare an array, however, you aren't declaring a pointer at all. You're reserving a certain amount of memory and giving it a name. So the line
char c[5] = "data";
starts with the string constant data
, then allocates 5 new bytes, calls them c
, and copies the string into them. You can access the elements of the array exactly as if you'd declared a pointer to them; arrays and pointers are (for most purposes) interchangeable in that way.
But since arrays are not pointers, you cannot reassign them.
You can't make c
"point" anywhere else, because it's not a pointer; it's the name of an area of memory.
You can change the value of the string, either one character at a time, or by calling a function like strcpy()
:
c[3] = 'e'; // Now c = "date", or 'd', 'a', 't', 'e', '\0'
strcpy(c, "hi"); // Now c = 'h', 'i', '\0', 'e', '\0'
strcpy(c, "too long!") // Error: overflows into memory we don't own.
Efficiency Tip
Note, also, that initializing an array generally makes a copy of the data: the original string is copied from the constant area to the data area, where your program can change it. Of course, this means you're using twice as much memory as you may have expected. You can avoid the copy and generally save memory by declaring a pointer instead. That leaves the string in the constant area and allocates only enough memory for a pointer, regardless of the length of the string.