Why can't I static_cast between char * and unsigned char *?
C++CharReinterpret CastStatic CastUnsigned CharC++ Problem Overview
Apparently the compiler considers them to be unrelated types and hence reinterpret_cast
is required. Why is this the rule?
C++ Solutions
Solution 1 - C++
They are completely different types see standard:
> 3.9.1 Fundamental types [basic.fundamental]
>
> 1 Objects declared as characters char) shall be large enough to
> store any member of the implementation's basic character set. If a
> character from this set is stored in a character object, the integral
> value of that character object is equal to the value of the single
> character literal form of that character. It is
> implementation-defined whether a char object can hold negative
> values. Characters can be explicitly declared unsigned or
> signed. Plain char, signed char, and unsigned char are
> three distinct types. A char, a signed char, and an unsigned char
> occupy the same amount of storage and have the same alignment
> requirements (basic.types); that is, they have the same object
> representation. For character types, all bits of the object
> representation participate in the value representation. For unsigned
> character types, all possible bit patterns of the value representation
> represent numbers. These requirements do not hold for other types. In
> any particular implementation, a plain char object can take on either
> the same values as a signed char or an unsigned char; which one is
> implementation-defined.
So analogous to this is also why the following fails:
unsigned int* a = new unsigned int(10);
int* b = static_cast<int*>(a); // error different types
a
and b
are completely different types, really what you are questioning is why is static_cast so restrictive when it can perform the following without problem
unsigned int a = new unsigned int(10);
int b = static_cast<int>(a); // OK but may result in loss of precision
and why can it not deduce that the target types are the same bit-field width and can be represented? It can do this for scalar types but for pointers, unless the target is derived from the source and you wish to perform a downcast then casting between pointers is not going to work.
Bjarne Stroustrop states why static_cast
's are useful in this link: http://www.stroustrup.com/bs_faq2.html#static-cast but in abbreviated form it is for the user to state clearly what their intentions are and to give the compiler the opportunity to check that what you are intending can be achieved, since static_cast
does not support casting between different pointer types then the compiler can catch this error to alert the user and if they really want to do this conversion they then should use reinterpret_cast
.
Solution 2 - C++
you're trying to convert unrelated pointers with a static_cast. That's not what static_cast is for. Here you can see: Type Casting.
With static_cast you can convert numerical data (e.g. char to unsigned char should work) or pointer to related classes (related by some inheritance). This is both not the case. You want to convert one unrelated pointer to another so you have to use reinterpret_cast.
Basically what you are trying to do is for the compiler the same as trying to convert a char * to a void *.
Ok, here some additional thoughts why allowing this is fundamentally wrong. static_cast can be used to convert numerical types into each other. So it is perfectly legal to write the following:
char x = 5;
unsigned char y = static_cast<unsigned char>(x);
what is also possible:
double d = 1.2;
int i = static_cast<int>(d);
If you look at this code in assembler you'll see that the second cast is not a mere re-interpretation of the bit pattern of d but instead some assembler instructions for conversions are inserted here.
Now if we extend this behavior to arrays, the case where simply a different way of interpreting the bit pattern is sufficient, it might work. But what about casting arrays of doubles to arrays of ints? That's where you either have to declare that you simple want a re-interpretation of the bit patterns - there's a mechanism for that called reinterpret_cast, or you must do some extra work. As you can see simple extending the static_cast for pointer / arrays is not sufficient since it needs to behave similar to static_casting single values of the types. This sometimes needs extra code and it is not clearly definable how this should be done for arrays. In your case - stopping at \0 - because it's the convention? This is not sufficient for non-string cases (number). What will happen if the size of the data-type changes (e.g. int vs. double on x86-32bit)?
The behavior you want can't be properly defined for all use-cases that's why it's not in the C++ standard. Otherwise you would have to remember things like: "i can cast this type to the other as long as they are of type integer, have the same width and ...". This way it's totally clear - either they are related CLASSES - then you can cast the pointers, or they are numerical types - then you can cast the values.
Solution 3 - C++
Aside from being pointers, unsigned char *
and char *
have nothing in common (EdChum already mentioned the fact that char
, signed char
and unsigned char
are three different types). You could say the same thing for Foo *
and Bar *
pointer types to any dissimilar structures.
static_cast
means that a pointer of the source type can be used as a pointer of the destination type, which requires a subtype relationship. Hence it cannot be used in the context of your question; what you need is either reinterpret_cast
which does exactly what you want or a C-style cast.