i am trying to construct a (test) WideString
of:
á (U+00E1 Small Letter Latin A with acute)
but using it\'s decomp
Did you try #$0301#$0061 (i.e. diacritic first)?
OK.
So #$.... only handles ASCII 8 bits constants in this version.
You can just use a workaround using memory level:
type
TWordArray = array[1..MaxInt div SizeOf(word)-2] of word;
// start at [1], just as WideStrings
// or: TWordArray = array[0..MaxInt div SizeOf(word)-1] of word;
PWordArray = ^TWordArray;
var
test: WideString;
begin
test := '12'; // or SetLength(test,2);
PWordArray(test)[1] := $61;
PWordArray(test)[2] := $301;
MessageBoxW(0, pointer(test), 'Character with diacratic', MB_ICONINFORMATION or MB_OK);
end;
This will always work since you don't play with chars/widechars and such.
And it will also work as expected with Unicode version of Delphi.
This works in Delphi 5/7:
var
test: WideString;
begin
test := WideChar($0061);
test := test + WideChar($0301);
MessageBoxW(0, PWideChar(test), 'Character with diacratic', MB_ICONINFORMATION or MB_OK);
end;
In short:
#$xxxx
form literals.#
doesn't seem to work as you'd expect for unicode literals.
You can't just add two or more widechars in a single expression, like this:
test := WideChar(a)+WideChar(b); // won't compile in D5/D7.
Best answer:
const
n: WideString = ''; //n=Nothing
s := n+#$0061+#$0301;
This fixes all cases i have below that otherwise fail.
The only variant that works is to declare it as a constant:
AccentAcute: WideString = #$0301;
AccentAcute: WideString = WideChar($0301);
AccentAcute: WideString = WideChar(#$0301);
AccentAcute: WideString = WideString(#$0301);
Sample Usage:
s := 'Pasta'+AccentAcute;
AccentAcute: WideString = $0301;
AccentAcute: WideString = #0301;
AccentAcute: WideString = WideString($0301);
AccentAcute: WideString = WideString(#$0301);
AccentAcute: WideChar = WideChar(#0301);
gives Pastai
AccentAcute: WideChar = WideChar($0301);
gives Pasta´
'Pasta'+WideChar($0301)
Pasta´
'Pasta'+#$0301
Pasta´
WideString('Pasta')+#$0301
Summary of all constant based syntaxes i found think up:
AccentAcute: WideString = #$0301; //works
AccentAcute: WideString = WideChar(#$0301); //works
AccentAcute: WideString = WideString(#$0301); //works
AccentAcute: WideString = $0301; //incompatble types
AccentAcute: WideString = WideChar($0301); //works
AccentAcute: WideString = WideString($0301); //invalid typecast
AccentAcute: WideChar = #$0301; //fails, gives Pasta´
AccentAcute: WideChar = WideChar(#$0301); //fails, gives Pasta´
AccentAcute: WideChar = WideString(#$0301); //incompatible types
AccentAcute: WideChar = $0301; //incompatible types
AccentAcute: WideChar = WideChar($0301); //fails, gives Pasta´
AccentAcute: WideChar = WideString($0301); //invalid typecast
WideChar
can work, as long as you only append to a variable//Works
t := '0123401234012340123';
t := t+WideChar(#$D840);
t := t+WideChar(#$DC00);
//fails
t := '0123401234012340123'+WideChar(#$D840);
t := t+WideChar(#$DC00);
//fails
t := '0123401234012340123'+WideChar(#$D840)+WideChar(#$DC00);
//works
t := '0123401234012340123';
t := t+WideChar(#$D840)+WideChar(#$DC00);
//works
t := '';
t := t+WideChar(#$D840)+WideChar(#$DC00);
//fails; gives junk
t := ''+WideChar(#$D840)+WideChar(#$DC00);
//crashes compiler
t := WideString('')+WideChar(#$D840)+WideChar(#$DC00);
//doesn't compile
t := WideChar(#$D840)+WideChar(#$DC00);
Definitely hitting against compiler nonsense; cases that weren't tested tested fully. Yes, i know David, we should upgrade.