I\'d like to test the Unicode handling of my code. Is there anything I can put in random.choice() to select from the entire Unicode range, preferably not an external module?
Follows a code that print any printable character of UTF-8:
print(''.join(tuple(chr(i) for i in range(32, 0x110000) if chr(i).isprintable())))
All printable characters are included above, even those that are not printed by the current font. The clause and not chr(i).isspace()
can be added to filter out whitespace characters.
Since Unicode is just a range of - well - codes, what about using unichr() to get the unicode string corresponding to a random number between 0 and 0xFFFF?
(Of course that would give just one codepoint, so iterate as required)