Ranter
Join devRant
Do all the things like
++ or -- rants, post your own rants, comment on others' rants and build your customized dev avatar
Sign Up
Pipeless API
From the creators of devRant, Pipeless lets you power real-time personalized recommendations and activity feeds using a simple API
Learn More
Comments
-
I actually got through every unicode Character which could represent an ASCII Character and noted them down in one Java method. It's not actually that much to write because they are mostly in one big Block and you can just map them to real ASCII.
-
devios157706y@EaZyCode Unicode agrees with ASCII on the lower 7 bits. That means UTF-8 and ASCII are identical for all standard ASCII characters.
-
@devios1 i mean stuff like greek alphabet (Α) and latin alphabet (A) which look almost identical. Or you can add 0xFEE0 to any ASCII character and almost always get a similar looking Character.
-
@devios1 everywhere, where someone can write Unicodes and you want to process some text. For example if you want devRant statistics and you search for certain words, you would never get this rant. If you pre-process it you get this rant in normal ASCII and it would work. I used it for an Anti-Advertisment program (just a small fun project)
And the use case of using these tricks would be to bypass such filters without that pre-processing.
Related Rants
The thing about UNICODE is the ability to make it hard to parse what humans automatically see as ASCII
rant
encoding
ascii
unicode