System(s) for encoding data using a limited set of characters

扶醉桌前 提交于 2019-12-12 05:08:11

问题


up-front clarification of exactly what I need: I would like to know - are there any standards or conventions for encoding information like this within a url that I might try to use to read what it says?

I'm sure there are established methods for encoding binary information using non-binary character sets. In this case each location can be occupied by one of 66 different characters, so each "byte" is slightly longer than six bits if you look at it logarithmically.

For example, a link in a mail message I received from the IEEE was surprisingly long:

http://links.mail.spectrum.ieee.org/mpss/c/4wA/vCMaAA/t.1xp/MW0ThjonTUa... ...6Bg3tXEzxb1xlZ4zyB3umN-2BxkLQw6LtGSGH3ghSDqpqiUmxfSVUprd0HnHYipmddjyE-3D

876 characters in toto (I've deleted most of it here) - over half a kilobyte!

The encoding must conform to legal characters within a url, the coding in the url I received uses 66 ASCII characters only - see below.

Probably - of course - it is proprietary and there's little hope, but I'd like to know if there do exist some standards that allow these to be readable if so desired.

My math:

841 * (log(26+26+10+4) / log(2)) / 8 / 1024

~0.62 kb

26 uppers, 26 lowers, 10 digits and 4 punctuation characters for this estimate. ['-', '/', '.', '_']

Since urls have a de facto limit of 2000 characters, a crafted url can carry over a kilobyte of "individualized" information.

来源:https://stackoverflow.com/questions/37640137/systems-for-encoding-data-using-a-limited-set-of-characters

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!