simon-svn: charset: simon

Commits to Tartarus CVS repository. tartarus-commits at lists.tartarus.org
Wed Jul 18 23:52:00 BST 2012


SVN root:       svn://svn.tartarus.org/sgt
Changes by:     simon
Revision:       9581
Date:           2012-07-18 23:52:00 +0100 (Wed, 18 Jul 2012)

Log message (8 lines):
A slightly silly new utility: 'confuse'. You provide it with some
Unicode values (typically two of them), and it finds cases in which
the provided characters are all encoded as the same thing in different
charsets and prints those charsets. So if you encounter, for example,
some piece of text which has U+0153 LATIN SMALL LIGATURE OE where you
might have expected U+00A3 POUND SIGN, simply run 'confuse 153 a3' and
it'll tell you which character sets the sender and receiver of the
text might have got confused between.

Modified files:
_U  charset/
U   charset/Makefile
A   charset/confuse.c

Links:
http://svn.tartarus.org/sgt/?rev=9581&view=rev
http://svn.tartarus.org/sgt/charset/Makefile?rev=9581&r1=9580&r2=9581
http://svn.tartarus.org/sgt/charset/confuse.c?rev=9581&view=markup



More information about the tartarus-commits mailing list