simon-git: charset (master): Simon Tatham

Commits to Tartarus hosted VCS tartarus-commits at lists.tartarus.org
Thu Feb 22 20:44:18 GMT 2018


TL;DR:
  fa3ef41 csshow: option to control size of Unicode range displayed.
  2e0d760 csshow: allow showing non-single-byte charsets.
  dda40e3 csshow: add a TODO thought about MBCS subranges.

Repository:     https://git.tartarus.org/simon/charset.git
On the web:     https://git.tartarus.org/?p=simon/charset.git
Branch updated: master
Committer:      Simon Tatham <anakin at pobox.com>
Date:           2018-02-22 20:44:18

commit fa3ef41b74f0618268b5a7920b24523921a53ce3
web diff https://git.tartarus.org/?p=simon/charset.git;a=commitdiff;h=fa3ef41b74f0618268b5a7920b24523921a53ce3;hp=0a81212ae48131db761890fb058111ae2f2ce59f
Author: Simon Tatham <anakin at pobox.com>
Date:   Thu Feb 22 20:39:22 2018 +0000

    csshow: option to control size of Unicode range displayed.
    
    If you've given a U+NNNN code point as the argument to csshow,
    specifying the start of some range of Unicode, then you can also give
    another code point as a second argument which marks the end of the
    range to display, or +NNN for a number of code points to include. This
    allows you to list the whole of a block longer than 256 characters, or
    to stop at the end of a shorter one.

 csshow.c | 42 ++++++++++++++++++++++++++++++++++++++++--
 1 file changed, 40 insertions(+), 2 deletions(-)

commit 2e0d7602d57759cd6e4e364682c01a2c983e1faa
web diff https://git.tartarus.org/?p=simon/charset.git;a=commitdiff;h=2e0d7602d57759cd6e4e364682c01a2c983e1faa;hp=fa3ef41b74f0618268b5a7920b24523921a53ce3
Author: Simon Tatham <anakin at pobox.com>
Date:   Thu Feb 22 20:41:28 2018 +0000

    csshow: allow showing non-single-byte charsets.
    
    Some charsets, like Shift-JIS or ISO 6937, are not wholly single-byte
    but do have a subset of single-byte encodings large enough that one
    might want to view it. So now we don't reject non-SBCS names as an
    argument; instead, I came up with a way to identify multibyte-
    character introducer bytes, so as to exclude them from display.
    
    (And if and when I get round to the TODO item to colour or otherwise
    highlight the various different reasons why a character can't be
    shown, then I'll have MULTIBYTE_INTRODUCER ready to be distinguished
    from all the other reasons.)

 csshow.c | 33 ++++++++++++++++++++++++---------
 1 file changed, 24 insertions(+), 9 deletions(-)

commit dda40e37c45d8491491a393a3d706e0cd659a466
web diff https://git.tartarus.org/?p=simon/charset.git;a=commitdiff;h=dda40e37c45d8491491a393a3d706e0cd659a466;hp=2e0d7602d57759cd6e4e364682c01a2c983e1faa
Author: Simon Tatham <anakin at pobox.com>
Date:   Thu Feb 22 20:43:43 2018 +0000

    csshow: add a TODO thought about MBCS subranges.

 csshow.c | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)



More information about the tartarus-commits mailing list