String::Multibyte::EUC_JP - internally used by String::Multibyte for EUC-JP


NAME

String::Multibyte::EUC_JP - internally used by String::Multibyte for EUC-JP


SYNOPSIS

    use String::Multibyte;
    $eucjp = String::Multibyte->new('EUC_JP');
    $eucjp_length = $eucjp->length($eucjp_string);


DESCRIPTION

String::Multibyte::EUC_JP is used for manipulation of strings in EUC-JP family encodings.

Byte range of single-byte characters: 0x00..0x7F.

First and second byte range of double-byte characters: 0xA1..0xFE.

Legal byte range for two bytes preceded by SS3: 0xA1..0xFE.

Character order (invalid code points are excluded): 0x00..0x7F, 0x8EA1..0x8EFE, 0xA1A1..0xFEFE, 0x8FA1A1..0x8FFEFE.

                                       EUC-JP         EUC-JISX0213
  1.  0x00..0x7F         C0/G0      ASCII          ASCII
  2.  0x8EA1..0x8EFE       G2       JIS kana       JIS kana
  3.  0xA1A1..0xFEFE       G1       JIS X 0208     JIS X 0213 plane-1
  4.  0x8FA1A1..0x8FFEFE   G3       JIS X 0212     JIS X 0213 plane-2


CAVEAT

C1 controls other than SS2/SS3 (0x80..0x8D and 0x90..0x9E) are not supported.

Unassigned G2 (0x8EE0..0x8EFE) are now included.


SEE ALSO

the String::Multibyte manpage

 String::Multibyte::EUC_JP - internally used by String::Multibyte for EUC-JP