java.lang.Object
org.jcodings.Encoding
org.jcodings.AbstractEncoding
org.jcodings.MultiByteEncoding
org.jcodings.CanBeTrailTableEncoding
org.jcodings.specific.BaseBIG5Encoding
- All Implemented Interfaces:
Cloneable
- Direct Known Subclasses:
BIG5Encoding
,Big5HKSCSEncoding
,Big5UAOEncoding
-
Field Summary
FieldsModifier and TypeFieldDescription(package private) static final boolean[]
private static final int[][]
private final int[]
Fields inherited from class org.jcodings.CanBeTrailTableEncoding
CanBeTrailTable
-
Constructor Summary
ConstructorsModifierConstructorDescriptionprotected
BaseBIG5Encoding
(String name, int[] EncLen, int transIndex) -
Method Summary
Modifier and TypeMethodDescriptionint
codeToMbc
(int code, byte[] bytes, int p) Extracts code point into it's multibyte representationint
codeToMbcLength
(int code) Returns character length given a code point Oniguruma equivalent:code_to_mbclen
int[]
ctypeCodeRange
(int ctype, IntHolder sbOut) Returns code range for a given character type Oniguruma equivalent:get_ctype_code_range
boolean
isCodeCType
(int code, int ctype) Perform a check whether given code is of given character type (e.g.int
length
(byte[] bytes, int p, int end) Returns character length given stream, character position and stream end returns1
for singlebyte encodings or performs sanity validations for multibyte ones and returns the character length, missing characters in the stream otherwiseint
mbcCaseFold
(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower) onigenc_ascii_mbc_case_foldint
mbcToCode
(byte[] bytes, int p, int end) Returns code point for a character Oniguruma equivalent:mbc_to_code
Methods inherited from class org.jcodings.CanBeTrailTableEncoding
isReverseMatchAllowed, leftAdjustCharHead
Methods inherited from class org.jcodings.MultiByteEncoding
caseMap, isInRange, length, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLength
Methods inherited from class org.jcodings.AbstractEncoding
applyAllCaseFold, asciiApplyAllCaseFold, asciiCaseFoldCodesByString, asciiMbcCaseFold, caseFoldCodesByString, isCodeCTypeInternal, isNewLine, propertyNameToCType
Methods inherited from class org.jcodings.Encoding
asciiToLower, asciiToUpper, digitVal, equals, getCharset, getCharsetName, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, load, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitVal
-
Field Details
-
TransBase
private final int[] TransBase -
BIG5_CAN_BE_TRAIL_TABLE
static final boolean[] BIG5_CAN_BE_TRAIL_TABLE -
BIG5Trans
private static final int[][] BIG5Trans
-
-
Constructor Details
-
BaseBIG5Encoding
-
-
Method Details
-
length
public int length(byte[] bytes, int p, int end) Description copied from class:Encoding
Returns character length given stream, character position and stream end returns1
for singlebyte encodings or performs sanity validations for multibyte ones and returns the character length, missing characters in the stream otherwise -
mbcToCode
public int mbcToCode(byte[] bytes, int p, int end) Description copied from class:Encoding
Returns code point for a character Oniguruma equivalent:mbc_to_code
-
codeToMbcLength
public int codeToMbcLength(int code) Description copied from class:Encoding
Returns character length given a code point Oniguruma equivalent:code_to_mbclen
- Specified by:
codeToMbcLength
in classEncoding
-
codeToMbc
public int codeToMbc(int code, byte[] bytes, int p) Description copied from class:Encoding
Extracts code point into it's multibyte representation -
mbcCaseFold
Description copied from class:AbstractEncoding
onigenc_ascii_mbc_case_fold- Overrides:
mbcCaseFold
in classAbstractEncoding
- Parameters:
flag
- case fold flagpp
- anIntHolder
that points at character headlower
- a buffer where to extract case folded character Oniguruma equivalent:mbc_case_fold
-
isCodeCType
public boolean isCodeCType(int code, int ctype) Description copied from class:Encoding
Perform a check whether given code is of given character type (e.g. used by isWord(someByte) and similar methods)- Specified by:
isCodeCType
in classEncoding
- Parameters:
code
- a code point of a characterctype
- a character type to check against Oniguruma equivalent:is_code_ctype
-
ctypeCodeRange
Description copied from class:Encoding
Returns code range for a given character type Oniguruma equivalent:get_ctype_code_range
- Specified by:
ctypeCodeRange
in classEncoding
-