public class BidiFormatter extends BidiFormatterBase
1. BiDi Wrapping: When text in one language is mixed into a document in another, opposite-direction language, e.g. when an English business name is embedded in a Hebrew web page, both the inserted string and the text following it may be displayed incorrectly unless the inserted string is explicitly separated from the surrounding text in a "wrapper" that declares its direction at the start and then resets it back at the end. This wrapping can be done in HTML mark-up (e.g. a 'span dir=rtl' tag) or - only in contexts where mark-up cannot be used - in Unicode BiDi formatting codes (LRE|RLE and PDF). Optionally, the mark-up can be inserted even when the direction is the same, in order to keep the DOM structure more stable. Providing such wrapping services is the basic purpose of the BiDi formatter.
2. Direction estimation: How does one know whether a string about to be inserted into surrounding text has the same direction? Well, in many cases, one knows that this must be the case when writing the code doing the insertion, e.g. when a localized message is inserted into a localized page. In such cases there is no need to involve the BiDi formatter at all. In some other cases, it need not be the same as the context, but is either constant (e.g. urls are always LTR) or otherwise known. In the remaining cases, e.g. when the string is user-entered or comes from a database, the language of the string (and thus its direction) is not known a priori, and must be estimated at run-time. The BiDi formatter can do this automatically.
3. Escaping: When wrapping plain text - i.e. text that is not already HTML or HTML-escaped - in HTML mark-up, the text must first be HTML-escaped to prevent XSS attacks and other nasty business. This of course is always true, but the escaping can not be done after the string has already been wrapped in mark-up, so the BiDi formatter also serves as a last chance and includes escaping services.
Thus, in a single call, the formatter will escape the input string as specified, determine its direction, and wrap it as necessary. It is then up to the caller to insert the return value in the output.
Modifier and Type | Class and Description |
---|---|
(package private) static class |
BidiFormatter.Factory |
BidiFormatterBase.Format
Modifier and Type | Method and Description |
---|---|
java.lang.String |
dirAttr(java.lang.String str)
Like
dirAttr(String, boolean) , but assumes isHtml is
false. |
java.lang.String |
dirAttr(java.lang.String str,
boolean isHtml)
Returns "dir=ltr" or "dir=rtl", depending on
str 's estimated
direction, if it is not the same as the context direction. |
java.lang.String |
endEdge()
Returns "left" for RTL context direction.
|
static BidiFormatter |
getInstance(boolean rtlContext)
Factory for creating an instance of BidiFormatter given the context
direction.
|
static BidiFormatter |
getInstance(boolean rtlContext,
boolean alwaysSpan)
Factory for creating an instance of BidiFormatter given the context
direction and the desired span wrapping behavior (see below).
|
static BidiFormatter |
getInstance(HasDirection.Direction contextDir)
Factory for creating an instance of BidiFormatter given the context
direction.
|
static BidiFormatter |
getInstance(HasDirection.Direction contextDir,
boolean alwaysSpan)
Factory for creating an instance of BidiFormatter given the context
direction and the desired span wrapping behavior (see below).
|
static BidiFormatter |
getInstanceForCurrentLocale()
Factory for creating an instance of BidiFormatter whose context direction
matches the current locale's direction.
|
static BidiFormatter |
getInstanceForCurrentLocale(boolean alwaysSpan)
Factory for creating an instance of BidiFormatter whose context direction
matches the current locale's direction, and given the desired span wrapping
behavior (see below).
|
java.lang.String |
knownDirAttr(HasDirection.Direction dir)
Returns "dir=ltr" or "dir=rtl", depending on the given direction, if it is
not the same as the context direction.
|
java.lang.String |
mark()
Returns the Unicode BiDi mark matching the context direction (LRM for LTR
context direction, RLM for RTL context direction), or the empty string for
default / unknown context direction.
|
java.lang.String |
markAfter(java.lang.String str)
Like
markAfter(String, boolean) , but assumes isHtml is
false. |
java.lang.String |
markAfter(java.lang.String str,
boolean isHtml)
Returns a Unicode BiDi mark matching the context direction (LRM or RLM) if
either the direction or the exit direction of
str is opposite to
the context direction. |
java.lang.String |
spanWrap(java.lang.String str)
|
java.lang.String |
spanWrap(java.lang.String str,
boolean isHtml)
Like
spanWrap(String, boolean, boolean) , but assumes dirReset is true. |
java.lang.String |
spanWrap(java.lang.String str,
boolean isHtml,
boolean dirReset)
Formats a string of unknown direction for use in HTML output of the context
direction, so an opposite-direction string is neither garbled nor garbles
what follows it.
|
java.lang.String |
spanWrapWithKnownDir(HasDirection.Direction dir,
java.lang.String str)
Like
spanWrapWithKnownDir(com.google.gwt.i18n.client.HasDirection.Direction, String, boolean, boolean) ,
but assumes isHtml is false and dirReset is true. |
java.lang.String |
spanWrapWithKnownDir(HasDirection.Direction dir,
java.lang.String str,
boolean isHtml)
Like
spanWrapWithKnownDir(com.google.gwt.i18n.client.HasDirection.Direction, String, boolean, boolean) ,
but assumes dirReset is true. |
java.lang.String |
spanWrapWithKnownDir(HasDirection.Direction dir,
java.lang.String str,
boolean isHtml,
boolean dirReset)
Formats a string of given direction for use in HTML output of the context
direction, so an opposite-direction string is neither garbled nor garbles
what follows it.
|
java.lang.String |
startEdge()
Returns "right" for RTL context direction.
|
java.lang.String |
unicodeWrap(java.lang.String str)
|
java.lang.String |
unicodeWrap(java.lang.String str,
boolean isHtml)
Like
unicodeWrap(String, boolean, boolean) , but assumes dirReset is true. |
java.lang.String |
unicodeWrap(java.lang.String str,
boolean isHtml,
boolean dirReset)
Formats a string of unknown direction for use in plain-text output of the
context direction, so an opposite-direction string is neither garbled nor
garbles what follows it.
|
java.lang.String |
unicodeWrapWithKnownDir(HasDirection.Direction dir,
java.lang.String str)
Like
unicodeWrapWithKnownDir(com.google.gwt.i18n.client.HasDirection.Direction, String, boolean, boolean) ,
but assumes isHtml is false and dirReset is true. |
java.lang.String |
unicodeWrapWithKnownDir(HasDirection.Direction dir,
java.lang.String str,
boolean isHtml)
Like
unicodeWrapWithKnownDir(com.google.gwt.i18n.client.HasDirection.Direction, String, boolean, boolean) ,
but assumes dirReset is true. |
java.lang.String |
unicodeWrapWithKnownDir(HasDirection.Direction dir,
java.lang.String str,
boolean isHtml,
boolean dirReset)
Formats a string of given direction for use in plain-text output of the
context direction, so an opposite-direction string is neither garbled nor
garbles what follows it.
|
dirAttrBase, endEdgeBase, estimateDirection, estimateDirection, getAlwaysSpan, getContextDir, isRtlContext, knownDirAttrBase, markAfterBase, markBase, spanWrapBase, spanWrapWithKnownDirBase, startEdgeBase, unicodeWrapBase, unicodeWrapWithKnownDirBase
public static BidiFormatter getInstance(boolean rtlContext)
spanWrap(java.lang.String)
and its variations is
set to avoid span wrapping unless it's necessary ('dir' attribute needs to
be set).rtlContext
- Whether the context direction is RTL.
In one simple use case, the context direction would simply be the
locale direction, which can be retrieved using
LocaleInfo.getCurrentLocale().isRTL()
public static BidiFormatter getInstance(boolean rtlContext, boolean alwaysSpan)
rtlContext
- Whether the context direction is RTL. See an example of
a simple use case at getInstance(boolean)
alwaysSpan
- Whether spanWrap(java.lang.String)
(and its variations) should
always use a 'span' tag, even when the input direction is neutral
or matches the context, so that the DOM structure of the output
does not depend on the combination of directionspublic static BidiFormatter getInstance(HasDirection.Direction contextDir)
spanWrap(java.lang.String)
and its variations is
set to avoid span wrapping unless it's necessary ('dir' attribute needs to
be set).contextDir
- The context direction. See an example of a simple use
case at getInstance(boolean)
. Note: Direction.DEFAULT
indicates unknown context direction. Try not to use it, since it
is impossible to reset the direction back to the context when it
is unknownpublic static BidiFormatter getInstance(HasDirection.Direction contextDir, boolean alwaysSpan)
contextDir
- The context direction. See an example of a simple use
case at getInstance(boolean)
. Note: Direction.DEFAULT
indicates unknown context direction. Try not to use it, since it
is impossible to reset the direction back to the context when it
is unknownalwaysSpan
- Whether spanWrap(java.lang.String)
(and its variations) should
always use a 'span' tag, even when the input direction is neutral
or matches the context, so that the DOM structure of the output
does not depend on the combination of directionspublic static BidiFormatter getInstanceForCurrentLocale()
spanWrap(java.lang.String)
and its variations is set to avoid span wrapping unless it's
necessary ('dir' attribute needs to be set).public static BidiFormatter getInstanceForCurrentLocale(boolean alwaysSpan)
alwaysSpan
- Whether spanWrap(java.lang.String)
(and its variations) should
always use a 'span' tag, even when the input direction is neutral
or matches the context, so that the DOM structure of the output
does not depend on the combination of directionspublic java.lang.String dirAttr(java.lang.String str)
dirAttr(String, boolean)
, but assumes isHtml
is
false.str
- String whose direction is to be estimatedpublic java.lang.String dirAttr(java.lang.String str, boolean isHtml)
str
's estimated
direction, if it is not the same as the context direction. Otherwise,
returns the empty string.str
- String whose direction is to be estimatedisHtml
- Whether str
is HTML / HTML-escapedpublic java.lang.String endEdge()
public java.lang.String knownDirAttr(HasDirection.Direction dir)
dir
- Given directionpublic java.lang.String mark()
public java.lang.String markAfter(java.lang.String str)
markAfter(String, boolean)
, but assumes isHtml
is
false.str
- String after which the mark may need to appearpublic java.lang.String markAfter(java.lang.String str, boolean isHtml)
str
is opposite to
the context direction. Otherwise returns the empty string.str
- String after which the mark may need to appearisHtml
- Whether str
is HTML / HTML-escapedpublic java.lang.String spanWrap(java.lang.String str)
str
- The input stringpublic java.lang.String spanWrap(java.lang.String str, boolean isHtml)
spanWrap(String, boolean, boolean)
, but assumes dirReset
is true.str
- The input stringisHtml
- Whether str
is HTML / HTML-escapedpublic java.lang.String spanWrap(java.lang.String str, boolean isHtml, boolean dirReset)
The algorithm: estimates the direction of input argument str
. In
case its direction doesn't match the context direction, wraps it with a
'span' tag and adds a "dir" attribute (either 'dir=rtl' or 'dir=ltr').
If setAlwaysSpan(true)
was used, the input is always wrapped with
'span', skipping just the dir attribute when it's not needed.
If dirReset
, and if the overall direction or the exit direction of
str
are opposite to the context direction, a trailing unicode BiDi
mark matching the context direction is appended (LRM or RLM).
If !isHtml
, HTML-escapes str
regardless of wrapping.
str
- The input stringisHtml
- Whether str
is HTML / HTML-escapeddirReset
- Whether to append a trailing unicode bidi mark matching the
context direction, when needed, to prevent the possible garbling
of whatever may follow str
public java.lang.String spanWrapWithKnownDir(HasDirection.Direction dir, java.lang.String str)
spanWrapWithKnownDir(com.google.gwt.i18n.client.HasDirection.Direction, String, boolean, boolean)
,
but assumes isHtml
is false and dirReset
is true.dir
- str
's directionstr
- The input stringpublic java.lang.String spanWrapWithKnownDir(HasDirection.Direction dir, java.lang.String str, boolean isHtml)
spanWrapWithKnownDir(com.google.gwt.i18n.client.HasDirection.Direction, String, boolean, boolean)
,
but assumes dirReset
is true.dir
- str
's directionstr
- The input stringisHtml
- Whether str
is HTML / HTML-escapedpublic java.lang.String spanWrapWithKnownDir(HasDirection.Direction dir, java.lang.String str, boolean isHtml, boolean dirReset)
The algorithm: estimates the direction of input argument str
. In
case its direction doesn't match the context direction, wraps it with a
'span' tag and adds a "dir" attribute (either 'dir=rtl' or 'dir=ltr').
If setAlwaysSpan(true)
was used, the input is always wrapped with
'span', skipping just the dir attribute when it's not needed.
If dirReset
, and if the overall direction or the exit direction of
str
are opposite to the context direction, a trailing unicode BiDi
mark matching the context direction is appended (LRM or RLM).
If !isHtml
, HTML-escapes str
regardless of wrapping.
dir
- str
's directionstr
- The input stringisHtml
- Whether str
is HTML / HTML-escapeddirReset
- Whether to append a trailing unicode bidi mark matching the
context direction, when needed, to prevent the possible garbling
of whatever may follow str
public java.lang.String startEdge()
public java.lang.String unicodeWrap(java.lang.String str)
str
- The input stringpublic java.lang.String unicodeWrap(java.lang.String str, boolean isHtml)
unicodeWrap(String, boolean, boolean)
, but assumes dirReset
is true.str
- The input stringisHtml
- Whether str
is HTML / HTML-escapedpublic java.lang.String unicodeWrap(java.lang.String str, boolean isHtml, boolean dirReset)
spanWrap(java.lang.String)
, this makes use of
Unicode BiDi formatting characters. In HTML, its *only* valid use is inside
of elements that do not allow mark-up, e.g. an 'option' tag.
The algorithm: estimates the direction of input argument str
. In
case it doesn't match the context direction, wraps it with Unicode BiDi
formatting characters: RLE+str
+PDF for RTL text, or LRE+ str
+PDF for LTR text.
If opt_dirReset
, and if the overall direction or the exit direction
of str
are opposite to the context direction, a trailing unicode
BiDi mark matching the context direction is appended (LRM or RLM).
Does *not* do HTML-escaping regardless of the value of isHtml
.
str
- The input stringisHtml
- Whether str
is HTML / HTML-escapeddirReset
- Whether to append a trailing unicode bidi mark matching the
context direction, when needed, to prevent the possible garbling
of whatever may follow str
public java.lang.String unicodeWrapWithKnownDir(HasDirection.Direction dir, java.lang.String str)
unicodeWrapWithKnownDir(com.google.gwt.i18n.client.HasDirection.Direction, String, boolean, boolean)
,
but assumes isHtml
is false and dirReset
is true.dir
- str
's directionstr
- The input stringpublic java.lang.String unicodeWrapWithKnownDir(HasDirection.Direction dir, java.lang.String str, boolean isHtml)
unicodeWrapWithKnownDir(com.google.gwt.i18n.client.HasDirection.Direction, String, boolean, boolean)
,
but assumes dirReset
is true.dir
- str
's directionstr
- The input stringisHtml
- Whether str
is HTML / HTML-escapedpublic java.lang.String unicodeWrapWithKnownDir(HasDirection.Direction dir, java.lang.String str, boolean isHtml, boolean dirReset)
spanWrapWithKnownDir(com.google.gwt.i18n.client.HasDirection.Direction, java.lang.String)
, this
makes use of unicode BiDi formatting characters. In HTML, its *only* valid
use is inside of elements that do not allow mark-up, e.g. an 'option' tag.
The algorithm: estimates the direction of input argument str
. In
case it doesn't match the context direction, wraps it with Unicode BiDi
formatting characters: RLE+str
+PDF for RTL text, or LRE+ str
+PDF for LTR text.
If opt_dirReset
, and if the overall direction or the exit direction
of str
are opposite to the context direction, a trailing unicode
BiDi mark matching the context direction is appended (LRM or RLM).
Does *not* do HTML-escaping regardless of the value of isHtml
.
dir
- str
's directionstr
- The input stringisHtml
- Whether str
is HTML / HTML-escapeddirReset
- Whether to append a trailing unicode bidi mark matching the
context direction, when needed, to prevent the possible garbling
of whatever may follow str