I learnt from Google that Internationalization is the process by which I can make my web application to use all languages. I want to understand Unicode for the process of i
In case of a basic JSP/Servlet webapplication, the basic approach would be using JSTL fmt taglib in combination with resource bundles. Resource bundles contain key-value pairs where the key is a constant which is the same for all languages and the value differs per language. Resource bundles are usually properties files which are loaded by ResourceBundle API. This can however be customized so that you can load the key-value pairs from for example a database.
Here's an example how to internationalize the login form of your webapplication with properties file based resource bundles.
Create the following files and put them in some package, e.g. com.example.i18n
(in case of Maven, put them in the package structure inside src/main/resources
).
text.properties
(contains key-value pairs in the default language, usually English)
login.label.username = Username login.label.password = Password login.button.submit = Sign in
text_nl.properties
(contains Dutch (nl
) key-value pairs)
login.label.username = Gebruikersnaam login.label.password = Wachtwoord login.button.submit = Inloggen
text_es.properties
(contains Spanish (es
) key-value pairs)
login.label.username = Nombre de usuario login.label.password = Contraseña login.button.submit = Acceder
The resource bundle filename should adhere the following pattern name_ll_CC.properties
. The _ll
part should be the lowercase ISO 693-1 language code. It is optional and only required whenever the _CC
part is present. The _CC
part should be the uppercase ISO 3166-1 Alpha-2 country code. It is optional and often only used to distinguish between country-specific language dialects, like American English (_en_US
) and British English (_en_GB
).
If not done yet, install JSTL. If you're running on a Servlet 2.5 container or newer (Tomcat 6.0 and so on) and your web.xml
is declared conform the Servlet 2.5 specification, then just put jstl-1.2.jar in webapp's /WEB-INF/lib
folder.
Create the following example JSP file and put it in web content folder.
login.jsp
<%@ page pageEncoding="UTF-8" %>
<%@ taglib prefix="c" uri="http://java.sun.com/jsp/jstl/core" %>
<%@ taglib prefix="fmt" uri="http://java.sun.com/jsp/jstl/fmt" %>
JSP/JSTL i18n demo
The
manages the current language. If the language was supplied as request parameter (by language dropdown), then it will be set. Else if the language was already previously set in the session, then stick to it instead. Else use the user supplied locale in the request header.
The
sets the locale for resource bundle. It's important that this line is before the
.
The
initializes the resource bundle by its base name (that is, the full qualified package name until with the sole name without the _ll_CC
specifier).
The
retrieves the message value by the specified bundle key.
The informs the searchbots what language the page is in so that it won't be marked as duplicate content (thus, good for SEO).
The language dropdown will immediately submit by JavaScript when another language is chosen and the page will be refreshed with the newly chosen language.
You however need to keep in mind that properties files are by default read using ISO-8859-1 character encoding. You would need to escape them by unicode escapes. This can be done using the JDK-supplied native2ascii.exe
tool. See also this article section for more detail.
A theoretical alternative would be to supply a bundle with a custom Control to load those files as UTF-8, but that's unfortunately not supported by the basic JSTL fmt
taglib. You would need to manage it all yourself with help of a Filter
. There are (MVC) frameworks which can handle this in a more transparent manner, like JSF, see also this article.