i18n-checker | Internationalization Activity Blog

Internationalization (i18n)

Making the World Wide Web worldwide!


Groups/repos

i18n WG

i18n Interest Group

African LE

Americas LE

Arabic LE

Chinese LE

Ethiopic LE

European LE

Hebrew LE

India LE

Japanese LE

Korean LE

Mongolian LE

SE Asian LE

Tibetan LE

Participate!

Join a Group

Follow the work

Translate a specification or page

International­ization Sponsorship Program

News by category
News archives
July 2011 (13)
July 2009 (10)
June 2009 (10)
June 2008 (13)
Search news

I18n sponsors

APL, Japan The Paciello Group Monotype Alibaba

Tag(s): i18n-checker

Posts

New version of Internationalization Checker released

The W3C Internationalization Checker is a free service for web authors and developers that checks web pages and provides:

  • a table listing key international settings for a page, such as character encoding, language declarations, and text direction.
  • a list of errors, warnings and helpful suggestions about the page, with pointers to resources where you can learn more.

Version 2 of the checker moves away from checking against particular specifications to checking how a page will work in a browser. For the most part, it assumes that pages will be parsed using an HTML5 compliant parser. Pages served as application/xhtml+xml have some significant differences with regards to character encoding and language declarations, however, and these are taken into account if the checker detects that the page being checked is served as XML.

See the change log for detailed information about changes. In summary, 18 new checks were added, and the messages for 11 checks were significantly updated.

In addition, the following new rows were added to the information table:

  • All language tags: lists all language tags used in the page. If you click on any of the language tags listed, you are taken to the Language Subtag Lookup tool, which provides information about validity of the subtags used, lists their meaning, and provides additional usage tips.
  • Unicode control codes: lists directional controls used in the document, with a frequency count for each. The list is divided to reflect actual characters vs. numeric character references vs. named character references.
  • Notable attributes: lists attributes used that are typically associated with features needed by an international audience.
  • Notable elements: the same, but for elements.

Please let us know about bugs and missing features using the feedback form.

W3C HTML5 Validator enhanced with language detection functionality

The W3C HTML5 Validator has been enhanced with functionality that detects the overall language of a page. The validator can currently detect a little over 50 languages, but more will be added over time.

This makes it possible to compare the language of the content in a page with language declarations, and issue warnings if the lang attribute does not match the language of content, if no lang attribute is given at all, or if a language using a right-to-left script is detected but a dir attribute is missing from the html tag.

For more information on the lang attribute, see the Why use the language attribute? article, or Declaring the overall language of a page in the technique index.

New version of the Internationalization Checker released

The ‘i18n checker‘ is a free service by W3C that provides information about internationalization-related aspects of your HTML page, and advice on how to improve your use of markup, where needed, to support the multilingual Web.

This latest release uses a new user interface and redesigned source code. It also adds a number of new tests, a file upload facility, and support for HTML5.

This is still a ‘pre-final’ release and development continues. There are already plans to add further tests and features, to translate the user interface, to add support for XHTML5 and polyglot documents, to integrate with the W3C Unicorn checker, and to add various other features. At this stage we are particularly interested in receiving user feedback.

Try the checker and let us know if you find any bugs or have any suggestions.

Tags:

Copyright © 2023 World Wide Web Consortium.
W3C® liability, trademark and permissive license rules apply.

Questions or comments? ishida@w3.org