HTML 4 Tests

US-ASCII Base Case (Control Tests)

Control tests to check whether a certain functionality is supported with URIs (US-ASCII only). If this test does not work for a certain functionality, the other tests will fail because the functionality is not supported, not because IRIs are not supported.

Language Encoding Word a@hrefarea@hreflink@hrefimg@srcparam@valueobject@database@hrefscript@srcform@actioninput@src
English us-ascii baseCase TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST

Legacy Human

These tests are for direct use by humans, directly showing failure (e.g. with red or the word 'wrong') for some failure cases (not converting to UTF-8).

Language Encoding Word a@hrefarea@hreflink@hrefimg@srcparam@valueobject@database@hrefscript@srcform@actioninput@src
French iso-8859-1 résumé TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German iso-8859-1 Übersetzung TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German iso-8859-1 Bücher TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German iso-8859-2 Übersetzung TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Hungarian iso-8859-2 előírás TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Chinese (simp.) gb2312 词典 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Japanese shift_jis 日記 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Japanese euc-jp 日記 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Korean euc-kr 소설 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Arabic iso-8859-6 كتب TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Arabic windows-1256 كتب TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Russian iso-8859-5 перевод TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Russian koi8-r перевод TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Russian windows-1251 перевод TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST

Legacy Machine

These tests are for use with a machine, they will always lead to a 402 not found (or something equivalent for a protocol other than HTTP).

Language Encoding Word a@hrefarea@hreflink@hrefimg@srcparam@valueobject@database@hrefscript@srcform@actioninput@src
French iso-8859-1 résumé TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German iso-8859-1 Übersetzung TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German iso-8859-1 Bücher TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German iso-8859-2 Übersetzung TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Hungarian iso-8859-2 előírás TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Chinese (simp.) gb2312 词典 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Japanese shift_jis 日記 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Japanese euc-jp 日記 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Korean euc-kr 소설 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Arabic iso-8859-6 كتب TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Arabic windows-1256 كتب TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Russian iso-8859-5 перевод TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Russian koi8-r перевод TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Russian windows-1251 перевод TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST

UTF-8 Crosscheck

UTF-8 only, checking whether URI/IRI processing is 8-bit clean only.

Language Encoding Word a@hrefarea@hreflink@hrefimg@srcparam@valueobject@database@hrefscript@srcform@actioninput@src
French UTF-8 résumé TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German UTF-8 Übersetzung TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German UTF-8 Bücher TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Hungarian UTF-8 előírás TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Chinese (simp.) UTF-8 词典 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Japanese UTF-8 日記 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Korean UTF-8 소설 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Arabic UTF-8 كتب TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Russian UTF-8 перевод TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Vietnamese UTF-8 GiấyChứngNhận TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST

Numeric Character References (decimal)

Numeric Character References, checking correct conversion to UTF-8.

Language Encoding Word a@hrefarea@hreflink@hrefimg@srcparam@valueobject@database@hrefscript@srcform@actioninput@src
French decimal NCRs résumé TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German decimal NCRs Übersetzung TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German decimal NCRs Bücher TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Hungarian decimal NCRs előírás TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Chinese (simp.) decimal NCRs 词典 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Japanese decimal NCRs 日記 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Korean decimal NCRs 소설 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Arabic decimal NCRs كتب TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Russian decimal NCRs перевод TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Vietnamese decimal NCRs GiấyChứngNhận TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST

Numeric Character References (Hexadecimal)

Numeric Character References, checking correct conversion to UTF-8.

Language Encoding Word a@hrefarea@hreflink@hrefimg@srcparam@valueobject@database@hrefscript@srcform@actioninput@src
French hexadecimal NCRs résumé TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German hexadecimal NCRs Übersetzung TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
German hexadecimal NCRs Bücher TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Hungarian hexadecimal NCRs előírás TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Chinese (simp.) hexadecimal NCRs 词典 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Japanese hexadecimal NCRs 日記 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Korean hexadecimal NCRs 소설 TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Arabic hexadecimal NCRs كتب TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Russian hexadecimal NCRs перевод TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST
Vietnamese hexadecimal NCRs GiấyChứngNhận TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST

Normalization Human (Exploratory Tests)

These tests are Exploratory tests to check for normalization of non-Unicode encodings using NFC. They are for direct use by humans, directly showing failure (e.g. with red or the word 'wrong') for some failure cases (not using NFC or not converting to UTF-8).

Language Encoding Word a@hrefarea@hreflink@hrefimg@srcparam@valueobject@database@hrefscript@srcform@actioninput@src
Vietnamese windows-1258 GiấyChứngNhận TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST

Normalization Machine (Exploratory Tests)

These tests are Exploratory tests to check for normalization of non-Unicode encodings using NFC. They are for use with a machine, they will always lead to a 402 not found (or something equivalent for a protocol other than HTTP).

Language Encoding Word a@hrefarea@hreflink@hrefimg@srcparam@valueobject@database@hrefscript@srcform@actioninput@src
Vietnamese windows-1258 GiấyChứngNhận TEST TEST TEST TEST TEST TEST TEST TEST TEST TEST