Character encoding: Difference between revisions
Jump to navigation
Jump to search
m (using an external editor) |
m (→Tutorials) |
||
(10 intermediate revisions by 2 users not shown) | |||
Line 6: | Line 6: | ||
See also: [[Codec]] (Encoding/decoding of compression formats that for simple files, archive files or files with multiple contents (e.g. in [[multimedia container format]]s). | See also: [[Codec]] (Encoding/decoding of compression formats that for simple files, archive files or files with multiple contents (e.g. in [[multimedia container format]]s). | ||
See also: [[webdings and wingdings]] | |||
== Links == | == Links == | ||
Line 43: | Line 45: | ||
=== Tutorials === | === Tutorials === | ||
* [http://www.joelonsoftware.com/articles/Unicode.html The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)] by Joel Spolsky, October 08, 2003 | |||
* [http://www.tbray.org/ongoing/When/200x/2003/04/06/Unicode On the Goodness of Unicode] by Tim Bray | |||
* [http://www.w3.org/International/tutorials/tutorial-char-enc/ Character encodings in HTML and CSS] (W3C tutorial) | |||
* [http://www.cs.tut.fi/~jkorpela/chars/index.html Characters and encodings] by Jukka "Yucca" Korpela, very good reading ! See e.g. [http://www.cs.tut.fi/~jkorpela/chars.html A tutorial on character code issues] and [http://www.cs.tut.fi/~jkorpela/www/windows-chars.html On the use of some MS Windows characters in HTML] | * [http://www.cs.tut.fi/~jkorpela/chars/index.html Characters and encodings] by Jukka "Yucca" Korpela, very good reading ! See e.g. [http://www.cs.tut.fi/~jkorpela/chars.html A tutorial on character code issues] and [http://www.cs.tut.fi/~jkorpela/www/windows-chars.html On the use of some MS Windows characters in HTML] | ||
* [http://www.webmasterworld.com/forum21/11176.htm HTML and Browsers] Character encoding, entity references and UTF-8. Good short tutorial. | * [http://www.webmasterworld.com/forum21/11176.htm HTML and Browsers] Character encoding, entity references and UTF-8. Good short tutorial. | ||
; Some Wikipedia entries | ; Some Wikipedia entries regarding Wikipedia contents | ||
Wikipedia is a good example that shows how modern websites can deal with most character sets. | |||
* [http://en.wikipedia.org/wiki/Help:Multilingual_support Help:Multilingual support] | |||
* [http://en.wikipedia.org/wiki/Wikipedia:Enabling_complex_text_support_for_Indic_scripts Indic scripts] (as an example) | |||
;More general Wikipedia entries | |||
* [http://en.wikipedia.org/wiki/Character_encoding Wikipedia Character Encoding] | * [http://en.wikipedia.org/wiki/Character_encoding Wikipedia Character Encoding] | ||
* [http://en.wikipedia.org/wiki/Unicode Wikipedia UniCode] | * [http://en.wikipedia.org/wiki/Unicode Wikipedia UniCode] | ||
* [http://en.wikipedia.org/wiki/UTF-8 Wikipedia UTF-8] | * [http://en.wikipedia.org/wiki/UTF-8 Wikipedia UTF-8] | ||
=== URL encoding === | |||
* [http://www.blooberry.com/indexdot/html/topics/urlencoding.htm URL Encoding] (or what are those " " codes in URLs?') by Brian Wilson | |||
=== Internationalisation and Localisation === | |||
[[:fr:Internationalisation|Internationalisation]] (edutech wiki en français) | |||
[[fr:encodage de caractères]] | [[fr:encodage de caractères]] | ||
[[Category: Web authoring]] |
Latest revision as of 15:07, 19 January 2011
Definition
This article focuses on character encoding.
See also: Codec (Encoding/decoding of compression formats that for simple files, archive files or files with multiple contents (e.g. in multimedia container formats).
See also: webdings and wingdings
Links
Specifications
(some)
- UniCode Home Page (includes for example code-charts and the Unicode and the Web FAQ)
- Character Model for the World Wide Web 1.0: Fundamentals (W3C Recommendation 2005).
- Unicode in XML and other Markup Languages (W3C Technical Report)
- W3C I18N GEO Working Group
Charts
- IANA Character Sets table for the Internet
- Unicode 4.1.0 Chart
- Unicode Character Code Charts (PDF files)
- HTML specific
- HTM entities table
- Character Converter(Iain Tucker)
- ISO8859-1 (Latin-1) (HEX/Dec/Entities)
- iso8859-1 Table
- Table of character entity references in HTML 4
- ASCII - ISO 8859-1 (Latin-1)Table with HTML Entity Names
- The ISO 8859 Alphabet Soup
See HTML links for other HTML-related links.
Online converters
- Text to UTF-8 or HTML Entities Tool
- Unicode (UTF-8) to HTML entity online converter
- UTF Converter
- Converter for funny characters into the proper HTML
Tutorials
- The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!) by Joel Spolsky, October 08, 2003
- On the Goodness of Unicode by Tim Bray
- Character encodings in HTML and CSS (W3C tutorial)
- Characters and encodings by Jukka "Yucca" Korpela, very good reading ! See e.g. A tutorial on character code issues and On the use of some MS Windows characters in HTML
- HTML and Browsers Character encoding, entity references and UTF-8. Good short tutorial.
- Some Wikipedia entries regarding Wikipedia contents
Wikipedia is a good example that shows how modern websites can deal with most character sets.
- Help:Multilingual support
- Indic scripts (as an example)
- More general Wikipedia entries
URL encoding
- URL Encoding (or what are those " " codes in URLs?') by Brian Wilson
Internationalisation and Localisation
Internationalisation (edutech wiki en français)