Key-Code Conversion
Show/Hide:
Definitions
Formulae
Instructions
Introduction:
This page allows you to encode text into alternative formats, such as HTML entities, or URL encoding.
Online character code charts can be found at http://www.unicode.org/charts/
An index to characters names with links to the corresponding chart is found at http://www.unicode.org/charts/charindex.html
Click on any of the links at the top of the page to reveal (or hide) more information about this calculator.
Caveats:
If you are reading this paragraph, then you probably don't have JavaScript installed/enabled on your computer.
At this point in time the calculators on this site are entirely dependent on JavaScript.
Given enough time and money it is hoped to develop a version in the future which will be able to operate independently.
Since JavaScript runs on your own computer, rather than on this server, it has been possible to set up this site much more quickly than if a server-side language was used, and it is possible to serve far more pages with the available bandwidth.
If you think that you can assist with this project in any way, then please visit the Support section and leave a message.
Definitions:
- Char-Set
-
A collection of characters in which a numeric code is assigned to each character so that it can be represented on a computer.
Most traditional character sets contain characters from only one or two scripts.
More information on IANA charset assignments
- ACR
- Abstract Character Repertoire
- ASCII
- American Standard Code for Information Interchange
- BE
- Big-endian (most significant byte first)
- BMP
- Basic Multilingual Plane, the first 65,536 characters of 10646
- BOCU
- Byte Ordered Compression for Unicode
- CCS
- Coded Character Set
- CCSID
- Code Character Set Identifier
- CDRA
- Character Data Representation Architecture from IBM
- CEF
- Character Encoding Form
- CES
- Character Encoding Scheme
- CM
- Character Map
- CP
- Code Page
- DBCS
- Double-Byte Character Set
- ECMA
- European Computer Manufacturers Association
- EBCDIC
- Extended Binary Coded Decimal Interchange Code
- EUC
- Extended Unix Code
- GCGID
- Graphic Character Set Glyphic Identifier
- IANA
- Internet Assigned Numbers Authority
- IEC
- International Electrotechnical Commission
- IETF
- Internet Engineering Taskforce
- ISO
- International Organization for Standardization
- JIS
- Japanese Industrial Standard
- JTC1
- Joint Technical Committee 1 (responsible for ISO/IEC IT Standards)
- LE
- Little-endian (least significant byte first)
- MBCS
- Multiple-Byte Character Set (1 to n bytes per code point)
- MIME
- Multipurpose Internet Mail Extensions
- RCSU
- Reuters Compression Scheme for Unicode (precursor to SCSU)
- SBCS
- Single-Byte Character Set
- SCSU
- Standard Compression Scheme for Unicode
- TES
- Transfer Encoding Syntax
- UCS
- Universal Character Set; Universal Multiple-Octet Coded Character Set - the repertoire and encoding represented by ISO/IEC 10646-1:1993 and its amendments.
- UDC
- User-defined Character
- UTF
- Unicode (or UCS) Transformation Format
- Unicode
- The latest version of the Unicode Standard
Instructions
- Type, or Copy-and-Paste the text to be encoded in the large text box below.
- The text will be encoded when you press the 'Encode' button a little below the input box.
- The encoded text will be displayed below the input area.
- To do further encoding simply enter more text in the input field, all existing encoding will be retained.
© Copyright Mike Brockington 2004 - 2024 All Rights Reserved