Contents Index Search Related Documents Previous Next
2.1 Character Set
1/2
The character repertoire
for the text of an Ada program consists of the collection of characters
described by the ISO/IEC 10646:2003 Universal Multiple-Octet Coded Character
Set. This collection is organized in
planes, each plane comprising
65536 characters.
.
Syntax
2/2
This paragraph was deleted.
3/2
This paragraph was deleted.
3.1/2
A character
is any character defined within ISO/IEC 10646:2003 other than those whose
relative code position in their plane is 16#FFFE# or 16#FFFF#.
Static Semantics
4/2
The coded representation for characters is implementation
defined (it need not be a representation defined within ISO/IEC 10646:2003).
A character whose relative code position in its plane is 16#FFFE# or
16#FFFF# is not allowed anywhere in the text of a program.
4.1/2
The semantics of an Ada program whose text
is not in Normalization Form KC (as defined by section 24 of ISO/IEC
10646:2003) is implementation defined.
5/2
The description of the language definition in
this International Standard uses the character properties General Category,
Simple Uppercase Mapping, Uppercase Mapping, and Special Case Condition
of the documents referenced by the note in section 1 of ISO/IEC 10646:2003.
The actual set of graphic symbols used by an implementation for the visual
representation of the text of an Ada program is not specified.
6/2
Characters are
categorized as follows:
7/2
- This paragraph was deleted.
8/2
- letter_uppercase
-
Any character whose General Category is defined to be “Letter,
Uppercase”.
9/2
- letter_lowercase
-
Any character whose General Category is defined to be “Letter,
Lowercase”.
9.1/2
- letter_titlecase
-
Any character whose General Category is defined to be “Letter,
Titlecase”.
9.2/2
- letter_modifier
-
Any character whose General Category is defined to be “Letter,
Modifier”.
9.3/2
- letter_other
-
Any character whose General Category is defined to be “Letter,
Other”.
9.4/2
- mark_non_spacing
-
Any character whose General Category is defined to be “Mark, Non-Spacing”.
9.5/2
- mark_spacing_combining
-
Any character whose General Category is defined to be “Mark, Spacing
Combining”.
10/2
- number_decimal_digit
-
Any character whose General Category is defined to be “Number,
Decimal Digit”.
10.1/2
- number_letter
-
Any character whose General Category is defined to be “Number,
Letter”.
10.2/2
- punctuation_connector
-
Any character whose General Category is defined to be “Punctuation,
Connector”.
10.3/2
- other_format
-
Any character whose General Category is defined to be “Other, Format”.
11/2
- separator_space
-
Any character whose General Category is defined to be “Separator,
Space”.
12/2
- separator_line
-
Any character whose General Category is defined to be “Separator,
Line”.
12.1/2
- separator_paragraph
-
Any character whose General Category is defined to be “Separator,
Paragraph”.
13/2
- format_effector
-
The characters whose code positions are 16#09# (CHARACTER TABULATION),
16#0A# (LINE FEED(LF)), 16#0B# (LINE TABULATION), 16#0C# (FORM FEED(FF)),
16#0D# (CARRIAGE RETURN(CR)), 16#85# (NEXT LINE(NEL)), and the characters
in categories separator_line and
separator_paragraph. The names mentioned
in parentheses in this list are not defined by ISO/IEC 10646:2003; they
are only used for convenience in this International Standard.
13.1/2
- other_control
-
Any character whose General Category is defined to be “Other, Control”,
and which is not defined to be a format_effector.
13.2/2
- other_private_use
-
Any character whose General Category is defined to be “Other, Private
Use”.
13.3/2
- other_surrogate
-
Any character whose General Category is defined to be “Other, Surrogate”.
14/2
- graphic_character
-
Any character which is not in the categories other_control,
other_private_use, other_surrogate,
format_effector, and whose relative
code position in its plane is neither 16#FFFE# nor 16#FFFF#.
15/2
The following
names are used when referring to certain characters (the first name is
that given in ISO/IEC 10646:2003):
graphic symbol | name | graphic symbol | name |
|
| | | |
|
" | quotation mark | : | colon |
|
# | number sign | ; | semicolon |
|
& | ampersand | < | less-than sign |
|
' | apostrophe, tick | = | equals sign |
|
( | left parenthesis | > | greater-than sign |
|
) | right parenthesis | _ | low line, underline |
|
* | asterisk, multiply | | | vertical line |
|
+ | plus sign | / | solidus, divide |
|
, | comma | ! | exclamation point |
|
– | hyphen-minus, minus | % | percent sign |
|
. | full stop, dot, point | | |
|
Implementation Permissions
16/2
This paragraph was deleted.
17/2
1 The characters in categories
other_control, other_private_use,
and other_surrogate are only allowed
in comments.
18
2 The language does not
specify the source representation of programs.
Contents Index Search Related Documents Previous Next Legal