SMS extended characters now supported
We have expanded the set of characters that can be used when sending SMS messages to devices from the Soracom User Console or using the Soracom API/CLI.
Previously, when specifying GSM-7 (GSM 7-bit default alphabet) encoding, SMS messages only supported characters in the GSM-7 Basic Character Set. With this update, SMS messages can now include additional characters such as {
, }
, [
, ]
, |
, ^
, and ~
. The full list of supported characters is shown in the two tables below.
GSM-7 Encoding Basic Character Set
These are standard characters that are transmitted using 7 bits per character.
0x0_ | 0x1_ | 0x2_ | 0x3_ | 0x4_ | 0x5_ | 0x6_ | 0x7_ | |
---|---|---|---|---|---|---|---|---|
0x_0 | @ |
Δ |
SP | 0 |
¡ |
P |
¿ |
p |
0x_1 | £ |
_ |
! |
1 |
A |
Q |
a |
q |
0x_2 | $ |
Φ |
" |
2 |
B |
R |
b |
r |
0x_3 | ¥ |
Γ |
# |
3 |
C |
S |
c |
s |
0x_4 | è |
Λ |
¤ |
4 |
D |
T |
d |
t |
0x_5 | é |
Ω |
% |
5 |
E |
U |
e |
u |
0x_6 | ù |
Π |
& |
6 |
F |
V |
f |
v |
0x_7 | ì |
Ψ |
' |
7 |
G |
W |
g |
w |
0x_8 | ò |
Σ |
( |
8 |
H |
X |
h |
x |
0x_9 | Ç |
Θ |
) |
9 |
I |
Y |
i |
y |
0x_A | LF | Ξ |
* |
: |
J |
Z |
j |
z |
0x_B | Ø |
+ |
; |
K |
Ä |
k |
ä |
|
0x_C | ø |
Æ |
, |
< |
L |
Ö |
l |
ö |
0x_D | CR | æ |
- |
= |
M |
Ñ |
m |
ñ |
0x_E | Å |
ß |
. |
> |
N |
Ü |
n |
ü |
0x_F | å |
É |
/ |
? |
O |
§ |
o |
à |
LF, CR, and SP represent the following control characters (with their Unicode code points):
- LF: Line Feed (
U+000A
) - CR: Carriage Return (
U+000D
) - SP: Space (
U+0020
)
GSM-7 Encoding Basic Character Set Extension
Note that these characters are sent using 14 bits per character (escape sequence 0x1B
+ extended character).
0x0_ | 0x1_ | 0x2_ | 0x3_ | 0x4_ | 0x5_ | 0x6_ | 0x7_ | |
---|---|---|---|---|---|---|---|---|
0x_0 | | |
|||||||
0x_1 | ||||||||
0x_2 | ||||||||
0x_3 | ||||||||
0x_4 | ^ |
|||||||
0x_5 | € |
|||||||
0x_6 | ||||||||
0x_7 | ||||||||
0x_8 | { |
|||||||
0x_9 | } |
|||||||
0x_A | FF | |||||||
0x_B | ||||||||
0x_C | [ |
|||||||
0x_D | ~ |
|||||||
0x_E | ] |
|||||||
0x_F | \ |
- FF represents the form feed character (
U+21A1
).
Please note that the interpretation and display of these characters on the device receiving the SMS message may vary depending on the device's implementation.
If you need to send characters that are not listed in the tables, consider using UCS-2 encoding.