MULTILINGUAL DOMAIN NAMES Hirofumi Hotta JOINT ITU/WIPO SYMPOSIUM

advertisement
JOINT ITU/WIPO SYMPOSIUM
MULTILINGUAL DOMAIN NAMES
December 6, 2001
Hirofumi Hotta
hotta@jprs.jp
http://日本レジストリサービス.jp
1
1,2
ASCII characters in the Internet
• The Internet as the birth
– Born in the United States
– Research and academic users
• Limited resources in computers and
communication devices
– Only ASCII codes are used for the core of the Internet
Only ASCII characters have been used
even by people using non-ASCII characters in social life
2
Ex) non-English characters in e-mail
• Step1
– Phonetic mapping in e-mail texts
• Step2
– Native language characters in e-mail texts
• Step3
– Native language characters in “Subject” fields
• Step4 ?
– Native language characters in “To” and “From” field
• Names such as company names and personal names in the
social relevant context should be presented in their native
language
3
What is a domain name ?
• Human readable identifier of an entity within
the Internet : Ex) www.itu.int
• Substitute of an IP address
www.itu.int
domain name
top level domain label
second level domain label
third level domain label
each label <= 63 characters
domain name =< 255 characters
4
Domain name structure
“”
com
org
co
jp
or
.. uk
go
au
ad
itu
yahoo sun
nic
jprs
int
..
wipo
WWW
WWW
5
Characters in domain names
Computer Engineers
LDH (letters-digits-hyphen)
Wider users
• Consumers
• ASCII character set users
– Natively
– In transliterated form
???
ex) English
ex) Malay
• Non-ASCII character set users
– Chinese, Japanese, Korean, Arabic, Tamil, ….
6
Demands on multilingual domain names
• Rapid growth of the Internet
– More non-English speakers are becoming Internet users
• People using non-ASCII characters
• Undesirable unification in LDH world
– 博文, 博史, 宏史, …..are all “hirofumi”s in ASCII space
– Apostrophe, accents, umlauts, ….. cannot be used in
ASCII space
Demand on multilingual domain names
7
3
History (technology)
• Late 1990s
– Multilingual domain names were developed at the
National University of Singapore
• July 1998
– Asia Pacific Networking Group
• iDNS Working group : development of the experimental
implementation of an Internationalized multilingual
multiscript Domain Names Service
– Why shouldn’t domain names be internationalized too, now that
the Internet has grown to reach almost every corner of the
world using different languages?
• iDomain Working Group : creation of an iDNS testbed in Asia
Pacific countries
• China, Japan, Korea, Hong Kong, Singapore, Taiwan,
Thailand, ….
8
History (technology) - continued
• 1998-1999
– Prototypes demonstrated in international conferences
– BoFs held in international conferences
• APRICOT
• INET
– Singapore, China, Hong Kong, Korea, Japan, …
expressed interests in implementation
• Nov. 1999
– BoF in IETF
– IETF Mailing list discussion
• Jan. 2000 – IDN (Internationalized Domain Name) Working
Group in IETF
9
History (deployment)
• End of 1999
– Several companies began commercialization of the
multilingual domain name technology
– Several testbeds emerged
• July 2000 – MINC (Multilingual Domain Names Consortium)
• promotion of the multilingualization of Internet names,
including Internet domain names and keywords, the
internationalization of Internet names standards and
protocols, technical coordination, and liaison with other
international bodies
– Country/regional organizations
•
•
•
•
AINC (Arabic Internet Names Consortium)
CDNC (Chinese Domain Name Consortium )
INFITT (International Forum for IT in Tamil )
JDNA (Japanese Domain Names Association )
10
History (policy)
• March 2001
– Internationalized Domain Name (IDN) Working
Group in ICANN Board
• Fact finding survey concerning technical, policy, and service
aspects
• Survey report published in Sept. 2001
– Market demand shown
– List of issues elaborated
– GAC (Governmental Advisory Committee) of ICANN
• communiqué expressing GAC’s support for multilingual
domain names
– With regard to international domain names, the GAC confirms
the importance and interests of this development to the benefit
of Internet users worldwide
• Sept. 2001
– IDN Committee
• Will recommend solutions of non-technical issues
11
4
How domain names are resolved
managed
by IANA
Internet
managed
by IANA
on behalf
of .int
managed
by ITU
root server
IP address of .com name server
IP address of .int name server
IP address of .jp name server
:
.int name server
IP address of itu.int name server
IP address of wipo.int name server
:
itu.int name server
IP address of www.itu.int
IP address of intra.itu.int
:
A server having www.itu.int
http port
intend to browse
http://www.itu.int
156.106.192.32
12
Where multilingual domain names are
recognized
Internet
application
user
server
application
client
application
DNS
PC
currently
ASCII domain names
in the future
option 1 multilingual domain names
option 2
ASCII domain names
multilingual domain names
13
Key technical questions
• How should non-ASCII codes be represented ? <Q1>
• Where should non-ASCII codes be recognized ? <Q2>
– in the client application / in the DNS server
• What is the technical mechanism that maps
multilingual domain names to current DNS
technology ?
<Q3>
14
Basic technical requirements
• Preservation of compatibility with current
domain names
• Preservation of uniqueness of domain name
space
• The Internet must not be divided into islands
Required by IAB (Internet Architecture Board)
15
Character codes of multilingual domain
names
<Q1>
• Current : proprietary (local) standard
– in PCs
– in PDAs
– in Internet-enabled phones
• Best current solution may be
– UNICODE
– Specification of code sets of many languages
• Additional issues
– traditional Chinese characters / simplified Chinese
characters
• Are they same characters in domain names ?
• Is this a local code issue or universal protocol issue ?
16
Client-side vs. Server-side solutions
• Client-side solution
<Q2>
– Translation between multilingual script and ASCIIcompatible representation is performed in the user
application
– Domain names are processed as ASCII domain
names all over the Internet
• Server-side solution
– Domain names are sent over the Internet in local
encoding
– Applications and services communicate with each
other using non-ASCII domain names all the way
user
application
DNS
ASCII domain names
client-side multilingual domain names
solution
multilingual domain names
server-side
solution
17
Client-side solution > Server-side solution
• IETF is moving towards client-side solution
• Stability
– DNS is a huge distributed database
– DNS is working on a delicate balance
– Substantial change of DNS is dangerous
• Deployment speed
– Changing all the servers takes long time
• Consistency
– Partial deployment of server-side solution may lead to
separation of the Internet
18
How multilingual string is converted to
ASCII
ex)
ABCカンパニー.JP
ABCカンパニー.JP
ABCカンハ゜ニー.JP
original
string
<Q3>
NAMEPREP
unification of the strings considered to be the same
normalized
string
ex)
ABCカンパニー.JP
ACE
conversion to an ASCII string
Internet
(based on ASCII)
ASCII
string
ex)
BQ--GD7UD72C75B2X46RZP6A.JP
19
NAMEPREP and ACE
• NAMEPREP (Preparation of Internationalized Host Names)
– Multilingual string representations which should be
regarded as the same string are converted into one
representation
• Case fold
• Normalize
• Prohibit
• ACE (ASCII Compatible Encoding)
– Multilingual representation is converted into an
appropriate ASCII domain name
– Ex) ACE algorithm
• RACE
– BQ--3BS6KZZMRKPDBSJQ4EYKIMHTKQGU7CY
• AMC-ACE-Z
– ZQ--ECKWD4C7CU47R2WFQW7A0ECL32K
20
Issues in using ACE
• Subspace is used by multilingual domain names
ACE
Multilingual
decode
Domain Names
ACE-ed
Multilingual
Domain Names
ASCII Domain Names
• Issues
– Reservation of the subspace
– Length limitation is severer
• Domain label
• Domain name
21
IDNA
(Internationalizing Host Names in Applications)
local
user
application
end system
UI
internal
representation
resolver
international DNS servers
To/From
to/from Unicode
NAMEPREP
To/From
to/from ACE
API
application servers
22
5
Impact on the DNS structure
• Alternate root
multilingualized
Name
Server
Name
Server
Name
Server
multilingual
domain name
space
multilingual
domain name
space
authoritative root
Hierarchy
overseen
by ICANN
TLDs not authorized by ICANN
23
Impact on the DNS structure (continued)
• Inclusive root (variation of alternate root)
multilingualized
Name
Server
Hierarchy
overseen
by ICANN
multilingual
domain name
space
unauthorized TLDs can be
seen together with ICANN’s
24
Impact on the DNS structure (continued)
• Pseudo-root (zero level domain)
銀行.企業
Name
Server
Hierarchy
overseen
by ICANN
append “.jp” to “銀行.企業”
jp
企業
銀行
multilingual
domain name
space
25
Defining a multilingual top level domain
• Current implementation of multilingual domain
names
– Second level domain or under
• Allowed by current DNS architecture and technology
– Top level domain
• Alternate root
• Inclusive root
• Pseudo-root
• Above are only to satisfy commercial drive or
users’ demands on early deployment of
multilingual domain names
• It is important for ICANN to define a
multilingual top level domain creation policy
26
6
Issues in various TLDs
• {non-ASCII-string}.{ASCII-ccTLD}
• {non-ASCII-string}.{ASCII-gTLD}
– Organizations already being authorized are responsible for
the domain name space
• {any-string}.{non-ASCII-ccTLD}
– One organization from the relevant country is named to be
responsible for the domain name space
– If a country has more than 1 official language,
• What is the language for non-ASCII-ccTLD, or
• How many non-ASCII-ccTLDs are given to the country
• {any-string}.{non-ASCII-gTLD}
– No one can tell whether top level domain “.企業” is
Chinese or Japanese
– Difficulty in choosing a responsible organization
• who in what country
27
Other political issues
• What are the languages that constitute multilingual
domain names
– Some languages have 2 or more kinds of scripts
– Traditional Chinese/simplified Chinese
• Who is the language authority for multilingual
domain names
– Should rules be the same even under different TLDs?
• A single domain name registry should not be the ultimate authority
of for the rules
• Is such rule definition an international issue?
– Language rules are known to only people using the
language
• To what extent does the solution need international standard or local
coordination?
• Each language stakeholders should coordinate among themselves 28
7
Implementations
• VGRS (VeriSign Global Registry Services)
• JPNIC/JPRS (Japan Network Information Center /
Japan Registry Service)
• i-DNS.net
• CNNIC (China Internet Network Information
Center)
• Walid
• Neteka
• NativeNames
:
:
29
8
Future Issues
growth of the number of multilingual
domain names and their users
deployment of
name servers synergy
with multilingual
domain names
applications
with
multilingual
domain name
facilities
policy and coordination of registration and management rules
technology standardization and development
30
Download