WHOIS was created in the 1980s as a service to identify network operators on the Internet. Since this time, the Internet has changed far beyond expectations, evolving from a research network into a global commercial network that is integrated into everyday life. The usage of WHOIS has changed along with the evolution of the Internet, but the protocol has changed very little.
Consequently, the WHOIS protocol as it exists today has several deficiencies that are becoming increasingly apparent to the community. Some of the pressing issues include:
data accuracy and reliability
accessibility for users whose local language cannot be represented in ASCII
data protection and privacy concerns
Despite a series of task forces, working groups, workshops, surveys and studies over the last 15 years, the various issues with WHOIS policy and protocol still need to be addressed. The entire community of stakeholders are affected by WHOIS, bringing a wide variety of diverse concerns and interests to the table for discussion.
In 2010, ICANN formed the WHOIS Review Team, guided by the Affirmation of Commitments to review the are effectiveness of ICANN’s WHOIS policy and implementation, as well as if it meet the needs of the law enforcement community and promotes consumer trust. In response to the Review Team’s Final Report (2012), the ICANN Board passed a resolution to launch the Expert Working Group on gTLD Registration Directory Services (EWG) to “redefine the purpose of collecting, maintaining and providing access to gTLD registration data, and consider safeguards for protecting data” and to propose a new model for Registry Directory Services (RDS) that addresses the issues outlined above. Additionally, the Board’s resolution called for a GNSO Policy Development Process, to be informed by and address the issues in the EWG’s Final Report.
The EWG released its final report in 2014, initiating the process of structuring the Next-Generation gTLD Registration Directory Service to replace WHOIS PDP WG (Next-Gen RDS PDP WG). The PDP will be a three step process:
Establish gTLD registration data requirements to determine if and why a next-gen RDS is needed.
Design policies that detail functions that must be in place to support those requirements.
Provide guidance for how the next-gen RDS should implement those policies, including coexisting with and replacing WHOIS.
The process is currently in Phase One, in which the Next-Gen PDP WG aims to reach consensus recommendations, through analysis of several key questions, relating to the uses/purposes of RDS, who should have access to the data, data accuracy, privacy, steps for replacing WHOIS and other important issues.
Running parallel to this process, is the Public Comment period on the Registration Data Access Protocol (RDAP) Operational Profile for gTLD Registries and Registrars, which is a likely candidate to eventually replace WHOIS. The RDAP was developed by the IETF’s Web Extensible Internet Registration Data Services (WEIRDS) WG. Some major improvements that come with the RDAP include a standardized data model, internationalization for non-ASCII languages, and differentiated access.
As these processes develop, the outdated WHOIS Policy and Protocol will receive an overdue update, making improvements on the issues of accuracy, privacy and access. The Next-Gen PDP WG will hold their first face-to-face meeting at ICANN 55 to work toward their phase one goals.
Universal Acceptance is the state in which all valid domains names are useable in all Internet enabled applications, devices, and systems regardless of length, script, or newness to the DNS.
The issue of Universal Acceptance has been present since the 2000 gTLD expansion round, but has been exacerbated by the introduction of IDN’s and ICANN’s new gTLD Program, which has introduced over 900 new TLDs to the DNS root zone.
Prior to the early 2000s, developers of user interfaces and security rules made certain assumptions about TLDs:
Valid TLDs include, three letter TLDs, such as .com, .org, .gov, etc and ccTLDs, based on the ISO 3122 two letter codes
TLDs and email addresses contained only ASCII
The Root Zone is relatively static, with few changes made
In the early 2000s, the gTLD expansion made assumption of TLD length obsolete. TLD lengths became even more varied with the implementation of ICANN’s new gTLD Program. Also in 2010, the first IDN was introduced, which broke the assumption that ASCII is the only valid character set, introducing UTF-8. The final assumption was then broken by the new gTLD Program, which made the root zone much more dynamic, with new TLDs and IDNs being added every week.
Universal Acceptance is currently being addressed by the community through several initiatives, including the Universal Acceptance Steering Group (UASG), chartered in March 2015. The UASG’s primary objective is to help developers and website owners understand how to update their systems to treat all TLDs consistently, which will enable more customer choice, faith in the DNS and allow users of IDNs to fully utilize the Internet.
The UASG and ICANN launched a Universal Acceptance study for new gTLDs and reported the results in September 2015. The study and report: An Analysis of New gTLD Universal Acceptance, found that there were no problems with Universal Acceptance on the infrastructure level and that non-IDNs performed strongly across the board. However, IDNs still require some changes to software and systems to be reach full usability. Particularly complex and interesting issues arise in regard to scripts that run from right to left.
Usability of TLDs and IDNs requires five criteria outlined above in the UASG’s definition of Universal Acceptance:
Acceptance - Web applications and services that take domain names and email addresses as inputs should accept all valid extensions.
Validation - obsolete techniques and criteria used to validate domain names and email addresses should be updated
Storing - Applications and services require storage of domain names and email addresses, should be stored in RFC-defined or other compatible formats.
Processing - Domain names or email addresses should be able to be processed in programs or features without issues.
Displaying - Domain names and email strings should be properly displayed to users. This includes displaying the address in the proper script, without displaying the punycode text.
The work on Universal Acceptance will not be complete until “all valid domain names and email addresses are accepted, validated, stored, processed, and displayed correctly and consistently by all Internet-enabled applications, devices and systems.” At ICANN 55, the UASG will be holding an all day workshop for participants to develop solutions to the issues.