|
The
problems of long, complex Internet addresses and unwieldy search-engine
results are being eroded by a new search method. Stuart Mark explores
the latest advances in name resolution technology.
Common
Names Resolution Protocol
The
Internet has established itself as part of everyday life. It is faster,
can be accessed cheaply and provides a much improved quality of site and
service. But amid all this progress, the means by which a web site is
located and contacted is still extremely cumbersome.
There
is nothing more frustrating than typing a long website address full of
symbols and meaningless character sequences into the ‘Open’ or
‘Address’ field of your browser. The alternative of using a search
engine isn’t much better as, with all but the most stringently
constructed search strings, the result can be thousands of sites, many
of which are duplicated or bewilderingly obscure.
Finding A Site
Each
website on the Internet is identified by a Universal Resource Locator
(URL) or Univesal Resource Identifier (URI). This contains a domain name
which has a familiar format to most people, especially since the advent
of the dot com business model. The names are held on Domain Name System
(DNS) servers scattered around the web and their hierarchical
dot-notation format allow them to have geographic and functional
significance as in .uk and .org. Each name must be unique and this is
the first problem with URIs. Many organisations and individuals find
that their first choice name has already been used and have to settle
for something less fitting and, in many cases, a name that doesn’t
effectively describe the service they offer.
The
second problem with the URI is that it can easily turn into a typing
nightmare. In its most basic form, a URI will contain just a DNS name
and will provide access to a site’s homepage, for example
www.itweek.co.uk. If, however, direct access to a specific page deep
within the site is required, the URI will contain the DNS name followed
by the file pathname or script.
If the
URI of a site is not known, a search engine must be used. These are
ubiquitous on the web and allow a search to be carried out on any word
or phrase. The need for such a facility is obvious but the way in which
most search engines work severely restrict their effectiveness. In order
for a website to be located by a search engine, the site owner must have
first registered the site with the search engine operatator. In most
cases, reqistration involves the notification of the URI after which a
‘Crawler’ or ‘Spider’ robot program will interrogate the site. The
information gathered will be used by subsequent searches. This means
that a search based on a specific word will locate all sites containing
that word or, even worse, all pages within the same site containing the
same word. This is why a search will often return numerous instances of
the same URI.
Common
Names
To
address the problems of website search and access, the concept of common
names resolution has been formed and is gaining industry-wide
acceptance. A common name is an easily recalled identifier that can be
used to provide direct access to a website. The idea is that a site can
be located simply by typing a keyword or phrase into the ‘Open’ or
‘Address’ field of a browser instead of a URI. Alternatively, a common
names website can be used to perform a search.
Common
names are held in a database and reference specific areas of a
particular website. The difference between a search-engine search and a
common name search is that the common name method does not actually
search websites but instead relies on the database to return results.
This allows much closer control of search parameters and more accurate
results. A number of commercial common names solutions are already
available.
Netword invites subscriptions from web organisations for a yearly fee. A
company can choose brand, product, service or any other type of name and
reference it to any location in their website. Prospective customers can
search from the Netword website or download a free plug-in for their
browser to allow searches from the address field.
Even
more impressive is the offering from RealNames. Again companies register
names for a fee but anyone with Internet explorer 4.x or higher or the
NeoPlanet browser can perform common names searches direct from their
browser without the need for a plug-in. This is because RealNames has
formed pertnerships with Microsoft for browser functionality and with a
numerous web portals and search engines including MSN, Google and Alta
Vista. If you use IE, try entering a word into your browser; Microsoft,
Novell, Nasa and even Pokemon will connect you directly to these
websites. Even spaces and symbols can be used. Mutiple hits will take
you to the MSN search site and RealNames hits will be highlighted with a
special icon on the results page.
Both
Netword and RealNames use proprietary names databases and transport
methods but, such is the potential for this technology in the context of
ecommerce, B2B solutions and a more efficient marketplace, it has become
the subject of an Internet Engineering Task Force (IETF) working group.
CNRP
The
Common Names Resolution Protocol is intended to become a standardised
method of request-repsonse connections using common names. It will use
what the (IETF) refer to as Parameterised Resolution which means that
flexible descriptors will be used as query parameters to resolve
specific URIs. These descriptors will be permitted to contain the
symbols and different language characters available from UTF-8 encoding
instead of the 37 character ASCII limitation imposed by DNS. XML will be
used as a transport on TCP port 1096. Also the shorter names should
benefit wireless applications.
Informational RFC 2972 lays out the design goals for CNRP and the
framework for a Global Registry has recently been announced by the
Internet Corporation for Assigned Names and Numbers (ICANN), part of
which should provide a standard repository for Common Names. This could
replace the proprietary databases used at present using CNRP for
communication with search clients. Logical centralisation of the
registry may be achieved by a CNRP referral model which would share name
information between databases in much the same way that DNS forwarding
works today.
Namespaces are intended to be the method of storing names. These are
defined by the RFC as sets of common names within a category such as
Movie Titles under Entertainment or Company Names under Plumbers under
Home Services. CNRP is required to include simple search filter
functions which, when used in conjunction with a set of predefined
categories, will greatly reduce the size of search results compared to
those from search engines. One other advantage of categorisation is name
duplication. Where URIs must be unique, identical common names can be
registered by different organisations. If a name search finds
duplications, it can return results listed by category.
At
present, only one Informational and two Internet-Draft RFCs exist, but
the latter are expected to move to Proposed-Standard status shortly. The
most widely available implementation of Common Names Resolution, that
from RealNames, is likely to be one of the first commercial
implementations of CNRP. The company is working with the IETF in the
development of CNRP and has recently announced a restructuring in
response to the advent of the Global Registry.
However, there is little evidence of interest from the major browser
suppliers. Microsoft’s partnership with RealNames should force CNRP into
IE but Netscape may not follow until business pressures prove too great.
Either way, the days of wandering aimlessly around the web appear to be
numbered.
www.realnames.com
www.netword.com
www.ietf.org
www.icann.com
www.msn.com
Key
Points
-
CNRP will be the standard
for Common Names searches on the web
-
Normal words can be used
in browsers instead of complex URIs
-
IE4.x and above is
capable of a proprietary names search function now, supplied by
RealNames
-
CNRP will not replace DNS
but will operate on top of it.
-
CNRP is not a search
engine but relies on databases containg categories and namespaces
-
A global registry to
accomodate Common Names and Internationalised searching has recently
been announced by ICANN
|