Monitoring and Displaying Traffic on the World Wide Web
by Evangelos P. Markatos and Athanasios E. Papathanasiou
Developed by the Computer Architecture and VLSI Group at the Institute
of Computer Science - FORTH, Palantir is an application that can
be used to display the origin, volume and type of the incoming
requests of a web server. A good knowledge of the geographic distribution
and access patterns of the clients creating these requests may
indicate a more efficient way of erving them.
World-Wide Web traffic continues to increase at impressive rates.
Busy web servers may get as many as several millions of hits (accesses)
in a day. Accesses may originate from all over the world and may
result in a rush hour that lasts 24-hours per day. Web traffic
will probably continue to increase as more people gain access
and new applications (including commercial ones) are emerging.
To meet the demands of this ever-increasing traffic, webmasters
should design their web servers in such a way as to disseminate
information (and sell or advertise products) effectively and reliably.
A first step towards effective information dissemination is understanding
a web servers client base, and reaching out to it. Palantir is
a web traffic monitor and visualization tool that can be used
to display the origin, volume and type of the incoming requests
of a web server. A good knowledge of the geographic distribution
and access patterns of the clients creating these requests may
indicate a more efficient way of serving them.
Palantir reads the log files of a web server and after categorizing
the requests into six major types (text, graphics, audio, video,
scientific, and other - that is any other kind of request) displays
the origin of the requests on a geographical landscape (in pictorial
form). The geographical origin of the request is found by comparing
its IP address to databases of information about geographical
locations of the address. In the case of addresses registered
in the US, the addresses are compared to registry databases and
then that information in compared against a zipcode database to
determine the geographic origination of the address. The type
and magnitude of requests that originate from each region are
shown in the map either stacked bars or as concentric circles.
Different types of requests are easily identified by being displayed
with different colors.
Palantir can animate the web traffic in static mode or in dynamic
mode. In the first case, the requests, which have occurred during
a specific period of time and are contained in the selected log
files, are animated in the viewer. Each request remains displayed
until the end of the simulation (it has an unlimited time life).
Thus, the stacked bars (or the concentric circles) present the
total amount of requests cumulatively (summary of traffic over
a specified period). In the Dynamic mode, Palantirs viewer tries
to capture the instant traffic of requests. Each request, contained
in the log file, is considered to have a limited time life. As
time passes, new requests are displayed on the viewer, while those
that have exceeded their time life (old requests) are deleted.
In this way a visualization of the way that network traffic varies
with time is given! Moreover, Palantir checks continually the
log file to display new incoming requests at real-time.
To facilitate its use, Palantir provides an graphical interface
with several functions most important of which are:
- aggregation: requests originating from several predefined (or
user-defined) regions are aggregated into a single stacked bar
(or concentric cycle)
- zooming: Palantir may zoom in a specific location in order to
study more effectively the traffic that originates from a particular
geographic region.
- filtering: Palantir provides two kinds of filters: Domain Filter
and Request Filter. The Domain Filter is used to display only
requests that come from a specified domain, while the Request
Filter displays only those requests that ask for a specific kind
of files.
Palantir is completely written in Java, mainly for portability
reasons and may be accessed though its Home Page at ICS-FORTH:
http://archvlsi.ics.forth.gr/OS/www.html.
A Palantir server is currently running at http://sappho.ics.forth.gr:9000,
which may be freely used for downloading Log files and viewing
Network Traffic Visualizations.
Current plans for Palantir include visualizing a domains outgoing
requests. A simple way for achieving this is by monitoring outgoing
TCP packets that have as a destination port the 80 port (default
port for http requests) of a host.
Please contact
Evangelos P. Markatos - ICS-FORTH
Tel: +30 81 391655
E-mail: markatos@ics.forth.gr
Athanasios E. Papathanasiou - ICS-FORTH
Tel: +30 81 391437
E-mail: papathan@ics.forth.gr