Next: Notification Log Analysis
Up: Data Characteristics
Previous: Description of Data Logs
Types of Analyses
We now discuss the types of analyses that we perform on the notification and
browse logs, and the motivations for doing these analysis.
- 1.
- Content analysis: We are interested in questions such as: (i) what
are the most popular content categories, and (ii) what is the distribution of message
sizes? We believe such questions are important to (i) content providers who need
to understand better how to prioritize and use the system and network resources
efficiently, and to (ii) web site developers who are interested in supporting
fast access to popular content.
- 2.
- Popularity analysis: We are interested in the popularity
distribution of notification and browse documents. In particular, we are
interested in comparing these accesses to the well-known Zipf-like
distribution as reported in previous web
studies [4,7,10,14,16], and in determining how
concentrated are the number of requests/transmissions for popular
documents. This has significant implication for the effectiveness of web
caching and multicast delivery.
- 3.
- User-behavior analysis: We are interested in classifying users
according to their access patterns. This is useful for personalization,
targeted advertising, prioritizing, and capacity planning. Specifically,
we look at the following aspects of user behavior:
- Spatial Locality: whether users in the same
geographical region tend to receive/request similar
notification and browsing content.
- Temporal Stability: whether users are interested in browsing similar documents over time.
- User Load Distribution: how different users place load on the web
site; for service providers, this distribution has implications on pricing.
Next: Notification Log Analysis
Up: Data Characteristics
Previous: Description of Data Logs
Lili Qiu
2002-04-17