Clustering Techniques for the Identification of Web User Session
The web user-session can be defined as a set of several TCP connections generated by a single user while surfing the web during a given time frame. An activity period, i.e. session, is terminated by a long silent period. This activity period is comprised of several TCP connections which may be used to transfer data. However, identification of active and silent period is not trivial. Correct identification of session is the main goal of the authors' study. Traditional method used threshold-based mechanism for the identification of web user-sessions which required a priori definition of the threshold value. This method is very sensitive to the threshold value, which is very difficult to set correctly.