WeBrowse Traces
The following traces, which are available for the
research community, refer to HTTP traffic only.
Traces have been collected and organized with the support of the mPlane project.
Traces for WeBrowse
To build WeBrowse we employed a dataset of HTTP traces collected using the Tstat probe installed at the egress link at our campus network, plus a set of ground thruth traces which we employed to evaluate the accuracy of the classifiers building WeBrowse.
The trace log_http_complete.anonim is one hour long and cointains logs of HTTP requests observed by Tstat.
In order to respect privacy, IP addresses have been anonymized, and any private sensitive information has been removed.
The ground truth archive contains a list of websites we manually visited, together with the list of URLs actually contacted by the browser to render such websites.
All files are available in gzip format only.
|
|