Summaries Created from URLs

Any log format that contains a URL will produce a range of Summaries enabling parts of the URL to be analyzed separately.

Some log formats contain more than one URLs such those that include the Referrer Url in addition to the normal requested URL. The following Summaries are created from URLs:

  • Origin Domain
    This is the result of the Fastvue Site Clean engine that attempts to show real sites visited instead of CDNs, advertising and social sharing widget URLs. If you want to see a clean list of websites visited in your Reports, we recommend using this Summary.
  • Site Domain
    This is domain part of the URL, excluding sub-domains.
  • Site Extension
    This is the extension of the URL, such as .js (javascript), .png, .gif etc.
  • Site Keywords
    If the URL contains a common search query (such as q= used by google) this Summary shows the value of the search.
  • Site Name
    This is the domain part of the URL, including sub-domains such as www., images. etc
  • Site Port
    This is the Port number used, most often set to 80 or 443 for web traffic. If the port is not explicitly defined in the URL, the port will be automatically set to either 80 or 443 depending on the scheme of the URL (http or https).
  • Site Profile
    This Summary displays the result of WebSpy Vantage's Profile engine. It looks for certain keywords in the URL and returns a Category such as Search Engines or Adult.

    Note: If your log format includes a URL Category field, we recommend using that Category summary in your Report Templates, rather than Vantage's Site Profile Summary. Generally speaking, web gateways are better at determining the category of a web page, over WebSpy Vantage's profiling feature that only has access to a URL in a log file.

  • Site Protocol
    This Summary displays the protocol of the URL, such as http, https or ftp.

  • Site Query
    This is the Query part of the URL (everything after the ? character)
  • Site Resource
    This summary displays the Path portion of URL, which is everything between the Domain and the Query.
  • Site Top Level Domain
    This summary displays the Top Level Domain of a URL, such as .com, .org and .co.uk
  • Site URL
  • This summary displays the full URL, excluding the Query.

Example

For example, the URL: https://www.google.com:443/search.php?q=my+search+term will return the following Summaries and Values:

  • Origin Domain: google.com
  • Site Domain: google.com
  • Site Extension: .php
  • Site Keywords: my search term
  • Site Name: www.google.com
  • Site Port: 443
  • Site Profile: Search Engines
  • Site Protocol: https
  • Site Query: ?q=my+search+term
  • Site Resource: /search.php
  • Site Top Level Domain: .com
  • Site URL: http://www.google.com:443/search.php?q=my+search+term