Here are a few common web analytic metrics generated from clickstream data. Keeping in mind the goal of web mining and following the metrics listed below will help us better understand their use. Broadly all web mining goals can be classified as: 1] Increasing revenue, 2] reducing costs and 3] improving customer loyalty.
Conversion Rate = Transactions / Visit. This number answers, what percentage of customers bought a product, subscribed to a service, or filled out a form. [9]
Page Depth = Totals Pages Viewed / Visits. Page depth is a key metric for determining site stickiness. Stickiness determines how engaging a website is to a visitor. Typically, there is a direct correlation between page depth and conversion rates. The more pages a visitor sees, the better the chance the visitor converts. [9]
Bounce-Rate: A “bounce” occurs when a person leaves a website immediately without having viewed any page but the entry page. The number of bounces is compared to those who visit more than one page to give a ‘Bounce Rate’. There are two main problems that lead to a high bounce rate: Attracting the wrong kind of traffic and not giving the visitor what they were looking for. High bounce rate is also a good indicator to detect click frauds. [10]
A drop-out rate refers to a given process (say a purchasing process or a registration process) and the % of people who fail to get past that process successfully.
Path Analysis – gives the movement of the flow of visitors. It gives the sequence of hyperlinks one or more website visitors follows on a given site. The ideal path through the site should go from the homepage to the products page to the orders page, and finally to the checkout page. Deviations might include paths to tutorials, articles and other information pages. The marketer can select a particular visitor, or drop-out and then drill down to the detail page to reveal every page visited and path taken, as well as the amount of time spent viewing each page.
Page View Duration - Average amount of time that visitors spend on each page of the site. As with Session Duration, this metric is complicated by the fact that analytics programs can not measure the length of the final page view.
Cost per Acquisition = Cost / Visits. How much does it cost to bring a visitor to the website? CPA can help identify efficient and inefficient traffic sources/mediums. [9]
ROI = (Revenue – Cost) / Cost). ROI is the Holy Grail of Key Performance Indicators for any campaign that has an associate cost.
Some other information that can be inferred form clickstream data is: the IP-address of the user, the page (URL) the user requested, and the timestamp (down to seconds) of the event. From these 3 facts, we can derive a great deal of extra information. First, we can identify individual users, through the IP-address with IP geolocation which can then be mapped to the user’s country and city of origin. We can identify user sessions, including start page, end page and all pages visited between those two.
Finally, we can also group users by time between clicks and the time of day/week users are on the site. This is useful to monitor traffic from specific geographies and to detect click frauds.
These metrics are processed under different algorithms to infer the different types of information.
Some examples include:
Association rules are employed to discover associated events, products and pages.
Clustering is used to discover visitor groups with common properties, interests and common behavior.
Classification is used to characterize visitors with respect to a set of predefined classes. It is also used to detect card frauds.
It might be appropriate to include a reference to Search Engine optimization (SEO) at this stage. Search engine optimization is the process of changing various elements of a web site to optimized targeted traffic though search engines. This process is achieved by using the web analytic metrics to analyze the site. The goal of this analysis is to analyze the market, the target audience, and the result that the site owner wants to achieve. Some SEO metrics used are: number of links pointing to the site, keyword density, keyword proximity, number of searches per keyword, internal site links, URL normalization and page design. The analysis phase is followed by the implementation of these changes. The results are tracked and monitored to achieve better search engine ranking and a higher targeted traffic.
Search engine optimization primarily uses web content mining, and web structure mining.
[8] http://www.bipminstitute.com/datamining/propensity-cluster-associationpatterns.php
[9] http://www.weblinc.com/Our_Services/Metrics_Analytics/