Charles Explorer logo
🇬🇧

Visual Similarity of Web Pages

Publication at Faculty of Mathematics and Physics |
2010

Abstract

In this paper we introduce an experiment with two methods for evaluating similarity of Web pages. The results of these methods can be used in different ways for the reordering and clustering a Web page set.

Both of these methods belong to the field of Web content mining. The first method is purely focused on the visual similarity of Web pages.

This method segments Web pages and compares their layouts based on image processing and graph matching. The second method is based on detecting of objects that result from the user point of view on the Web page.

The similarity of Web pages is measured as an object match on the analyzed Web pages.