A survey on web page de duplication using web mining techniques
A survey on web page de duplication using web mining techniques
The presence of duplicate web pages affects the speed of searching, the relevant documents to be retrieved and thereby the search engine performance. Web mining is the application of data mining techniques to discover patterns from the World Wide Web. Web mining can be divided into three different types.