WEB CONTENT MINING USING GRAPH THEORETICAL MODEL AND GENETIC ALGORITHM

Show simple item record

dc.contributor.author ADELOLA, Moses Akinjide
dc.date.accessioned 2020-10-28T12:15:54Z
dc.date.available 2020-10-28T12:15:54Z
dc.date.issued 2016-06
dc.identifier.uri http://196.220.128.81:8080/xmlui/handle/123456789/553
dc.description M.TECH THESIS en_US
dc.description.abstract The World Wide Web (www) is arguably the largest repository of data and continues to expand in size and complexity daily. The growth in size, however poses a lot of challenges to web users. Retrieving required web pages and information efficiently and effectively has become a herculean task for web users due to information overload and limitations of some information retrieval tools. Web mining adopts data mining techniques to automatically discover and retrieve information from web documents and services. Web content mining gathers, categorizes, organizes and presents accurate information available online to the user of the web. The research work designed a technique for performing data mining on web documents which utilize graph representations of document content and it addressed the problems of initialization, convergence to local minimal and failure to handle large datasets which are the limitations of reviewed works. Genetic algorithm is adapted to work with the graph representation of the web documents while the graph serves as data items to the genetic algorithm. The design of the system is in three phases namely: contents extraction, preprocessing and database of mined data. Maximum common sub graph (MCS) was used to calculate the distance between clusters. The research was conducted using a Computer System with 2GHz processor and 1GB RAM running Window 7 operating system, web scraper (import.io), PHP 6 and MySQL5 as the database. en_US
dc.description.sponsorship FEDERAL UNIVERSITY OF TECHNOLOGY AKURE en_US
dc.language.iso en en_US
dc.publisher FEDERAL UNIVERSITY OF TECHNOLOGY, AKURE en_US
dc.subject Internet en_US
dc.subject GRAPH THEORETICAL MODEL en_US
dc.subject GENETIC ALGORITHM en_US
dc.title WEB CONTENT MINING USING GRAPH THEORETICAL MODEL AND GENETIC ALGORITHM en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search FUTAspace


Advanced Search

Browse

My Account