Learning from Web: Review of Approaches

04/13/2005
by   Vitaly Schetinin, et al.
0

Knowledge discovery is defined as non-trivial extraction of implicit, previously unknown and potentially useful information from given data. Knowledge extraction from web documents deals with unstructured, free-format documents whose number is enormous and rapidly growing. The artificial neural networks are well suitable to solve a problem of knowledge discovery from web documents because trained networks are able more accurately and easily to classify the learning and testing examples those represent the text mining domain. However, the neural networks that consist of large number of weighted connections and activation units often generate the incomprehensible and hard-to-understand models of text classification. This problem may be also addressed to most powerful recurrent neural networks that employ the feedback links from hidden or output units to their input units. Due to feedback links, recurrent neural networks are able take into account of a context in document. To be useful for data mining, self-organizing neural network techniques of knowledge extraction have been explored and developed. Self-organization principles were used to create an adequate neural-network structure and reduce a dimensionality of features used to describe text documents. The use of these principles seems interesting because ones are able to reduce a neural-network redundancy and considerably facilitate the knowledge representation.

READ FULL TEXT
research
08/16/2016

Authorship clustering using multi-headed recurrent neural networks

A recurrent neural network that has been trained to separately model the...
research
09/28/2019

W-RNN: News text classification based on a Weighted RNN

Most of the information is stored as text, so text mining is regarded as...
research
09/16/2016

Rule Extraction Algorithm for Deep Neural Networks: A Review

Despite the highest classification accuracy in wide varieties of applica...
research
04/11/2016

Intelligent information extraction based on artificial neural network

Question Answering System (QAS) is used for information retrieval and na...
research
12/24/2012

Reconstructing Self Organizing Maps as Spider Graphs for better visual interpretation of large unstructured datasets

Self-Organizing Maps (SOM) are popular unsupervised artificial neural ne...
research
06/27/2021

Deep Learning for Technical Document Classification

In large technology companies, the requirements for managing and organiz...
research
09/06/2019

Argument Component Classification for Classroom Discussions

This paper focuses on argument component classification for transcribed ...

Please sign up or login with your details

Forgot password? Click here to reset