Discovering knowledge such as causal relations among objects in large data
collections is very important in many decision-making processes. In this pa
per, we present our development of an integrated environment acting as a so
ftware agent for discovering correlative attributes of data objects from mu
ltiple heterogeneous resources. The environment provides necessary supporti
ng tools and processing engines for acquiring, collecting, and extracting r
elevant information from multiple data resources, and then forming meaningf
ul knowledge patterns. The agent system is featured with an interactive use
r interface that provides useful communication channels for human superviso
rs to actively engage in necessary consultation and guidance in the entire
knowledge discovery processes. A cross-reference technique is employed for
searching and discovering coherent set of correlative patterns from the het
erogeneous data resources. A Bayesian network approach is applied as a know
ledge representation scheme for recording and manipulating the discovered c
ausal relations. The system employs common data warehousing and OLAP techni
ques to form integrated data repository and generate database queries over
large data collections from various distinct data resources. (C) 2001 Elsev
ier Science Ltd All rights reserved.