Insights: Software that extracts knowledge from data. Model. Predict. Simulate. Discover. Associate. Gain new Knowledge and Insights from noisy data. Easily and Reliably. Ultra-fast, parallel, self-organizing, high-dimensional modeling of real-world processes and complex systems. INSIGHTS
Selected Publications.
Self-organizing Modeling applied in different domains.
Global sensitivity analysis using sparse high dimensional model representations generated by the group method of data handling
Romain S.C. Lamberta, Frank Lemkeb, Sergei S. Kucherenkoa, Shufang Songa,c, Nilay Shaha
a Centre for Process Systems Engineering, Department of Chemical Engineering, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
b KnowledgeMiner Software, 13187 Berlin, Germany
c School of Aeronautics, Northwestern Polytechnical University, Xi’an 710072, China

Abstract.
In this paper, the parameter selection capabilities of the group method of data handling (GMDH) as an inductive self-organizing modelling method are used to construct sparse random sampling high dimensional model representations (RS-HDMR), from which the Sobol’s rst and second order global sensitivity indices can be derived. The proposed method is capable of dealing with high-dimensional problems without the prior use of a screening technique and can perform with a relatively limited number of function evaluations, even in the case of under-determined modelling problems. Four classical benchmark test functions are used for the evaluation of the proposed technique.
Full Paper: (580k - PDF, Mathematics and Computers in Simulation 128 (2016) 42–54


Prediction of Wastewater Pre-precipitation Variables using Self-Organizing Networks
P. Isaias, S. Nilsson, A. Stathaki and R. E. King

Abstract.
This paper describes the derivation and design of an array of self-organizing networks trained by inductive learning for one step ahead prediction of the outputs of the pre-precipitation stage of a wastewater treatment plant with a view to model predictive control of the stage.
Full Paper: (116k - PDF, Proceedings 13th IEEE MED'05 Conference, 2005


A GMDH Approach to Modelling Gibbsite Solubility in Bayer Process Liquors
Frederick R. Bennett , Peter Crew, Jennifer K. Muller

Abstract.
The most widely employed industrial process for producing alumina (Bayer process) involves the dissolution of available aluminium hydroxide minerals present in raw bauxite into high temperature sodium hydroxide solutions. On cooling of the solution, or liquor in the industrial vernacular, Al is precipitated from solution in the form of gibbsite (Al(OH)3). In order to optimise the process, a detailed knowledge of factors influencing gibbsite solubility is required, a problem that is confounded by the presence of liquor impurities. In this paper, the use of the Group Method of Data Handling (GMDH) polynomial neural network for developing a gibbsite equilibrium solubility model for Bayer process liquors is discussed. The resulting predictive model appears to correctly incorporate the effects of liquor impurities and is found to offer a level of performance comparable to the most sophisticated phenomenological model presented to date.
Full Paper: (156k - PDF, International Journal of Molecular Sciences, 2004, 5, 101-109 ©MDPI)


Group method of data handling (GMDH) in modelling of growth dynamics of trees irrigated with wastewater
Ioannis Kalavrouziotisa , Volodymyr Stepashkob, Vladimir Vissikirskyb, Panagiotis Drakatosc
a Department of Environmental and Natural Resources Management , University of Ioannina, Greece
b Department for Information Technologies of Inductive Modeling, the International UNESCO Center of Information Technologies and Systems of the National Academy of Sciences of Ukraine
c Department of Mechanical and Aeronautics Engineering, Laboratory of Special Mechanical Engineering, Rion, University of Patras, Greece

Abstract.
This paper describes the formulation and analysis of growth dynamics models for trees irrigated with wastewater. The models can be used to obtain the characteristics of a species, depending on treatment conditions and climate factors, at all stages of growth. The experiments were carried out at the University of Patras to identify the characteristics (height rate and mortality) of the forest tree Pinus brutia cultivated under different treatment conditions. The growth dynamics models are designed on the basis of the group method of data handling. This principle generates sets of estimation models with different complexity and accuracy. By analysing their structures, qualitative features of the models may be assessed, and general linear models for different treatment cases compiled.
Link to Full Paper: (International Journal of Environment and Pollution (IJEP), Vol. 21, No. 4, 2004 ©InderScience Publishers)


Forecasting Commodity Prices for Predictive Decision Support Systems
Tamer Shahwana, Frank Lemkeb
a Humboldt-Universitaet zu Berlin, School of Business and Economics, Institute of Banking, Stock Exchanges and Insurance, Germany
b KnowledgeMiner Software, Germany

Abstract.
The builders of traditional decision support systems have regularly used game theory and operations research to build intelligent decision support systems. However, we have seen a rapid acceptance of new technology like neural networks and data mining which implement state-of-the art decision support system based on knowledge management to solve a wide range of business problems like classification and forecasting. Thus, we will illustrate our claims through the investigation of the forecasting performance of two time series forecasting techniques, namely elman neural network and self-organizing data mining against the autoregressive integrated moving average (ARIMA) model and futures prices as benchmarks. As an attempt to improve the accuracy of elman neural networks, the global search capability of Genetic Algorithms will be used to determine the optimal architecture of elman neural network. Real data sets of commodity prices are used to examine the forecasting accuracy of the proposed models.
Full Paper: (396k - PDF, Proceedings of EFITA and WCCA 2005, joint congress on on information technology in agriculture (25-28 July 2005), Vila Real, Portugal)

Data-driven Modeling and Prediction of Acute Toxicity of Pesticide Residues
F. Lemkea, E. Benfenatib, J.-A. Muellera

a KnowledgeMiner Software, Germany
b Istituto di Ricerche Farmacologiche "Mario Negri", Milan, Italy

Abstract.
This paper outlines and implements a concept for developing alternative tools for toxicity modeling and prediction of chemical compounds to be used for evaluation and authorization purposes of public regulatory bodies to help minimizing animal tests, costs, and time associated with registration and risk assessment processes. Starting from a general problem description we address and introduce concepts of multileveled self-organization for high-dimensional modeling, model validation, model combining, and decision support within the frame of a knowledge discovery from noisy data.
Full Paper: (1.2m - PDF, SIGKDD Explorations Special Issue, 8(2006)1, pp. 71-79)
Deep Learning and noise immunity of GMDH adaptive self-learning and self-organizing data mining, forecasting and classification of streams of data