Induction of decision trees using genetic programming for modelling ecotoxicity data: adaptive discretization of real-valued endpoints
Wang, X. Z.; Buontempo, F. V.; Young, A.; Osborn, D.. 2006 Induction of decision trees using genetic programming for modelling ecotoxicity data: adaptive discretization of real-valued endpoints. SAR and QSAR in Environmental Research, 17. 451-471. 10.1080/10629360600933723Full text not available from this repository.
Recent literature has demonstrated the applicability of genetic programming to induction of decision trees for modelling toxicity endpoints. Compared with other decision tree induction techniques that are based upon recursive partitioning employing greedy searches to choose the best splitting attribute and value at each node that will necessarily miss regions of the search space, the genetic programming based approach can overcome the problem. However, the method still requires the discretization of the often continuous-valued toxicity endpoints prior to the tree induction. A novel extension of this method, YAdapt, is introduced in this work which models the original continuous endpoint by adaptively finding suitable ranges to describe the endpoints during the tree induction process, removing the need for discretization prior to tree induction and allowing the ordinal nature of the endpoint to be taken into account in the models built.
|Programmes:||CEH Programmes pre-2009 publications > Biogeochemistry > SE01B Sustainable Monitoring, Risk Assessment and Management of Chemicals|
|CEH Sections:||_ Ecological Risk|
|Additional Keywords:||toxicity, QSAR, inductive learning, genetic and evolutionary programming, decision tree, discretization|
|NORA Subject Terms:||Computer Science
Ecology and Environment
|Date made live:||15 May 2008 08:32|
Actions (login required)