Tool stats & trends
Looking to identify usage trends or leading experts?
|Interface||Graphical user interface|
|Restrictions to use||Academic or non-commercial use|
|Input data||A file where the first line indicates the variable type (response, nominal or ordinal) with no particular order.|
|Output data||A tree structure.|
|Operating system||Unix/Linux, Mac OS, Windows|
|Programming languages||C++, Java|
No version available
- person_outline Heping Zhang
- person_outline Minghui Wang
- person_outline Xiang Chen
Publication for Willows
Incorporating epistasis interaction of genetic susceptibility single nucleotide polymorphisms in a lung cancer risk prediction model
[…] was selected as the one with the maximum prediction accuracy and cross-validation consistency and evaluated statistically using 1000-fold permutation test.For comparison, we used the freely available Willows software package for generating RF (). RF ranks variables by a variable importance index, a measure which reflects the ‘importance’ of a variable on the basis of the classification accuracy, w […]
The phenotypic manifestations of rare genic CNVs in autism spectrum disorder
[…] selected predictor variables (here, clinical phenotype variables). A collection of each of these trees is termed a forest., , Here, a random forest analysis using the DOS command line version of the Willows software package was used to investigate the phenotypic differences between cases with and without CNVs impacting ASD/ID or DBE genes. Forests were created from 10 000 trees with a minimum ter […]
Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?
[…] lled Random Jungle (RJ) was developed . It is currently the fastest implementation of RF, allows parallel computation of trees and is therefore very suited for the analysis of genome-wide data. The Willows package was also designed for tree-based analysis of genome-wide data by maximizing the use of computer memory . The WEKA workbench  is a data mining environment that includes several mach […]
Bioinformatics challenges for genome wide association studies
[…] on (McKinney et al., ). Advantages of this approach include its basis on decision trees and the availability of the algorithm in many different open source software packages including R. In fact, the Willows package was designed specifically for tree-based analysis of SNP data (Zhang et al., ). […]
Looking to check out a full list of citations?
Be the first to review Willows