Type: | Package |
Title: | Clustered Random Forests for Optimal Prediction and Inference of Clustered Data |
Version: | 1.1.0 |
Maintainer: | Elliot H. Young <ey244@cam.ac.uk> |
Description: | A clustered random forest algorithm for fitting random forests for data of independent clusters, that exhibit within cluster dependence. Details of the method can be found in Young and Buehlmann (2025) <doi:10.48550/arXiv.2503.12634>. |
License: | GPL-3 |
Encoding: | UTF-8 |
RoxygenNote: | 7.2.3 |
LinkingTo: | Rcpp |
Imports: | Rcpp, rpart |
Depends: | R (≥ 4.2.0) |
Suggests: | knitr, rmarkdown, testthat |
NeedsCompilation: | yes |
Packaged: | 2025-03-18 17:40:09 UTC; elliotyoung |
Author: | Elliot H. Young [aut, cre] |
Repository: | CRAN |
Date/Publication: | 2025-03-20 09:20:06 UTC |
Clustered random forest fitting
Description
Clustered random forest fitting
Usage
crf(
formula,
data,
B = 500,
L = 100,
beta = 0.9,
weight_optimiser = "Training MSE",
correlation = "equicorr",
maxdepth = 30,
minbucket = 10,
cp = 0,
x0 = NULL,
test_data = NULL,
fixrho = FALSE,
honesty = TRUE,
verbose = TRUE,
seed = NULL
)
Arguments
formula |
an object of class 'formula' describing the model to fit. |
data |
training dataset for fitting the CRF. Note that group ID must be given by the column |
B |
the total number of trees (or trees per little bag if |
L |
the total number of little bags if providing a bootstrap of little bags estimate for inference. To not include set |
beta |
the subsampling rate. Default is |
weight_optimiser |
the method used to construct weights. Options are 'Pointwise variance', 'Training MSE' or 'Test MSE'. Default is 'Training MSE'. |
correlation |
the weight structure implemented. Currently supported options are 'ar1' and 'equicorr'. Default is 'equicorr'. |
maxdepth |
the maximum depth of the decision tree fitting. Default is 30. |
minbucket |
the minbucket of the decision tree fitting. Default is 10. |
cp |
the complexity paramter for decision tree fitting. Default is 0. |
x0 |
the covariate point to optimise weights towards if 'weightoptimiser' set to 'Pointwise variance'. |
test_data |
the test dataset to optimise weights towards if 'weightoptimiser' set to 'Test MSE'. |
fixrho |
fixes a pre-specified weight structure, given by the relevant 'ar1' or 'equicorr' parameter. Default is 'FALSE' (optimise weights). |
honesty |
whether honest or dishonest trees to be fit. Default is 'TRUE'. |
verbose |
Logical indicating whether or not to print computational progress. Default is 'TRUE'. |
seed |
Random seed for sampling. Default is NULL. |
Value
A clustered random forest fitted object
Predictions from a crf given newdata
Description
Predictions from a fitted crf
clustered random forest on newdata newdata
.
Usage
## S3 method for class 'crf'
predict(object, newdata, sderr = FALSE, ...)
Arguments
object |
a fitted |
newdata |
dataset on which predictions are to be performed. |
sderr |
whether 'bootstrap of little bags' standard errors should be additionally outputted. Default is |
... |
additional arguments |
Value
Fitted values, potentially alongside standard errors (see sderr
).
Summary for a crf fitted object
Description
Summary of a fitted crf
clustered random forest object fitted by crf
.
Usage
## S3 method for class 'crf'
summary(object, ...)
Arguments
object |
a fitted |
... |
additional arguments |
Value
Prints summary output for crf
object