13-3 bagging和pasting
from sklearn.tree import DecisionTreeClassifier
from sklearn.ensemble import BaggingClassifier
bagging_clf = BaggingClassifier(DecisionTreeClassifier(),n_estimators=500, max_samples=100, bootstrap=True)
# 决策树这种非参数的算法更容易产生差异较大的子模型
# 所有集成学习如果要集成成百上千个子模型,通常首先决策树
# n_estimators:子模型数
# max_samples:每个子模型看的样本树
# bootstrap:放回取样
bagging_clf.fit(X_train, y_train)
bagging_clf.score(X_test, y_test)Last updated