.. DO NOT EDIT. .. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY. .. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE: .. "auto_examples/model_evaluation/plot_estimator_report.py" .. LINE NUMBERS ARE GIVEN BELOW. .. only:: html .. note:: :class: sphx-glr-download-link-note :ref:`Go to the end ` to download the full example code. .. rst-class:: sphx-glr-example-title .. _sphx_glr_auto_examples_model_evaluation_plot_estimator_report.py: .. _example_estimator_report: =============================================================== `EstimatorReport`: Get insights from any scikit-learn estimator =============================================================== This example shows how the :class:`skore.EstimatorReport` class can be used to quickly get insights from any scikit-learn estimator. .. GENERATED FROM PYTHON SOURCE LINES 13-19 Loading our dataset and defining our estimator ============================================== First, we load a dataset from skrub. Our goal is to predict if a healthcare manufacturing companies paid a medical doctors or hospitals, in order to detect potential conflict of interest. .. GENERATED FROM PYTHON SOURCE LINES 21-27 .. code-block:: Python from skrub.datasets import fetch_open_payments dataset = fetch_open_payments() df = dataset.X y = dataset.y .. rst-class:: sphx-glr-script-out .. code-block:: none Downloading 'open_payments' from https://github.com/skrub-data/skrub-data-files/raw/refs/heads/main/open_payments.zip (attempt 1/3) .. GENERATED FROM PYTHON SOURCE LINES 28-32 .. code-block:: Python from skrub import TableReport TableReport(df) .. raw:: html

	Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name	Dispute_Status_for_Publication	Name_of_Associated_Covered_Device_or_Medical_Supply1	Name_of_Associated_Covered_Drug_or_Biological1	Physician_Specialty
0	ELI LILLY AND COMPANY	No			Allopathic & Osteopathic Physicians\|Pediatrics\|Pediatric Rheumatology
1	ELI LILLY AND COMPANY	No			Allopathic & Osteopathic Physicians\|Internal Medicine\|Nephrology
2	ELI LILLY AND COMPANY	No			Allopathic & Osteopathic Physicians\|Internal Medicine\|Rheumatology
3	ELI LILLY AND COMPANY	No			Allopathic & Osteopathic Physicians\|Internal Medicine\|Endocrinology, Diabetes & Metabolism
4	ELI LILLY AND COMPANY	No		EFFIENT	Allopathic & Osteopathic Physicians\|Pediatrics\|Pediatric Hematology-Oncology

73,553	GlaxoSmithKline, LLC.	No		ZIAGEN
73,554	ALERE SCARBOROUGH, INC.	No	Alere PBP2a
73,555	NovoCure Limited	No
73,556	Wright Medical Technology, Inc.	No		HIPS
73,557	Alcon Research Ltd	No		Express

Column	Column name	dtype	Is sorted	Null values	Unique values
0	Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name	ObjectDType	False	0 (0.0%)	1466 (2.0%)
1	Dispute_Status_for_Publication	ObjectDType	False	0 (0.0%)	2 (< 0.1%)
2	Name_of_Associated_Covered_Device_or_Medical_Supply1	ObjectDType	False	43088 (58.6%)	4372 (5.9%)
3	Name_of_Associated_Covered_Drug_or_Biological1	ObjectDType	False	36233 (49.3%)	2262 (3.1%)
4	Physician_Specialty	ObjectDType	False	3996 (5.4%)	513 (0.7%)

Column 1	Column 2	Cramér's V
Name_of_Associated_Covered_Device_or_Medical_Supply1	Name_of_Associated_Covered_Drug_or_Biological1	0.263
Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name	Name_of_Associated_Covered_Drug_or_Biological1	0.214
Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name	Name_of_Associated_Covered_Device_or_Medical_Supply1	0.132
Name_of_Associated_Covered_Device_or_Medical_Supply1	Physician_Specialty	0.0962
Dispute_Status_for_Publication	Physician_Specialty	0.0960
Dispute_Status_for_Publication	Name_of_Associated_Covered_Drug_or_Biological1	0.0895
Name_of_Associated_Covered_Drug_or_Biological1	Physician_Specialty	0.0646
Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name	Physician_Specialty	0.0510
Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name	Dispute_Status_for_Publication	0.0308
Dispute_Status_for_Publication	Name_of_Associated_Covered_Device_or_Medical_Supply1	0.0284

Please enable javascript

The skrub table reports need javascript to display correctly. If you are displaying a report in a Jupyter notebook and you see this message, you may need to re-execute the cell or to trust the notebook (button on the top right or "File > Trust notebook").

.. GENERATED FROM PYTHON SOURCE LINES 33-35 .. code-block:: Python TableReport(y.to_frame()) .. raw:: html

	status
0	disallowed
1	disallowed
2	disallowed
3	disallowed
4	disallowed

73,553	allowed
73,554	allowed
73,555	allowed
73,556	allowed
73,557	allowed

Column	Column name	dtype	Is sorted	Null values	Unique values	Mean	Std	Min	Median	Max
0	status	ObjectDType	True	0 (0.0%)	2 (< 0.1%)

Please enable javascript

.. GENERATED FROM PYTHON SOURCE LINES 36-43 Looking at the distributions of the target, we observe that this classification task is quite imbalanced. It means that we have to be careful when selecting a set of statistical metrics to evaluate the classification performance of our predictive model. In addition, we see that the class labels are not specified by an integer 0 or 1 but instead by a string "allowed" or "disallowed". For our application, the label of interest is "allowed". .. GENERATED FROM PYTHON SOURCE LINES 43-45 .. code-block:: Python pos_label, neg_label = "allowed", "disallowed" .. GENERATED FROM PYTHON SOURCE LINES 46-53 Now, we need to define a predictive model. Thankfully, `skrub` provides a convenient function (:func:`skrub.tabular_pipeline`) when it comes to getting strong baseline predictive models with a single line of code. As its feature engineering is generic, it does not provide some handcrafted and tailored feature engineering but still provides a good starting point. So let's create a classifier for our task. .. GENERATED FROM PYTHON SOURCE LINES 53-58 .. code-block:: Python from skrub import tabular_pipeline estimator = tabular_pipeline("classifier") estimator .. raw:: html

Pipeline(steps=[('tablevectorizer',
                     TableVectorizer(low_cardinality=ToCategorical())),
                    ('histgradientboostingclassifier',
                     HistGradientBoostingClassifier())])

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

.. GENERATED FROM PYTHON SOURCE LINES 59-72 Getting insights from our estimator =================================== Introducing the :class:`skore.EstimatorReport` class ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Now, we would be interested in getting some insights from our predictive model. One way is to use the :class:`skore.EstimatorReport` class which we will construct using the `evaluate` function. This function will detect that our estimator is unfitted and will fit it for us on the training data and return an :class:`~skore.EstimatorReport` object. Specifying a `splitter` of 0.2 will perform a 80/20 train-test split. .. GENERATED FROM PYTHON SOURCE LINES 72-78 .. code-block:: Python from skore import evaluate report = evaluate(estimator, X=df, y=y, pos_label=pos_label, splitter=0.2) report .. raw:: html

Metric	HistGradientBoostingClassifier
Score	0.951196
Accuracy	0.951196
Precision	0.728595
Recall	0.450549
ROC AUC	0.936815
Log loss	0.131858
Brier score	0.036981
Fit time (s)	5.248361
Predict time (s)	0.611445

Pipeline(steps=[('tablevectorizer',
                     TableVectorizer(low_cardinality=ToCategorical())),
                    ('histgradientboostingclassifier',
                     HistGradientBoostingClassifier())])

Pipeline

?Documentation for PipelineiFitted

Parameters

	steps steps: list of tuples List of (name of step, estimator) tuples that are to be chained in sequential order. To be compatible with the scikit-learn API, all steps must define `fit`. All non-last steps must also define `transform`. See :ref:`Combining Estimators <combining_estimators>` for more details.	[('tablevectorizer', ...), ('histgradientboostingclassifier', ...)]
	transform_input transform_input: list of str, default=None The names of the :term:`metadata` parameters that should be transformed by the pipeline before passing it to the step consuming it. This enables transforming some input arguments to ``fit`` (other than ``X``) to be transformed by the steps of the pipeline up to the step which requires them. Requirement is defined via :ref:`metadata routing <metadata_routing>`. For instance, this can be used to pass a validation set through the pipeline. You can only set this if metadata routing is enabled, which you can enable using ``sklearn.set_config(enable_metadata_routing=True)``. .. versionadded:: 1.6	None
	memory memory: str or object with the joblib.Memory interface, default=None Used to cache the fitted transformers of the pipeline. The last step will never be cached, even if it is a transformer. By default, no caching is performed. If a string is given, it is the path to the caching directory. Enabling caching triggers a clone of the transformers before fitting. Therefore, the transformer instance given to the pipeline cannot be inspected directly. Use the attribute ``named_steps`` or ``steps`` to inspect estimators within the pipeline. Caching the transformers is advantageous when fitting is time consuming. See :ref:`sphx_glr_auto_examples_neighbors_plot_caching_nearest_neighbors.py` for an example on how to enable caching.	None
	verbose verbose: bool, default=False If True, the time elapsed while fitting each step will be printed as it is completed.	False

Fitted attributes

Name	Type	Value
classes_ classes_: ndarray of shape (n_classes,) The classes labels. Only exist if the last step of the pipeline is a classifier.	ndarray[object](2,)	['allowed','disallowed']
feature_names_in_ feature_names_in_: ndarray of shape (`n_features_in_`,) Names of features seen during :term:`fit`. Only defined if the underlying estimator exposes such an attribute when fit. .. versionadded:: 1.0	list	['Ap...me', 'Di...on', 'Na...y1', 'Na...l1', ...]
n_features_in_ n_features_in_: int Number of features seen during :term:`fit`. Only defined if the underlying first estimator in `steps` exposes such an attribute when fit. .. versionadded:: 0.24	int	5

tablevectorizer: TableVectorizer

Parameters

	low_cardinality	ToCategorical()
	high_cardinality	StringEncoder()
	numeric	PassThrough()
	datetime	DatetimeEncoder()
	cardinality_threshold	40
	specific_transformers	()
	drop_null_fraction	1.0
	drop_if_constant	False
	drop_if_unique	False
	datetime_format	None
	null_strings	None
	n_jobs	None

Fitted attributes

Name	Type	Value
all_outputs_	list	['Ap...00', 'Ap...01', 'Ap...02', 'Ap...03', ...]
all_processing_steps_	dict	{'Ap...me': [CleanNullStrings(), DropUninformative(), ToStr(), StringEncoder(), ...], 'Di...on': [CleanNullStrings(), DropUninformative(), ToStr(), ToCategorical()], 'Na...y1': [CleanNullStrings(), DropUninformative(), ToStr(), StringEncoder(), ...], 'Na...l1': [CleanNullStrings(), DropUninformative(), ToStr(), StringEncoder(), ...], ...}
column_to_kind_	dict	{'Ap...me': 'hi...ty', 'Di...on': 'lo...ty', 'Na...y1': 'hi...ty', 'Na...l1': 'hi...ty', ...}
feature_names_in_	list	['Ap...me', 'Di...on', 'Na...y1', 'Na...l1', ...]
input_to_outputs_	dict	{'Ap...me': ['Ap...00', 'Ap...01', 'Ap...02', 'Ap...03', ...], 'Di...on': ['Di...on'], 'Na...y1': ['Na...00', 'Na...01', 'Na...02', 'Na...03', ...], 'Na...l1': ['Na...00', 'Na...01', 'Na...02', 'Na...03', ...], ...}
kind_to_columns_	dict	{'da...me': [], 'hi...ty': ['Ap...me', 'Na...y1', 'Na...l1', 'Ph...ty'], 'lo...ty': ['Di...on'], 'numeric': [], ...}
n_features_in_	int	5
output_to_input_	dict	{'Ap...00': 'Ap...me', 'Ap...01': 'Ap...me', 'Ap...02': 'Ap...me', 'Ap...03': 'Ap...me', ...}
transformers_	dict	{'Ap...me': StringEncoder(), 'Di...on': ToCategorical(), 'Na...y1': StringEncoder(), 'Na...l1': StringEncoder(), ...}

numeric

PassThrough

Parameters

datetime

DatetimeEncoder

Parameters

	resolution	'hour'
	add_weekday	False
	add_total_seconds	True
	add_day_of_year	False
	periodic_encoding	None

low_cardinality

['Dispute_Status_for_Publication']

ToCategorical

Parameters

high_cardinality

['Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name', 'Name_of_Associated_Covered_Device_or_Medical_Supply1', 'Name_of_Associated_Covered_Drug_or_Biological1', 'Physician_Specialty']

StringEncoder

Parameters

	n_components	30
	vectorizer	'tfidf'
	ngram_range	(3, ...)
	analyzer	'char_wb'
	stop_words	None
	random_state	None
	vocabulary	None

121 features

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_00

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_01

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_02

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_03

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_04

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_05

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_06

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_07

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_08

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_09

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_10

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_11

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_12

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_13

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_14

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_15

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_16

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_17

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_18

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_19

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_20

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_21

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_22

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_23

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_24

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_25

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_26

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_27

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_28

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_29

Dispute_Status_for_Publication

Name_of_Associated_Covered_Device_or_Medical_Supply1_00

Name_of_Associated_Covered_Device_or_Medical_Supply1_01

Name_of_Associated_Covered_Device_or_Medical_Supply1_02

Name_of_Associated_Covered_Device_or_Medical_Supply1_03

Name_of_Associated_Covered_Device_or_Medical_Supply1_04

Name_of_Associated_Covered_Device_or_Medical_Supply1_05

Name_of_Associated_Covered_Device_or_Medical_Supply1_06

Name_of_Associated_Covered_Device_or_Medical_Supply1_07

Name_of_Associated_Covered_Device_or_Medical_Supply1_08

Name_of_Associated_Covered_Device_or_Medical_Supply1_09

Name_of_Associated_Covered_Device_or_Medical_Supply1_10

Name_of_Associated_Covered_Device_or_Medical_Supply1_11

Name_of_Associated_Covered_Device_or_Medical_Supply1_12

Name_of_Associated_Covered_Device_or_Medical_Supply1_13

Name_of_Associated_Covered_Device_or_Medical_Supply1_14

Name_of_Associated_Covered_Device_or_Medical_Supply1_15

Name_of_Associated_Covered_Device_or_Medical_Supply1_16

Name_of_Associated_Covered_Device_or_Medical_Supply1_17

Name_of_Associated_Covered_Device_or_Medical_Supply1_18

Name_of_Associated_Covered_Device_or_Medical_Supply1_19

Name_of_Associated_Covered_Device_or_Medical_Supply1_20

Name_of_Associated_Covered_Device_or_Medical_Supply1_21

Name_of_Associated_Covered_Device_or_Medical_Supply1_22

Name_of_Associated_Covered_Device_or_Medical_Supply1_23

Name_of_Associated_Covered_Device_or_Medical_Supply1_24

Name_of_Associated_Covered_Device_or_Medical_Supply1_25

Name_of_Associated_Covered_Device_or_Medical_Supply1_26

Name_of_Associated_Covered_Device_or_Medical_Supply1_27

Name_of_Associated_Covered_Device_or_Medical_Supply1_28

Name_of_Associated_Covered_Device_or_Medical_Supply1_29

Name_of_Associated_Covered_Drug_or_Biological1_00

Name_of_Associated_Covered_Drug_or_Biological1_01

Name_of_Associated_Covered_Drug_or_Biological1_02

Name_of_Associated_Covered_Drug_or_Biological1_03

Name_of_Associated_Covered_Drug_or_Biological1_04

Name_of_Associated_Covered_Drug_or_Biological1_05

Name_of_Associated_Covered_Drug_or_Biological1_06

Name_of_Associated_Covered_Drug_or_Biological1_07

Name_of_Associated_Covered_Drug_or_Biological1_08

Name_of_Associated_Covered_Drug_or_Biological1_09

Name_of_Associated_Covered_Drug_or_Biological1_10

Name_of_Associated_Covered_Drug_or_Biological1_11

Name_of_Associated_Covered_Drug_or_Biological1_12

Name_of_Associated_Covered_Drug_or_Biological1_13

Name_of_Associated_Covered_Drug_or_Biological1_14

Name_of_Associated_Covered_Drug_or_Biological1_15

Name_of_Associated_Covered_Drug_or_Biological1_16

Name_of_Associated_Covered_Drug_or_Biological1_17

Name_of_Associated_Covered_Drug_or_Biological1_18

Name_of_Associated_Covered_Drug_or_Biological1_19

Name_of_Associated_Covered_Drug_or_Biological1_20

Name_of_Associated_Covered_Drug_or_Biological1_21

Name_of_Associated_Covered_Drug_or_Biological1_22

Name_of_Associated_Covered_Drug_or_Biological1_23

Name_of_Associated_Covered_Drug_or_Biological1_24

Name_of_Associated_Covered_Drug_or_Biological1_25

Name_of_Associated_Covered_Drug_or_Biological1_26

Name_of_Associated_Covered_Drug_or_Biological1_27

Name_of_Associated_Covered_Drug_or_Biological1_28

Name_of_Associated_Covered_Drug_or_Biological1_29

Physician_Specialty_00

Physician_Specialty_01

Physician_Specialty_02

Physician_Specialty_03

Physician_Specialty_04

Physician_Specialty_05

Physician_Specialty_06

Physician_Specialty_07

Physician_Specialty_08

Physician_Specialty_09

Physician_Specialty_10

Physician_Specialty_11

Physician_Specialty_12

Physician_Specialty_13

Physician_Specialty_14

Physician_Specialty_15

Physician_Specialty_16

Physician_Specialty_17

Physician_Specialty_18

Physician_Specialty_19

Physician_Specialty_20

Physician_Specialty_21

Physician_Specialty_22

Physician_Specialty_23

Physician_Specialty_24

Physician_Specialty_25

Physician_Specialty_26

Physician_Specialty_27

Physician_Specialty_28

Physician_Specialty_29

HistGradientBoostingClassifier

?Documentation for HistGradientBoostingClassifier

Parameters

	loss loss: {'log_loss'}, default='log_loss' The loss function to use in the boosting process. For binary classification problems, 'log_loss' is also known as logistic loss, binomial deviance or binary crossentropy. Internally, the model fits one tree per boosting iteration and uses the logistic sigmoid function (expit) as inverse link function to compute the predicted positive class probability. For multiclass classification problems, 'log_loss' is also known as multinomial deviance or categorical crossentropy. Internally, the model fits one tree per boosting iteration and per class and uses the softmax function as inverse link function to compute the predicted probabilities of the classes.	'log_loss'
	learning_rate learning_rate: float, default=0.1 The learning rate, also known as shrinkage. This is used as a multiplicative factor for the leaves values. Use ``1`` for no shrinkage.	0.1
	max_iter max_iter: int, default=100 The maximum number of iterations of the boosting process, i.e. the maximum number of trees for binary classification. For multiclass classification, `n_classes` trees per iteration are built.	100
	max_leaf_nodes max_leaf_nodes: int or None, default=31 The maximum number of leaves for each tree. Must be strictly greater than 1. If None, there is no maximum limit.	31
	max_depth max_depth: int or None, default=None The maximum depth of each tree. The depth of a tree is the number of edges to go from the root to the deepest leaf. Depth isn't constrained by default.	None
	min_samples_leaf min_samples_leaf: int, default=20 The minimum number of samples per leaf. For small datasets with less than a few hundred samples, it is recommended to lower this value since only very shallow trees would be built.	20
	l2_regularization l2_regularization: float, default=0 The L2 regularization parameter penalizing leaves with small hessians. Use ``0`` for no regularization (default).	0.0
	max_features max_features: float, default=1.0 Proportion of randomly chosen features in each and every node split. This is a form of regularization, smaller values make the trees weaker learners and might prevent overfitting. If interaction constraints from `interaction_cst` are present, only allowed features are taken into account for the subsampling. .. versionadded:: 1.4	1.0
	max_bins max_bins: int, default=255 The maximum number of bins to use for non-missing values. Before training, each feature of the input array `X` is binned into integer-valued bins, which allows for a much faster training stage. Features with a small number of unique values may use less than ``max_bins`` bins. In addition to the ``max_bins`` bins, one more bin is always reserved for missing values. Must be no larger than 255.	255
	categorical_features categorical_features: array-like of {bool, int, str} of shape (n_features) or shape (n_categorical_features,), default='from_dtype' Indicates the categorical features. - None : no feature will be considered categorical. - boolean array-like : boolean mask indicating categorical features. - integer array-like : integer indices indicating categorical features. - str array-like: names of categorical features (assuming the training data has feature names). - `"from_dtype"`: dataframe columns with dtype "Categorical" and "Enum" are considered to be categorical features. The input must be a dataframe that is supported by narwhals (or supports it): :func:`narwhals.from_native` must work. This is the case, for instance, for pandas and polars DataFrames. For each categorical feature, there must be at most `max_bins` unique categories. Negative values for categorical features encoded as numeric dtypes are treated as missing values. All categorical values are converted to floating point numbers. This means that categorical values of 1.0 and 1 are treated as the same category. Read more in the :ref:`User Guide <categorical_support_gbdt>`. .. versionadded:: 0.24 .. versionchanged:: 1.2 Added support for feature names. .. versionchanged:: 1.4 Added `"from_dtype"` option. .. versionchanged:: 1.6 The default value changed from `None` to `"from_dtype"`.	'from_dtype'
	monotonic_cst monotonic_cst: array-like of int of shape (n_features) or dict, default=None Monotonic constraint to enforce on each feature are specified using the following integer values: - 1: monotonic increase - 0: no constraint - -1: monotonic decrease If a dict with str keys, map feature to monotonic constraints by name. If an array, the features are mapped to constraints by position. See :ref:`monotonic_cst_features_names` for a usage example. The constraints are only valid for binary classifications and hold over the probability of the positive class. Read more in the :ref:`User Guide <monotonic_cst_gbdt>`. .. versionadded:: 0.23 .. versionchanged:: 1.2 Accept dict of constraints with feature names as keys.	None
	interaction_cst interaction_cst: {"pairwise", "no_interactions"} or sequence of lists/tuples/sets of int, default=None Specify interaction constraints, the sets of features which can interact with each other in child node splits. Each item specifies the set of feature indices that are allowed to interact with each other. If there are more features than specified in these constraints, they are treated as if they were specified as an additional set. The strings "pairwise" and "no_interactions" are shorthands for allowing only pairwise or no interactions, respectively. For instance, with 5 features in total, `interaction_cst=[{0, 1}]` is equivalent to `interaction_cst=[{0, 1}, {2, 3, 4}]`, and specifies that each branch of a tree will either only split on features 0 and 1 or only split on features 2, 3 and 4. See :ref:`this example<ice-vs-pdp>` on how to use `interaction_cst`. .. versionadded:: 1.2	None
	warm_start warm_start: bool, default=False When set to ``True``, reuse the solution of the previous call to fit and add more estimators to the ensemble. For results to be valid, the estimator should be re-trained on the same data only. See :term:`the Glossary <warm_start>`.	False
	early_stopping early_stopping: 'auto' or bool, default='auto' If 'auto', early stopping is enabled if the sample size is larger than 10000 or if `X_val` and `y_val` are passed to `fit`. If True, early stopping is enabled, otherwise early stopping is disabled. .. versionadded:: 0.23	'auto'
	scoring scoring: str or callable or None, default='loss' Scoring method to use for early stopping. Only used if `early_stopping` is enabled. Options: - str: see :ref:`scoring_string_names` for options. - callable: a scorer callable object (e.g., function) with signature ``scorer(estimator, X, y)``. See :ref:`scoring_callable` for details. - `None`: :ref:`accuracy <accuracy_score>` is used. - 'loss': early stopping is checked w.r.t the loss value.	'loss'
	validation_fraction validation_fraction: int or float or None, default=0.1 Proportion (or absolute size) of training data to set aside as validation data for early stopping. If None, early stopping is done on the training data. The value is ignored if either early stopping is not performed, e.g. `early_stopping=False`, or if `X_val` and `y_val` are passed to fit.	0.1
	n_iter_no_change n_iter_no_change: int, default=10 Used to determine when to "early stop". The fitting process is stopped when none of the last ``n_iter_no_change`` scores are better than the ``n_iter_no_change - 1`` -th-to-last one, up to some tolerance. Only used if early stopping is performed.	10
	tol tol: float, default=1e-7 The absolute tolerance to use when comparing scores. The higher the tolerance, the more likely we are to early stop: higher tolerance means that it will be harder for subsequent iterations to be considered an improvement upon the reference score.	1e-07
	verbose verbose: int, default=0 The verbosity level. If not zero, print some information about the fitting process. ``1`` prints only summary info, ``2`` prints info per iteration.	0
	random_state random_state: int, RandomState instance or None, default=None Pseudo-random number generator to control the subsampling in the binning process, and the train/validation data split if early stopping is enabled. Pass an int for reproducible output across multiple function calls. See :term:`Glossary <random_state>`.	None
	class_weight class_weight: dict or 'balanced', default=None Weights associated with classes in the form `{class_label: weight}`. If not given, all classes are supposed to have weight one. The "balanced" mode uses the values of y to automatically adjust weights inversely proportional to class frequencies in the input data as `n_samples / (n_classes * np.bincount(y))`. Note that these weights will be multiplied with sample_weight (passed through the fit method) if `sample_weight` is specified. .. versionadded:: 1.2	None

Fitted attributes

Name	Type	Value
classes_ classes_: array, shape = (n_classes,) Class labels.	ndarray[object](2,)	['allowed','disallowed']
do_early_stopping_ do_early_stopping_: bool Indicates whether early stopping is used during training.	bool	True
feature_names_in_ feature_names_in_: ndarray of shape (`n_features_in_`,) Names of features seen during :term:`fit`. Defined only when `X` has feature names that are all strings. .. versionadded:: 1.0	ndarray[object](121,)	['Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_00', 'Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_01', 'Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name_02',..., 'Physician_Specialty_27','Physician_Specialty_28', 'Physician_Specialty_29']
is_categorical_ is_categorical_: ndarray, shape (n_features, ) or None Boolean mask for the categorical features. ``None`` if there are no categorical features.	ndarray[bool](121,)	[False,False,False,...,False,False,False]
n_features_in_ n_features_in_: int Number of features seen during :term:`fit`. .. versionadded:: 0.24	int	121
n_iter_ n_iter_: int The number of iterations as selected by early stopping, depending on the `early_stopping` parameter. Otherwise it corresponds to max_iter.	int	85
n_trees_per_iteration_ n_trees_per_iteration_: int The number of tree that are built at each iteration. This is equal to 1 for binary classification, and to ``n_classes`` for multiclass classification.	int	1
train_score_ train_score_: ndarray, shape (n_iter_+1,) The scores at each iteration on the training data. The first entry is the score of the ensemble before the first iteration. Scores are computed according to the ``scoring`` parameter. If ``scoring`` is not 'loss', scores are computed on a subset of at most 10 000 samples. Empty if no early stopping.	ndarray[float64](86,)	[-0.24,-0.21,-0.19,...,-0.1 ,-0.1 ,-0.1 ]
validation_score_ validation_score_: ndarray, shape (n_iter_+1,) The scores at each iteration on the held-out validation data. The first entry is the score of the ensemble before the first iteration. Scores are computed according to the ``scoring`` parameter. Empty if no early stopping or if ``validation_fraction`` is None.	ndarray[float64](86,)	[-0.24,-0.2 ,-0.19,...,-0.12,-0.12,-0.12]

	Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name	Dispute_Status_for_Publication	Name_of_Associated_Covered_Device_or_Medical_Supply1	Name_of_Associated_Covered_Drug_or_Biological1	Physician_Specialty	status
0	GC America Inc.	No	Restorative, Dental			allowed
1	Bayer HealthCare LLC	No	Jetstream		Respiratory, Developmental, Rehabilitative and Restorative Service Providers\|Orthotic Fitter	disallowed
2	Smith & Nephew, Inc.	No		Regranex	Allopathic & Osteopathic Physicians\|Family Medicine	disallowed
3	ViiV Healthcare Company	No		ZIAGEN	Allopathic & Osteopathic Physicians\|Internal Medicine\|Infectious Disease	disallowed
4	Covidien LP	No	Vascular		Allopathic & Osteopathic Physicians\|Colon & Rectal Surgery	disallowed

73,553	Carl Zeiss Meditec, Inc.	No			Allopathic & Osteopathic Physicians\|Otolaryngology\|Otolaryngology/Facial Plastic Surgery	disallowed
73,554	Novo Nordisk Inc	No		Victoza	Allopathic & Osteopathic Physicians\|Preventive Medicine\|Public Health & General Preventive Medicine	disallowed
73,555	Tactile Systems Technology Inc	No	Flexitouch		Allopathic & Osteopathic Physicians\|Radiology\|Radiation Oncology	disallowed
73,556	Cook Incorporated	No	OHNS - Biodesign		Speech, Language and Hearing Service Providers\|Audiologist-Hearing Aid Fitter	disallowed
73,557	Boston Scientific Corporation	No	METAL STENTS G.I.		Allopathic & Osteopathic Physicians\|Surgery	disallowed

Column	Column name	dtype	Is sorted	Null values	Unique values
0	Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name	ObjectDType	False	0 (0.0%)	1466 (2.0%)
1	Dispute_Status_for_Publication	ObjectDType	False	0 (0.0%)	2 (< 0.1%)
2	Name_of_Associated_Covered_Device_or_Medical_Supply1	ObjectDType	False	43088 (58.6%)	4372 (5.9%)
3	Name_of_Associated_Covered_Drug_or_Biological1	ObjectDType	False	36233 (49.3%)	2262 (3.1%)
4	Physician_Specialty	ObjectDType	False	3996 (5.4%)	513 (0.7%)
5	status	ObjectDType	False	0 (0.0%)	2 (< 0.1%)

Please enable javascript

1 issue(s), 1 tip(s), 3 passed, 6 not applicable, 0 ignored.

.. GENERATED FROM PYTHON SOURCE LINES 79-82 Once the report is created, we get some information regarding the available tools allowing us to get some insights from our specific model on our specific task by calling the :meth:`~skore.EstimatorReport.help` method. .. GENERATED FROM PYTHON SOURCE LINES 83-85 .. code-block:: Python report.help() .. raw:: html

.. GENERATED FROM PYTHON SOURCE LINES 86-87 Be aware that we can access the help for each individual sub-accessor. For instance: .. GENERATED FROM PYTHON SOURCE LINES 88-90 .. code-block:: Python report.metrics.help() .. raw:: html

.. GENERATED FROM PYTHON SOURCE LINES 91-99 Metrics computation with aggressive caching ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ At this point, we might be interested to have a first look at the statistical performance of our model on the validation set that we provided. We can access it by calling any of the metrics displayed above. Since we are greedy, we want to get several metrics at once and we will use the :meth:`~skore.EstimatorReport.metrics.summarize` method. .. GENERATED FROM PYTHON SOURCE LINES 100-107 .. code-block:: Python import time start = time.time() metric_report = report.metrics.summarize().frame() end = time.time() metric_report .. raw:: html

	HistGradientBoostingClassifier
Metric
Score	0.951196
Accuracy	0.951196
Precision	0.728595
Recall	0.450549
ROC AUC	0.936815
Log loss	0.131858
Brier score	0.036981
Fit time (s)	5.248361
Predict time (s)	0.611445

.. GENERATED FROM PYTHON SOURCE LINES 108-110 .. code-block:: Python print(f"Time taken to compute the metrics: {end - start:.2f} seconds") .. rst-class:: sphx-glr-script-out .. code-block:: none Time taken to compute the metrics: 0.00 seconds .. GENERATED FROM PYTHON SOURCE LINES 111-118 An interesting feature provided by the :class:`skore.EstimatorReport` is the the caching mechanism. Indeed, when we have a large enough dataset, computing the predictions for a model is not cheap anymore. For instance, on our smallish dataset, it took a couple of seconds to compute the metrics. The report will cache the predictions and if we are interested in computing a metric again or an alternative metric that requires the same predictions, it will be faster. Let's check by requesting the same metrics report again. .. GENERATED FROM PYTHON SOURCE LINES 119-125 .. code-block:: Python start = time.time() metric_report = report.metrics.summarize().frame() end = time.time() metric_report .. raw:: html

	HistGradientBoostingClassifier
Metric
Score	0.951196
Accuracy	0.951196
Precision	0.728595
Recall	0.450549
ROC AUC	0.936815
Log loss	0.131858
Brier score	0.036981
Fit time (s)	5.248361
Predict time (s)	0.611445

.. GENERATED FROM PYTHON SOURCE LINES 126-128 .. code-block:: Python print(f"Time taken to compute the metrics: {end - start:.2f} seconds") .. rst-class:: sphx-glr-script-out .. code-block:: none Time taken to compute the metrics: 0.00 seconds .. GENERATED FROM PYTHON SOURCE LINES 129-131 Note that when the model is fitted or the predictions are computed, we additionally store the time the operation took: .. GENERATED FROM PYTHON SOURCE LINES 132-134 .. code-block:: Python report.metrics.timings() .. rst-class:: sphx-glr-script-out .. code-block:: none {'fit_time': 5.248360689999998, 'predict_time_train': 2.436770182999993, 'predict_time_test': 0.6114447749999954} .. GENERATED FROM PYTHON SOURCE LINES 135-137 Since we obtain a pandas dataframe, we can also use the plotting interface of pandas. .. GENERATED FROM PYTHON SOURCE LINES 138-141 .. code-block:: Python ax = metric_report.plot.barh() _ = ax.set_title("Metrics report") .. image-sg:: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_001.png :alt: Metrics report :srcset: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_001.png :class: sphx-glr-single-img .. GENERATED FROM PYTHON SOURCE LINES 142-144 Whenever computing a metric, we check if the predictions are available in the cache and reload them if available. So for instance, let's compute the log loss. .. GENERATED FROM PYTHON SOURCE LINES 145-151 .. code-block:: Python start = time.time() log_loss = report.metrics.log_loss() end = time.time() log_loss .. rst-class:: sphx-glr-script-out .. code-block:: none 0.1318581181377569 .. GENERATED FROM PYTHON SOURCE LINES 152-154 .. code-block:: Python print(f"Time taken to compute the log loss: {end - start:.2f} seconds") .. rst-class:: sphx-glr-script-out .. code-block:: none Time taken to compute the log loss: 0.00 seconds .. GENERATED FROM PYTHON SOURCE LINES 155-157 We can show that without initial cache, it would have taken more time to compute the log loss. .. GENERATED FROM PYTHON SOURCE LINES 158-165 .. code-block:: Python report.clear_cache() start = time.time() log_loss = report.metrics.log_loss() end = time.time() log_loss .. rst-class:: sphx-glr-script-out .. code-block:: none 0.1318581181377569 .. GENERATED FROM PYTHON SOURCE LINES 166-168 .. code-block:: Python print(f"Time taken to compute the log loss: {end - start:.2f} seconds") .. rst-class:: sphx-glr-script-out .. code-block:: none Time taken to compute the log loss: 1.23 seconds .. GENERATED FROM PYTHON SOURCE LINES 169-172 By default, the metrics are computed on the test set only. However, if a training set is provided, we can also compute the metrics by specifying the `data_source` parameter. .. GENERATED FROM PYTHON SOURCE LINES 173-175 .. code-block:: Python report.metrics.log_loss(data_source="train") .. rst-class:: sphx-glr-script-out .. code-block:: none 0.09950479151151193 .. GENERATED FROM PYTHON SOURCE LINES 176-180 Be aware that we can also benefit from the caching mechanism with our own custom metrics. Skore only expects that we define our own metric function to take `y_true` and `y_pred` as the first two positional arguments. It can take any other arguments. Let's see an example. .. GENERATED FROM PYTHON SOURCE LINES 181-195 .. code-block:: Python def operational_decision_cost(y_true, y_pred, amount): mask_true_positive = (y_true == pos_label) & (y_pred == pos_label) mask_true_negative = (y_true == neg_label) & (y_pred == neg_label) mask_false_positive = (y_true == neg_label) & (y_pred == pos_label) mask_false_negative = (y_true == pos_label) & (y_pred == neg_label) fraudulent_refuse = mask_true_positive.sum() * 50 fraudulent_accept = -amount[mask_false_negative].sum() legitimate_refuse = mask_false_positive.sum() * -5 legitimate_accept = (amount[mask_true_negative] * 0.02).sum() return fraudulent_refuse + fraudulent_accept + legitimate_refuse + legitimate_accept .. GENERATED FROM PYTHON SOURCE LINES 196-200 In our use case, we have a operational decision to make that translate the classification outcome into a cost. It translate the confusion matrix into a cost matrix based on some amount linked to each sample in the dataset that are provided to us. Here, we randomly generate some amount as an illustration. .. GENERATED FROM PYTHON SOURCE LINES 201-212 .. code-block:: Python import numpy as np from sklearn.metrics import make_scorer rng = np.random.default_rng(42) amount = rng.integers(low=100, high=1000, size=len(report.y_test)) report.metrics.add(metric=make_scorer(operational_decision_cost, amount=amount)) cost = report.metrics.summarize(metric="operational_decision_cost") cost.frame() .. raw:: html

	HistGradientBoostingClassifier
Metric
Operational Decision Cost	-134634.96

.. GENERATED FROM PYTHON SOURCE LINES 213-216 By the way, skore caches the model predictions. It is really handy because it means that we can compute some additional metrics without having to recompute the the predictions. .. GENERATED FROM PYTHON SOURCE LINES 217-221 .. code-block:: Python report.metrics.summarize( metric=["precision", "recall", "operational_decision_cost"] ).frame() .. raw:: html

	HistGradientBoostingClassifier
Metric
Precision	0.728595
Recall	0.450549
Operational Decision Cost	-134634.960000

.. GENERATED FROM PYTHON SOURCE LINES 222-229 Effortless one-liner plotting ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ The :class:`skore.EstimatorReport` class also implements a number of the most common data science plots. As for the metrics, we only provide the meaningful set of plots for the provided estimator. .. GENERATED FROM PYTHON SOURCE LINES 230-232 .. code-block:: Python report.metrics.help() .. raw:: html

.. GENERATED FROM PYTHON SOURCE LINES 233-234 Let's start by plotting the ROC curve for our binary classification task. .. GENERATED FROM PYTHON SOURCE LINES 235-238 .. code-block:: Python display = report.metrics.roc() display.plot() .. image-sg:: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_002.png :alt: ROC Curve for HistGradientBoostingClassifier Positive label: allowed Data source: Test set :srcset: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_002.png :class: sphx-glr-single-img .. rst-class:: sphx-glr-script-out .. code-block:: none

.. GENERATED FROM PYTHON SOURCE LINES 239-243 The plot functionality is built upon the scikit-learn display objects. We return those display (slightly modified to improve the UI) in case we want to tweak some of the plot properties. We can have quick look at the available attributes and methods by calling the ``help`` method or simply by printing the display. .. GENERATED FROM PYTHON SOURCE LINES 244-246 .. code-block:: Python display.help() .. raw:: html

.. GENERATED FROM PYTHON SOURCE LINES 247-251 .. code-block:: Python fig = display.plot() fig.axes[0].set_title("Example of a ROC curve") fig .. image-sg:: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_003.png :alt: ROC Curve for HistGradientBoostingClassifier Positive label: allowed Data source: Test set, Example of a ROC curve :srcset: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_003.png :class: sphx-glr-single-img .. rst-class:: sphx-glr-script-out .. code-block:: none

.. GENERATED FROM PYTHON SOURCE LINES 252-256 Similarly to the metrics, we aggressively use the caching to avoid recomputing the predictions of the model. We also cache the plot display object by detection if the input parameters are the same as the previous call. Let's demonstrate the kind of performance gain we can get. .. GENERATED FROM PYTHON SOURCE LINES 257-264 .. code-block:: Python start = time.time() # we already trigger the computation of the predictions in a previous call display = report.metrics.roc() fig = display.plot() end = time.time() fig .. image-sg:: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_004.png :alt: ROC Curve for HistGradientBoostingClassifier Positive label: allowed Data source: Test set :srcset: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_004.png :class: sphx-glr-single-img .. rst-class:: sphx-glr-script-out .. code-block:: none

.. GENERATED FROM PYTHON SOURCE LINES 265-267 .. code-block:: Python print(f"Time taken to compute the ROC curve: {end - start:.2f} seconds") .. rst-class:: sphx-glr-script-out .. code-block:: none Time taken to compute the ROC curve: 0.04 seconds .. GENERATED FROM PYTHON SOURCE LINES 268-269 Now, let's clean the cache and check if we get a slowdown. .. GENERATED FROM PYTHON SOURCE LINES 270-272 .. code-block:: Python report.clear_cache() .. GENERATED FROM PYTHON SOURCE LINES 273-279 .. code-block:: Python start = time.time() display = report.metrics.roc() fig = display.plot() end = time.time() fig .. image-sg:: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_005.png :alt: ROC Curve for HistGradientBoostingClassifier Positive label: allowed Data source: Test set :srcset: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_005.png :class: sphx-glr-single-img .. rst-class:: sphx-glr-script-out .. code-block:: none

.. GENERATED FROM PYTHON SOURCE LINES 280-282 .. code-block:: Python print(f"Time taken to compute the ROC curve: {end - start:.2f} seconds") .. rst-class:: sphx-glr-script-out .. code-block:: none Time taken to compute the ROC curve: 1.28 seconds .. GENERATED FROM PYTHON SOURCE LINES 283-284 As expected, since we need to recompute the predictions, it takes more time. .. GENERATED FROM PYTHON SOURCE LINES 286-291 Visualizing the confusion matrix ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Another useful visualization for classification tasks is the confusion matrix, which shows the counts of correct and incorrect predictions for each class. .. GENERATED FROM PYTHON SOURCE LINES 293-294 Let's first start with a basic confusion matrix: .. GENERATED FROM PYTHON SOURCE LINES 294-297 .. code-block:: Python cm_display = report.metrics.confusion_matrix() cm_display.plot() .. image-sg:: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_006.png :alt: Confusion Matrix Data source: Test set :srcset: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_006.png :class: sphx-glr-single-img .. rst-class:: sphx-glr-script-out .. code-block:: none

.. GENERATED FROM PYTHON SOURCE LINES 298-302 In binary classification, a confusion matrix depends on the decision threshold used to convert predicted probabilities into class labels. By default, skore uses a threshold of 0.5, but confusion matrices are actually computed at every threshold internally. .. GENERATED FROM PYTHON SOURCE LINES 302-308 .. code-block:: Python # To visualize the confusion matrix at a different threshold, use the # ``threshold_value`` parameter. For example, a threshold of 0.3 will classify # more samples as positive: cm_display.plot(threshold_value=0.3) .. image-sg:: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_007.png :alt: Confusion Matrix Decision threshold: 0.30 Positive label: allowed Data source: Test set :srcset: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_007.png :class: sphx-glr-single-img .. rst-class:: sphx-glr-script-out .. code-block:: none

.. GENERATED FROM PYTHON SOURCE LINES 309-311 We can normalize the confusion matrix to get percentages instead of raw counts. Here we normalize by true labels (rows): .. GENERATED FROM PYTHON SOURCE LINES 311-313 .. code-block:: Python cm_display.plot(normalize="true") .. image-sg:: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_008.png :alt: Confusion Matrix Data source: Test set :srcset: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_008.png :class: sphx-glr-single-img .. rst-class:: sphx-glr-script-out .. code-block:: none

.. GENERATED FROM PYTHON SOURCE LINES 314-316 More plotting options are available via ``heatmap_kwargs``, which are passed to seaborn's heatmap. For example, we can customize the colormap and number format: .. GENERATED FROM PYTHON SOURCE LINES 316-319 .. code-block:: Python cm_display.set_style(heatmap_kwargs={"cmap": "Greens", "fmt": ".2e"}) cm_display.plot() .. image-sg:: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_009.png :alt: Confusion Matrix Data source: Test set :srcset: /auto_examples/model_evaluation/images/sphx_glr_plot_estimator_report_009.png :class: sphx-glr-single-img .. rst-class:: sphx-glr-script-out .. code-block:: none

.. GENERATED FROM PYTHON SOURCE LINES 320-322 Finally, the confusion matrix can also be exported as a pandas DataFrame for further analysis: .. GENERATED FROM PYTHON SOURCE LINES 322-325 .. code-block:: Python cm_display.frame() .. raw:: html

	true_label	predicted_label	value
0	allowed	allowed	451
1	allowed	disallowed	550
2	disallowed	allowed	168
3	disallowed	disallowed	13543

.. GENERATED FROM PYTHON SOURCE LINES 326-330 .. seealso:: For using the :class:`~skore.EstimatorReport` to inspect your models, see :ref:`example_feature_importance`. .. rst-class:: sphx-glr-timing **Total running time of the script:** (0 minutes 32.020 seconds) .. _sphx_glr_download_auto_examples_model_evaluation_plot_estimator_report.py: .. only:: html .. container:: sphx-glr-footer sphx-glr-footer-example .. container:: sphx-glr-download sphx-glr-download-jupyter :download:`Download Jupyter notebook: plot_estimator_report.ipynb ` .. container:: sphx-glr-download sphx-glr-download-python :download:`Download Python source code: plot_estimator_report.py ` .. container:: sphx-glr-download sphx-glr-download-zip :download:`Download zipped: plot_estimator_report.zip ` .. only:: html .. rst-class:: sphx-glr-signature `Gallery generated by Sphinx-Gallery `_

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name

Dispute_Status_for_Publication

Name_of_Associated_Covered_Device_or_Medical_Supply1

Name_of_Associated_Covered_Drug_or_Biological1

Physician_Specialty

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name

Dispute_Status_for_Publication

Name_of_Associated_Covered_Device_or_Medical_Supply1

Name_of_Associated_Covered_Drug_or_Biological1

Physician_Specialty

Please enable javascript

status

status

Please enable javascript

Applicable_Manufacturer_or_Applicable_GPO_Making_Payment_Name

Dispute_Status_for_Publication

Name_of_Associated_Covered_Device_or_Medical_Supply1

Name_of_Associated_Covered_Drug_or_Biological1

Physician_Specialty

status

Please enable javascript