dabl.plot.plot_classification_continuous¶Plots for continuous features in classification.
Selects important continuous features according to F statistics. Creates univariate distribution plots for these, as well as scatterplots for selected pairs of features, and scatterplots for selected pairs of PCA directions. If there are more than 2 classes, scatter plots from Linear Discriminant Analysis are also shown. Scatter plots are determined “interesting” is a decision tree on the two-dimensional projection performs well. The cross-validated macro-average recall of a decision tree is shown in the title for each scatterplot.
Input data including features and target.
Identifier of the target column in X.
Output of detect_types on X. Can be used to avoid recomputing the types.
Alpha values for scatter plots. ‘auto’ is dirty hacks.
Marker size for scatter plots. ‘auto’ is dirty hacks.
Supported: ‘histogram’ and ‘kde’.
Whether to drop outliers when plotting.
Whether to create pairwise plots. Can be a bit slow.
How many pairwise interactions to consider (ranked by univariate f scores). Runtime is quadratic in this, but higher numbers might find more interesting interactions.
Random state used for subsampling for determining pairwise features to show.
Notes
important kwargs parameters are: scatter_size and scatter_alpha.