-
Classification Metrics Sparse Support Bug (Issue #32036): A bug where classification metrics in scikit-learn claim sparse matrix support in docstrings but raise an error when used with sparse inputs. The issue is reliably reproducible with provided code steps, expected (support) vs. actual behavior (TypeError), and environment details in the traceback. No major missing elements. Link
-
RandomizedSearchCV Feature Request (Issue #32032): A proposal to add weights for controlling the probability of selecting items in a list of parameter distributions, useful for complex pipelines with interdependent hyperparameters. This is a feature enhancement, not a bug, and includes clear examples and rationale. Link
-
CI Failure on Linux Build (Issue #32022): Reported CI failure on a specific build configuration, with a reference to logs but no detailed steps to reproduce, expected behavior, or root cause analysis. More information on the failure context would be helpful for quicker resolution—feel free to add details like error logs or reproduction steps! Link
-
Website Logo Truncation (Issue #32011): A UI issue where the scikit-learn logo appears truncated on the website, with a suggestion to use the existing SVG file for better scaling. It's easily reproducible by visiting the site, and includes visual examples, but no specific environment details are needed. Low-impact cosmetic fix. Link
-
Themes: The issues cover core functionality bugs (e.g., sparse data handling), feature enhancements for advanced users (e.g., hyperparameter tuning), infrastructure reliability (e.g., CI failures), and minor UI improvements (e.g., website aesthetics). A common thread is improving usability and accuracy in data handling and development workflows.
-
Prioritization Based on Impact:
- High Priority: Address the sparse matrix bug and CI failure first, as they could affect user functionality and team productivity (e.g., sparse data is critical for large-scale applications, and CI issues may block merges).
- Medium Priority: The RandomizedSearchCV feature request could enhance efficiency for complex models, benefiting users with advanced needs.
- Low Priority: The logo truncation is a quick win for polish but has minimal impact on core operations—consider it if resources allow for minor updates.
Using
openai/gpt-5-mini
. All the previous ones usedxai/grok-3-mini
Classification metrics don't seem to support sparse labels (bug)
Allow weighting of list items passed to RandomizedSearchCV (feature request)
CI failure: Linux_Nightly.pylatest_pip_scipy_dev job (infrastructure/ci)
Website logo is truncated; propose using SVG from repo (UI/website)