Research & code: Research director @inria
►Data, Health, & Computer science
►Python coder, (co)founder of scikit-learn, joblib, & @probabl.bsky.social
►Sometimes does art photography
►Physics PhD
Gaël Varoquaux
In this high-level yet detailed blog post, I expend on the statistics that hold together AI and data science:
blog.probabl.ai/data-science...
- It is now possible to pass arguments to the scorers in Data Ops, such as sample weights.
- Diagrams for the Learner and parameter searches now include the full DataOp graph in their notebook repr.
- It is now possible to find nodes by name in the DataOp graph.
The CfP deadline for Compute! Paris 2026 was extended to Sunday, June 7! Just a few days left to submit a proposal on Open Source scientific compute, data science, ML & AI topics.
Conference dates and venue: November 25–26, 2026, Sorbonne Université · Paris
compute.events/paris2026/cf...
FAQ on NeurIPS Europe:
NeurIPS Europe is an official NeurIPS 2026 satellite event taking place in Paris, France, alongside the main conference in Sydney and the other satellite event in Atlanta.
NeurIPS authors can present their papers at any of the three locations, subject to space availability.
- The TableReport now uses plot_distributions and compute_associations to control the distribution and association tabs respectively.
- The cleaner now allows to control whether numeric-looking strings ("['1', '2', '3']") should be parsed to float.
- fuzzy_join and Joiner now allow to choose the metric that should be used for matching.
- ApplyToCols now has the exclude_cols parameter, to define which columns should not be transformed.