Frank Hutter @ NeurIPS @FrankRHutter, Twitter Profile

Frank Hutter @ NeurIPS @FrankRHutter

2 years ago

This may revolutionize data science: we introduce TabPFN, a new tabular data classification method that takes 1 second & yields SOTA performance (better than hyperparameter-optimized gradient boosting in 1h). Current limits: up to 1k data points, 100 features, 10 classes. 🧵1/6

113 781 4K 0 2K

Download Image

Frank Hutter @ NeurIPS @FrankRHutter

2 years ago

TabPFN is radically different from previous ML methods. It is meta-learned to approximate Bayesian inference with a prior based on principles of causality and simplicity. Here‘s a qualitative comparison to some sklearn classifiers, showing very smooth uncertainty estimates. 2/6

5 21 271 0 39

Download Image

Frank Hutter @ NeurIPS @FrankRHutter

2 years ago

If you'd like to play with TabPFNs yourself, here is a direct link to the Colab with a scikit-learn like interface: colab.research.google.com/drive/194mCs6S…

5 14 208 0 73

JFPuget 🇺🇦 @JFPuget

2 years ago

@FrankRHutter Why is there a need for fit() given you say it is pretrained already? Your sklearn example has a fit() step before the prediction.

1 0 5 0 0

Sebastian Raschka @rasbt

2 years ago

@FrankRHutter Just read through the paper, awesome fresh idea combining approximate Bayesian inference and transformers for tabular data! Re the 1k in your tweet: when I understood correctly, the synthetic datasets for the priors were up to 1024 ex, but in the paper you are referring to 2k?

2 1 27 0 6

Download Image