.. TALENT documentation master file ======================================================== TALENT: A Tabular Analytics and Learning Toolbox ======================================================== .. image:: ../resources/TALENT-LOGO.png :width: 1000px :align: center Welcome to **TALENT**, a comprehensive machine learning toolbox designed to enhance model performance on tabular data. TALENT integrates advanced deep learning models, classical algorithms, and efficient hyperparameter tuning, offering robust preprocessing capabilities to optimize learning from tabular datasets. The toolbox is user-friendly and adaptable, catering to both novice and expert data scientists. .. important:: If you use any content of this repo for your work, please make sure to cite the relevant papers as described in the `Citing TALENT` section below. ========================== Citing TALENT ========================== If you use **TALENT** in your research, please consider citing the following works: .. code-block:: bibtex @article{ye2024closerlookdeeplearning, title={A Closer Look at Deep Learning on Tabular Data}, author={Han-Jia Ye and Si-Yang Liu and Hao-Run Cai and Qi-Le Zhou and De-Chuan Zhan}, journal={arXiv preprint arXiv:2407.00956}, year={2024} } @article{liu2024talenttabularanalyticslearning, title={TALENT: A Tabular Analytics and Learning Toolbox}, author={Si-Yang Liu and Hao-Run Cai and Qi-Le Zhou and Han-Jia Ye}, journal={arXiv preprint arXiv:2407.04057}, year={2024} } ========================== What's New ========================== Here are the recent updates to **TALENT**: - [2025-06]🌟 Add `TabAutoPNPNet `_ (Electronics 2025) - [2025-06]🌟 Add `TabICL `_ (ICML 2025). The current code is based on TabICL v0.1.2. - [2025-05]🌟 Check out our three papers `MMTU `_, `Tabular-Temporal-Shift `_, and `BETA `_ accepted at ICML 2025! - [2025-04]🌟 Check out our new survey `Representation Learning for Tabular Data: A Comprehensive Survey `_ (`Repo `_). We organize existing methods into three main categories according to their generalization capabilities: specialized, transferable, and general models, which provides a comprehensive taxonomy for deep tabular representation methods.🚀🚀🚀 - [2025-02]🌟 Add `T2Gformer `_ (AAAI 2023). - [2025-02]🌟 Add `TabPFN v2 `_ (Nature). - [2025-02]🌟 Thanks to `Hengzhe Zhang `_ for providing a `Scikit-Learn compatible wrapper `_ for TALENT! - [2025-01]🌟 Check out our new baseline `ModernNCA `_ (**ICLR 2025**), inspired by traditional **Neighbor Component Analysis**, which outperforms both tree-based and other deep tabular models, while also reducing training time and model size!🚀🚀🚀 - [2025-01]🌟 Check out our `latest version of the benchmark paper `_ for updated and expanded results and analysis! - [2025-01]🌟We have curated and released `new benchmark datasets `_, along with updated `results `_ of the dataset across a broader range of methods. This update focuses on enhancing dataset quality, including removing duplicates, and correcting tasks where bin-class was mistakenly treated as regression. We have also separated the larger datasets and formed the basic benchmark (300 datasets, including 120 bin-class, 80 multi-class, and 100 regression), and the large benchmark (22 datasets). - [2024-12]🌟 Add `TabM `_ (ICLR 2025). - [2024-09]🌟 Add `Trompt `_ (ICML 2023). - [2024-09]🌟 Add `AMFormer `_ (AAAI 2024). - [2024-08]🌟 Add `GRANDE `_ (ICLR 2024). - [2024-08]🌟 Add `Excelformer `_ (KDD 2024). - [2024-08]🌟 Add `MLP_PLR `_ (NeurIPS 2022). - [2024-07]🌟 Add `RealMLP `_. - [2024-07]🌟 Add `ProtoGate `_ (ICML 2024). - [2024-07]🌟 Add `BiSHop `_ (ICML 2024). - [2024-06]🌟 Check out our new baseline `ModernNCA `_, inspired by traditional **Neighbor Component Analysis**, which outperforms both tree-based and other deep tabular models, while also reducing training time and model size! - [2024-06]🌟 Check out our `benchmark paper `_ about tabular data, which provides comprehensive evaluations of classical and deep tabular methods based on our toolbox in a fair manner! .. note:: If you want to view **benchmark results**, please visit: `https://6sy666.github.io/TALENT-Results/ `_ To explore **default hyperparameters and search spaces** of methods in this toolbox, check: `https://6sy666.github.io/TALENT-Configs/ `_ ========================== Contents ========================== .. toctree:: :maxdepth: 2 :caption: Tutorials tutorials .. toctree:: :maxdepth: 2 :caption: Methods methods .. toctree:: :maxdepth: 2 :caption: Dependencies dependencies .. toctree:: :maxdepth: 2 :caption: Benchmark_Datasets benchmark_datasets .. toctree:: :maxdepth: 2 :caption: Experimental_Results experimental_results .. toctree:: :maxdepth: 1 :caption: API Docs api/core api/classical_methods api/deep_learning api/lib .. toctree:: :maxdepth: 2 :caption: Acknowledgements acknowledgements