[New Model] TabStar by LennartPurucker · Pull Request #247 · autogluon/tabarena

LennartPurucker · 2025-12-12T16:27:27Z

Add support for the TabStar model (https://arxiv.org/abs/2505.18125).

Working on some more TODOs before starting the benchmark

LennartPurucker · 2025-12-15T12:46:17Z

Related PR for the sake of inversing label cleaner: autogluon/autogluon#5482

LennartPurucker · 2025-12-15T13:04:59Z

We ran a few benchmarks and are in contact with the first author of the model to improve the method and fix some bugs on their side in the package.

We are waiting for a final method to benchmark and all bugs to be fixed before running the full benchmark.

LennartPurucker · 2025-12-22T11:01:34Z

Notes for later:

Run TabSTAR wit hthe newest version again
Check if label column name is the correct value or just __label__

# Conflicts: # tabarena/pyproject.toml # tabflow_slurm/run_setup_slurm_jobs.py

LennartPurucker · 2026-01-17T17:27:45Z

I updated to the newest version and verified that the label column name is passed to the model.

Once the compute is available, I will rerun its results with tuning and submit this to the official leaderboard, and then merge TabSTAR.

LennartPurucker · 2026-01-31T13:26:56Z

Started TabArena-Lites runs for a sanity check, and then will go over to full and get results for the submission.

LennartPurucker · 2026-02-15T15:51:08Z

I reduced the maximal number of configurations for HPO to 50 for now.
The impact of HPO does not seem promising enough so far that it justifies spending so much compute on 200 configs for TabArena-Full. Plus I had a bunch of problems with the GPU cluster that made jobs fail randomly.

Since I stopped the runs in between (after ca. 2 weeks), there might be datasets in the final submission with more than 50 configs. If needed, we can reevaluate running more configs later.

LennartPurucker · 2026-02-25T11:50:07Z

It is still running into issues and takes too long. Reduced to 25 random configs.

LennartPurucker · 2026-03-02T11:00:36Z

We now go results with at least 25 configs per dataset:

Raw data: https://data.lennart-purucker.com/tabarena/leaderboard_submissions/data_TabSTAR_02032026.zip

I will merge this PR once I am done with perpetual as well (and then rebase etc)

# Conflicts: # tabarena/pyproject.toml # tabarena/tabarena/benchmark/models/model_registry.py # tabarena/tabarena/models/utils.py # tabflow_slurm/run_setup_slurm_jobs.py # tabflow_slurm/setup_slurm_base.py # tabflow_slurm/simple_evaluation/run_eval_for_new_model.py

LennartPurucker added 12 commits December 12, 2025 17:22

add: new model TabStar

7255944

add: benchmark config

e96d4ab

add: metric map and output dir

971e230

add: metric map and output dir

6b34526

fix: pyproject bug

0151bc3

add: fix for inf results

e5b4a7f

add: tabstar fix to batch size

af05177

add: tabstar fix to batch size

bd038de

fallback to default patience

3914134

add: tabstar fixes and newest version + label cleaning

1450a16

fix if-else so logging is correct

f4cb8b9

version bumpb to fix download issue

55c7ae2

LennartPurucker added 2 commits January 17, 2026 17:07

Merge remote-tracking branch 'origin/main' into add_tabstar

6473748

# Conflicts: # tabarena/pyproject.toml # tabflow_slurm/run_setup_slurm_jobs.py

maint: refactor and latest changes

e31044a

Merge remote-tracking branch 'origin/main' into add_tabstar

fae82bc

LennartPurucker added 2 commits February 14, 2026 15:49

maint: minor changes from running

4c65308

maint: update to new version of tabstar benchmark run

c82c913

Update batch size usage

d3a6ef6

LennartPurucker merged commit 0e06b06 into main Mar 6, 2026
5 checks passed

LennartPurucker deleted the add_tabstar branch March 6, 2026 17:00

LennartPurucker added a commit that referenced this pull request Mar 15, 2026

[New Model] TabStar (#247)

c2a8ad9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[New Model] TabStar#247

[New Model] TabStar#247
LennartPurucker merged 19 commits intomainfrom
add_tabstar

LennartPurucker commented Dec 12, 2025

Uh oh!

LennartPurucker commented Dec 15, 2025

Uh oh!

LennartPurucker commented Dec 15, 2025

Uh oh!

LennartPurucker commented Dec 22, 2025

Uh oh!

LennartPurucker commented Jan 17, 2026

Uh oh!

LennartPurucker commented Jan 31, 2026

Uh oh!

LennartPurucker commented Feb 15, 2026 •

edited

Loading

Uh oh!

LennartPurucker commented Feb 25, 2026

Uh oh!

LennartPurucker commented Mar 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

LennartPurucker commented Dec 12, 2025

Uh oh!

LennartPurucker commented Dec 15, 2025

Uh oh!

LennartPurucker commented Dec 15, 2025

Uh oh!

LennartPurucker commented Dec 22, 2025

Uh oh!

LennartPurucker commented Jan 17, 2026

Uh oh!

LennartPurucker commented Jan 31, 2026

Uh oh!

LennartPurucker commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LennartPurucker commented Feb 25, 2026

Uh oh!

LennartPurucker commented Mar 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

LennartPurucker commented Feb 15, 2026 •

edited

Loading

LennartPurucker commented Mar 2, 2026 •

edited

Loading