Refactor SequentialRecommender and add Sequential models: SASRec, BERT4Rec, GPT2Rec, FPMC by hieuddo · Pull Request #694 · PreferredAI/cornac

hieuddo · 2026-05-29T15:29:57Z

Description

Refactor NextItemRecommender to SequentialRecommender with two settings: session-based and session-aware.
Rename NextItemEvaluation to SequentialEvaluation
Add new sequential models: SASRec, BERT4Rec, GPT2Rec (similar to BERT4Rec, but with GPT2 architecture)
Refactor existing GRU4Rec compatible with session-aware setting

Quick run with 10 epochs from examples/sequential_diginetica.py:

Model	AUC	MRR	NDCG@10	NDCG@50	Recall@10	Recall@50	Train (s)	Test (s)
FPMC-sbr	0.7579	0.1254	0.1430	0.1686	0.2242	0.3398	2.8742	0.4422
GRU4Rec-sbr	0.8051	0.0611	0.0632	0.1035	0.1068	0.2907	6.0934	0.7539
SASRec-sbr	0.7902	0.3884	0.4216	0.4295	0.5289	0.5657	2.2551	1.4375
BERT4Rec-sbr	0.7880	0.3812	0.4140	0.4221	0.5201	0.5569	6.3330	1.4493
GPT2Rec-sbr	0.8111	0.3702	0.4021	0.4139	0.5096	0.5604	1.3811	1.4056
FPMC-sar	0.7480	0.1195	0.1341	0.1614	0.2067	0.3275	0.3568	0.3948
GRU4Rec-sar	0.8029	0.0754	0.0820	0.1146	0.1349	0.2837	3.4652	1.9199
SASRec-sar	0.7890	0.3891	0.4208	0.4297	0.5236	0.5639	1.6576	1.3925
BERT4Rec-sar	0.7910	0.3768	0.4080	0.4184	0.5114	0.5552	1.4889	1.3984
GPT2Rec-sar	0.8243	0.3790	0.4084	0.4217	0.5079	0.5674	1.3703	1.3644

Related Issues

Checklist:

I have added tests.
I have updated README.md (if you are adding a new model).
I have updated examples/README.md (if you are adding a new example).

…ding FPMC, BERT4Rec, SASRec, GPT2Rec, and updated GRU4Rec

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

qtuantruong · 2026-05-29T22:32:10Z

holy moly, really long PR

hieuddo · 2026-05-30T03:49:40Z

holy moly, really long PR

Haha. Most of these are from https://github.com/PreferredAI/CoVE/tree/main/models in our joint work with @lthoang

qtuantruong · 2026-05-31T04:19:02Z

Things to check carefully:

`SequentialEvaluation` breaks `SIT` / `SITJson` session-based evaluation

ranking_eval always iterates with test_set.usi_iter(...) and evaluates only session_items[-1]. For SIT formats, every row is mapped to uid=None, so all sessions become one pseudo-user. The updated Yoochoose examples will score only the last test session instead of all sessions.

References:

cornac/eval_methods/sequential_evaluation.py:64
cornac/data/dataset.py:1176

Session-aware evaluation never includes training history

history_items is built only from sessions present in the test or validation split, while train_set is only used for item masks. With a common one-held-out-session-per-user split, session-aware models are evaluated with only the current test prefix, or no history for first / any, so the cross-session behavior being added is not actually measured.

Reference:

eval_methods/sequential_evaluation.py:97

`user_based` is ignored

The function accepts user_based, but aggregation always averages over users via user_sessions.keys() with no session-level branch. Experiment(user_based=False) therefore cannot reproduce the old session-averaged next-item behavior.

Reference:

cornac/eval_methods/sequential_evaluation.py:123

FPMC cold-start user fallback does not trigger for unseen eval users

Because the shared uid_map is mutated while building test and validation sets, self.total_users includes users that were never trained. score() compares against total_users, so those users get random untrained embeddings instead of the padding row. Compare against self.num_users or explicitly track train-known users.

Reference:

cornac/models/fpmc/recom_fpmc.py:253

qtuantruong · 2026-05-31T04:40:40Z

If you don't mind, I would recommend to break it down into multiple small PRs so the verification is manageable.

hieuddo · 2026-05-31T05:48:59Z

If you don't mind, I would recommend to break it down into multiple small PRs so the verification is manageable.

Yeah. I'm closing this PR.

Will discuss with @lthoang further about the session-based vs. session-aware design.

Currently, all the models we're adding are session-based models. We don't have any session-aware model yet. I'm trying to anticipate and future-proof the design for both session-based and session-aware settings, and it turns out to be more complicated than necessary. The reason for this is because https://github.com/PreferredAI/CoVE are meant for session-aware setting.

Would you recommend dropping all session-aware support and keeping it simple with session-based only, then work on that later (because maybe YAGNI lol)?

qtuantruong · 2026-05-31T17:26:19Z

Yes, I’d recommend dropping session-aware support from this round and keeping the first PR session-based only. We can rename NextItemEvaluation to SequentialEvaluation, but preserve the existing NextItemEvaluation behavior, including SIT/SITJson examples and evaluation semantics. Then we can add the session-based models in separate PRs. Finally, we can open a dedicated design PR/RFC for session-aware support after we agree on the evaluation contract.

hieuddo added 9 commits May 29, 2026 22:42

feat: refine session-based recommendation framework with models inclu…

122a59d

…ding FPMC, BERT4Rec, SASRec, GPT2Rec, and updated GRU4Rec

feat: model selection

979601e

migrate NextItemRecommender to SequentialRecommender

7747078

add alias for SBR and SAR

e91947c

add example for sequential models

28006af

model_selection for FPMC

05831f8

add requirements.txt for sequential models

24b30df

add FPMC to example

d808b5b

update models in README

3db6f09

Copilot AI review requested due to automatic review settings May 29, 2026 15:29

Copilot started reviewing on behalf of hieuddo May 29, 2026 15:30 View session

Copilot AI reviewed May 29, 2026

View reviewed changes

remove unused nonlocal

71ee633

hieuddo requested a review from lthoang May 29, 2026 15:57

hieuddo closed this Jun 1, 2026

hieuddo deleted the seq branch June 4, 2026 05:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor SequentialRecommender and add Sequential models: SASRec, BERT4Rec, GPT2Rec, FPMC#694

Refactor SequentialRecommender and add Sequential models: SASRec, BERT4Rec, GPT2Rec, FPMC#694
hieuddo wants to merge 10 commits into
PreferredAI:masterfrom
hieuddo:seq

hieuddo commented May 29, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

qtuantruong commented May 29, 2026

Uh oh!

hieuddo commented May 30, 2026

Uh oh!

qtuantruong commented May 31, 2026

Uh oh!

qtuantruong commented May 31, 2026

Uh oh!

hieuddo commented May 31, 2026

Uh oh!

qtuantruong commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hieuddo commented May 29, 2026

Description

Related Issues

Checklist:

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

qtuantruong commented May 29, 2026

Uh oh!

hieuddo commented May 30, 2026

Uh oh!

qtuantruong commented May 31, 2026

Things to check carefully:

SequentialEvaluation breaks SIT / SITJson session-based evaluation

Session-aware evaluation never includes training history

user_based is ignored

FPMC cold-start user fallback does not trigger for unseen eval users

Uh oh!

qtuantruong commented May 31, 2026

Uh oh!

hieuddo commented May 31, 2026

Uh oh!

qtuantruong commented May 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

`SequentialEvaluation` breaks `SIT` / `SITJson` session-based evaluation

`user_based` is ignored