fix: use NOT EXISTS for superseding score set filter to prevent row m… by bencap · Pull Request #707 · VariantEffect/mavedb-api

bencap · 2026-04-14T00:43:20Z

The score set search query filtered out superseded score sets using a LEFT OUTER JOIN on the superseding_score_set relationship. Because replaces_id has no unique constraint, score sets with multiple superseding versions produced N rows per original, all counted against the SQL LIMIT. This caused paginated searches to return fewer unique score sets than requested (~84 instead of 100 on prod).

Replace the LEFT JOIN + OR filter with a NOT EXISTS subquery (via .has()), which produces exactly one row per score set regardless of how many superseders exist. Also strengthens the regression test to use multiple keywords per experiment and adds a new test for the multiple-superseders scenario.

Opens #706, which is the root cause of this issue. This fix mitigates consequences for the search endpoint specifically, but does not address all issues caused by the bug.

Although it turns out the joined_loads weren't the root cause of this specific issue, I'm leaving the new select_in_loads as they still represent an improvement over the prior code and could protect us from future row multiplication.

…ultiplication in search The score set search query filtered out superseded score sets using a LEFT OUTER JOIN on the superseding_score_set relationship. Because replaces_id has no unique constraint, score sets with multiple superseding versions produced N rows per original, all counted against the SQL LIMIT. This caused paginated searches to return fewer unique score sets than requested (~84 instead of 100 on prod). Replace the LEFT JOIN + OR filter with a NOT EXISTS subquery (via .has()), which produces exactly one row per score set regardless of how many superseders exist. Also strengthens the regression test to use multiple keywords per experiment and adds a new test for the multiple-superseders scenario.

bencap linked an issue Apr 14, 2026 that may be closed by this pull request

Score set search returns fewer results than expected due to row multiplication in query #675

Open

bencap requested review from jstone-dev and sallybg April 14, 2026 00:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: use NOT EXISTS for superseding score set filter to prevent row m…#707

fix: use NOT EXISTS for superseding score set filter to prevent row m…#707
bencap wants to merge 1 commit intorelease-2026.1.3from
bugfix/bencap/675/search-row-multiplication

bencap commented Apr 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bencap commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

bencap commented Apr 14, 2026 •

edited

Loading