Enable callback logging only for full eval on GEPA #9050

TomeHirata · 2025-11-12T07:30:00Z

Instead of relying on "capture_trace" option, enable/disable logging based on the sample size since sample size is a more accurate indicator.

…data-option-to-evaluate

LakshyAAAgrawal · 2025-11-12T08:53:03Z

Hi @TomeHirata, one thing to be aware of is that GEPA will soon have support for not running eval on len(valset) but instead a subset of it, similar to what MIPRO does, hence this scheme may stop working then. I am trying to think what might be a good solution, maybe tracking reflection_minibatch_size may be better here?

TomeHirata · 2025-11-12T09:51:45Z

@LakshyAAAgrawal thanks for sharing the plan, so should we disable logging when len(valset)== reflection_minibatch_size?

LakshyAAAgrawal · 2025-11-13T00:48:47Z

I think len(valset) <= reflection_minibatch_size should be good.

Updated the DspyAdapter and GEPA classes to replace the full_eval_size parameter with reflection_minibatch_size for improved clarity. Adjusted the evaluate method and corresponding tests to reflect this change, ensuring callback metadata is correctly generated based on the new parameter. Signed-off-by: TomuHirata <[email protected]>

TomeHirata · 2025-11-13T03:11:05Z

@LakshyAAAgrawal sounds good, updated

TomeHirata added 3 commits September 20, 2025 14:57

Limit callback metadata to trace capture path

7f05c25

use batch length for eval

2c41ae0

Merge remote-tracking branch 'dspy/main' into codex/add-callback_meta…

4a6b33e

…data-option-to-evaluate

TomeHirata requested a review from LakshyAAAgrawal November 12, 2025 07:30

LakshyAAAgrawal approved these changes Nov 13, 2025

View reviewed changes

TomeHirata merged commit 4dd085c into stanfordnlp:main Nov 13, 2025
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable callback logging only for full eval on GEPA #9050

Enable callback logging only for full eval on GEPA #9050

Uh oh!

TomeHirata commented Nov 12, 2025 •

edited

Loading

Uh oh!

LakshyAAAgrawal commented Nov 12, 2025

Uh oh!

TomeHirata commented Nov 12, 2025

Uh oh!

LakshyAAAgrawal commented Nov 13, 2025

Uh oh!

TomeHirata commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Enable callback logging only for full eval on GEPA #9050

Enable callback logging only for full eval on GEPA #9050

Uh oh!

Conversation

TomeHirata commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LakshyAAAgrawal commented Nov 12, 2025

Uh oh!

TomeHirata commented Nov 12, 2025

Uh oh!

LakshyAAAgrawal commented Nov 13, 2025

Uh oh!

TomeHirata commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

TomeHirata commented Nov 12, 2025 •

edited

Loading