Varying model fit accuracy of custom BoTorch model (Multi-task problem) #2961
-
Hi everyone! I am working on transfer learning in the Bayesian Optimization problem, where I aim to transfer knowledge from one system (source function) to another (target function) to facilitate Bayesian Optimization of the latter (see figure below). ![]() Initially, I employed MultiTaskGP; however, I encountered the so-called negative* transfer, a phenomenon when knowledge from less related source task "hurt" the target performance. Especially when the source function contains numerous points in regions that differ significantly from the target function, leading to a poor (over-confident) model fit at the initial stage of optimization. ![]() To address this issue, I implemented a method called envGP, which was suggested by Shilton et al. to mitigate negative transfer in multi-task BO. However, the model fit is somewhat unstable, i.e. sometimes the model fit is successful and sometimes not (see figure). Do you have any ideas why this could be happening and how to overcome this behavior? I would appreciate any feedback and/or suggestions. I also include my code implementation for convenience.
TEST
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 5 replies
-
Interesting and thanks for the great visualizations! I'm wondering whether model-fitting truly has failed vs. whether there are many different sets of hyperparameter values that can fit the data equally well. What are the mll values from each model fit? I also wonder if changing the |
Beta Was this translation helpful? Give feedback.
We're using transfer learning under the assumption that the target and source tasks are nicely correlated. If I don't know anything else about the target task and I can't assume that they're well correlated, there isn't anything to transfer. So,
MultiTaskGP
is a bit of a gamble that when we don't have much information about the target task, the source task will help us generate good candidates.But then what happens if they're actually not that related? Before generating candidates, we can evalua…