Does the current IA3 include the two additional loss terms? #930
-
I was going through the source code of IA3, but it doesn't seem to contain the two additional loss terms proposed by the author. unlikelihood loss (Lul) and a length-normalized loss (Lln). Am I missing anything here? Could someone point to me where it's implemented? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 6 replies
-
Hi, IA3 - “Infused Adapter by Inhibiting and Amplifying Inner Activation" is simply the PEFT method that adds additional parameters. You seem to be referring to the loss functions in the full T-Few recipe. You should have a look at the official implementation from the authors here |
Beta Was this translation helpful? Give feedback.
Yes that's right! Custom loss functions don't seem to fit in with the goal of the PEFT library (which is why it only implements IA3, not the full training recipe in T-few), so I don't think this is coming anytime soon. I will still cc @younesbelkada for a final word.