Making this work with relative position bias from XTransformers

Is there a way to make this work with `RelativePositionBias`. Currently this produces an attention bias of size $BHN^2$ where B is batch size, H is number of heads and N is input size. Can this be chunked and computed per chunk?