🚀 The feature, motivation and pitch
Right now, we are doing some redundant/duplicate computation in those to compute different types of metadata for various attention backends. Revisit those and cut down on the required ops
Alternatives
No response
Additional context
No response
Before submitting a new issue...