[Feature]: Integrate AITER MLA V1 from Upstream

### 🚀 The feature, motivation and pitch

After syncing with ROCm/vllm main, cherry pick the changes from upstream to support AITER MLA Engine V1.

Prepare the command to run
```bash
VLLM_ATTENTION_BACKEND=ROCM_AITER_MLA VLLM_USE_V1=1 VLLM_ROCM_USE_AITER=1 vllm serve /amdhome/models/DeepSeek-R1 --trust-remote-code -tp 8 --max-model-len 32768 --block-size 1 --no-enable-prefix-caching --max-num-batched-tokens 32768 --max-num-seqs 1024
```

Suggest a few configurations for the testing team to benchmark.

### Alternatives

_No response_

### Additional context

_No response_

### Before submitting a new issue...

- [x] Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the [documentation page](https://docs.vllm.ai/en/latest/), which can answer lots of frequently asked questions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature]: Integrate AITER MLA V1 from Upstream #62

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature]: Integrate AITER MLA V1 from Upstream #62

Description

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions