vllm.ir.ops.layernorm ¶
fused_add_rms_norm ¶
fused_add_rms_norm(
x: Tensor,
x_residual: Tensor,
weight: Tensor | None,
epsilon: float,
variance_size: int | None = None,
) -> tuple[Tensor, Tensor]
Fused add and weighted root-mean-square layer normalization
Source code in vllm/ir/ops/layernorm.py
rms_norm ¶
rms_norm(
x: Tensor,
weight: Tensor | None,
epsilon: float,
variance_size: int | None = None,
) -> Tensor
Weighted root-mean-square layer normalization