Abstract: Transformer-based models have achieved significant success in sequential recommendation tasks. However, they often suffer from over-smoothing and inference inefficiency when handling long ...