Clarify DSU L3D counter behavior under write-streaming
Ascertain whether the comparable values observed for the Arm DynamIQ Shared Unit (DSU) L3D performance counter when write-streaming mode is enabled versus disabled on Cortex-A55–based Rockchip RK3568 and RK3588 systems are caused by (i) a defect or undocumented behavior in the Cortex-A55 write-streaming mechanism that still allocates or accounts traffic in the L3 cache, or (ii) a mis-specification or implementation issue in the DSU L3D performance counter; and rigorously characterize the correct event-counting semantics of DSU L3D in write-streaming mode.
References
This counters the expectation that L3D does not count in write-streaming mode as the data accessed are not being allocated to the cache. We are not sure if this is an error in the behavior of write-streaming mode or counting implementation of L3D counter.