I would guess you will just make a dashboard(s) to compare them, the same way you will do it for any other metric.
Is there something specific that you think won’t work? (obviously, if it’s datadog specific, it’s better to ask their support as they will likely be better equipped to help you).
I guess the biggest problem will be the fact that unlike most other measurements you will have huge(week-long) holes in your data, but you can just zero this out and have 2 dashboards with different time intervals shown. So it will look something like the cloud test comparision.
If you want just a single number … I guess datadog has away to do as well, I have seen dashboards doing it, but I am not versed in datadog - again their support will likely be able to help you a lot more on datadog specific questions.
On the other hand just because (for example) the p(95) of something is lower this week, doesn’t mean that it’s all better, maybe the median has raised 10x and the p(99) is also huge just the p(95) is down, so you will need to figure out what is important to your case and monitor it.