Yes they are comparable simply because they are binary compatible.
On the other hand, in recent years, comparing even the same model CPU is no longer possible. Current benchmarking methods have become lost in the dynamic power management advances. To bring some comparison back, the big question is: What is the clock rate of each test's processor core during testing?
And that extra clock rate data itself would become an averaged number beside the individual score.