Using your example, I give one student a Math101 test and another Adv. Calc. The student aces the 101 test but the Adv Calc student gets a 'C'. Which student did better? As you keep saying, 'context matters'. Who they faced seems highly contextual.
I will say at least PFF is consistent. They rated Moore the worst LT while rating the guy he faced the best edge. It would be a little embarrassing if it didn't work out that way.
Bookmarks