(Edit - I took a while working on this, and marks1080 posted while I was typing)
Come on guys, look at the scale. If you want to talk about small systems, like an industrial facility or a small city, or even something the size of our distribution service territory, then sure there's one frequency and there's a bit of minor phase angle difference. The two machine model equations work quite well given a few combinations into equivalent machines. That's one scale, that's where a lot of people have concentrated.
But the OP started out asking about "over a large region". So let's look a a large region.
From Williston Lake (Bennett Dam) in northern British Columbia to El Paso Texas is 1840 miles (2960km) as the crow flys (a very tired crow). They are both part of the Western Interconnect (WECC). For reference, Stockholm to Rome is only about 2/3 that distance. Assuming there was a great circle line from Williston Lake to El Paso an electrical impulse would take nearly 10 milliseconds to travel the distance. In the two machine model systems there's not much elasticity and any power transfer across an angle greater than 90 degrees slides off into instability. I've seen phase angle plots where northern British Columbia is leading San Diego by 135 degrees and there are stable power flows north to south across the entire distance. The power marketers can even sell BC Hydro power to SDG&E. It works because it isn't a two machine system; there are hundreds of machines along that path, each doing its part; the flow moves along a series of smaller angular differences. Each one of those nodes introduces a certain amount of elasticity into the system
Over those distances, and resulting times, things can happen. Most of the time you can't tell the difference in frequency between Williston Lake and El Paso, just the rotor angle difference. But now and then the system gets thumped. Frequencies deviate, RAS systems do their thing, system stability is maintained, the system recovers, nothing slips poles. Afterward plots are produced that show frequency (at least the units on the vertical axis are Hz, not degrees of rotor angle) at various locations around the interconnection. The frequencies aren't all the same, but they all get back to the same place eventually.
Maybe it's all one frequency that definitely changes and a whole bunch of rotor angle differences, maybe its different frequencies. It depends. From a practical point of view it's different frequencies and I'd expect underfrequency load shedding schemes to operate in certain areas and not in other areas. The relays that run the UF schemes will see different frequencies based on how they determine frequency.
If we go back many posts to Bill's line shaft analogy, if over a certain period of time the line shaft (a really long one) absorbs 720 degrees of twist before the torsional forces even things out again did the two ends have the same speed the whole time? Or was one end faster than the other for a while before the initially slower end caught up and ran faster to work the twist out of the shaft? Over some period of time the two ends made exactly the same number of revolutions, and the difference in accumulated revolutions was never more than two, but did they always turn at the same speed?