I think we've got a handle on it now. Some of the servers didn't get the event data - was causing some strange results and it took a while to figure out what was happening. At least according to my monitoring, it looks like the loops are breaking.

Why loops happen in the first place is an excellent question - going to follow up on that to make sure this can't happen again.