There isn't necessarily a contradiction there, and there may be an explanation.
Go back and look at the strength tests that the SPTs were correlated with. In the case of T&P, and I believe also Sowers, the reference test was the unconfined compressive strength. Less-plastic materials suffer more disturbance in sampling and handling, which may explain why they (at least on average) give lower qu for a given N. Regarding Stroud and Butler, Delijosi wrote "SPT N value was referenced to lab Su," but was not specific. Assuming that was something other than unconfined, the effects of disturbance are compensated to some extent; hence, there could be less of a trend or an opposite trend in Su vs PI, for a given N.
How good were the fits? [I haven't seen the actual data for either T&P or Sowers (or Stroud).] Did they control for all the relevant variables that would create apparent trends where none exist? For example, did all of their high-PI data come from a particular stratum that happened to be at a greater depth or had higher Ns? One correlation I remember, but can't find, shows the slope of Su vs N decreasing with increasing N. Was that effect involved? Were the same hammers used at all the sites, or did some get tested with doughnut hammers while others had CME auto hammers? Does sensitivity affect it, and can that sensitivity be linked with PI in such a way as to explain the trend? Were enough different sites and materials included to show that the trends are valid in general and not just between site A and site B?
Considering the mechanism by which clayey soil resists the penetration of split-barrel sampler, is there any theoretical reason why a more (or less) plastic clay with a given Su should provide greater penetration resistance?
BTW - where was Stroud and Butler 1975 published?