I found some instances where the same publication gets completely different publication_text in various versions. The texts don't seem to be related to the article at all. The situation seems to be happening quite often to The News Lens (producer id 5030BBED81FE11EA8627F23C92E71BAD) and ebc.net (producer id 5030A5E181FE11EA8627F23C92E71BAD)
Here's are the two examples I found (one for each aforementioned producer):
select * from publication
where publication_id = unhex("51868BFA0DB0464C8A94B5F3C3DEEB35")
and
select * from publication
where publication_id = unhex("69A12537455F429B9C264F25775130F4")