John will certainly remember the old-school data sgml modelers, who said "just model the data", i.e. ignore any usage or application constraints.
But i think the onus needs to be on content architects who develop new intermediate formats to demonstrate that it represent the shortest distance between the inputs and outputs. (Or has some other hard non-theoretical benefit otherwise.)
(I was sacked from a job once for suggesting that extended html would suffice, in the 90s.)