There is something especially elegant about letting the hash of data identify the data. This is used to a degree in the Data Bank proposal. A complex part of that proposal is establishing the lifetime of data how long must the bank keep the data? Indexing from hash may also substantially increase cost for things the size of pages.
The hash roots, on the other hand, may be well suited to identify checkpoints.