Wednesday, June 27, 2012

Doubles are sufficient for all data relations

I was interested a while ago in RDF, which stores information in triples, like "the sky" -> "has the color" -> "blue". I think there's too much structure in the triples structure. This can all be reduced to doubles, or just a simple directed graph, through judicious use of blank nodes. So the example becomes "the sky" -> blank node that could be interpreted as having-a-property, that blank node -> "has the color", that blank node -> "blue". (If you want you could add "has the color" -> "relation" and "blue" -> "property" or similar.)

This is of course not necessarily the most efficient way to store information, or the easiest to work with, but it reduces the structure so that all the primitives are of the same type, and I don't think that the structure can be simplified any further, so in that sense it represents a kind of absolute in data structuring.

I don't know what this kind of data structure is called by others or if there is any existing work around it. I would like to know!

No comments: