I think it is cool - too look at individual letters as shapes - relative to other shapes and building SW to interpret these shapes and assemble patters - much the same way as one would communicate using pictures - for example, one might communicate that one is satisfied by presenting an image of a person smiling with a full stomach in a temperate setting. Of course, images of shapes is less complex, but the logic would be similar and allow a computer program to more accurately interpret text. I wish this board supported mathematical symbols so one could express how this might work - but for example, let;s say we took a reference sample of text [say 10,000 business letters] and we first determined the Incidence of Coincidence or IC. One could determine the IC of Observed [for each and in aggregate] and based upon this, further determine the IC for Random - based upon the frequency of shapes present in vowel/consonant and consonant/vowel combinations, or Isomorphs and Idiomorphs. Then drawing a table into which these values would be written into both vowel and consonant lines, where the relative and absolute frequency amongst shapes could be measured and expressed as values - each value, IC, IM/IS and VC/CV would be added to weight a value as compared to the absolute values in the sample texts. This could be used to perhaps more accurately interpret and predict the values of shapes - much the same way we as people can still extract the intended contextual meaning when someone speaks improperly.
At a minimum, it makes for an interesting puzzle and means to examine different written and spoken languages. Curiously, this could also be applied to programming languages - perhaps making syntax a simple process of diarization.
|