Is there text that cannot be analyzed?
The preceding discussion has discussed general considerations about how text can be analyzed and turned into a database. Indeed most text can be analyzed using the techniques described.
But is it possible that there is text that defies conventional analysis? The answer is absolutely yes.
Consider the writing of two world famous writers, William Faulkner and Ernest Hemingway. Both writers are famous and successful.
However from a writing style the two writers could not be more disparate. Faulkner had a style where his sentences were gargantuan. For example, there is a long paragraph that Faulkner wrote that was one sentence and can be very difficult to understand. Hemingway on the other hand wrote in a clear, simple style. There is very little doubt about what Hemingway was saying.
Trying to use textual ETL on Faulkner-style writing probably would produce very little value. It is inevitable that faced with a sentence from Faulkner there...