Consciousness, Fractals and Great Literature
You want mathematics in your understanding of writing, you got mathematics. You want hard science? This science is hard. And to paraphrase that pithyest of pithers, S.J. Perelman: "There is nothing like a report (see below) filled with decimal points and guarded generalizations to put a glaze on your eye like a sung vase (long a, please).
I'm of the school that writers write, editors edit, readers read, agents take ten percent of the gross, and reviewers do something, no one is sure exactly what. Literary theoreticians, to continue the cascade, scare the hell out of everyone. I have images of graduate assistants assigning nascent authors underlining the fractals, mono-fractals and multi-fractals in a story. And you thought parsing a sentence was a chore.
Finally, if you understand any of this research, anything at all, even one pseudo-fractal, kindly post your explanation for the rest of us at the end of this story. In the meantime, I need a drink. Perhaps a double. Remember, writers, as Hemingway is alleged to have said, "Write drunk, edit sober." Take your pick before plowing through this research.
As far as many bookworms are concerned, advanced equations and graphs are the last things which would hold their interest, but there's no escape from the math. Physicists from the Institute of Nuclear Physics of the Polish Academy of Sciences (IFJ PAN) in Cracow, Poland, performed a detailed statistical analysis of more than one hundred famous works of world literature, written in several languages and representing various literary genres. The books, tested for revealing correlations in variations of sentence length, proved to be governed by the dynamics of a cascade. This means that the construction of these books is in fact a fractal. In the case of several works their mathematical complexity proved to be exceptional, comparable to the structure of complex mathematical objects considered to be multi-fractal. Interestingly, in the analyzed pool of all the works, one genre turned out to be exceptionally multi-fractal in nature.
Multi-fractals are more highly advanced mathematical structures: fractals of fractals. They arise from fractals 'interwoven' with each other in an appropriate manner and in appropriate proportions. Multi-fractals are not simply the sum of fractals and cannot be divided to return back to their original components, because the way they weave is fractal in nature. The result is that in order to see a structure similar to the original, different portions of a multi-fractal need to expand at different rates. A multi-fractal is therefore non-linear in nature.
"Analyses on multiple scales, carried out using fractals, allow us to neatly grasp information on correlations among data at various levels of complexity of tested systems. As a result, they point to the hierarchical organization of phenomena and structures found in nature. So we can expect natural language, which represents a major evolutionary leap of the natural world, to show such correlations as well. Their existence in literary works, however, had not yet been convincingly documented. Meanwhile, it turned out that when you look at these works from the proper perspective, these correlations appear to be not only common, but in some works they take on a particularly sophisticated mathematical complexity," says Prof. Stanislaw Drozdz (IFJ PAN, Cracow University of Technology).
The study involved 113 literary works written in English, French, German, Italian, Polish, Russian and Spanish by such famous figures as Honore de Balzac, Arthur Conan Doyle, Julio Cortazar, Charles Dickens, Fyodor Dostoevsky, Alexandre Dumas, Umberto Eco, George Elliot, Victor Hugo, James Joyce, Thomas Mann, Marcel Proust, Wladyslaw Reymont, William Shakespeare, Henryk Sienkiewicz, JRR Tolkien, Leo Tolstoy and Virginia Woolf, among others. The selected works were no less than 5,000 sentences long, in order to ensure statistical reliability.
To convert the texts to numerical sequences, sentence length was measured by the number of words (an alternative method of counting characters in the sentence turned out to have no major impact on the conclusions). The dependences were then searched for in the data -- beginning with the simplest, i.e. linear. This is the posited question: if a sentence of a given length is x times longer than the sentences of different lengths, is the same aspect ratio preserved when looking at sentences respectively longer or shorter?
"All of the examined works showed self-similarity in terms of organization of the lengths of sentences. Some were more expressive -- here The Ambassadors by Henry James stood out -- while others to far less of an extreme, as in the case of the French seventeenth-century romance Artamene ou le Grand Cyrus. However, correlations were evident, and therefore these texts were the construction of a fractal," comments Dr. Pawel Oswiecimka (IFJ PAN), who also noted that fractality of a literary text will in practice never be as perfect as in the world of mathematics. It is possible to magnify mathematical fractals up to infinity, while the number of sentences in each book is finite, and at a certain stage of scaling there will always be a cut-off in the form of the end of the dataset.
Things took a particularly interesting turn when physicists from the IFJ PAN began tracking non-linear dependence, which in most of the studied works was present to a slight or moderate degree. However, more than a dozen works revealed a very clear multifractal structure, and almost all of these proved to be representative of one genre, that of stream of consciousness. The only exception was the Bible, specifically the Old Testament, which has so far never been associated with this literary genre.
"The absolute record in terms of multifractality turned out to be Finnegan's Wake by James Joyce. The results of our analysis of this text are virtually indistinguishable from ideal, purely mathematical multifractals," says Prof. Drozdz.
The most multi-fractal works also included A Heartbreaking Work of Staggering Genius by Dave Eggers, Rayuela by Julio Cortazar, The US Trilogy by John Dos Passos, The Waves by Virginia Woolf, 2666 by Roberto Bolano, and Joyce's Ulysses. At the same time a lot of works usually regarded as stream of consciousness turned out to show little correlation to multi-fractality, as it was hardly noticeable in books such as Atlas Shrugged by Ayn Rand and A la recherche du temps perdu by Marcel Proust.
"It is not entirely clear whether stream of consciousness writing actually reveals the deeper qualities of our consciousness, or rather the imagination of the writers. It is hardly surprising that ascribing a work to a particular genre is, for whatever reason, sometimes subjective. We see, moreover, the possibility of an interesting application of our methodology: it may someday help in a more objective assignment of books to one genre or another," notes Prof. Drozdz.
Multifractal analyses of literary texts carried out by the IFJ PAN have been published in Information Sciences, a journal of computer science. The publication has undergone rigorous verification: given the interdisciplinary nature of the subject, editors immediately appointed up to six reviewers.
Story Source: Materials provided by The Henryk Niewodniczanski Institute of Nuclear Physics Polish Academy of Sciences. Stanisław Drożdż, Paweł Oświȩcimka, Andrzej Kulig, Jarosław Kwapień, Katarzyna Bazarnik, Iwona Grabska-Gradzińska, Jan Rybicki, Marek Stanuszek. Quantifying origin and character of long-range correlations in narrative texts. Information Sciences, 2016.
I'm of the school that writers write, editors edit, readers read, agents take ten percent of the gross, and reviewers do something, no one is sure exactly what. Literary theoreticians, to continue the cascade, scare the hell out of everyone. I have images of graduate assistants assigning nascent authors underlining the fractals, mono-fractals and multi-fractals in a story. And you thought parsing a sentence was a chore.
Finally, if you understand any of this research, anything at all, even one pseudo-fractal, kindly post your explanation for the rest of us at the end of this story. In the meantime, I need a drink. Perhaps a double. Remember, writers, as Hemingway is alleged to have said, "Write drunk, edit sober." Take your pick before plowing through this research.
* * * * *
The world's greatest literature reveals
multi-fractals and cascades of consciousness
As far as many bookworms are concerned, advanced
equations and graphs are the last things which would hold
their interest, but there's no escape from the math.
As far as many bookworms are concerned, advanced
equations and graphs are the last things which would hold
their interest, but there's no escape from the math.
Credit: Source: IFJ PAN
Sequences of sentence lengths (as measured by number of words) in four
literary works representative of various degree of cascading character.
|
James Joyce, Julio Cortazar, Marcel Proust, Henryk Sienkiewicz and Umberto Eco. Regardless of the language they were working in, some of the world's greatest writers appear to be, in some respects, constructing fractals*. Statistical analysis carried out at the Institute of Nuclear Physics of the Polish Academy of Sciences, however, revealed something even more intriguing. The composition of works from within a particular genre was characterized by the exceptional dynamics of a cascading or avalanche narrative structure. This type of narrative turns out to be multi-fractal. That is, fractals of fractals are created.
*A fractal is a never-ending pattern. Fractals are infinitely complex patterns that are self-similar across different scales. They are created by repeating a simple process over and over in an ongoing feedback loop. Fractals are images of dynamic systems – the pictures of Chaos. Geometrically, they exist in between our familiar dimensions. Fractal patterns are extremely familiar, since nature is full of fractals. For instance: trees, rivers, coastlines, mountains, clouds, seashells, hurricanes, etc. Abstract fractals can be generated by a computer calculating a simple equation over and over.
A Sierpinski square-based pyramid and its 'inverse'. |
Fractals are self-similar mathematical objects: when we begin to expand one fragment or another, what eventually emerges is a structure that resembles the original object. Typical fractals, especially those widely known as the Sierpinski triangle and the Mandelbrot set, are mono-fractals, meaning that the pace of enlargement in any place of a fractal is the same, linear: if they at some point were rescaled x number of times to reveal a structure similar to the original, the same increase in another place would also reveal a similar structure.
Multi-fractals are more highly advanced mathematical structures: fractals of fractals. They arise from fractals 'interwoven' with each other in an appropriate manner and in appropriate proportions. Multi-fractals are not simply the sum of fractals and cannot be divided to return back to their original components, because the way they weave is fractal in nature. The result is that in order to see a structure similar to the original, different portions of a multi-fractal need to expand at different rates. A multi-fractal is therefore non-linear in nature.
"Analyses on multiple scales, carried out using fractals, allow us to neatly grasp information on correlations among data at various levels of complexity of tested systems. As a result, they point to the hierarchical organization of phenomena and structures found in nature. So we can expect natural language, which represents a major evolutionary leap of the natural world, to show such correlations as well. Their existence in literary works, however, had not yet been convincingly documented. Meanwhile, it turned out that when you look at these works from the proper perspective, these correlations appear to be not only common, but in some works they take on a particularly sophisticated mathematical complexity," says Prof. Stanislaw Drozdz (IFJ PAN, Cracow University of Technology).
The study involved 113 literary works written in English, French, German, Italian, Polish, Russian and Spanish by such famous figures as Honore de Balzac, Arthur Conan Doyle, Julio Cortazar, Charles Dickens, Fyodor Dostoevsky, Alexandre Dumas, Umberto Eco, George Elliot, Victor Hugo, James Joyce, Thomas Mann, Marcel Proust, Wladyslaw Reymont, William Shakespeare, Henryk Sienkiewicz, JRR Tolkien, Leo Tolstoy and Virginia Woolf, among others. The selected works were no less than 5,000 sentences long, in order to ensure statistical reliability.
To convert the texts to numerical sequences, sentence length was measured by the number of words (an alternative method of counting characters in the sentence turned out to have no major impact on the conclusions). The dependences were then searched for in the data -- beginning with the simplest, i.e. linear. This is the posited question: if a sentence of a given length is x times longer than the sentences of different lengths, is the same aspect ratio preserved when looking at sentences respectively longer or shorter?
"All of the examined works showed self-similarity in terms of organization of the lengths of sentences. Some were more expressive -- here The Ambassadors by Henry James stood out -- while others to far less of an extreme, as in the case of the French seventeenth-century romance Artamene ou le Grand Cyrus. However, correlations were evident, and therefore these texts were the construction of a fractal," comments Dr. Pawel Oswiecimka (IFJ PAN), who also noted that fractality of a literary text will in practice never be as perfect as in the world of mathematics. It is possible to magnify mathematical fractals up to infinity, while the number of sentences in each book is finite, and at a certain stage of scaling there will always be a cut-off in the form of the end of the dataset.
Things took a particularly interesting turn when physicists from the IFJ PAN began tracking non-linear dependence, which in most of the studied works was present to a slight or moderate degree. However, more than a dozen works revealed a very clear multifractal structure, and almost all of these proved to be representative of one genre, that of stream of consciousness. The only exception was the Bible, specifically the Old Testament, which has so far never been associated with this literary genre.
"The absolute record in terms of multifractality turned out to be Finnegan's Wake by James Joyce. The results of our analysis of this text are virtually indistinguishable from ideal, purely mathematical multifractals," says Prof. Drozdz.
The most multi-fractal works also included A Heartbreaking Work of Staggering Genius by Dave Eggers, Rayuela by Julio Cortazar, The US Trilogy by John Dos Passos, The Waves by Virginia Woolf, 2666 by Roberto Bolano, and Joyce's Ulysses. At the same time a lot of works usually regarded as stream of consciousness turned out to show little correlation to multi-fractality, as it was hardly noticeable in books such as Atlas Shrugged by Ayn Rand and A la recherche du temps perdu by Marcel Proust.
"It is not entirely clear whether stream of consciousness writing actually reveals the deeper qualities of our consciousness, or rather the imagination of the writers. It is hardly surprising that ascribing a work to a particular genre is, for whatever reason, sometimes subjective. We see, moreover, the possibility of an interesting application of our methodology: it may someday help in a more objective assignment of books to one genre or another," notes Prof. Drozdz.
Multifractal analyses of literary texts carried out by the IFJ PAN have been published in Information Sciences, a journal of computer science. The publication has undergone rigorous verification: given the interdisciplinary nature of the subject, editors immediately appointed up to six reviewers.
Story Source: Materials provided by The Henryk Niewodniczanski Institute of Nuclear Physics Polish Academy of Sciences. Stanisław Drożdż, Paweł Oświȩcimka, Andrzej Kulig, Jarosław Kwapień, Katarzyna Bazarnik, Iwona Grabska-Gradzińska, Jan Rybicki, Marek Stanuszek. Quantifying origin and character of long-range correlations in narrative texts. Information Sciences, 2016.
Comments
Post a Comment