Week 5: Tree of Codes

Tree of Codes by Jonathan Safran Foer is, in many respects, an artist’s book that experiments with the material possibilities of print. It was published by Visual Editions in 2010, using the die-cutting technique to create a book that is sculptural and complex in nature. Each page cuts out various lines of text, leaving physical spaces in the paper through which the reader can see onto the next page, which then contains its own unique formation of text (see Figure 1). It reads like an experimental work since it very much occupies the grey area between poetry and prose. In actuality the text is from a book of short stories called Street of Crocodiles by Polish author Bruno Schulz. Foer then takes a pre-existing text and attempts to create a new piece by experimenting with the unique capabilities of the printed book and its encompassing technologies.

5264794042_0c33b206a9 Figure 1.

The practicalities of attempting to represent this text in code seem beyond the scope of the required skills of the course, but I still wanted to blog about it because of the interesting thought experiment it poses. The main issue at the heart of rendering such a text in code would be around representing the space or negative space within a printed text, versus the screen.

The problem then is that when you attempt to go down the path of encoding, the impulse is to focus on representing the content how it falls on the page using our traditional sense of what a page is. However the process of representing the text and space on any given page is not terribly different from marking up for regular prose, and the end result could easily be an accurate representation of the text but with its variety represented only in one dimensional space. The problem that then arises is that it leaves out the key structural feature of the book: the three dimensional space that is  created by the depth of the cutouts in the book.

The caveat of this book is really that it is an artist’s book, a book that is self-reflexive in nature and very much in conversation with the unique capabilities of print. Even its title reflects its dependency on paper. Rendering it in code would change the project completely. Trying to really stay true to representing the physical book in a digital format might then be more effective if it were to be simply photographed, and since I don’t have an e-reader, I wasn’t able to get a sense of how digital versions cope with this.

On the other hand, the title is also in conversation with our understanding of language as “code,” and one could imagine the possibilities of the new values it could take on through the process of markup. Attempting to represent this text in code would be engaging with much of the same processes that Foer engaged with when he decided to take a pre-existing text and make it something new by drastically altering its physical structure. Coding this text seems like natural next step in the process of experimenting with the reader’s perception of a single text in various artistic formats. My group has discussed using concrete poetry as being a challenge to code, which is of course, because of the challenge of representing the way poetry plays with word placement and space. Tree of Codes looks at the same problem in multiple dimensions and would be doubly challenging, and to consider how to cope with that would be an artistic undertaking in and of itself. For that reason we won’t be pursuing it, but I thought it presented an interesting conundrum for considering something like XML.

 

Bibliography

Foer, Jonathan Safran. Tree of Codes. Belgium: Visual Editions, 2011. Print.

Encoding Escapades with Robert Bringhurst…

As Holly mentioned, she and I have chosen to work on a poem by the poet Robert Bringhurst for our encoding challenge. We have selected a poem from Bringhurst’s Selected Poems, published in 2009 by Gaspereau Press.

bringhurst3

In an epigraph to the collection, Bringhurst notes that, “when conditions are right, it is good for poems to be spoken aloud. I mean that the poems themselves may benefit, not just the creatures who speak them and hear them. But in some of the poems in this book, two or three voices are speaking at once, contradicting or enlarging or refining one another as they go. Here, the overlapping voices are printed in different colors. Poems in which this occurs can be read in silence by one person alone or spoken aloud with one or two friends, if conditions are right. Which, in the presence of one or two friends, they just might be” (5). We felt that focusing on one of these poems with multiple voices was a great opportunity to explore what TEI allows us to capture of the experience of interacting with poetry. To begin, we looked at New World Suite No. 3: Four Movements for Three Voices:

bringhurst

If you’ll excuse the blurry photo…you can see perhaps that the three colours (black, blue, and red) represent the voice of the viola, the violin, and the cello, respectively. Bringhurst intends for the lines to be spoken simultaneously, at times, or in sequence. In some cases, the voices will finish each other’s sentences, or contradict one another. This sort of variety of reading experience immediately made me recall Prof. Galey’s example of his own work as part of his Visualizing Variation project, specifically his Animated Variants prototype (http://individual.utoronto.ca/alangaley/visualizingvariation/animated.html). Prof. Galey describes the project: “it was thought that digital editions could represent variants dynamically, presenting their ambiguity to readers not as a problem to solve, but as a field of interpretive possibility. Very few digital editions, however, have realized this possibility in their interfaces. One thing a digital visualization can do is make a virtue of ambiguity in ways that print cannot, combining the elements of time and motion to represent variants in ways that challenge the idea that texts are fixed and immutable.” This made me wonder if it would be possible to translate Bringhurst’s vision for a multi-vocal, performatory reading experience just as ‘dynamically’ as it is on the page, yet through XML. Exactly as Prof. Galey points out the potential for digital visualization, Bringhurst seems to be looking for the fluidity, the mutability, and the unfixed nature of this poetic interaction.

Upon considering the reflection further, Holly and I looked to another example of Bringhurst’s polyphonic poetry, The Blue Roofs of Japan: Duet for interpenetrating voices:

bringhurst2

In an introduction to this poem, Bringhurst notes that, “The Blue Roofs of Japan is a poem for two voices – in principle little different from a sonata for cello and piano, except that here the instruments speak; they don’t quite sing. The full text of the poem is printed on both the righthand and lefthand pages of the book, but since the two voices frequently overlap, the two parts are not always legible on any one page. The lefthand pages give prominence to one voice, the righthand pages to the other. Facing pages should be read not in sequence but together. Reading the poem aloud requires two people…one reader, in any case, reads the black ink on the lefthand page while the other reads the black ink on the right. Under his or her own lines, each reader can see the other’s voice in blue. Enough blue ink is visible on every page to allow both readers to keep pace with one another. There isn’t, and in my view musn’t be, a metronome. The only thing the readers have to pace themselves against is each other” (175). It is exactly this sentiment that we hope to capture in our encoding. A sense of flow, overlap, and cohesion. As Galey points out, “animated variants also drive home the simple yet unsettling point that textual transmission is more often a matter of change than fixity: texts sometimes change even when readers aren’t looking.” Bringhurst opens up this possibility to his readers. While we’re unsure whether or not we’ll be able to execute our vision, we love the idea of applying something like CSS transitions (see: http://individual.utoronto.ca/alangaley/visualizingvariation/samples/animatedVariants/animatedVariants_v02_sonnet129.xml) to give the poem movement, colour, and ‘voice’, just as Bringhurst intended.

Bibliography:

Bringhurst, Robert. (2009). Selected poems. Canada: Gaspereau Press.

Galey, Alan. Visualizing variation. http://individual.utoronto.ca/alangaley/visualizingvariation/index.html#using.

Travel Guides: An Encoding Challenge

Stephanie and I decided on Karl Baedeker’s “Southern France Including Corsica: Handbook for Travellers” (1907) for the encoding challenge. Stephanie had previously spent some time in Nice and we are both fascinated by the format and content of the guide.IMG_0520

Here’s a little history: Beadeker’s guides set the standard for authoritative travel guides and introduced information on routes, accommodations and travel, an innovation at the time. Beadeker created guides that focused on the traveler and reader. The guides featured information regarding transportation, restaurants, tipping, sights, and prices for a variety of things, including taxi fares. In addition, the guides include “pull out” maps that are referred to throughout the tex. These guides were designed to provide maximum coverage of the destination so that the traveler would not be required to look for information outside of the guide.

IMG_0526Since these guides contain a boat load of information, they are very dense and detailed (see the image below). Some challenges we will face for encoding will arise when it comes to decisions on format and content. The guides were formatted to fit the most content in the smallest amount of space. This resulted in large paragraphs containing content that is better presented in a list format. Our challenge will be to determine if we want to focus on presenting a true encoding of the material, in both content and format, or if we want to focus our attention on presenting the content in a different format, like including lists.

image1

Another challenge we will face is encoding the references to the maps included in the guides or the references to other pages in the book. It will be interesting to play around with XML tagging to find a way to adequately represent these cross references.

image4Both Stephanie and I are super excited to take on this challenge and can’t wait to see how it turns out!

 

The encoding challenge that might have been: Lyric Philosophy

Marlena and I are collaborating on the encoding challenge. We are both interested in Canadian ecological poetry, which led us to consider the work of Robert Bringhurst and Jan Zwicky as options for our encoding challenge.

In what follows I reflect on Jan Zwicky’s Lyric Philosophy, the revised second edition, published in 2011 by Gaspereau Press.

This text fascinates me for many reasons, one of which being the complexity of its form, and the relationship in the text between form and function. Non-linear in nature, Zwicky takes a polyphonic approach to her reflections on time and resonant meaning. On each recto page is a passage by Zwicky, while on each verso are the thoughts of another thinker. While not directly commenting on the other thinker’s work, Zwicky’s placement of the passages allow them to exist alongside, and speak to, one another. In the introduction, she writes: “The relation of the two texts to one another is somewhere between counterpoint and harmony, somewhere between a double helix and the allemande of the earth and moon.” It gestures towards what David Gelernter describes as deep beauty: “An idealized integration of form and function” (Kirschenbaum, 2004). The text is shaped by the physicality and spatiality of pages and signatures. The negative space on each page also adds to the meaning of the text by both inviting the reader’s hands into the margins and suggesting where to pause.

The form of the book challenges understandings of interface. Kirschenbaum (2004) writes: “… the interface is also where representation and its attendant ideologies are most conspicuous to our critical eyes. Ostensibly wedded to the ideal of user-friendliness, the interface is also where we deploy our most creative features and imaginative functions.” The structure of Zwicky’s work is complex: the relationship between form and meaning is not an obvious one, and is certainly not designed to be user-friendly. The form invites the reader to work to make meaning, and much effort is required on the reader’s part to decode the text. The structure of Lyric Philosophy challenges the notion of user-friendliness. It is this challenge which makes Zwicky’s work an intriguing candidate for translation into XML. In the end, we chose a poem by Robert Bringhurst to encode. Lyric Philosophy will have to wait for another time…

Encoding Meets Overlapping

My terrible photo below (low light + bad phone) shows two pages of Canadian poet Lisa Robertson‘s long poem, “Debbie: An Epic.”

IMG_20160211_201329

To start, let’s just consider the left page. There are a couple of interesting features to consider if encoding this text.

Most obviously, there is the overlapping content in the top right corner. (This is repeated on other pages – but it is specific to this poem.) What exactly would you tag the non-traditional text? Where would it be sequenced in relation to the standard column of poetry? Before? After? What about the issue of legibility? As discussed in class, the detail of classification of the tags – as well as the amount of effort one might put into specifying a particular feature – might depend on the audience. Another key consideration would be how much time a scholar has to encode a text. If this were part of a well-funded project that expects the most detailed possible encoding work, than more specificity and consideration could be applied to this feature.

Other notable features are the line numbers (which are specific to the poem, not the anthology) and the dagger footnote. For the line numbers, how would you make it clear that these are an integral part of the poem rather than an editor’s addition? For the footnote, is the shape important? Indeed, these are the kind of questions that make applying TEI as tricky as the text at hand.

Bibliography:

Robertson, Lisa. “Debbie: An Epic.” In The New Long Poem Anthology, edited by Sharon Thesen, 347-371. Vancouver: Talonbooks, 2001.

Week 5: Encoding Roads Not Taken

Our group chose to tackle House of Leaves by Mark Z. Danielewski for the encoding challenge. House of Leaves is a deeply complex novel, both in content and form. It is a description of a documentary film, written as a fragmented manuscript by a blind man named Zampano, which is compiled by a young man named Johnny Truant, whose own story unfolds in a series of footnotes, and the whole is commented on by nameless Editors. The documentary is about a bizarre house inhabited by a fictional Pulitzer-prize winning photographer, Will Navidson, and his family. After moving in, the family discovers new spaces appearing inside the house. The interior appears to be larger than the exterior. Passages lead into endless dark spaces full of branching ways, an enormous spiral staircase, and an ominous growl. Navidson and his brother take photographs and video of the phenomenon, eventually taking a documentary film team to explore the recesses of the house.

I always think about this book when considering form and format. It was the first book I read that really played with the link between form and content. Some of my favourite pages in the book arrange the text to reflect Navidson’s physical journey through the house while describing it.

Version 2 IMG_1102

However, those passages have too much form and too little content for this assignment.

There are several pages we considered for the encoding project, each having their own challenges. The pages below were an option that we didn’t go with.

IMG_1106

Page 336 begins in the middle of a footnote to the main text, where Zampano is quoting another scholar’s interpretation of the documentary. Most of the footnote is struck out, and the main text itself on the next page has been damaged and burned, which is indicated by square brackets enclosing blank space. The destroyed text compares Navidson’s house to Minos’ mythological labyrinth, infamous home of the Minotaur. Page 337, meanwhile, describes the deaths of some members of Navidson’s documentary team, followed in the footnotes by Zampano’s scholarship and Johnny Truant’s reflections. The content of these two pages are in fact a complex web of referentiality, while the form is an equally complex mix of layout and design elements.

With any part of this book, the content is mostly nonlinear, which is a real challenge for encoding. While I have been treating Zampano’s manuscript as the main text in this post, this is really only my interpretation. Johnny Truant’s spiral into paranoia and madness could be equally or more important to some readers. Essentially, all parts of the book are equally important in terms of meaning, and so translating this into hierarchical XML requires implementing a specific reading of the text. Our group is currently figuring out an interpretation of our chosen passage, which is not as visually complex as this one, but is quite complex in its content.

Encoding Challenge Example

House of Leaves is a novel that involves several layers of character narrative, a documentary film manuscript and editorial notes all mixed together to make a unique story. Though I have not read the book, from what I have seen of the novel is further complicated by an unusual presentation, orientation and purpose of the different types on the paper. It had definitely intrigued me and I look forward to reading it in the future.

Our group has chosen to do a page from the novel that involves markup regarding the multiple character stories and comments on footnotes. For this post I have chosen two pages from within the book that present an interesting challenge to markup in terms of the orientation and break up of print. Though it is the same speaker it appears to be a quote and a reflection of the quote.

Questions about it’s markup included whether the quotes and comments on the quotes should be separated or represented separately, as the comment breaks up the flow of reading the quote. Another question about markup is whether or not to markup both pages together as the quote transcends the page. Furthermore there was the question of the sentence breaks- though they go with the theme of the quote, they break mid-sentence and don’t appear to be important to the content.

Kali

IMG_0885

Week 4: Colonial Despatches

Having a deep passion for history, my attention was almost immediately drawn to the University of Victoria’s “Colonial Despatches” project presented in the TEI project list. The project is a digital archive containing “transcriptions of virtually the complete correspondences between the British colonial authorities and the successive governors of the nascent Vancouver Island and British Columbia colonies”, among other historically valuable documents. These artifacts provide the history of Vancouver Island and British Columbia from 1846 to 1871 from the perspective of the individuals that were closely tied to the governance and development of the land, its resources and its population.

The original project was created in the 1980s using files encoded in Waterloo Script ( a text-encoding language processed using SCRIPT). These files have now been converted into XML, and the University of Victoria has built a web application to make these files readable and searchable. The files were converted into XML based on TEI P5 Guidelines.

The project provides a detailed guideline that outlines the mark-up scheme used for each record. This document demonstrates how the TEI guidelines were used in the creation of the XML files. The project guide also includes details about the tags used within the markup, offering an explanation for the purpose of the tag and examples on how to use them.

The XML code for all documents in the project are available to the public, making this a very useful example for our own encoding challenge if you are working with handwritten or annotated works.

Bibliography:

Colonial Despatches: The Colonial Despatches of Vancouver Island and British Columbia 1846-1871. University of Victoria (n.d.). Retrieved from: http://bcgenesis.uvic.ca/about.htm

TEI in the wild

For this week’s task, identifying TEI in the wild, I selected the New York Public Library’s Digital Schomburg: African American Woman Writers of the 19th Century. This project makes its use of TEI explicit, detailing its use of machine-readable form under a link called “Technical Notes.” According to the site, Digital Schomburg uses Standard General Markup Language (SGML), according to the Text Encoding Initiative Lite Document Type Definition.

On the home page, beneath the technical notes link is a link titled “Editorial Methods.” This includes a comparison of the print and electronic forms of the text, which claims “Not a single character of text has been deleted or excluded from the source documents in their conversion to the Digital Schomburg Edition” (http://digital.nypl.org/schomburg/writers_aa19/technotes.html). It describes its use of TEI as “literary,” and explains the difference between a TEI header, encoding description, revision history description, and text profile description.

While the project clearly describes its method, the editors do not go so far as to describe challenges encountered during their work. They stick to a descriptive approach to their use of XML. Having scoured the project’s website, I found no trace of accessible code. While the editors are willing to describe their process, they protect it by refusing to share it with visitors to the site. This closes down the possibilities for scholarly engagement with the text, assuming users’ interest in the project’s form is fairly superficial.

The link to the Digital Schomburg project can be found here: http://digital.nypl.org/schomburg/writers_aa19/technotes.html

TEI in the wild – Vincent van Gogh’s letters

‘Vincent van Gogh – The Letters’ is a “scholarly edition of all extant letters (902) sent by or to Vincent van Gogh (1853-1890)” (TEI, 2010). The TEI project page notes that, “Van Gogh’s correspondence is a unique resource for the insight it provides into both his artistic practice and his personal life. The full digital edition is available online; a reading edition is available as a six volume book edition. While intended for a scholarly audience, the edition is expected to serve the interest of a much wider public” (TEI, 2010).

The project, which is hosted by the Huygens Instituut, in collaboration with the Van Gogh Museum Amsterdam, provides for each letter, “a zoomable facsimile, a transcription of the original text (mostly Dutch or French), a new translation into English, and extensive annotation. More than 2000 illustrations are given of the works of art that Van Gogh mentions in his letters. Introductory essays discuss Van Gogh, his letters and his circle. Other material includes a timeline, maps, indices and a bibliography” (TEI, 2010).

Interestingly, the TEI project page also notes that, “the project started before the era of the web, and it was only later decided the web would be its main publication platform. The letters and annotations were created in a word-processing program and later converted semi-automatically into TEI (something that you probably want to avoid doing)” (TEI, 2010). It also describes how the project, “created one TEI (P5) document per letter, holding header information (we introduced some new header elements), facsimile information, transcription, translation and annotation. Other TEI documents hold secondary texts such as the essays and bibliography” (TEI, 2010).

The website itself describes the use of XML generally – though it takes a bit of searching to find the link (under ‘About this edition’ – ‘The web edition’). It notes that: “This edition, like many modern digital editions, is based on XML (eXtensible Markup Language) documents. XML is a standard for the creation of documents in which the document text is interspersed with ‘tags’, brief labels that describe the nature and properties of the text fragments that they surround. The Text Encoding Initiative (TEI) has proposed guidelines for the names and types of the tags to be employed in humanities texts. Out of the 400+ existing tags, a so-called ‘schema’ can be created that contains exactly those tags that are applicable to a certain type of document (such as a letter that is prepared for a scholarly edition). New tags can be defined when the existing tagset is insufficient. The schema describes the required and permitted tags in a class of XML documents. It can be used to check the correctness of these documents. A dedicated schema was created for the Van Gogh edition. A number of non-standard tags were used, some of which were ‘borrowed’ from the DALF (Digital Archive of Letters in Flanders) project” (Jansen, Luijten, and Bakker (eds.), 2009).

It then goes on to explain the specific use of XML for the project: “One XML document was created for each of Van Gogh’s letters and each related document. It holds letter-level metadata (title, number, date, correspondents, etc.), the full transcription, the translation, the notes, the textual notes, and the information that connects transcribed pages with images of those pages (facsimile elements). The XML files were created in an automatic conversion from word-processor documents. The conversion result was checked and extensively corrected. The XML files were manually indexed to facilitate searching and cross-referencing” (Jansen, Luijten, and Bakker (eds.), 2009).

I find it incredible to think that this XML was converted from word documents and not born digital!

The page even mentions that, “for those interested in technical matters, we provide somesample XML files. In the zip file we also include the so-called ‘ODD’ file which is used to customise the TEI schema and the schema files generated from the ODD-file.  We use W3C schema rather than Relax NG because the contractors who performed the conversion to XML were more familiar with that format” (Jansen, Luijten, and Bakker (eds.), 2009).

Here is an example of one of the XML files opened in word format:

screenshot

The site also describes in detail the software tools that support the project. Overall, I am impressed with the degree of detail provided by the project, and their transparency with regards to the technical process.

Bibliography:

Text Encoding Initiative. (2010). Vincent van Gogh – The letters. http://www.tei-c.org/Activities/Projects/vi02.xml.

Jansen, Leo, Luitjen, Hans, and Bakker, Nienke (eds.) (2009). Vincent van Gogh – The Letters. Amsterdam & The Hague: Van Gogh Museum & Huygens ING. http://vangoghletters.org.