The audiobook and the reading by an artificial intelligence: intermedial frontiers
DOI:
https://doi.org/10.18617/liinc.v19i1.6295Keywords:
Audiobook, E-book, Artificial intelligence, Intermediality, Re-mediationAbstract
Historically, the audiobook is a type of media that requires recording the reading of a text aloud that can be reproduced in order for it to be mediated. However, what happens when this “reading” is done by Artificial Intelligence in real-time, instantaneously, such as by means of a virtual assistant like Alexa? Can this mediation be seen as an audiobook? With the view that this question is an issue that emerges with the new relations among digital media, the aim of this article is to define what kind of phenomenon this is. Hence, we use Intermediality Studies based on the models proposed by Lars Elleström (2021) and other authors who help us fill in and develop specific gaps in the theory, to find the needed support for this analysis. To expand our discussion, the example used was the e-book and audiobook of The Alchemist, by Paulo Coelho, to compare the reading done by a human being and the “reading” done by Artificial Intelligence. Based on our analysis, the conclusion is that the “reading” carried out by Alexa is in fact an "oralization" (BAJARD, 2014), corresponding to a process of audio decoding of words. Thus, compared to an audiobook read by a human, the result of a transmediation, Alexa’s decoding is a “re-mediation”. In other words, it is a re-exhibition of the material, spatial-temporal, sensorial, and potentially semiotic modalities of an e-book by means of a different technical media of display
References
ADAMOPOULOU, E.; MOUSSIADES, L. Chatbots: History, technology, and applications. Em: Machine Learning with Applications. Vol 2. 2020.
BAJARD, Élie. Ler e dizer: compreensão e comunicação do texto escrito. São Paulo, SP: Cortez, 2014.
BOLTER, Jay David. GRUSIN, Richard. Remediation: Understanding New Media. Cambridge (MA): MIT Press, 2000.
BROCH, José Carlos. O conceito de affordance como estratégia generativa no design de produtos orientado para a versatilidade. [Em linha] Dissertação (Mestrado em Design e Tecnologia). Porto Alegre, RS: Universidade Federal do Rio Grande do Sul. [Acesso em 22 janeiro 2022]. Disponível em: https://www.lume.ufrgs.br/bitstream/handle/10183/25510/000752864.pdf
BRUHN, Jørgen Bruhn; SCHIRRMACHER, Beate. Intermedial studies. Em: BRUHN, Jørgen Bruhn; SCHIRRMACHER, Beate. Intermedial studies: an introduction to meaning across media. Nova York, NY: Routledge, 2022.
CHARMEUX, Eveline. Apprendre à lire: échecá l’échec. Paris: Milan, 1987.
CHION, Michel. The three listening modes. Em: STERNE, Jonathan. The Sound Studies Reader. Nova York, NY: Routledge, 2012.
COELHO, Paulo. O Alquimista. Ledor: Beth Goulart. São Paulo, SP: Paralela, 2021. Audiolivro.
COELHO, Paulo. O Alquimista. São Paulo, SP: Paralela, 2017. E-book.
ELLESTRÖM, Lars. As modalidades das mídias II: um modelo expandido para compreender as relações intermidiais. Tradução: Beatriz Alves Cerveira, Júlia de Oliveira Rodrigues e Juliana de Oliveira Schaidhauer. Porto Alegre: EDIPUCRS, 2021.
ELLESTRÖM, Lars. The Modalities of Media: A Model for Understanding Intermedial Relations. Em: ELLESTRÖM, Lars (ed.). Media Borders, Multimodality and Intermediality. Basingstoke, Inglaterra: Palgrave Macmillan, 2010. p. 11-48.
GOLD, Ben. MORGAN, Nelson. ELLIS, Dan. Speech and audio signal processing: processing and perception of speech and music. Nova Jersey, NJ: Wiley, 2011.
HUANG, Xuedong; ACERO, Alex; HON, Hsiao-Wuen. Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Nova Jersey, NJ: Prentice Hall, 2001.
LECUN, Y., BENGIO, Y., HINTON, G. (2015). Deep learning. Em: Nature, 521(7553), p 436-444.
MITCHELL, T. M. (1997). Machine learning. Burr Ridge, IL: McGraw Hill, 45(37), p. 870-877.
MITCHELL, William John Thomas. Picture Theory: Essays on Verbal and Visual Representation. Chicago: University of Chicago Press, 1994.
RABINER, L. R.; SCHAFER, R. W. Introduction to digital speech processing. Nova Jersey, NJ: Prentice Hall, 2010.
REZENDE, Solange Oliveira. Sistemas Inteligentes – Fundamentos e Aplicações. São Paulo: Manole, 2003.
RUSSEL, Stuart J.; NORVIG, Peter. Inteligência Artificial. Rio de Janeiro: Elsevier, 2004.
JENSEN, Signe Kjær; SALMOSE, Niklas. Media and modalities – Film. In: BRUHN, Jørgen; SCHIRRMACHER, Beate. (Aut.). Intermidial studies: an introduction to meaning across media. 1 ed. Nova York: Routledge, 2022. p. 28-41
SANTAELLA, Lucia. Comunicação Ubíqua: repercussões na cultura e na educação. São Paulo: Paulos, 2013.
SANTAELLA, Lucia. Matrizes da linguagem do pensamento: sonora visual verbal: aplicações na hipermídia. 3ª Ed. São Paulo: Iluminuras, 2005.
SANTAELLA, Lucia. Neo-humano: a sétima revolução do Sapiens. São Paulo: Paulus, 2022. Edição do Kindle.
SONNENSCHEIN, David. Sound design: the expressive power of music, voice, and sound effects in cinema. Michigan: Michael Wiese Productions, 2001.
TAYLOR, Paul. Text-to-Speech Synthesis. Nova York, NY: Cambridge University Press, 2009.
ZUMTHOR, Paul. Perfomance, recepção, leitura. Tradução: Jerusa Pires Ferreira e Suely Fenerich. São Paulo, SP: Ubu Editora, 2018.
ZUMTHOR, Paul. “A poesia e a voz.” In: ZUMTHOR, Paul. Escritura e nomadismo: entrevistas e ensaios. Cotia, SP: Ateliê Editorial, 2005.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 Jaimeson Machado Garcia, Ana Cláudia Munari Domingos , Rejane Frozza

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
Authors retain copyright and grant Liinc em Revista the right of first publication with the work simultaneously licensed under a Creative Commons Attribution 4.0 International License.
The authors have permission and are encouraged to deposit their manuscripts and versios of record (VoR) in their personal web pages or institutional repositories, generic repositories etc., before (pre-print) or after (post-print) the publication in Liinc em Revista, according to its open access depositing policy registered in the Directory of Editorial Policies of Brazilian Journals (DIADORIM), kindly providing a link to the article published on Liinc's website.
Liinc em Revista, published by Instituto Brasileiro de Informação em Ciência e Tecnologia, is licensed under a Creative Commons Attribution 4.0 International License – CC BY 4.0