This article presents ethnographic research in the field of algorithmic speech synthesis, conducted in universities in Italy and Germany and in a speech synthesis company. The methodology integrates ethnography, software studies and media archaeology in order to account for the complex socio-technical network of algorithmic systems. Fieldwork included interviews with programmers as well as the examination of speech synthesis algorithms at work. Following analysis of the empirical research, the study discusses epistemological and socio-cultural aspects of data assemblages, focusing on changes in programming practices related to deep learning, a technology that bypasses domain-knowledge and human models of speech to refer directly to the observation of examples. Highlighting the tension between technical operations and social representations of these operations, the paper suggests that the sense-making of algorithms is not to be found in automation, but in the shift in programmers’ position and in the associated subjectivation processes.
"Where is the voice of the machine?". An ethnography of artificial voice socio-technical networks / Napolitano, Domenico. - In: ETNOGRAFIA E RICERCA QUALITATIVA. - ISSN 1973-3194. - 3/2020:(2020), pp. 351-372. [10.3240/99549]
"Where is the voice of the machine?". An ethnography of artificial voice socio-technical networks
Domenico Napolitano
2020
Abstract
This article presents ethnographic research in the field of algorithmic speech synthesis, conducted in universities in Italy and Germany and in a speech synthesis company. The methodology integrates ethnography, software studies and media archaeology in order to account for the complex socio-technical network of algorithmic systems. Fieldwork included interviews with programmers as well as the examination of speech synthesis algorithms at work. Following analysis of the empirical research, the study discusses epistemological and socio-cultural aspects of data assemblages, focusing on changes in programming practices related to deep learning, a technology that bypasses domain-knowledge and human models of speech to refer directly to the observation of examples. Highlighting the tension between technical operations and social representations of these operations, the paper suggests that the sense-making of algorithms is not to be found in automation, but in the shift in programmers’ position and in the associated subjectivation processes.File | Dimensione | Formato | |
---|---|---|---|
D. Napolitano - Where's the voice of the machine [ERQ 3-2020].pdf
non disponibili
Dimensione
1.71 MB
Formato
Adobe PDF
|
1.71 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.