The CorDis television corpus is an XML (eXtensible Mark-up Language) TEI (Text Encoding Initiative)-conformant collection of texts representing a signifi cant portion of the television news discourse on the 2003 Iraqi confl ict, comprising four subcorpora, that is, the evening news broadcasts for BBC, CBS, RAI Uno and Canale 5 from 20 March to 18 April (see Introduction, this volume). The main purpose of this paper is to show the function and importance of markup for the retrieval of discourse-specific information in a television news corpus. In order to do so, some preliminary issues have to be addressed: (1) the role of annotation in the creation of a harmonized and consistent corpus, with specific reference to TEI mark-up of spoken discourse, and (2) an overview of the corpus composition and of the relevant categories that have been encoded. The focus will be particularly on the function of mark-up associated with television news discourse, in order to illustrate the way mark-up gives access to meta-linguistic information by telling part of the parallel story constituted by the visual text, thus permitting the recovery of non-verbal data, a fundamental characteristic of the medium (television) and of the genre (television news). Finally, we will argue that such a homogeneously encoded corpus is a precious resource for research, both because it enhances reliability and favours reusability, making the data easily retrievable, and because it gives access to a whole set of information that would otherwise be lost.

Mark-up and the narrative structure of TV news / Venuti, Marco; A., Marchi. - STAMPA. - Research in Corpus and Discourse:(2009), pp. 27-47.

Mark-up and the narrative structure of TV news

VENUTI, MARCO;
2009

Abstract

The CorDis television corpus is an XML (eXtensible Mark-up Language) TEI (Text Encoding Initiative)-conformant collection of texts representing a signifi cant portion of the television news discourse on the 2003 Iraqi confl ict, comprising four subcorpora, that is, the evening news broadcasts for BBC, CBS, RAI Uno and Canale 5 from 20 March to 18 April (see Introduction, this volume). The main purpose of this paper is to show the function and importance of markup for the retrieval of discourse-specific information in a television news corpus. In order to do so, some preliminary issues have to be addressed: (1) the role of annotation in the creation of a harmonized and consistent corpus, with specific reference to TEI mark-up of spoken discourse, and (2) an overview of the corpus composition and of the relevant categories that have been encoded. The focus will be particularly on the function of mark-up associated with television news discourse, in order to illustrate the way mark-up gives access to meta-linguistic information by telling part of the parallel story constituted by the visual text, thus permitting the recovery of non-verbal data, a fundamental characteristic of the medium (television) and of the genre (television news). Finally, we will argue that such a homogeneously encoded corpus is a precious resource for research, both because it enhances reliability and favours reusability, making the data easily retrievable, and because it gives access to a whole set of information that would otherwise be lost.
2009
9781847061768
Mark-up and the narrative structure of TV news / Venuti, Marco; A., Marchi. - STAMPA. - Research in Corpus and Discourse:(2009), pp. 27-47.
File in questo prodotto:
File Dimensione Formato  
Mark-up and the narrative structure of television news.pdf

non disponibili

Tipologia: Documento in Post-print
Licenza: Accesso privato/ristretto
Dimensione 1.07 MB
Formato Adobe PDF
1.07 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/115657
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact