Journal article

Mixing and merging for spoken document retrieval

  • Crestani, Fabio Facoltà di scienze informatiche, Università della Svizzera italiana, Svizzera
Published in:
  • Lecture notes in computer science. - Springer. - 2009, vol. 1513, p. 397-407
English This paper describes a number of experiments that explored the issues surrounding the retrieval of spoken documents. Two such issues were examined. First, attempting to find the best use of speech recogniser output to produce the highest retrieval effectiveness. Second, investigating the potential problems of retrieving from a so-called "mixed collection", i.e. one that contains documents from both a speech recognition system (producing many errors) and from hand transcription (producing presumably near perfect documents). The result of the first part of the work found that merging the transcripts of multiple recognisers showed most promise. The investigation in the second part showed how the term weighting scheme used in a retrieval system was important in determining whether the system was affected detrimentally when retrieving from a mixed collection.
  • English
Computer science and technology
License undefined
  • RERO DOC 12697
  • ARK ark:/12658/srd1318283
Persistent URL

Document views: 20 File downloads:
  • crestani_LNCS_2009_2.pdf: 44