articleJan 1, 2012Closed access

WIT3: Web Inventory of Transcribed and Translated Talks

Abstract

We describe here a Web inventory named WIT 3 that offers access to a collection of transcribed and translated talks. The core of WIT 3 is the TED Talks corpus, that basically redistributes the original content published by the TED Conference website (http://www.ted.com). Since 2007, the TED Conference, based in California, has been posting all video recordings of its talks together with subtitles in English and their translations in more than 80 languages. Aside from its cultural and social relevance, this content, which is published under the Creative Commons BYNC-ND license, also represents a precious language resource for the machine translation research community, thanks to its size, variety of topics, and…

Citation impact

580
total citations
FWCI
55.71
Percentile
100%
References
15
Citations per year

Authors

3

Topics & keywords

Keywords
  • World Wide Web
  • Aside
  • Computer science
  • Machine translation
  • License
  • Variety (cybernetics)
  • Resource (disambiguation)
  • Library science
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.