- Início
- Bibliografia
- The first Mirandese text-to-speech system
The first Mirandese text-to-speech system
Autores
Tipología
Capítulo de livro
Título del libro
Language Documentation and Conservation in Europe
Editoras de livros
Ferreira, Vera; Bouda, Peter
Localização
Honolulu
Editorial
University of Hawai‘i Press
Ano
2016
Páginas
150-158
ISBN
978-0-9856211-5-5
Sítio web
Sinopse do conteúdo
[Resumen extraído de la fuente original]
This paper describes the creation of base NLP resources and tools for an underresourced minority language spoken in Portugal, Mirandese, in the context of the generation of a text-to-speech system, a collaborative citizenship project between Microsoft, ILTEC, and ALM – Associaçon de la Lhéngua Mirandesa. Development efforts encompassed the compilation of a large textual corpus, definition of a complete phone-set, development of a tokenizer, inflector, TN and GTP modules, and creation of a large phonetic lexicon with syllable segmentation, stress mark-up, and POS. The TTS system will provide an open access web interface freely available to the community, along with the other resources. We took advantage of mature tools, resources, and processes already available for phylogenetically-close languages, allowing us to cut development time and resources to a great extent, a solution that can be viable for other lesser-spoken languages which enjoy a similar situation.
Notas
Language Documentation & Conservation, Special Publication No. 9.
Linguagem
Tema
Área geográfica
Última modificação
2021-05-29 13:24