SATELLITE WORKSHOP AT INTERSPEECH 2019, GRAZ, AUSTRIA
PLURICENTRIC LANGUAGES IN SPEECH TECHNOLOGY
CALL FOR PAPERS:
DESCRIPTION AND OBJECTIVES OF THE WORKSHOP:
1. Pluricentric languages (PLCLs) are a common type among the languages of the world. Presently 43 languages have been identified to belong to this category . Languages like English, Spanish, Portuguese, Bengali, Hindi, Urdu etc. fall into this category. These languages are being used in at least two nations having an official function there and forming national varieties of their own with specific linguistic and pragmatic features. In addition to the variation on the level of national standard varieties there is also so called “second level variation” on a regional and local level that is often being used in diglossic speech situations where code switching is a salient feature with two or more varieties being used within thesame utterance.
2. The amount of linguistic variation in pluricentric languages is considerable and poses a challenge for speech recognition in particular andhuman language technology in general.
3. The motivation for the satellite workshop is the observation that pluricentric languages have not sufficiently been dealt with. This is particularly the case with the so-called “non-dominant varieties” that often suffer from lack of documentation and treatment in speech technology. The satellite workshop will therefore have a focus on these varieties as they share many features with endangered languages.
THE ORGANIZERS WELCOME PAPERS THAT DEAL WITH:
- Speech recognition and development of language resources for underresourced pluricentric languages and varieties of languages. This is particularly the case for the so called non-dominant varieties. Examples for this are amongst others, Scots, Saami, Karelian Finnish, Ruthenian and Kashubian, Tadczik, Frisian as well as diverse American and African languages: Aymara, Bamabara, Fulfulde, Lingala, Malinke, Soninke, Tuareg, Xhosa etc.
- Language and speech resources development (parallel corpora, pronunciation databases, tagging etc.) especially for non-dominant varieties.
Speech technologies such as speech recognition, text-to-speech and speech-to-speech for the national varieties of Pluricentric languages; on the level of standard varieties and on the level of so-called “informal speech”.
- Empirical studies on the phonetics and phonology of national varieties of different pluricentric languages.
- Speech and language technologies that are able to cope with the variation in the pluricentric languages and particularly in respect to non-dominant varieties and under-resourced languages
- Speech and language processing for code-switched speech in national varieties of pluricentric languages.
ABSTRACTS: 400-600 words long should be sent to: firstname.lastname@example.org
(Call is open from 01 February - 05 April 2019)
LENGTH OF PRESENTATIONS: 15 minutes presenation plus 5 minutes for discussion.
PUBLICATION OF PRESENTATIONS: A special issue of a major journal will be later published containing peer-reviewed substantially extended versions of the abstracts contributed to the conference. Submissions to it will be by invitation.
For more information click "LINK TO ORIGINAL" below.