Baza Publikacji Pracowników Politechniki Lubelskiej

Status:
Autorzy:	Mamyrbayev Orken, Pavlov Sergii V., Bekarystankyzy Akbayan, Oralbekova Dina, Zhumazhanov Bagashar, Azarova Larysa E., Mussayeva Dinara, Koval Tetiana, Gromaszek Konrad, Issimov Nurdaulet, Shiyapov Kadrzhan
Dyscypliny:
	Aby zobaczyć szczegóły należy się zalogować.
Wersja dokumentu:	Drukowana \| Elektroniczna
Język:	angielski
Strony:	1 - 8
Web of Science® Times Cited:	0
Scopus® Cytowania:	0
Bazy:	Web of Science \| Scopus
Efekt badań statutowych	NIE
Materiał konferencyjny:	TAK
Nazwa konferencji:	20th meeting in the series of National Symposia "Fiber Optics and their Applications"
Skrócona nazwa konferencji:	20th OFTA 2023
URL serii konferencji:	LINK
Termin konferencji:	11 września 2023 do 14 września 2023
Miasto konferencji:	Lublin
Państwo konferencji:	POLSKA
Publikacja OA:	NIE
Abstrakty:	angielski
	Nowadays, speech-processing technologies with different language systems are successfully used in mobile and stationary devices. Kazakh is considered a low-resource language, which poses various challenges for conventional speech recognition methods. This paper presents a proposed model capable of multitasking and handling concurrent speech recognition, dialect identification, and speaker identification, all in an end-to-end framework. The developed multitask model enables training three different tasks within a single model. A multitask recognition system is created based on the WaveNet-CTC model. Experiments show that for the concrete task end-to-end multitask model has better performance than other models.

Informacja o cookies

Neurorecognition visualization in multitask end-to-end speech

Fragment książki (Rozdział monografii pokonferencyjnej)