speech recognition | Academic Journals and Conferences

DEVELOPMENT OF THE MULTIMODAL HANDLING INTERFACE BASED ON GOOGLE API

Today, Artificial Intelligence is a daily routine, becoming deeply entrenched in our lives. One of the most popular and rapidly advancing technologies is speech recognition, which forms an integral part of the broader concept of multimodal data handling. Multimodal data encompasses voice, audio, and text data, constituting a multifaceted approach to understanding and processing information. This paper presents the development of a multimodal handling interface leveraging Google API technologies.

Development of a Web Application for Taking Tests by Blind People

The main purpose of this article is to de- scribethe process of creating a web application designed specifically for blind individuals to take tests. The author discusses the challenges that visually impaired individuals face when taking tests and how the new web application addresses these challenges. The application has been devel- oped using web accessibility guidelines and includes features such as screen reader compatibility, speech recognition, keyboard navigation, and high-contrast options.

Information system for converting audio in ukrainian language into its textual representation using nlp methods and machine learning

Speech recognition involves various models, methods and algorithms for analysing and processing the user’s recorded voice. This allows people to control different systems that support one type of speech recognition. A speech-to-text conversion system is a type of speech recognition that uses spoken data for further processing.