IOT System for Real-Time Audio Information Processing
This paper presents the development and inves- tigation of a speech-to-text conversion and speaker identi- fication system based on a Raspberry Pi microcomputer, designed for local audio data processing in environments with limited network connectivity. The system integrates Silero and WebRTC models for voice activity detection, SpeechBrain for speaker identification, and the Whisper family of models for speech recognition.