Anuj Solanki
GSoC Week 07 Summary
Date: 08 July 2024 - 14 July 2024
Introduction
In this week's update, I have made significant progress on integrating Whisper AI into the AGL Voice Assistant. The primary focus was on enhancing the Flutter application for speech recognition, refining the AGL Voice Agent service, and conducting thorough testing to ensure seamless functionality. Here’s a detailed breakdown of the tasks accomplished, steps to use the AGL Voice Assistant, and plans for the upcoming week.
Tasks Completed This Week
Updated the Flutter Speech Recognition Demo:
Enhanced the flutter-speechrecognition-demo app.
- Added a toggle button to choose between Vosk and Whisper for speech-to-text functionality.
- Included an option to toggle between online and offline speech-to-text functionality.
Tested the Complete Voice Agent Service:
- Conducted comprehensive testing with both the Flutter client and Python client on the AGL image.
Fixed the Kuksa Authorization Issue:
- Resolved the Kuksa authorization issue that was preventing the voice agent from connecting to the Kuksa server.
Use AGL Voice Assistant
Build the Voice-Agent into AGL Image
To integrate the voice-agent into the AGL image, follow these steps:
$ source master/meta-agl/scripts/aglsetup.sh -m qemux86-64 -b build-master agl-demo agl-devel agl-offline-voice-agent
$ source agl-init-build-env
$ bitbake agl-ivi-demo-flutter
Start the Voice-Agent Service
Login as root and start the voice-agent service with the following command:
$ voiceagent-service run-server --default
This command will start the agl-service-voiceagent. Now you can use either the Python client or Flutter client to execute voice commands.
Using Python Client
$ voiceagent-service run-client --server-address 127.0.0.1 --server-port 51053 --nlu snips --action ExecuteVoiceCommand
--mode manual --stt-framework whisper/vosk --online-mode 1/0
Using Flutter Client
Simply use the voice-assistant app.
AGL Voice-Assistant :-
- Voice Agent Service:
- Python Client:
- Flutter Client:
Tasks for Next Week
- Continue refining the Flutter app and push the changes to Gerrit.
- Improve the agl-service-voiceagent.
- Begin work on the NLU component to further improve the voice assistant's functionality.
Conclusion
This week marked a significant milestone in enhancing the AGL Voice Assistant with robust speech recognition capabilities using Whisper and Vosk.
-