SUSI.AI – The Privacy Aware Smart Assistant – Norbert Preining, Accelia Inc., FOSSASIA

The Linux FoundationURL:
Embed:

The landscape of smart assistants is constantly evolving. Many homes now feature voice-controlled devices. However, critical questions about privacy and data security often arise. The accompanying video introduces SUSI.AI, a privacy-aware smart assistant. This open-source project offers a compelling alternative to mainstream smart speakers. It shifts control back to the user.

This article expands on the insights shared in the video. It explores why SUSI.AI is a crucial development. Furthermore, its unique approach to personal data protection will be highlighted. The project’s deep roots in the open-source community are also a key factor. Understanding SUSI.AI’s core principles can empower users. It offers a path to more secure smart home interactions.

The Rising Concern for Smart Speaker Privacy

Commercial smart speakers offer undeniable convenience. They can play music, check the weather, and set alarms. Many households worldwide rely on these devices daily. However, this convenience often comes at a cost. User data privacy is a significant concern. The core problem lies in their design. Voice commands are typically sent to remote servers for processing.

This server-side processing creates vulnerabilities. Recordings can be accessed by human reviewers. These reviewers often aim to improve voice recognition systems. Such practices raise serious ethical questions. There have been many reported instances of unauthorized access. Personal conversations can be exposed. User trust is eroded when privacy is compromised. IoT security risks also loom large. Unsecured devices could become entry points for malicious actors. These concerns are actively addressed by SUSI.AI, a privacy-aware smart assistant.

Introducing SUSI.AI: A Different Approach to Smart Assistants

SUSI.AI presents a fundamental shift. It operates on principles of privacy and openness. This project is developed by the FOSSASIA community. FOSSASIA is recognized as a leading open-source community in Asia. Their mission involves fostering open technologies globally. SUSI.AI embodies this ethos directly.

The system is designed with privacy at its forefront. Offline capability is a key feature. This allows many functions to run without internet access. Local data processing keeps information on the user’s device. This significantly reduces privacy risks. It prevents data from being sent to external servers. Therefore, user information remains under personal control.

Key Pillars of the SUSI.AI Experience

Several features define SUSI.AI’s unique value. Its design aims for maximum user empowerment. These aspects directly counter commercial smart speaker limitations. Each pillar reinforces the commitment to privacy.

Offline Functionality: Your Data Stays Home

A significant advantage of SUSI.AI is its offline capability. Many basic tasks do not require an internet connection. Music playback from a local server is possible. Setting timers and reminders can be done locally. This offline mode minimizes data transmission. It ensures that sensitive interactions remain private. Such functionality is vital for users prioritizing security. It eliminates reliance on cloud services for common tasks.
Open Source Ecosystem: Transparency and Control

SUSI.AI is entirely open source. All its software components are freely available. This transparency allows for community auditing. Users can inspect the code for vulnerabilities. This contrasts sharply with proprietary systems. Commercial offerings often hide their internal workings. Open source fosters trust and collaboration. It also ensures long-term viability and adaptability.
Easy Skill Development: Customize Your Assistant

Developing new skills for SUSI.AI is made accessible. A simple skill language is used. It can be edited much like a wiki page. This encourages community contributions. Users can create custom commands and integrations. This flexibility is uncommon in commercial alternatives. It empowers users to tailor the assistant to their specific needs. Innovation is driven by a diverse developer base.

Anatomy of a Privacy-Aware Smart Assistant: SUSI.AI’s Core Components

SUSI.AI is structured into distinct, interconnected parts. Each component plays a vital role. They work together to deliver its privacy-focused functionality. Understanding these parts helps appreciate the system’s design. This modularity also facilitates ongoing development and customization. The entire system is built for resilience and user control.

The SUSI Server: The Brain of the Operation

The SUSI Server acts as the central processing unit. It interprets the skill language. Natural language processing (NLP) is performed on user questions. This allows the system to understand spoken commands. The server determines which skill matches a query. It can run locally on a device like a Raspberry Pi. This local deployment is highly recommended. It keeps all command processing within the user’s home network. User and device management are also handled by the server. This provides optional registration and anonymity.
SUSI Skills: Enabling Diverse Functionality

Skills are the specific capabilities of SUSI.AI. They provide question-and-answer functionality. A vast collection of user-contributed skills exists. These skills can integrate with external APIs for certain functions. Accessing weather data or exchange rates might use APIs. Users can develop and host their own skills locally. This means private skills never leave the home network. A wiki-style editing interface simplifies skill creation. It enables a broad range of community contributions.
SUSI Frontends: Accessible Across Devices

Various frontends ensure wide accessibility. An Android application is available. An iOS version is currently being developed. A desktop client supports Linux and Mac systems. It can run in the background like other voice assistants. A web frontend also allows interaction via a browser. These frontends communicate with the SUSI Server. They can connect to a local server or the public susi.ai server. The desktop client prioritizes local server connection by default. This design choice reinforces privacy.

Practical Privacy: Speech-to-Text and Text-to-Speech

Speech processing is a critical area for privacy. Translating spoken words to text is complex. It often requires significant computational resources. SUSI.AI addresses this with configurable options. Users can choose their preferred level of privacy.

By default, DeepSpeech from Mozilla Foundation is used. This open-source solution runs on the device. It ensures voice data is processed locally. However, DeepSpeech currently has fewer language models. Its accuracy may not always match commercial offerings. Users can opt for Google, Bing, or Watson speech-to-text services. These provide higher accuracy but compromise local processing. The choice is left to the user’s discretion. Text-to-speech similarly offers default and alternative options. FLite is the on-device default. It provides acceptable, though not premium, pronunciation. Google or Watson can be selected for higher quality output. This flexible configuration ensures users maintain control. It allows for a balance between privacy and functionality.

Building Your Own SUSI.AI Smart Speaker: The Hardware Aspect

SUSI.AI isn’t just software. It also supports a dedicated hardware device. This allows for a complete, self-contained smart speaker experience. The hardware is designed with open principles in mind. It exemplifies the DIY spirit of the open-source community.

The SUSI.AI smart speaker is Raspberry Pi-based. Raspberry Pi 3+ or 4 models are recommended. These small computers provide sufficient processing power. A ReSpeaker HAT is used for microphone input and audio output. This provides essential voice interaction capabilities. A 3D-printed cover houses all components neatly. The SUSI.AI Server and Linux frontend run directly on the Raspberry Pi. An installer simplifies the setup process. It configures the device to function right out of the box. Users can set up Wi-Fi via a local hotspot. This avoids the need for cables. The device can operate entirely offline after initial setup. This ensures maximum privacy for smart home interactions.

Community, Challenges, and the Future of SUSI.AI

The development of SUSI.AI is a community-driven effort. Volunteers from diverse backgrounds contribute. This collaborative spirit is foundational to the project. Workshops, coding contests, and Gitter channels foster engagement. Initiatives like Google Summer of Code involve many students. This broad participation brings fresh ideas. It also creates a vibrant development environment.

However, such a large, volunteer-led project faces unique challenges. The vast scope of work is handled by part-time contributors. This can impact development speed. Diversity in coding skills and cultural backgrounds needs careful management. Maintaining consistent coding standards is a continuous effort. Furthermore, long-term volunteer involvement is often difficult to secure. Many participants join for specific coding programs. Technical hurdles also persist. The popularity of Java for server-side development has declined among students. Python’s frequent version updates can introduce compatibility issues. The future of DeepSpeech, a core component, is also uncertain. Mozilla has scaled back its development efforts for the project. These challenges are openly discussed within the FOSSASIA community. They highlight the complexities of large-scale open-source initiatives.

The roadmap for SUSI.AI is ambitious. The ultimate dream is a privacy-aware smart assistant in every household. Current efforts focus on improving the out-of-the-box experience for the hardware device. Desktop integration is another priority. This involves creating seamless voice recognition functionality. Better skill management tools are also planned. These tools will help users activate vetted skills more easily. A server mesh concept aims to further decentralize the system. This allows multiple local SUSI Servers to communicate. It would enable device and user registration within a private household network. Sustaining steady development is a perpetual goal. The FOSSASIA community welcomes new contributors. Engagement through Gitter channels and personal contact is encouraged. This ensures the project continues to evolve. It keeps privacy and open-source principles at its core.

Your Questions for SUSI.AI: Exploring Privacy and Smart Capabilities

What is SUSI.AI?

SUSI.AI is an open-source, privacy-aware smart assistant. It offers an alternative to mainstream smart speakers by giving users more control over their data.

How is SUSI.AI different from common smart speakers?

Unlike many commercial smart speakers, SUSI.AI focuses on privacy by allowing many functions to run offline and processing data locally on your device. This prevents your information from being sent to external servers.

What does it mean that SUSI.AI is ‘open-source’?

Being open-source means all of SUSI.AI’s software components are freely available for anyone to inspect, modify, and contribute to. This promotes transparency, trust, and community-driven development.

Can I build my own SUSI.AI smart speaker device?

Yes, you can build a SUSI.AI smart speaker using hardware like a Raspberry Pi computer and a ReSpeaker HAT for audio. The project encourages a DIY approach for a self-contained, private smart home experience.