University of Bahrain · Senior Project 2026 · ITCE 499

Augmented Understanding &
Real-Time Interpretation System

Real-Time · Speech-To-Text · Wearable · Accessible

Smart glasses-based assistive communication for the deaf and hard-of-hearing. Real-time speech-to-text, environmental noise detection, and speaker differentiation — all on-device.

Explore Features How It Thinks Technology

◆Speech To Text ◆Bluetooth Low Energy ◆Noise Detection ◆Speaker Differentiation ◆ESP32-S3 ◆OLED Display ◆Offline Mode ◆Flutter App ◆YAMNet AI ◆OpenAI Whisper ◆Speech To Text ◆Bluetooth Low Energy ◆Noise Detection ◆Speaker Differentiation ◆ESP32-S3 ◆OLED Display ◆Offline Mode ◆Flutter App ◆YAMNet AI ◆OpenAI Whisper

Capabilities

Built Different.

Six intelligent modules working in unison to break communication barriers.

🎙️

Real-Time Speech-to-Text

Spoken language is transcribed instantly and displayed as live captions on the OLED screen built into the user's glasses — no lag, no looking down at a phone.

📡

Bluetooth Low Energy

BLE ensures wireless, low-latency communication between the smartphone and the ESP32-S3 wearable device, with minimal power draw for all-day use.

🔔

Environmental Noise Detection

A hybrid fingerprint + YAMNet model identifies important sounds — doorbells, alarms, sirens — and alerts the user with on-screen labels and app notifications.

👥

Speaker Differentiation

Using Resemblyzer voice embeddings and cosine similarity, AURIS labels each speaker in a conversation — "Speaker 1", "Speaker 2" — in real time without prior registration.

🌐

Online & Offline Modes

Online mode uses OpenAI Whisper for highest accuracy across 96+ languages. Offline mode switches to Apple Speech Recognition locally on-device — no internet needed.

🔒

Privacy-First Architecture

Heavy processing happens on the smartphone — not in the cloud. Offline mode ensures that sensitive conversations stay entirely on the user's device when preferred.

Architecture

How It Thinks.

A real-time pipeline from sound capture to visual caption display.

🎤

Input Layer

MEMS Microphone

Compact, high-sensitivity microphone captures audio from the user's environment.

📱

Processing

Flutter Mobile App

Handles STT, noise detection routing, and BLE communication. Acts as the main compute unit.

🧠

Intelligence

Whisper / Apple STT + FastAPI

Dual-mode transcription with cloud Whisper for accuracy or on-device Apple STT for privacy.

📡

Communication

BLE Protocol

Bidirectional low-latency wireless link between smartphone and ESP32-S3 wearable.

⚙️

Embedded

ESP32-S3 Microcontroller

Receives captioned text and drives the OLED display; integrated BLE support.

🕶️

Output

OLED HUD Display

1.3″ OLED projects captions via beam-splitter optics directly into the user's field of view.

End-to-end Latency

1–3s

From speech input to caption visible on the OLED display.

Online Mode

OpenAI Whisper via backend server — highest accuracy, 96+ languages, noise robust.

Offline Mode

Apple Speech Recognition — fully on-device, privacy-safe, no internet required.

Database

Firebase

Realtime Database for user data & custom sounds. Firestore for persistent settings.

Noise Detection Engine

Hybrid

Custom fingerprint matching for personalized sounds + YAMNet for general classification.

Technology

The Stack Behind the Intelligence.

Mobile & Frontend

Flutter Dart iOS Native Speech API Firebase SDK BLE Flutter Plugin

AI & Backend

OpenAI Whisper ASR YAMNet (TensorFlow) Resemblyzer FastAPI (Python) Firebase Realtime DB Cloud Firestore

Hardware

ESP32-S3 Microcontroller 1.3″ OLED (SPI) MEMS Microphone (I2S) 3.7V LiPo Battery MT3608 Boost Converter Beam-Splitter HUD Optics 3D Printed Chassis

The Team

Built By.

Aya Mohamed Ahmed Ismail

Project Member

University ID 202206832

Institution Univ. of Bahrain

Mira Mahmoud Mousa Basiouny

Project Member

University ID 202204167

Institution Univ. of Bahrain

Amani Emad Ebrahim Hajjaj

Project Member

University ID 202200149

Institution Univ. of Bahrain

Project Supervisor

Dr. Aysha Al-Sayed

Project Supervisor · College of IT, UoB

Contact

Get In Touch.

Questions about the project, collaboration requests, or feedback — we'd love to hear from you.

✉

contact@auris-glasses.it.com

🏛

Institution

University of Bahrain

📐

Department

Computer Engineering, College of IT

📅

Academic Year

2025–2026 · Semester 2

Name

Message

Augmented Understanding & Real-Time Interpretation System

Built Different.

Real-Time Speech-to-Text

Bluetooth Low Energy

Environmental Noise Detection

Speaker Differentiation

Online & Offline Modes

Privacy-First Architecture

How It Thinks.

The Stack Behind the Intelligence.

Built By.

Aya Mohamed Ahmed Ismail

Mira Mahmoud Mousa Basiouny

Amani Emad Ebrahim Hajjaj

Get In Touch.

Augmented Understanding &
Real-Time Interpretation System