Gesture-Controlled Info Hub

Built a real-time gesture recognition system using an IMU and Edge Impulse, collecting fixed-rate accelerometer data (AX/AY) via touch-triggered windows and classifying four gestures (W, S, Q, J) on-device.

Sep 2025 - Dec 2025

Technologies

Embedded CC++ESP32Edge ImpulseIMUWi-FiOLED DisplayI²CReal-Time Systems

Overview

A gesture-controlled embedded "info hub" that uses inertial sensing and on-device machine learning to trigger contextual information displays. The system allows a user to perform simple hand gestures to retrieve weather, stock prices, quotes, and jokes, all rendered on an OLED screen in real time.

The device uses an IMU to capture fixed-rate accelerometer data (AX/AY) during short, touch-triggered recording windows. Each gesture is recorded over a deterministic time window, converted into a feature vector, and classified on-device using a model trained and deployed with Edge Impulse.

Four gestures were trained and mapped to system actions: W (Weather), S (Stock price), Q (Quote), and J (Joke). A capacitive touch pin is used to explicitly gate recording, ensuring clean data collection and preventing accidental triggers.

After establishing Wi-Fi connectivity, the system prefetches and caches network data (weather and stock prices) to avoid repeated API calls during runtime. Quotes and jokes are fetched live on demand. Results are rendered on a 128×64 OLED display, with output persisting until the next user interaction, creating a clear and intuitive user experience.

Project Context

A gesture-controlled embedded "info hub" that uses inertial sensing and on-device machine learning to trigger contextual information displays. The project runs entirely on an ESP32 microcontroller and combines sensor data acquisition, real-time inference, network communication, and UI rendering within a tightly constrained embedded environment.

Key Features

Real-time gesture recognition using IMU data and on-device ML inference
Touch-gated data capture for reliable gesture segmentation
Four gesture-driven modes: weather, stocks, quotes, jokes
Wi-Fi HTTP integration with REST APIs and JSON parsing
Prefetching and caching of network data to reduce latency
Persistent OLED UI updated only on user interaction
Modular, event-driven embedded architecture