SENTINEL
The assistant that watches you work — and obeys your hands.
It sees your posture and focus through the webcam, coaches you in real time, and lets you command your screen with bare-handed gestures. Every frame stays on your machine. No cloud. No account. Free.
It sees you the way a good coach would.
Presence
Knows when you sit down, step away, and return. Greets the work, not the empty chair.
Posture
Calibrates your upright baseline, then catches the slow slouch before your back does.
Stillness
Tracks how long you've been frozen in the chair and tells you to stand before the hour is gone.
Screen thrash
Counts how fast you're flipping between windows and calls it when you're flailing, not working.
Eleven controllers — run it all with your bare hands.
One line. The right amount of pressure.
The whole thing runs on the laptop in front of you.
Why these features — grounded in the science.
Sedentary risk
The WHO flags prolonged sitting as a real health risk; the remedy is frequent movement. → our sitting timer + stand-up nudges.
20-20-20
American Academy of Ophthalmology: every 20 min, look ~20 ft away for 20 s to ease screen eye-strain. → our eye-rest reminder.
Computer Vision Syndrome
American Optometric Association: screens sharply cut your blink rate and dry the eyes. → we surface your blinks/min.
“Tech neck”
Cervical-load research: a forward-tilted head multiplies the strain on your neck. → posture calibration + slouch catch.
Attention residue
Sophie Leroy's work: switching tasks leaves part of your mind stuck on the last one. → window-thrash detection.
Camera vitals (rPPG)
Poh, McDuff & Picard, MIT Media Lab: pulse and breathing from ordinary webcam video. → roadmap; research-grade, not in v1.
A real assistant — for the betterment of the human using it.
Built on open source — and what plugs in next.
Vision
Live: MediaPipe Pose / Face / Gesture + OpenCV — body skeleton, blink, hands, on-device. Next: YOLO-Pose, py-feat expressions, rPPG vitals (research-grade).
Voice
Live: faster-whisper (hears) + pyttsx3 (speaks). Next: openWakeWord (a real “hey jarvis” model), Piper neural TTS, Silero VAD.
Brain & control
Live: Claude (Jarvis) + pywin32 (windows, keys, cursor). Next: Ollama local LLM ($0, private), Playwright for precise in-app selection.
Wearables & IoT
Documented, not yet wired: Fitbit / Withings / Garmin APIs; ESP32 sensors (MAX30102 heart-rate, MPU-6050 posture); Home Assistant for TV / lights (“change the channel by blink”).
Two lines and it's watching.
Built as a personal tool — sibling to my FlowLite dictation app. Pure Python, no installer, no telemetry. The ~8MB gesture model downloads itself on first run.