g

glaucosim

Glaucoma monitoring,
from home.

Author

Mauro Gobira, MD

Visiting Scholar in Glaucoma

Site

UC San Diego · Shiley Eye Institute

Hamilton Glaucoma Center · 2026

Background

Three gaps in glaucoma care.

A quarterly clinic model cannot see what determines outcomes.

01 · Adherence

~50%

of glaucoma patients are non-adherent on objective measurement, and clinicians cannot detect it before the next visit.

Olthoff 20051 · Friedman 2007 (GAPS)2 · Newman-Casey 20153

02 · Progression sampling

8.8%

of 380,029 US OAG enrollees received zero visual fields over the study window; only ~23% met the AAO ≥ 1 VF/year guideline. A single VF/year cannot separate noise from a 1.5 dB/year progressor for ~5 years.

Stagg 20224 · Chauhan 20085

03 · Drug side effects

up to 60%

of eyes on prostaglandin analogues for ≥ 3 months develop deepening of the upper eyelid sulcus (DUES) — 60% on bimatoprost, down to 18–24% on tafluprost / latanoprost. Surface and lid changes evolve between visits.

Inoue 2013 (J Glaucoma)6

The platform

Continuous monitoring,
between visits.

Glaucosim is a browser-and-phone layer that runs a longitudinal home cadence of clinically grounded tests, captures medication adherence, and surfaces a trend to the eye-care professional before the next appointment.

Visual function — VF 24-2, visual acuity, contrast sensitivity
Anatomy — anterior-segment video, voice-guided
Pressure — acoustic IOP screen (β, research)
Patient-reported — NEI VFQ-25, symptom diary
Adherence — drop diary, reminders, missed-dose log

No dedicated hardware No app store Eye-care reviewed

g

glaucosim · dashboard

● synced M. Reyes · 64F

PATIENT

Maria Reyes · OD POAG

Latanoprost 0.005% · 1× QHS · since 2023-08

MD trend · 24-2 OD

−0.62 dB / yr

Drop adherence · last 30 d

26 / 30

What already exists.

Most home tools cover one test and depend on dedicated hardware. Glaucosim runs a multi-modality session on the devices the patient already owns.

Home · hardware-bound

Home · software / BYOD

In-clinic · hardware

In-clinic · software

Hardware-bound

Software / BYOD

Home / remote

In-clinic only

HFA · SITA

Goldmann

Reichert ORA / 7CR

Octopus · Henson

iCare HOME / HOME2

Sensimed Triggerfish

Implandata Eyemate

Olleyes VisuALL

Notal Vision (home OCT)

Heru VR

RadiusXR

Imo Vifa (Topcon CREWT)

NOVA VR (Bradley)

Eyenuk EyeArt

M&S Smart System

Melbourne Rapid Fields

iPad ZEST (Schulz)

EyeQue

Easee

Peek Vision

Glaucosim · multi-modality

Continuous IOP needs an implant or a contact lens (Eyemate, Triggerfish). Home perimetry needs a VR headset or a tablet kiosk (Olleyes, Heru, RadiusXR, Imo Vifa). Home anterior-segment imaging needs a clip-on lens. Home tools that ship without dedicated hardware cover a single test — refraction (EyeQue, Easee), VF (MRF, iPad ZEST), or screening (Peek Vision).

Glaucosim is the only point in the top-right quadrant covering visual function + anterior segment + IOP screen (β) + adherence in one home session, on devices the patient already owns. Peek Vision is the closest conceptual peer but is built for community-screening triage, not longitudinal glaucoma monitoring.

Per-measurement precision is lower than instrument-bound counterparts. The trade is an order-of-magnitude increase in sampling cadence, and slope-estimate variance falls as 1/n³ when test occasions are added.5

The hard part

A home test without integrity
checks is a screenshot of a screen.

How do we run clinical-grade tests remotely, without dedicated devices, and still trust the data?

Environment

Is the room within spec?

Visual acuity, contrast and perimetry each assume a different luminance window. A test outside its window is not interpretable.

Geometry

Is the patient where the test assumes?

Stimulus angle, optotype size, and pixel pitch all depend on the patient-to-screen distance — and on which eye is actually being tested.

Behavior

Is the patient actually fixating?

Peripheral perimetry assumes central fixation. A 4° saccade away from target makes the stimulus land at the wrong location.

Approach

Glaucosim runs an end-to-end sensor stack — five in-house ML models on the patient's own webcam and phone — that gates every trial against environment, geometry, and behavior before it counts.

In-house ML

Five models guard every trial.

No additional hardware. No data leaves the device until results are signed and synced.

01

Screen distance

Iris-pinhole projection from MediaPipe FaceMesh.

02

Eye cover

EAR + hand-landmark + iris occlusion fusion.

03

Gaze fixation

Iris-relative-to-canthi, Kalman-filtered.

04

Ambient light

Calibrated webcam-mean luminance proxy + glare.

05

Capture quality

Anterior-segment focus, exposure, framing scorer.

Each module reads all five channels before allowing a trial. Out-of-band readings prompt re-positioning or invalidate the affected stimulus. Every event is logged for retrospective audit.

Model 01 · Screen distance

Distance from a single RGB frame.

Open live demo ↗

Principle

We use the interpupillary distance (IPD) as the real-world anchor — the population mean for adults is 63 mm (SD ~3.5 mm).7 MediaPipe FaceMesh returns the two iris-center landmarks (468 left, 473 right). We measure the IPD in pixels and recover patient-to-screen distance from the pinhole projection.

d = (f_px · 63 mm) / IPD_px

Distance

d patient-to-camera (mm) · f_px camera focal length (px), recovered with a one-time on-screen calibration step · IPD_px live pixel distance between iris centers (FaceMesh 468 ↔ 473).

Why IPD and not iris diameter: the iris edge is harder to segment reliably under variable lighting and lashes, while iris centers are detected by MediaPipe with sub-pixel stability and remain visible even when the lid covers part of the limbus.

What we measure to trust it

Mean absolute error vs ruler-measured distance, 30–100 cm window
Sample variance across head pose (±20° yaw, ±15° pitch)
Drop-out rate under low light / glasses / blink frames
Latency budget < 33 ms / frame (30 Hz)

SIMILAR TRIANGLES · REAL IPD FIXED AT 63 MM · PIXEL IPD INVERTS WITH DISTANCE

Model 02 · Eye cover

Per-trial verification of which eye is occluded.

Open live demo ↗

Principle

Monocular tests assume the operator knows which eye is tested. At home, a left-eye trial labelled as right-eye produces a clean, plausible, incorrect record. Glaucosim verifies cover state from three independent signals — any single one of which is brittle alone.

EAR = ( ‖p₂−p₆‖ + ‖p₃−p₅‖ ) / ( 2 · ‖p₁−p₄‖ )

Eye Aspect Ratio

Open eye ≈ 0.27–0.32; closed eye < 0.15. Threshold calibrated per subject over a 25-frame baseline at session start.8

EAR — lid aperture from FaceMesh landmarks
Hand landmarks — overlapping the orbital bounding box
Iris occlusion — skin or fabric inside iris ROI

What we measure to trust it

Sensitivity / specificity per channel vs ground-truth video labels
Mis-occlusion rate (eye reported covered when it isn't)
Robustness against glasses, dark lashes, sleeves, palms

PER-EYE STATE GATES EVERY STIMULUS · LIVE @ ~30 HZ

Model 03 · Gaze fixation

Drift outside 4° invalidates the stimulus.

Open live demo ↗

Principle

Perimetry assumes the patient is looking at the central target. If gaze drifts, the stimulus meant to land at 21° lands at 17° or 25°, and the threshold at the labelled location is wrong without the algorithm knowing.

Gaze is computed as iris position relative to the eye corners, in a head-relative frame — so translating the head does not move the vector; only a saccade does. A 1-D Kalman filter is applied to each component, with measurement noise inflated during blinks.

g = ( c_iris − c_canthi ) / w_eye

Eye-relative gaze

Reads as: the offset of the iris center from the eye's center, normalised by the width of the eye opening.

c_iris — pixel coordinate of the iris center (FaceMesh 468 / 473).
c_canthi — midpoint between the inner and outer canthus of the same eye (landmarks 33 ↔ 133 left, 263 ↔ 362 right). This is the eye's geometric center in a head-relative frame.
w_eye — distance between the same two canthi. Used as the normaliser so g is unitless: head pose, screen distance, and resolution drop out.

After a 30-frame baseline at session start g₀, drift is Δ = g − g₀. Stimuli presented while ‖Δ‖ > 4° are flagged and excluded from the ZEST posterior update. Heijl-Krakau blind-spot catches run in parallel for the standard reliability indices.

What we measure to trust it

Calibrated gaze accuracy at 9-point grid (mean ± SD in degrees)
Blink-window rejection sensitivity
Head-pose invariance across ±20° yaw / ±15° pitch

DRIFTED STIMULI ARE DROPPED FROM THE BAYESIAN POSTERIOR · FL / FP / FN COMPUTED IN PARALLEL

Model 04 · Ambient light

Reject the session if the room is off-spec.

Open live demo ↗

Principle

Visual function thresholds are luminance-dependent. Acuity assumes ISO 8596 background; Pelli-Robson assumes ~85 cd/m²; perimetry assumes a dim room so stimulus contrast reaches operating range.

Glaucosim derives an operational ambient proxy from the webcam: mean greyscale intensity of the central patch, exposure-compensated, calibrated against an on-screen reference step at session start.

L̂_amb ≈ k · ⟨I_grey⟩ · ( 1 / e_cam )

Ambient proxy

⟨I_grey⟩ mean intensity of central patch · e_cam camera exposure from MediaStream constraints · k per-device constant from a 5 s on-screen reference.

What we measure to trust it

Agreement (Pearson r, Bland-Altman) vs a calibrated lux meter
Per-module operating window pass-through rate
Off-axis glare detection: variance over the corneal reflection ROI

EACH TEST DEFINES ITS OWN WINDOW · OUT-OF-WINDOW SESSIONS ARE TAGGED ADVISORY OR REJECTED

Model 05 · Capture quality

Anterior segment frames graded in real time.

Open live demo ↗

Principle

A patient's phone records a short anterior-segment clip per take. To be useful for surface review, each frame has to be in focus, well exposed, and framed on the iris. A quality scorer runs over every frame so the patient is guided in real time.

Q = α·F_var + β·E_hist + γ·R_iris − δ·M_blur

Quality score

F_var Laplacian variance (focus) · E_hist exposure flatness · R_iris iris coverage from FaceMesh ROI · M_blur motion blur from optical-flow magnitude.

What we measure to trust it

Frame-level agreement with rater quality grading (κ)
Take re-do rate vs unguided baseline capture
Lens-blur, light-flicker, off-axis rejection

Only takes that pass the threshold are kept. The voice avatar tells the patient to come a little closer, hold still, or retake.

ONE FRAME PER EYE · PHONE OR LAPTOP · IMAGES ENCRYPTED AT REST

Test 01 · Visual field

Visual field 24-2 exam.

ZEST Bayesian adaptive thresholding on the 54-location grid. Same family as SITA.

Open exam ↗

P(t | r_1:n) ∝ P(r_n | t) · P(t | r_1:n−1)

Bayesian update · stop SD < 1.5 dB

Output

MD · mean deviation
PSD · focal loss
VFI · macula-weighted %
GHT · superior vs inferior
Reliability · FL · FP · FN

ZEST ≈ SITA, fewer presentations9 · iPad ZEST validated vs HFA10

Sample 24-2 ZEST · OD · MD −8.42 dB · PSD 6.71 dB · VFI 87% · GHT OUTSIDE NORMAL LIMITS

Test 02 · Acoustic IOP (β · research)

Acoustic IOP.

Contactless pressure screen — laptop emits, phone listens. Research-only signal. Not a replacement for Goldmann.

Open exam ↗

Eye = viscoelastic ball under pressure → mechanical resonance rises with IOP. Speaker drives, selfie camera reads iris motion.

Pipeline

Drive · 12→22 Hz chirp, 5 s
Capture · iris landmarks @ fps
Subtract · head motion, ICD-norm
FFT · windowed, chirp-aligned
Peak · 14–20 Hz band
f* → mmHg · per-patient calibration

|X(f)| = FFT{ x_iris(t) }

Read-out

Output

mmHg · 95% CI
Peak f* · Hz, stability
Signal quality · SNR gate

Shiley validation · acoustic mmHg vs same-day Goldmann · 4-week retest

IRIS DISPLACEMENT x(t)

0 → 5 s · head-motion subtracted · ICD-normalised

SPECTRUM |X(f)| · peak f*

peak f* → mmHg (per-patient calibration)

Prior art · scientific basis

IOP MODULATES EYE RESONANCE

Coquart et al., J Biomech 1992 — FEM model: eye resonance frequencies are sensitive to IOP.
Kim et al., Sci Rep 2021 — Vibroacoustic resonance and CMVR scale monotonically with IOP (p < 0.0001).

ACOUSTIC TONOMETRY · IN-VIVO

Salz et al., J Glaucoma 2009 — Acoustic tonometry feasibility on porcine eyes: r = −0.98 vs IOP.
Osmers et al., TVST 2020 — First in-vivo human trial of an acoustic self-tonometer.

VIBRATION FROM A REGULAR CAMERA

Davis et al., ACM TOG · SIGGRAPH 2014 — "Visual Microphone": sub-pixel vibration recovered from ordinary video.
Wu et al., ACM TOG · SIGGRAPH 2012 — Eulerian Video Magnification reveals motions below pixel resolution.

CONTACTLESS DEFORMATION → IOP (PRECEDENT)

Luce, J Cataract Refract Surg 2005 — ORA: applied force → corneal deformation reads out biomechanics and IOP. Same logic, mechanical excitation rather than acoustic.

FOR RESEARCH ONLY · NOT A TONOMETER · NOT A SUBSTITUTE FOR GOLDMANN

Test 03 · Visual acuity

Visual acuity exam.

ETDRS / Bailey-Lovie logMAR on a physically calibrated display, at the patient's measured distance.

Open exam ↗

20/20 letter = 5 arcmin visual angle. Optotype physically resized per live distance.

h = d · tan( 5 · MAR_arcmin )

Letter height (mm)

Pixel pitch is the hard part

DOM cm/mm are pinned to 96 DPI — not the real display. Glaucosim fingerprints the device (UA + screen + DPR) against an internal DB → real CSS DPI.

p_mm/px = 25.4 / DPI_device

Pixel pitch

Output

logMAR · per eye, 95% CI
Snellen · 20/x
Conditions · distance, DPI, lux

Sloan optotypes, 2-down-1-up staircase, 0.1 logMAR step, 5 reversals.12 Clinically meaningful Δ ≈ 0.1 logMAR.13

DISTANCE FROM MODEL 01 · OPTOTYPE HEIGHT RECOMPUTED PER FRAME

Test 04 · Contrast sensitivity

Contrast sensitivity exam.

Pelli-Robson, age-normed. Background luminance gated by Model 04 before the run starts.

Open exam ↗

Letter size fixed · only contrast varies. Triplets step 0.15 log units. Threshold = last triplet ≥ 2/3 correct.

log CS = log₁₀( 1 / C_threshold )

Normal ≈ 1.95 · ≤ 1.5 impaired14

Output

log CS · per eye
z-score · age-band norms
Slope · vs prior

CS loss often precedes VA loss · sensitive to drug-induced surface change

LETTER SIZE FIXED · ONLY CONTRAST VARIES · LAST CORRECT TRIPLET = THRESHOLD

Test 05 · Anterior segment

Anterior segment exam.

Four graded outputs from a single frame per eye. Phone or laptop — patient picks the device.

Open exam ↗

What we grade (each on every visit)

01 · Image quality

Q 0–1 · keep / retake

F_var focus + E_hist exposure + R_iris ROI − M_blur.

02 · Conjunctival hyperemia

Efron 0–4

R / (R+G+B) over bulbar ROI · cheek-normalised → Efron.

03 · Eyelid hyperpigmentation

POHSS 0–3

ITA° = arctan((L*−50)/b*) · 180/π. ΔITA° vs cheek → POHSS.

04 · Orbital fat reabsorption

Aakalu PAP 0–3

MRD1 via IPD scale + sulcus shadow → PAP.

Capture flow · phone or laptop

EVERY FRAME TAGGED WITH DEVICE · DISTANCE · LUX · Q · MODEL VERSION · TIME

Test 05 · Anterior segment · models

From classical CV today
to a trained model from our cohort.

V0 ships with hand-engineered features per output. V1+ is a multi-task CNN, on the three clinical grades only, trained on labels the clinician writes in the dashboard. Image quality stays deterministic.

V0 · ships day one

Classical CV · no training data.

MediaPipe landmarks + per-pixel colour · calibrated vs reference photos.

Image quality

Q = α·F_var + β·E_hist + γ·R_iris − δ·M_blur

Hyperemia

⟨R / (R+G+B)⟩ · cheek-normalised → Efron 0–4

Hyperpigmentation

ITA° + ΔITA° vs cheek → POHSS 0–3

Orbital fat reabsorption

MRD1 (mm) + sulcus shadow → Aakalu PAP 0–3

V1+ · trained from our cohort

Multi-task CNN · active-learning loop.

Each dashboard review → 3 ordinal labels per take. Platform IS the labelling tool.

Model

Backbone · ConvNeXt-Tiny or EfficientNet-B3 (ImageNet-pretrained)
Three heads · ordinal regression (CORN loss), one per grading
Inputs · frame + metadata (distance, lux, Q)
Eval · weighted κ vs grader consensus · per-grade ROC · calibration
Active sampling · low V0 confidence or V_n–V_n−1 disagreement

Versioned files · predictions never overwrite labels · UCSD-labelled corpus stays UCSD-owned.

Test 06 · NEI VFQ-25

Vision-related quality of life.

The standard 25-item PRO, voice or tap, on a home cadence rather than annual.

Open exam ↗

25 items → 12 subscales, rescaled 0–100. Composite = mean of vision-targeted subscales.

Composite = ( 1 / 11 ) · Σ Subscale_i

Mangione 200115 · 90-day cadence at home

Output

Composite 0–100
Per-subscale · vs baseline
Longitudinal Δ · patient's own anchor

VOICE OR TAP · ~7 MIN · 90-DAY CADENCE

Test 07 · Drop adherence

Medication adherence,
as a clinical variable.

Reminders, single-tap confirmation, structured missed-dose reason — then overlaid on visual-function trend.

Open exam ↗

Self-report overstates ~31% vs objective.3 Per-dose reminder · single-tap confirm · structured missed-dose reason.

Dashboard overlays missed-dose density on the MD trend — one chart, concrete artefact.

Output

30-day ribbon
% · 30 / 90 / 365 d
Missed-dose timeline + reasons
Adherence × MD overlay

30-DAY RIBBON26 / 30

MD TREND × MISSED-DOSE DENSITY · 12 MO−0.62 dB/yr

ADHERENCE BECOMES A VARIABLE, NOT A SELF-REPORT

Why I built this

I built Glaucosim because two appointments a year can't catch a disease that damages the optic nerve silently, fiber by fiber, between visits.

Mauro Gobira

Founder · Ophthalmology MD · Visiting Scholar, Shiley Eye Institute

glaucosim.com · app.glaucosim.com

References

[1] Olthoff CMG et al. Ophthalmology 2005

[2] Newman-Casey PA et al. Ophthalmology 2015

[3] Friedman DS et al. IOVS 2014

[4] Stagg BC et al. JAMA Ophthalmol 2022

[5] Chauhan BC et al. Br J Ophthalmol 2008

[6] Sakata R et al. Am J Ophthalmol 2021

[7] Caroline P, André M. Contact Lens Spectrum 2002

[8] Soukupová T, Čech J. CVWW 2016 (Eye Aspect Ratio)

[9] Turpin A et al. IOVS 2003 (ZEST validation)

[10] Schulz AM et al. JAMA Ophthalmol 2018 (iPad ZEST)

[11] Heijl A et al. Acta Ophthalmol 1989

[12] Bailey IL, Lovie JE. Am J Optom 1976

[13] Rosser DA et al. Br J Ophthalmol 2003

[14] Pelli DG et al. Clin Vis Sci 1988

[15] Mangione CM et al. Arch Ophthalmol 2001 (NEI VFQ-25)

Glaucoma monitoring,from home.

Three gaps in glaucoma care.

Continuous monitoring,between visits.

What already exists.

A home test without integritychecks is a screenshot of a screen.

Is the room within spec?

Is the patient where the test assumes?

Is the patient actually fixating?

Five models guard every trial.

Screen distance

Eye cover

Gaze fixation

Ambient light

Capture quality

Distance from a single RGB frame.

Principle

What we measure to trust it

Per-trial verification of which eye is occluded.

Principle

What we measure to trust it

Drift outside 4° invalidates the stimulus.

Principle

What we measure to trust it

Reject the session if the room is off-spec.

Principle

What we measure to trust it

Anterior segment frames graded in real time.

Principle

What we measure to trust it

Visual field 24-2 exam.

Output

Acoustic IOP.

Pipeline

Output

Prior art · scientific basis

Visual acuity exam.

Pixel pitch is the hard part

Output

Contrast sensitivity exam.

Output

Anterior segment exam.

What we grade (each on every visit)

Q 0–1 · keep / retake

Efron 0–4

POHSS 0–3

Aakalu PAP 0–3

Capture flow · phone or laptop

From classical CV todayto a trained model from our cohort.

Classical CV · no training data.

Image quality

Hyperemia

Hyperpigmentation

Orbital fat reabsorption

Multi-task CNN · active-learning loop.

Model

Vision-related quality of life.

Output

Medication adherence,as a clinical variable.

Output

I built Glaucosim because two appointments a year can't catch a disease that damages the optic nerve silently, fiber by fiber, between visits.

References

Glaucoma monitoring,
from home.

Continuous monitoring,
between visits.

A home test without integrity
checks is a screenshot of a screen.

From classical CV today
to a trained model from our cohort.

Medication adherence,
as a clinical variable.