Instrument the attempt
Capture the learner's action, timing, confidence, error type and context. A score alone is too coarse; the trace is where the useful signal lives.
Mereth Labs takes education's oldest bottlenecks and treats them as systems you can actually measure — slow feedback, memory left to chance, incentives aimed at the wrong target, skill nobody can verify. It starts as research and is built to leave as software.
The evidence is old. The delivery stack is new.
Bloom's 1984 "2 sigma" paper made the benchmark vivid: one-to-one tutoring with mastery learning can shift achievement far beyond ordinary classroom instruction. The exact effect depends on domain, tutor quality and implementation, but the mechanism is not mysterious: frequent diagnosis, targeted feedback and progression after mastery.
Many education bottlenecks have the same structure. The learner has a changing hidden state. The system observes that state poorly, acts too late, or rewards the wrong signal. Computation matters because it lowers the cost of observing, estimating and acting on that state for each learner.
We turn a learning question into a loop you can measure, test and tighten — without pretending education is only a software problem.
Capture the learner's action, timing, confidence, error type and context. A score alone is too coarse; the trace is where the useful signal lives.
Maintain explicit beliefs about mastery, recall probability, misconception class and uncertainty. The model should say what it knows and what it does not.
Pick the next move from the estimated state: what feedback to give, when to schedule a review, how hard the next task should be, which way to route. The goal is a loop you can steer, not personalisation for its own sake.
Track learning gain, retention, transfer, calibration, latency and robustness to gaming. A fluent interface is not evidence of learning.
Each is filed by the kind of fix it needs: engineering, when the bottleneck is measurement, scheduling or latency; system dynamics, when the bottleneck is routing, incentives or verification.
One-to-one mastery tutoring is still the benchmark for adaptive instruction. Naming the fix was never the hard part — delivering diagnosis, feedback and progression cheaply enough for everyone is.
Memory weakens with time and strengthens with successful retrieval. The technical problem is estimating when a learner needs the next review, not telling everyone to revise more.
When a score becomes the target, it starts measuring strategy as much as learning. Assessment has to be designed as an incentive system, not only as a measurement instrument.
Age-graded cohorts are a routing policy: administratively simple, but weakly tied to mastery. A better system routes learners by evidence of readiness, not only by calendar time.
Feedback that arrives after the learner has left the attempt often loses corrective force. The engineering target is to return useful signals while the attempt is still actionable.
Degrees bundle instruction, selection and signalling into one coarse credential. Many jobs need a sharper signal: evidence that a person can perform a specific skill under defined conditions.
A paper that never leaves the page helps nobody. Everything we study is built to ship as a product you can inspect — assumptions, measurements, outcomes and all.
Software that gives a learner tighter feedback, better retrieval timing and clearer mastery signals than a fixed classroom clock can provide.
Tools for schools, training teams and employers that need stronger evidence than grades, attendance or coarse credentials.
If you build learning systems, run a classroom, evaluate skills, fund research, or just have a problem with measurable stakes — get in touch. We want collaborators who care about the evidence and the implementation in equal measure.
Get in touch