Kensink Labs
★ Industry vertical · EdTech AI06 service itemsKid-safe by default
INDUSTRY · EDTECH AI · K-12 + AFTER-SCHOOL

AI tutors kids actually learn from.

Production AI for K-12 platforms, after-school tutoring apps, university copilots, and learning operators. Teach-back loops that force active recall, voice-first explain-it pipelines with sub-five-second feedback, kid-safe guardrails by default, and a parent dashboard that turns a sticker chart into evidence.

Industry
EdTech · K-12 + tutoring
Safety
COPPA-aware · kid-safe
Voice
STT + TTS · <5s round-trip
Stack
Direct LLM · evals-first
[WHAT WE HEAR FROM EDTECH FOUNDERS AND CURRICULUM LEADS]

Three pains every edtech-AI team hits.

We have shipped against this shape with K-12 learning apps, after-school tutoring platforms, university student copilots, and corporate L&D operators. The shape repeats.

The fix is not a more polished avatar. The fix is the eval suite underneath, the voice pipeline that does not break for accent-heavy speech, and the parent visibility that turns parents from skeptics into evangelists.

PAIN · 0101 / 03

AI tutors hallucinate facts kids remember wrong.

A drilled fact a six-year-old learns becomes a foundational belief. An LLM that confidently says the wrong thing once creates a misconception that costs years to unlearn. The eval bar is unforgiving.

↓ How we fix it, below.
PAIN · 0202 / 03

Parents can't see if learning is happening.

Sticker charts and 'time on app' are not learning. Parents pay for results and need to see what their kid mastered tonight, in plain language, with evidence the AI is not making it up.

↓ How we fix it, below.
PAIN · 0303 / 03

Multilingual + regional curriculum is a sales blocker.

Markets outside English-first geographies want the tutor in their language, on their curriculum, with their grade-level pedagogy. Most AI tools fail this on day one and lose the deal.

↓ How we fix it, below.
[SIX SERVICE ITEMS · ONE TEAM]

Pick the edtech-AI problem.
We'll bring the build.

Eight-week engagements, kid-safe by default, eval-gated releases, and a deployment shape your COPPA reviewer can sign off on.

SERVICE · 01 / 06Teach-back core
AI tutor engine

A Feynman-method teach-back loop that forces active recall. Kid explains, AI evaluates, kid refines, mastery scored.

  • Five-step loop: Learn It > Explain It > Feedback > Refine > Master
  • Grade-level + subject-tuned prompts in a versioned registry
  • Per-child knowledge model that compounds across sessions
AnthropicPostgreSQLTypeScriptVoltAgent
SERVICE · 02 / 06Accuracy + safety
Eval suite for educational AI

Curriculum-validated golden sets, hallucination scoring on facts kids will remember, safety red-team on every release.

  • Golden set per grade and subject, curriculum-aligned
  • Hallucination + age-appropriateness + tone scoring
  • Red-team prompt sets for kid-safety regression
LangSmithPromptfooPythonClickHouse
SERVICE · 03 / 06Sub-5s round-trip
Voice-first STT + TTS pipeline

Kid explains by voice, gets a written + spoken feedback report in under five seconds. Accent-aware STT, grade-tuned TTS.

  • Whisper + custom STT post-processing for kid speech
  • ElevenLabs / Google TTS tuned for grade-school clarity
  • Round-trip under 5s so a six-year-old does not lose focus
WhisperElevenLabsCloudflare QueuesHono
SERVICE · 04 / 06COPPA-aware
Kid-safe guardrails

Input + output guardrails that block PII, off-topic prompts, and any attempt to socialize the AI as a friend. Tutor only, never chatbot.

  • PII redaction on every input the child speaks or types
  • Output filter for age-inappropriate content + tone
  • Refusal pattern for any out-of-curriculum prompt
TypeScriptZodOpenTelemetryAnthropic
SERVICE · 05 / 06Visibility that converts
Parent dashboard + reports

Per-session learning evidence, weekly summary that names what the child mastered, and weak-area alerts the parent can act on.

  • Per-session mastery snapshot with cited evidence from the transcript
  • Weekly summary in parent-friendly language, not curriculum-speak
  • Weak-area alerts surfaced with a 'practice this together' suggestion
Next.jsPostgreSQLTypeScriptCloudflare
SERVICE · 06 / 06Pre-build wedge
Architecture review

One-week audit of your edtech-AI stack. Eval shape, voice pipeline, guardrail posture, parent-visibility data model — all named in writing.

  • Eval suite shape: what golden sets to build first
  • Voice pipeline: STT + TTS providers scored on accent + cost
  • Parent-visibility data model: what to surface, what to never show
ADRsPostgresWhisperAnthropic

Most engagements bundle two: a tutor build (01, 03) paired with the discipline that keeps it credible (02, 05). Bring the shape closest to your blocker.

Scope your engagement →

Want to see the K-Framework discipline behind every item? Read the K-Framework.

[THE STACK · BY LAYER]

Boring infrastructure. Kids that learn.

Tools that already pass kid-safety reviews. Voice on the edge where latency matters, cloud where flexibility wins.

LAYER · DATA + RETRIEVAL

Data + retrieval.

The store, the index, the search

PostgreSQLpgvectorRedisClickHouseBigQueryOpenSearch
LAYER · MODEL LAYER

Model layer.

Embeddings, providers, fallbacks

OpenAIAnthropicCohere EmbedVoyageLlama (self-hosted)vLLM
LAYER · EVAL + OBSERVABILITY

Eval + observability.

The eval bar, the cost meter, the drift alarm

LangSmithPromptfooOpenTelemetryDatadogGrafanaSentry
LAYER · BACKEND + TRANSPORT

Backend + transport.

Type-safe everything

TypeScriptNext.jsPythonFastAPIgRPCBullMQtRPCZod
LAYER · MOBILE

Mobile.

iOS + Android, native or cross

React NativeExpoSwiftKotlinFCMAPNs
LAYER · CLOUD + DEPLOYMENT

Cloud + deployment.

Whatever your infra already runs

Cloudflare WorkersCloudflare R2AWSGCPVercelFly
✕ WHAT WE DO NOT SHIP

Direct against the model API. Voice + eval gated on every release.

  • No LangChain
  • No LlamaIndex
  • No agent framework
  • No orchestration vendor
  • No black-box ML platform
[PROOF · WHAT THE STACK DELIVERS]

Numbers that hold up
in the parent meeting.

MEASURED · WEIGHTED · 2024–2026
LATENCY
<5s

Voice explanation to feedback report

BREADTH
K-12

Curriculum-aligned eval coverage

REACH
BN + EN

Bilingual voice + UI from day 1

SAFETY
COPPA

Kid-safety default for production

EDTECH AI · APPLIED K-FRAMEWORK

Bring the learning problem.
We'll bring the build.

Eight weeks, fixed scope, eval suite + kid-safety review at handoff. Direct LLM engineering on top of the K-Framework. Two Q3 slots remain.

CYCLE
8 weeks · problem to kid
OUTPUT
Code · evals · runbook
SAFETY
COPPA-aware by default